tombo2-progress’s diary

できるだけ毎日1時間を切り取ってここに晒す。誤字脱字気にしない。日本語が崩壊するのも気にしない。最終的にまとめて本ブログに書く

Re: [WIP] Effective storage of duplicates in B-tree index. のメーリスを眺めてみた

https://www.postgresql.org/message-id/flat/56AB6D30.2040900%40postgrespro.ru#80cf7838f6285d5eb71475ac606fa5a7 からパフォーマンス影響に関してどれくらい言及されるたのかを中心に眺めてみた

メモそのままです(番号に意味はない)

感想

  • ここに出てくる内容だと、keyの重複が多いとindex sizeを30%~50%削減できるので、パフォーマンスの懸念はほとんどなさそう
  • この機能の提案者、他メンバーが pgbench, TPC-H, TPC-Eのデータで検証していた。
  • "Mouse genome informatics" (http://www.informatics.jax.org/software.shtml)での検証もしていてやはり分析用途で使われているんだなと再認識した
  • 変更に対する指定のベンチマークセットはなさそう。(Mailing listで言及がないだけで別のところで議論されてる可能性はある)

1. 最初の投稿(提案)

Anastasia Lubennikova 2015-08-31 07:41:15

B-tree index中の重複したキーの効率的な格納

Btreeの実装を変更することに対する確認事項

(内容は省略) 1. Compatibility. 2. There are several tricks to handle non-unique keys in B-tree. 3. Microvacuum.

2

Tomas Vondra 2015-08-31 15:26:53

  • 概ね賛成
  • GINと posting listsを実装したbtreeはどう違うのか?
  • posting listsを導入することのhigh concurrencyに対する影響は?

3, 2に対する返信

Anastasia Lubennikova 2015-09-01 09:31:54

  • UNIQUE feature以外の違いについて説明The difference between btree_gin and btree is not only UNIQUE feature. あたり
  • concurrencyの向上はそれほどないと思う

4

Tomas Vondra 2015-09-01 15:41:43

  • ok

5

Peter Geoghegan 2015-09-01 18:23:44

I'm glad someone is thinking about this, because it is certainly needed.

... 他いくつか意見 ...

6

Anastasia Lubennikova 2016-01-28 14:06:57

進捗報告

  • 特別な修正なしに広報互換性は保てそう
  • 各種indexで作成されるindexサイズ比較の結果を載せてる
i - number of distinct values in the index. So i=1 means that all rows
have the same key, and i=10000000 means that all keys are different.
The other columns contain the index size (MB).

i   B-tree Old  B-tree New  GIN
1   214,234375  87,7109375  10,2109375
10  214,234375  87,7109375  10,71875
100     214,234375  87,4375     15,640625
1000    214,234375  86,2578125  31,296875
10000   214,234375  78,421875   104,3046875
100000  214,234375  65,359375   49,078125
1000000     214,234375  90,140625   106,8203125
10000000    214,234375  214,234375  534,0625

今後追加実施予定の機能について

  1. Add storage_parameter 'enable_compression' for btree access method which specifies whether the index handles duplicates. default is 'off'
  2. Bring back microvacuum functionality for compressed indexes.
  3. Improve insertion speed. Insertions became significantly slower with compressed btree, which is obviously not what we do want.
  4. Clean the code and comments, add related documentation.

7

Thom Brown 2016-01-28 17:03:02

pgbenchでパフォーマンス検証した報告

Timing
Scale: master / patch
100: 10657ms / 13555ms (rechecked and got 9745ms)
500: 56909ms / 56985ms

Size
Scale: master / patch
100: 214MB / 87MB (40.7%)
500: 1071MB / 437MB (40.8%)

No performance issues from what I can tell.

8

Peter Geoghegan 2016-01-28 17:09:36

  • I/O boundの負荷かければ結構ちがうはずだよ

9

Thom Brown 2016-01-28 20:32:38

  • 300MBくらいのデータリストアしたらエラー出た

...

10

Anastasia Lubennikova Date: 2016-03-18 17:19:37

  • WAL機能もつけて進捗報告

... 次のバージョンで入れることが可能かの議論 ...

11

Anastasia Lubennikova Date: 2019-07-04 12:06:38

  • 進捗報告
  • reviewとtestを受けられそう
  • (新旧Btreeでのサイズ比較のグラフがわかりやすい)

12

Anastasia Lubennikova Date: 2019-07-04 17:38:21

  • "Mouse genome informatics" databaseのデータを入れたらたいてい50%くらい小さくなった

13

Anastasia Lubennikova <a(dot)lubennikova(at)postgrespro(dot)ru> Date: 2019-07-05 00:06:09

  • TPC-Hのデータで検証したらindex sizeが1/3くらいにできた

14

Rafia Sabih Date: 2019-07-31 14:59:22

  • TPC-Eのi_t_st_id indexは50%小さくなった

goland on ubuntuでctrl + [ がescとして認識されないとき

.ideavimrcに設定を書いても効かなくて困った

  • 結局設定したshortcut key自体がvim (ideavim)のものとして使われていなかった。
  • ショートカットをどのpluginもしくはnativeの設定に対して使うか自体を設定する必要がある。

結論

  • Settings > Editor > Vim Emulationでkeymapごとの設定先を変更可能。
  • ctrl - [ をideavimのものにしてやればあとは設定が聞いてescになった。

定期的に忘れてハマっているのでメモ

簡単なcreate table文をパースする

簡単なcreate table文をパースする

5.7のcreate table文でparserを作る https://dev.mysql.com/doc/refman/5.7/en/create-table.html

syntax 原文

CREATE [TEMPORARY] TABLE [IF NOT EXISTS] tbl_name
    (create_definition,...)
    [table_options]
    [partition_options]

CREATE [TEMPORARY] TABLE [IF NOT EXISTS] tbl_name
    [(create_definition,...)]
    [table_options]
    [partition_options]
    [IGNORE | REPLACE]
    [AS] query_expression

CREATE [TEMPORARY] TABLE [IF NOT EXISTS] tbl_name
    { LIKE old_tbl_name | (LIKE old_tbl_name) }

create_definition:
    col_name column_definition
  | {INDEX|KEY} [index_name] [index_type] (key_part,...)
      [index_option] ...
  | {FULLTEXT|SPATIAL} [INDEX|KEY] [index_name] (key_part,...)
      [index_option] ...
  | [CONSTRAINT [symbol]] PRIMARY KEY
      [index_type] (key_part,...)
      [index_option] ...
  | [CONSTRAINT [symbol]] UNIQUE [INDEX|KEY]
      [index_name] [index_type] (key_part,...)
      [index_option] ...
  | [CONSTRAINT [symbol]] FOREIGN KEY
      [index_name] (col_name,...)
      reference_definition
  | CHECK (expr)

column_definition:
    data_type [NOT NULL | NULL] [DEFAULT default_value]
      [AUTO_INCREMENT] [UNIQUE [KEY]] [[PRIMARY] KEY]
      [COMMENT 'string']
      [COLLATE collation_name]
      [COLUMN_FORMAT {FIXED|DYNAMIC|DEFAULT}]
      [STORAGE {DISK|MEMORY}]
      [reference_definition]
  | data_type
      [COLLATE collation_name]
      [GENERATED ALWAYS] AS (expr)
      [VIRTUAL | STORED] [NOT NULL | NULL]
      [UNIQUE [KEY]] [[PRIMARY] KEY]
      [COMMENT 'string']
      [reference_definition]

data_type:
    (see Chapter 11, Data Types)

key_part:
    col_name [(length)] [ASC | DESC]

index_type:
    USING {BTREE | HASH}

index_option:
    KEY_BLOCK_SIZE [=] value
  | index_type
  | WITH PARSER parser_name
  | COMMENT 'string'

reference_definition:
    REFERENCES tbl_name (key_part,...)
      [MATCH FULL | MATCH PARTIAL | MATCH SIMPLE]
      [ON DELETE reference_option]
      [ON UPDATE reference_option]

reference_option:
    RESTRICT | CASCADE | SET NULL | NO ACTION | SET DEFAULT

table_options:
    table_option [[,] table_option] ...

table_option:
    AUTO_INCREMENT [=] value
  | AVG_ROW_LENGTH [=] value
  | [DEFAULT] CHARACTER SET [=] charset_name
  | CHECKSUM [=] {0 | 1}
  | [DEFAULT] COLLATE [=] collation_name
  | COMMENT [=] 'string'
  | COMPRESSION [=] {'ZLIB'|'LZ4'|'NONE'}
  | CONNECTION [=] 'connect_string'
  | {DATA|INDEX} DIRECTORY [=] 'absolute path to directory'
  | DELAY_KEY_WRITE [=] {0 | 1}
  | ENCRYPTION [=] {'Y' | 'N'}
  | ENGINE [=] engine_name
  | INSERT_METHOD [=] { NO | FIRST | LAST }
  | KEY_BLOCK_SIZE [=] value
  | MAX_ROWS [=] value
  | MIN_ROWS [=] value
  | PACK_KEYS [=] {0 | 1 | DEFAULT}
  | PASSWORD [=] 'string'
  | ROW_FORMAT [=] {DEFAULT|DYNAMIC|FIXED|COMPRESSED|REDUNDANT|COMPACT}
  | STATS_AUTO_RECALC [=] {DEFAULT|0|1}
  | STATS_PERSISTENT [=] {DEFAULT|0|1}
  | STATS_SAMPLE_PAGES [=] value
  | TABLESPACE tablespace_name [STORAGE {DISK|MEMORY}]
  | UNION [=] (tbl_name[,tbl_name]...)

partition_options:
    PARTITION BY
        { [LINEAR] HASH(expr)
        | [LINEAR] KEY [ALGORITHM={1|2}] (column_list)
        | RANGE{(expr) | COLUMNS(column_list)}
        | LIST{(expr) | COLUMNS(column_list)} }
    [PARTITIONS num]
    [SUBPARTITION BY
        { [LINEAR] HASH(expr)
        | [LINEAR] KEY [ALGORITHM={1|2}] (column_list) }
      [SUBPARTITIONS num]
    ]
    [(partition_definition [, partition_definition] ...)]

partition_definition:
    PARTITION partition_name
        [VALUES
            {LESS THAN {(expr | value_list) | MAXVALUE}
            |
            IN (value_list)}]
        [[STORAGE] ENGINE [=] engine_name]
        [COMMENT [=] 'string' ]
        [DATA DIRECTORY [=] 'data_dir']
        [INDEX DIRECTORY [=] 'index_dir']
        [MAX_ROWS [=] max_number_of_rows]
        [MIN_ROWS [=] min_number_of_rows]
        [TABLESPACE [=] tablespace_name]
        [(subpartition_definition [, subpartition_definition] ...)]

subpartition_definition:
    SUBPARTITION logical_name
        [[STORAGE] ENGINE [=] engine_name]
        [COMMENT [=] 'string' ]
        [DATA DIRECTORY [=] 'data_dir']
        [INDEX DIRECTORY [=] 'index_dir']
        [MAX_ROWS [=] max_number_of_rows]
        [MIN_ROWS [=] min_number_of_rows]
        [TABLESPACE [=] tablespace_name]

query_expression:
    SELECT ...   (Some valid select or union statement)

対象とする部分のみ抜き出す

まず簡単なサンプルを作るために構文を簡単にする

サポートしないもの

  • PARTITION, SUBPARTITION, PARTITION_OPTIONS
  • CREATE TABLE AS ... SELECT ...
  • CREATE TABLE LIKE ...
  • CREATE TABLE [IGNORE | REPLACE]
  • FULLTEXT, SPATIAL index
  • TABLE_OPTION
  • CONSTRAINT, CHECK CONSTRAINT
  • index_option

parseするが無視するもの

  • index_type: {BTREE | HASH}
    • BTREEを前提にする
CREATE [TEMPORARY] TABLE [IF NOT EXISTS] tbl_name
    (create_definition,...)

create_definition:
    col_name column_definition
  | {INDEX|KEY} [index_name] [index_type] (key_part,...)

column_definition:
    data_type [NOT NULL | NULL] [DEFAULT default_value]
      [AUTO_INCREMENT] [UNIQUE [KEY]] [[PRIMARY] KEY]
      [COMMENT 'string']
      [COLLATE collation_name]
      [COLUMN_FORMAT {FIXED|DYNAMIC|DEFAULT}]
      [STORAGE {DISK|MEMORY}]
  | data_type
      [COLLATE collation_name]
      [GENERATED ALWAYS] AS (expr)
      [VIRTUAL | STORED] [NOT NULL | NULL]
      [UNIQUE [KEY]] [[PRIMARY] KEY]
      [COMMENT 'string']

data_type:
    (see Chapter 11, Data Types)

key_part:
    col_name [(length)] [ASC | DESC]

index_type:
    USING {BTREE | HASH}

data_typeの構文

document 11章を参照

BIT[(M)]
TINYINT[(M)] [UNSIGNED] [ZEROFILL]
BOOL
BOOLEAN
SMALLINT[(M)] [UNSIGNED] [ZEROFILL]
MEDIUMINT[(M)] [UNSIGNED] [ZEROFILL]
INT[(M)] [UNSIGNED] [ZEROFILL]
INTEGER[(M)] [UNSIGNED] [ZEROFILL]
BIGINT[(M)] [UNSIGNED] [ZEROFILL]
DECIMAL[(M[,D])] [UNSIGNED] [ZEROFILL]
DEC[(M[,D])] [UNSIGNED] [ZEROFILL]
NUMERIC[(M[,D])] [UNSIGNED] [ZEROFILL]
FIXED[(M[,D])] [UNSIGNED] [ZEROFILL]
FLOAT[(M,D)] [UNSIGNED] [ZEROFILL]
FLOAT(p) [UNSIGNED] [ZEROFILL]
DOUBLE[(M,D)] [UNSIGNED] [ZEROFILL]
DOUBLE PRECISION[(M,D)] [UNSIGNED] [ZEROFILL]
REAL[(M,D)] [UNSIGNED] [ZEROFILL]
DATE
DATETIME[(fsp)]
TIMESTAMP[(fsp)]
TIME[(fsp)]
YEAR[(4)]
[NATIONAL] CHAR[(M)] [CHARACTER SET charset_name] [COLLATE collation_name]
[NATIONAL] VARCHAR(M) [CHARACTER SET charset_name] [COLLATE collation_name]
BINARY[(M)]
VARBINARY(M)
TINYBLOB
TINYTEXT [CHARACTER SET charset_name] [COLLATE collation_name]
BLOB[(M)]
TEXT[(M)] [CHARACTER SET charset_name] [COLLATE collation_name]
MEDIUMBLOB
MEDIUMTEXT [CHARACTER SET charset_name] [COLLATE collation_name]
LONGBLOB
LONGTEXT [CHARACTER SET charset_name] [COLLATE collation_name]
ENUM('value1','value2',...) [CHARACTER SET charset_name] [COLLATE collation_name]
SET('value1','value2',...) [CHARACTER SET charset_name] [COLLATE collation_name]

printf("%d\n", $1) とかで出力が出ないんだが?

https://www.gnu.org/software/bison/manual/bison.html#Multi_002dfunction-Calc

sysbenchでMySQLのベンチマークをしていく

sysbenchでMySQLのパフォーマンス比較をして結果を集計する作業を自動で継続的にやるためのツールを作ってる。

ある程度大枠はできたので、より使いやすい方法を模索中。GUIは面倒だから作りたくない、、、、

まずはMySQL 8.0以上のみを対象にすることにして、ベンチマークごとの環境変数設定を set persistent ... でセットしてやっていく。

メタデータを管理するバックエンドDBのためにglobal変数を出してみた。

docker-composeでテスト環境として立てていた関係でその影響の設定があって(admin_port 33062とか)、完全なdefaultではないけど項目としてはだいたいこんな感じ。

mysql -u root -p -h 127.0.0.1 -P 33062 -e "show global variables;" > tmp.txt

Variable_name    Value
activate_all_roles_on_login OFF
admin_address
admin_port  33062
auto_generate_certs ON
auto_increment_increment    1
auto_increment_offset   1
autocommit  ON
automatic_sp_privileges ON
avoid_temporal_upgrade  OFF
back_log    151
basedir /usr/
big_tables  OFF
bind_address    *
binlog_cache_size   32768
binlog_checksum CRC32
binlog_direct_non_transactional_updates OFF
binlog_encryption   OFF
binlog_error_action ABORT_SERVER
binlog_expire_logs_seconds  2592000
binlog_format   ROW
binlog_group_commit_sync_delay  0
binlog_group_commit_sync_no_delay_count 0
binlog_gtid_simple_recovery ON
binlog_max_flush_queue_time 0
binlog_order_commits    ON
binlog_rotate_encryption_master_key_at_startup  OFF
binlog_row_event_max_size   8192
binlog_row_image    FULL
binlog_row_metadata MINIMAL
binlog_row_value_options
binlog_rows_query_log_events    OFF
binlog_stmt_cache_size  32768
binlog_transaction_dependency_history_size  25000
binlog_transaction_dependency_tracking  COMMIT_ORDER
block_encryption_mode   aes-128-ecb
bulk_insert_buffer_size 8388608
caching_sha2_password_auto_generate_rsa_keys    ON
caching_sha2_password_private_key_path  private_key.pem
caching_sha2_password_public_key_path   public_key.pem
character_set_client    utf8mb4
character_set_connection    utf8mb4
character_set_database  utf8mb4
character_set_filesystem    binary
character_set_results   utf8mb4
character_set_server    utf8mb4
character_set_system    utf8
character_sets_dir  /usr/share/mysql-8.0/charsets/
check_proxy_users   OFF
collation_connection    utf8mb4_0900_ai_ci
collation_database  utf8mb4_0900_ai_ci
collation_server    utf8mb4_0900_ai_ci
completion_type NO_CHAIN
concurrent_insert   AUTO
connect_timeout 10
core_file   OFF
create_admin_listener_thread    OFF
cte_max_recursion_depth 1000
datadir /var/lib/mysql/
default_authentication_plugin   caching_sha2_password
default_collation_for_utf8mb4   utf8mb4_0900_ai_ci
default_password_lifetime   0
default_storage_engine  InnoDB
default_tmp_storage_engine  InnoDB
default_week_format 0
delay_key_write ON
delayed_insert_limit    100
delayed_insert_timeout  300
delayed_queue_size  1000
disabled_storage_engines
disconnect_on_expired_password  ON
div_precision_increment 4
end_markers_in_json OFF
enforce_gtid_consistency    OFF
eq_range_index_dive_limit   200
event_scheduler ON
expire_logs_days    0
explicit_defaults_for_timestamp ON
flush   OFF
flush_time  0
foreign_key_checks  ON
ft_boolean_syntax   + -><()~*:""&|
ft_max_word_len 84
ft_min_word_len 4
ft_query_expansion_limit    20
ft_stopword_file    (built-in)
general_log OFF
general_log_file    /var/lib/mysql/f0f16f848d76.log
group_concat_max_len    1024
group_replication_consistency   EVENTUAL
gtid_executed
gtid_executed_compression_period    1000
gtid_mode   OFF
gtid_owned
gtid_purged
have_compress   YES
have_dynamic_loading    YES
have_geometry   YES
have_openssl    YES
have_profiling  YES
have_query_cache    NO
have_rtree_keys YES
have_ssl    YES
have_statement_timeout  YES
have_symlink    DISABLED
histogram_generation_max_mem_size   20000000
host_cache_size 279
hostname    f0f16f848d76
information_schema_stats_expiry 86400
init_connect
init_file
init_slave
innodb_adaptive_flushing    ON
innodb_adaptive_flushing_lwm    10
innodb_adaptive_hash_index  ON
innodb_adaptive_hash_index_parts    8
innodb_adaptive_max_sleep_delay 150000
innodb_api_bk_commit_interval   5
innodb_api_disable_rowlock  OFF
innodb_api_enable_binlog    OFF
innodb_api_enable_mdl   OFF
innodb_api_trx_level    0
innodb_autoextend_increment 64
innodb_autoinc_lock_mode    2
innodb_buffer_pool_chunk_size   134217728
innodb_buffer_pool_dump_at_shutdown ON
innodb_buffer_pool_dump_now OFF
innodb_buffer_pool_dump_pct 25
innodb_buffer_pool_filename ib_buffer_pool
innodb_buffer_pool_in_core_file ON
innodb_buffer_pool_instances    1
innodb_buffer_pool_load_abort   OFF
innodb_buffer_pool_load_at_startup  ON
innodb_buffer_pool_load_now OFF
innodb_buffer_pool_size 134217728
innodb_change_buffer_max_size   25
innodb_change_buffering all
innodb_checksum_algorithm   crc32
innodb_cmp_per_index_enabled    OFF
innodb_commit_concurrency   0
innodb_compression_failure_threshold_pct    5
innodb_compression_level    6
innodb_compression_pad_pct_max  50
innodb_concurrency_tickets  5000
innodb_data_file_path   ibdata1:12M:autoextend
innodb_data_home_dir
innodb_deadlock_detect  ON
innodb_dedicated_server OFF
innodb_default_row_format   dynamic
innodb_directories
innodb_disable_sort_file_cache  OFF
innodb_doublewrite  ON
innodb_fast_shutdown    1
innodb_file_per_table   ON
innodb_fill_factor  100
innodb_flush_log_at_timeout 1
innodb_flush_log_at_trx_commit  1
innodb_flush_method fsync
innodb_flush_neighbors  0
innodb_flush_sync   ON
innodb_flushing_avg_loops   30
innodb_force_load_corrupted OFF
innodb_force_recovery   0
innodb_fsync_threshold  0
innodb_ft_aux_table
innodb_ft_cache_size    8000000
innodb_ft_enable_diag_print OFF
innodb_ft_enable_stopword   ON
innodb_ft_max_token_size    84
innodb_ft_min_token_size    3
innodb_ft_num_word_optimize 2000
innodb_ft_result_cache_limit    2000000000
innodb_ft_server_stopword_table
innodb_ft_sort_pll_degree   2
innodb_ft_total_cache_size  640000000
innodb_ft_user_stopword_table
innodb_io_capacity  200
innodb_io_capacity_max  2000
innodb_lock_wait_timeout    50
innodb_log_buffer_size  16777216
innodb_log_checksums    ON
innodb_log_compressed_pages ON
innodb_log_file_size    50331648
innodb_log_files_in_group   2
innodb_log_group_home_dir   ./
innodb_log_spin_cpu_abs_lwm 80
innodb_log_spin_cpu_pct_hwm 50
innodb_log_wait_for_flush_spin_hwm  400
innodb_log_write_ahead_size 8192
innodb_lru_scan_depth   1024
innodb_max_dirty_pages_pct  90.000000
innodb_max_dirty_pages_pct_lwm  10.000000
innodb_max_purge_lag    0
innodb_max_purge_lag_delay  0
innodb_max_undo_log_size    1073741824
innodb_monitor_disable
innodb_monitor_enable
innodb_monitor_reset
innodb_monitor_reset_all
innodb_numa_interleave  OFF
innodb_old_blocks_pct   37
innodb_old_blocks_time  1000
innodb_online_alter_log_max_size    134217728
innodb_open_files   4000
innodb_optimize_fulltext_only   OFF
innodb_page_cleaners    1
innodb_page_size    16384
innodb_parallel_read_threads    4
innodb_print_all_deadlocks  OFF
innodb_print_ddl_logs   OFF
innodb_purge_batch_size 300
innodb_purge_rseg_truncate_frequency    128
innodb_purge_threads    4
innodb_random_read_ahead    OFF
innodb_read_ahead_threshold 56
innodb_read_io_threads  4
innodb_read_only    OFF
innodb_redo_log_encrypt OFF
innodb_replication_delay    0
innodb_rollback_on_timeout  OFF
innodb_rollback_segments    128
innodb_sort_buffer_size 1048576
innodb_spin_wait_delay  6
innodb_stats_auto_recalc    ON
innodb_stats_include_delete_marked  OFF
innodb_stats_method nulls_equal
innodb_stats_on_metadata    OFF
innodb_stats_persistent ON
innodb_stats_persistent_sample_pages    20
innodb_stats_transient_sample_pages 8
innodb_status_output    OFF
innodb_status_output_locks  OFF
innodb_strict_mode  ON
innodb_sync_array_size  1
innodb_sync_spin_loops  30
innodb_table_locks  ON
innodb_temp_data_file_path  ibtmp1:12M:autoextend
innodb_temp_tablespaces_dir ./#innodb_temp/
innodb_thread_concurrency   0
innodb_thread_sleep_delay   10000
innodb_tmpdir
innodb_undo_directory   ./
innodb_undo_log_encrypt OFF
innodb_undo_log_truncate    ON
innodb_undo_tablespaces 2
innodb_use_native_aio   ON
innodb_version  8.0.15
innodb_write_io_threads 4
interactive_timeout 28800
internal_tmp_disk_storage_engine    InnoDB
internal_tmp_mem_storage_engine TempTable
join_buffer_size    262144
keep_files_on_create    OFF
key_buffer_size 8388608
key_cache_age_threshold 300
key_cache_block_size    1024
key_cache_division_limit    100
keyring_operations  ON
large_files_support ON
large_page_size 0
large_pages OFF
lc_messages en_US
lc_messages_dir /usr/share/mysql-8.0/
lc_time_names   en_US
license GPL
local_infile    OFF
lock_wait_timeout   31536000
locked_in_memory    OFF
log_bin ON
log_bin_basename    /var/lib/mysql/f0f16f848d76-bin
log_bin_index   /var/lib/mysql/f0f16f848d76-bin.index
log_bin_trust_function_creators OFF
log_bin_use_v1_row_events   OFF
log_error   stderr
log_error_services  log_filter_internal; log_sink_internal
log_error_suppression_list
log_error_verbosity 2
log_output  FILE
log_queries_not_using_indexes   OFF
log_slave_updates   ON
log_slow_admin_statements   OFF
log_slow_extra  OFF
log_slow_slave_statements   OFF
log_statements_unsafe_for_binlog    ON
log_throttle_queries_not_using_indexes  0
log_timestamps  UTC
long_query_time 10.000000
low_priority_updates    OFF
lower_case_file_system  OFF
lower_case_table_names  0
mandatory_roles
master_info_repository  TABLE
master_verify_checksum  OFF
max_allowed_packet  67108864
max_binlog_cache_size   18446744073709547520
max_binlog_size 1073741824
max_binlog_stmt_cache_size  18446744073709547520
max_connect_errors  100
max_connections 151
max_delayed_threads 20
max_digest_length   1024
max_error_count 1024
max_execution_time  0
max_heap_table_size 16777216
max_insert_delayed_threads  20
max_join_size   18446744073709551615
max_length_for_sort_data    4096
max_points_in_geometry  65536
max_prepared_stmt_count 16382
max_relay_log_size  0
max_seeks_for_key   18446744073709551615
max_sort_length 1024
max_sp_recursion_depth  0
max_user_connections    0
max_write_lock_count    18446744073709551615
min_examined_row_limit  0
myisam_data_pointer_size    6
myisam_max_sort_file_size   9223372036853727232
myisam_mmap_size    18446744073709551615
myisam_recover_options  OFF
myisam_repair_threads   1
myisam_sort_buffer_size 8388608
myisam_stats_method nulls_unequal
myisam_use_mmap OFF
mysql_native_password_proxy_users   OFF
mysqlx_bind_address *
mysqlx_connect_timeout  30
mysqlx_document_id_unique_prefix    0
mysqlx_idle_worker_thread_timeout   60
mysqlx_interactive_timeout  28800
mysqlx_max_allowed_packet   67108864
mysqlx_max_connections  100
mysqlx_min_worker_threads   2
mysqlx_port 33060
mysqlx_port_open_timeout    0
mysqlx_read_timeout 30
mysqlx_socket   /var/run/mysqld/mysqlx.sock
mysqlx_ssl_ca
mysqlx_ssl_capath
mysqlx_ssl_cert
mysqlx_ssl_cipher
mysqlx_ssl_crl
mysqlx_ssl_crlpath
mysqlx_ssl_key
mysqlx_wait_timeout 28800
mysqlx_write_timeout    60
net_buffer_length   16384
net_read_timeout    30
net_retry_count 10
net_write_timeout   60
new OFF
ngram_token_size    2
offline_mode    OFF
old OFF
old_alter_table OFF
open_files_limit    1048576
optimizer_prune_level   1
optimizer_search_depth  62
optimizer_switch    index_merge=on,index_merge_union=on,index_merge_sort_union=on,index_merge_intersection=on,engine_condition_pushdown=on,index_condition_pushdown=on,mrr=on,mrr_cost_based=on,block_nested_loop=on,batched_key_access=off,materialization=on,semijoin=on,loosescan=on,firstmatch=on,duplicateweedout=on,subquery_materialization_cost_based=on,use_index_extensions=on,condition_fanout_filter=on,derived_merge=on,use_invisible_indexes=off,skip_scan=on
optimizer_trace enabled=off,one_line=off
optimizer_trace_features    greedy_search=on,range_optimizer=on,dynamic_range=on,repeated_subselect=on
optimizer_trace_limit   1
optimizer_trace_max_mem_size    1048576
optimizer_trace_offset  -1
parser_max_mem_size 18446744073709551615
password_history    0
password_require_current    OFF
password_reuse_interval 0
performance_schema  ON
performance_schema_accounts_size    -1
performance_schema_digests_size 10000
performance_schema_error_size   4367
performance_schema_events_stages_history_long_size  10000
performance_schema_events_stages_history_size   10
performance_schema_events_statements_history_long_size  10000
performance_schema_events_statements_history_size   10
performance_schema_events_transactions_history_long_size    10000
performance_schema_events_transactions_history_size 10
performance_schema_events_waits_history_long_size   10000
performance_schema_events_waits_history_size    10
performance_schema_hosts_size   -1
performance_schema_max_cond_classes 100
performance_schema_max_cond_instances   -1
performance_schema_max_digest_length    1024
performance_schema_max_digest_sample_age    60
performance_schema_max_file_classes 80
performance_schema_max_file_handles 32768
performance_schema_max_file_instances   -1
performance_schema_max_index_stat   -1
performance_schema_max_memory_classes   450
performance_schema_max_metadata_locks   -1
performance_schema_max_mutex_classes    300
performance_schema_max_mutex_instances  -1
performance_schema_max_prepared_statements_instances    -1
performance_schema_max_program_instances    -1
performance_schema_max_rwlock_classes   60
performance_schema_max_rwlock_instances -1
performance_schema_max_socket_classes   10
performance_schema_max_socket_instances -1
performance_schema_max_sql_text_length  1024
performance_schema_max_stage_classes    175
performance_schema_max_statement_classes    218
performance_schema_max_statement_stack  10
performance_schema_max_table_handles    -1
performance_schema_max_table_instances  -1
performance_schema_max_table_lock_stat  -1
performance_schema_max_thread_classes   100
performance_schema_max_thread_instances -1
performance_schema_session_connect_attrs_size   512
performance_schema_setup_actors_size    -1
performance_schema_setup_objects_size   -1
performance_schema_users_size   -1
persist_only_admin_x509_subject
persisted_globals_load  ON
pid_file    /var/run/mysqld/mysqld.pid
plugin_dir  /usr/lib/mysql/plugin/
port    3306
preload_buffer_size 32768
profiling   OFF
profiling_history_size  15
protocol_version    10
query_alloc_block_size  8192
query_prealloc_size 8192
range_alloc_block_size  4096
range_optimizer_max_mem_size    8388608
rbr_exec_mode   STRICT
read_buffer_size    131072
read_only   OFF
read_rnd_buffer_size    262144
regexp_stack_limit  8000000
regexp_time_limit   32
relay_log   f0f16f848d76-relay-bin
relay_log_basename  /var/lib/mysql/f0f16f848d76-relay-bin
relay_log_index /var/lib/mysql/f0f16f848d76-relay-bin.index
relay_log_info_file relay-log.info
relay_log_info_repository   TABLE
relay_log_purge ON
relay_log_recovery  OFF
relay_log_space_limit   0
report_host
report_password
report_port 3306
report_user
require_secure_transport    OFF
rpl_read_size   8192
rpl_stop_slave_timeout  31536000
schema_definition_cache 256
secure_file_priv    NULL
server_id   101
server_id_bits  32
server_uuid ef66b495-39d8-11e9-a16a-0242ac120003
session_track_gtids OFF
session_track_schema    ON
session_track_state_change  OFF
session_track_system_variables  time_zone,autocommit,character_set_client,character_set_results,character_set_connection
session_track_transaction_info  OFF
sha256_password_auto_generate_rsa_keys  ON
sha256_password_private_key_path    private_key.pem
sha256_password_proxy_users OFF
sha256_password_public_key_path public_key.pem
show_create_table_verbosity OFF
show_old_temporals  OFF
skip_external_locking   ON
skip_name_resolve   ON
skip_networking OFF
skip_show_database  OFF
slave_allow_batching    OFF
slave_checkpoint_group  512
slave_checkpoint_period 300
slave_compressed_protocol   OFF
slave_exec_mode STRICT
slave_load_tmpdir   /tmp
slave_max_allowed_packet    1073741824
slave_net_timeout   60
slave_parallel_type DATABASE
slave_parallel_workers  0
slave_pending_jobs_size_max 134217728
slave_preserve_commit_order OFF
slave_rows_search_algorithms    INDEX_SCAN,HASH_SCAN
slave_skip_errors   OFF
slave_sql_verify_checksum   ON
slave_transaction_retries   10
slave_type_conversions
slow_launch_time    2
slow_query_log  OFF
slow_query_log_file /var/lib/mysql/f0f16f848d76-slow.log
socket  /var/run/mysqld/mysqld.sock
sort_buffer_size    262144
sql_auto_is_null    OFF
sql_big_selects ON
sql_buffer_result   OFF
sql_log_off OFF
sql_mode    ONLY_FULL_GROUP_BY,STRICT_TRANS_TABLES,NO_ZERO_IN_DATE,NO_ZERO_DATE,ERROR_FOR_DIVISION_BY_ZERO,NO_ENGINE_SUBSTITUTION
sql_notes   ON
sql_quote_show_create   ON
sql_require_primary_key OFF
sql_safe_updates    OFF
sql_select_limit    18446744073709551615
sql_slave_skip_counter  0
sql_warnings    OFF
ssl_ca  ca.pem
ssl_capath
ssl_cert    server-cert.pem
ssl_cipher
ssl_crl
ssl_crlpath
ssl_fips_mode   OFF
ssl_key server-key.pem
stored_program_cache    256
stored_program_definition_cache 256
super_read_only OFF
sync_binlog 1
sync_master_info    10000
sync_relay_log  10000
sync_relay_log_info 10000
system_time_zone    JST
table_definition_cache  2000
table_open_cache    4000
table_open_cache_instances  16
tablespace_definition_cache 256
temptable_max_ram   1073741824
thread_cache_size   9
thread_handling one-thread-per-connection
thread_stack    286720
time_zone   SYSTEM
tls_version TLSv1,TLSv1.1,TLSv1.2
tmp_table_size  16777216
tmpdir  /tmp
transaction_alloc_block_size    8192
transaction_isolation   REPEATABLE-READ
transaction_prealloc_size   4096
transaction_read_only   OFF
transaction_write_set_extraction    XXHASH64
unique_checks   ON
updatable_views_with_limit  YES
version 8.0.15
version_comment MySQL Community Server - GPL
version_compile_machine x86_64
version_compile_os  Linux
version_compile_zlib    1.2.11
wait_timeout    28800
windowing_use_high_precision    ON

結構多いし、newとかあってなんじゃこれって感じ ver4.0時代に4.1の機能をONにする変数らしい

https://dev.mysql.com/doc/refman/8.0/en/server-system-variables.html#sysvar_new

古すぎてウケる

コンパイラ入門実践 その1

最近調子が良いけど、やはり定期的なアウトプットをしたほうが良さそう。 ということで現状を書いておく。コンパイラ学習の調子は良くないw

現状

Computer Science Library9のコンパイラ入門を読みつつbison, flexで簡単な言語のコンパイラを作ってみている。

この本のSL1を解析してCSTを作る途中。

yacc, lexの写経、include周りを修正したのが↓

Fix include settings · tom--bo/compiler-study@1a15a28 · GitHub

コンパイル方法

bison -d mycc/parser.y
flex --header-file=lex.yy.h mycc/lexer.l
# bison, flex結果をcfiles dirに移動
gcc -I cfiles cfiles/*.c -o ./parser

todo

  • 宣言部がどう変換されるのか

parserでlexのheaderをincludeするのに宣言部で

%{
  #include "lex.yy.h"  
%}

するとちゃんとヘッダーが読めたがこのあたりがどうファイルに入っているか確認しないといけない。

  • LALRの構文規則を左再帰で書くか、右再帰で書くかの違い

SL1の拡張SL2案

  • PrintStmtsをプログラム中のどこでもかける蝶にする
  • 型推論を試す。変数宣言にvarとか入れる

MySQLでcustom.cnfが指定できない

docker上のmysql環境が作れない、、、

customなmy.cnfを読ませたくて、

/path/to/test/1/custom.cnf を用意して、それを/etc/mysql/conf.dにmountさせようとするも失敗、、、

docker run --name mysql80 -e MYSQL_ROOT_PASSWORD=root -p 3306:3306  -d mysql:8.0 -v /path/to/test/1:/etc/mysql/conf.d --bind-address=0.0.0.0             
19b8fdfc985f3899ef750ade06f55852498f07f47c116e75d1047ca583f37baa

起動こそするけど、接続しようとすると死ぬ。

docker run -it --link mysql80:mysql --rm mysql sh -c 'exec mysql -h"$MYSQL_PORT_3306_TCP_ADDR" -P"$MYSQL_PORT_3306_TCP_PORT" -uroot -p"$MYSQL_ENV_MYSQL_ROOT_PASSWORD"'          
mysql: [Warning] Using a password on the command line interface can be insecure.
ERROR 2003 (HY000): Can't connect to MySQL server on '172.17.0.2' (111)

github.com

公式どおり指定しているけど、古いのかな? 雰囲気でdocker使っていて、落ちた理由がなんなのかすらわからん、、、

MySQLをgdbでステップ実行できる環境がほしい

実行中のものにattachすることで関数くらいはわかるものの、その時の状態をcuiだけで確認するのは結構厳しい。

dockerでdebug-buildしたものにgdbserver経由でステップ実行する環境を作れそうな気がするので奮闘している。

github.com

これそのままでは動かないと思うので、手を入れつつどうにかしたい。 とりあえずdebug-buildところまで来たけど、gdbserverの仕組みがよくわかっていない