Refer to config details for explanations of common fields.
[extractor]
extract_type=struct
db_type=mysql
url=mysql://root:123456@127.0.0.1:3307?ssl-mode=disabled
[sinker]
url=mysql://root:@127.0.0.1:9030
sink_type=struct
db_type=starrocks
conflict_policy=interrupt
[filter]
do_dbs=test_db
ignore_dbs=
do_tbs=
ignore_tbs=
do_events=
do_structures=
[router]
db_map=
tb_map=
col_map=
[runtime]
log_level=info
log4rs_file=./log4rs.yaml
log_dir=./logs
[parallelizer]
parallel_type=serial
[pipeline]
checkpoint_interval_secs=10
buffer_size=100
[extractor]
db_type=mysql
extract_type=snapshot
url=mysql://root:123456@127.0.0.1:3307?ssl-mode=disabled
batch_size=10000
[sinker]
db_type=starrocks
sink_type=write
url=mysql://root:123456@127.0.0.1:9030
stream_load_url=mysql://root:123456@127.0.0.1:8040
batch_size=5000
[filter]
do_dbs=
ignore_dbs=
do_tbs=test_db.*
ignore_tbs=
do_events=insert
[router]
db_map=
tb_map=
col_map=
[parallelizer]
parallel_type=snapshot
parallel_size=8
[pipeline]
buffer_size=100000
buffer_memory_mb=200
checkpoint_interval_secs=10
batch_sink_interval_secs=0
[runtime]
log_level=info
log4rs_file=./log4rs.yaml
log_dir=./logs
- [sinker]
Config | Description | Example | Default |
---|---|---|---|
url | the url of StarRocks FE, used for metadata query | - | - |
stream_load_url | the url for Stream Load | - | - |
batch_size | the max record count in one Stream Load | - | - |
[extractor]
db_type=mysql
extract_type=cdc
binlog_position=5299302
binlog_filename=mysql-bin.000035
server_id=2000
url=mysql://root:123456@127.0.0.1:3307?ssl-mode=disabled
[filter]
ignore_dbs=
do_dbs=
do_tbs=test_db.*
ignore_tbs=
do_events=insert,update,delete
[sinker]
db_type=starrocks
sink_type=write
url=mysql://root:123456@127.0.0.1:9030
stream_load_url=mysql://root:123456@127.0.0.1:8040
batch_size=5000
[router]
tb_map=
col_map=
db_map=
[parallelizer]
parallel_type=table
parallel_size=8
[pipeline]
buffer_size=100000
buffer_memory_mb=200
checkpoint_interval_secs=10
batch_sink_interval_secs=15
[runtime]
log_dir=./logs
log_level=info
log4rs_file=./log4rs.yaml
- [pipeline]
Config | Description | Example | Default |
---|---|---|---|
batch_sink_interval_secs | when importing data into StarRocks by Stream Load, avoid frequent small-batch imports, as this may cause throttle errors in StarRocks. If the batch_sink_interval_secs is set, Stream Load will be triggered when either of the following conditions is met: 1) the pipeline's buffer is full, or 2) it has been more than batch_sink_interval_secs seconds since the last Stream Load. | 15 | 0 |
Refer to tutorial for the differences between hard delete and soft delete.
The differences with soft delete:
[parallelizer]
parallel_type=rdb_merge
[sinker]
hard_delete=true