Shard based recovery #799

Rachelint · 2023-03-31T04:57:42Z

Describe This Problem

Now table recovery in on table level but wal's storing in on shard level.
The recovery performance may be unsatisfied especially in kafka based wal.

Proposal

Split actual table recovery from schema, and refactor table engine
Impl shard based table meta recovery
Impl shard based table data recovery

1. Split actual table recovery from schema, and refactor table engine

We should begin at modifying the high level interface(Schema and TableEngine) for adapting to the new recovery process.

Now the path about Schema and TableEngine when opening tables on shard is like:

For modify interfaces above to open whole tables on shard together rather than respectively, the most troublesome place is:

Tables on the same shard may belong to different schema, so we are unable to add the api like open_tables_on_shard to Schema.

My solution about this is:

Split actual table recovery from schema, we just call the TableEngine directly, and just register the opened tables to Schema feat: introduce TableOperator to encasulate operation of tables #808

In this stage, we still keep the origin interface of TableEngine, the path may be like:

Refactor the TableEngine interface to support shard level opening
tbc...
Furthermore, split Schema and TableEngine completely?

Just keep register_table and unregister_table in Schema
Remove all table operation(create,drop,open,close), and call them directly in TableEngine

2. Impl shard based table meta recovery

Refactor manifest module
- Rename the original TableData to TableContext, and extract members which will be recovered from manifest as the new TableData.
- Register the TableData into manifest when create table and open table, and unregister it when drop table and close table.
- When do snapshot in Manifest, we just make use of the hold TableData rather than scanning the persist wals.
Place TableDatas into Manifest, and we update the memory and storage in just one place.
Shard based manifest recovery

3. Impl shard based table data recovery

Region based wal replay feat: support region based wal replay #976
Make wal replay more concurrently

4. other

Add integration test about recovery chore: add integration test about recovery #996
[WIP] retry when open shard failed
Support limited retry in kafka client feat: use instead forked rskafka to support limited retry #1005
Fix logs deleting in wal on kafka

Additional Context

No response

The text was updated successfully, but these errors were encountered:

ShiKaiWi · 2023-04-03T02:58:43Z

@Rachelint In the #800, the separate runtime for recovery has been mentioned, and I guess it should be taken into considerations together with this.

## Which issue does this PR close? Closes # Part of #799 ## Rationale for this change Now, we update `TableData` and store its wal seperately. The order of two operations above is maintained by developer, that will lead to a big bug if developer is not so careful when modifying related codes.  ## What changes are included in this PR? + Place table datas into manifest. + Update it both in memory and storage in the same call.  ## Are there any user-facing changes? None.  ## How does this change test Test by ut.

## Which issue does this PR close? Closes # Part of apache#799 ## Rationale for this change Now, we update `TableData` and store its wal seperately. The order of two operations above is maintained by developer, that will lead to a big bug if developer is not so careful when modifying related codes.  ## What changes are included in this PR? + Place table datas into manifest. + Update it both in memory and storage in the same call.  ## Are there any user-facing changes? None.  ## How does this change test Test by ut.

## Which issue does this PR close? Closes # Part of #799 ## Rationale for this change + Add `open_shard` and `close_shard` methods into `TableEngine`. + Impl the methods above on demand. ## What changes are included in this PR? See above. ## Are there any user-facing changes? None. ## How does this change test Test by ut.

## Rationale Part of #799 ## Detailed Changes - Define `WalReplayer` to carry out replay work. - Support both `TableBased`(original) and `RegionBased` replay mode in `WalReplayer`. - Expose related configs. ## Test Plan - Modify exist unit tests to cover the `RegionBased` wal replay. - Refactor the integration test to cover recovery logic(TODO).

## Rationale Part of #799 Now we run the test about recovery manually that is so tired, this pr add this into integration tests which will be run automatically in ci. ## Detailed Changes + Add integration test about recovery. + Add above test to ci. ## Test Plan None.

## Rationale Part of #799 We use `rskafka` as our kafka client. However I found it will retry without limit even though kafka service is unavailable... (see [https://github.com/influxdata/rskafka/issues/65](https://github.com/influxdata/rskafka/issues/65)) Worse, I found `rskafka` is almostis no longer maintained... For quick fix, I forked it to support limited retry. ## Detailed Changes + Use the instead forked `rskafka`(supporting limited retry). + Add more logs in recovery path for better debugging. ## Test Plan Test manually.

## Rationale Part of #799 ## Detailed Changes see title. ## Test Plan None.

## Rationale Part of apache#799 ## Detailed Changes - Define `WalReplayer` to carry out replay work. - Support both `TableBased`(original) and `RegionBased` replay mode in `WalReplayer`. - Expose related configs. ## Test Plan - Modify exist unit tests to cover the `RegionBased` wal replay. - Refactor the integration test to cover recovery logic(TODO).

## Rationale Part of apache#799 Now we run the test about recovery manually that is so tired, this pr add this into integration tests which will be run automatically in ci. ## Detailed Changes + Add integration test about recovery. + Add above test to ci. ## Test Plan None.

) ## Rationale Part of apache#799 We use `rskafka` as our kafka client. However I found it will retry without limit even though kafka service is unavailable... (see [https://github.com/influxdata/rskafka/issues/65](https://github.com/influxdata/rskafka/issues/65)) Worse, I found `rskafka` is almostis no longer maintained... For quick fix, I forked it to support limited retry. ## Detailed Changes + Use the instead forked `rskafka`(supporting limited retry). + Add more logs in recovery path for better debugging. ## Test Plan Test manually.

## Rationale Part of apache#799 ## Detailed Changes see title. ## Test Plan None.

jiacai2050 · 2024-03-12T09:00:04Z

Most tasks are finished, so closing.

Rachelint added the feature New feature or request label Mar 31, 2023

ShiKaiWi mentioned this issue Mar 31, 2023

Tracking Issue: accelerate recover speed #800

Closed

2 tasks

Rachelint added help wanted Extra attention is needed and removed help wanted Extra attention is needed labels Mar 31, 2023

Rachelint mentioned this issue Apr 11, 2023

feat: refactor manifest to get snapshot in memory #825

Merged

Rachelint mentioned this issue May 4, 2023

feat: place table datas into manifest, update them together #863

Merged

Rachelint mentioned this issue May 16, 2023

feat: add shard related methods to table engine #897

Merged

Rachelint mentioned this issue Jun 7, 2023

feat: support region based wal replay #976

Merged

Rachelint mentioned this issue Jun 15, 2023

chore: add integration test about recovery #996

Merged

Rachelint mentioned this issue Jun 20, 2023

feat: use instead forked rskafka to support limited retry #1005

Merged

Rachelint mentioned this issue Jun 20, 2023

chore: add logs and metric to recovery #1007

Merged

Rachelint added a commit that referenced this issue Jun 20, 2023

chore: add logs and metric to recovery (#1007)

9a9c0f7

## Rationale Part of #799 ## Detailed Changes see title. ## Test Plan None.

dust1 pushed a commit to dust1/ceresdb that referenced this issue Aug 9, 2023

chore: add logs and metric to recovery (apache#1007)

0f4a3c5

## Rationale Part of apache#799 ## Detailed Changes see title. ## Test Plan None.

jiacai2050 closed this as completed Mar 12, 2024

jiacai2050 mentioned this issue Mar 12, 2024

Tracking issue for supporting shard level replay #412

Closed

10 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shard based recovery #799

Shard based recovery #799

Rachelint commented Mar 31, 2023 •

edited

Loading

ShiKaiWi commented Apr 3, 2023

jiacai2050 commented Mar 12, 2024

Shard based recovery #799

Shard based recovery #799

Comments

Rachelint commented Mar 31, 2023 • edited Loading

Describe This Problem

Proposal

1. Split actual table recovery from schema, and refactor table engine

2. Impl shard based table meta recovery

3. Impl shard based table data recovery

4. other

Additional Context

ShiKaiWi commented Apr 3, 2023

jiacai2050 commented Mar 12, 2024

Rachelint commented Mar 31, 2023 •

edited

Loading