Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WIP: [FLASH-663] Pure memory block buffer for fracture write #320

Closed
wants to merge 65 commits into from
Closed

WIP: [FLASH-663] Pure memory block buffer for fracture write #320

wants to merge 65 commits into from

Conversation

leiysky
Copy link
Contributor

@leiysky leiysky commented Nov 13, 2019

No description provided.

flowbehappy and others added 30 commits November 4, 2019 18:12
disable unittest of zookeeper
use gtest for testing
add code coverage report
add test cases for DeltaMerge::(StorageDeltaMerge / DeltaMergeStore / Segment / DiskValueSpace / Chunk)
* add test cases for PageStorage

* split PageStorage gc stage into small helper functions

* add test cases for PageStorage gc concurrency

* add stress test and dump utils of PageStorage

* Fix bug:
1. ensure PageFile with meta && data, in case gc drop file but be killed before drop dir
2. turn PageStorage::Config::sync_on_write = true by default
3. avoid PageStorage::gc run by multi-threads
4. print PageFile's path if checksum is not correct

* throw exception for must_exist==false and errno != ENOENT
…97)

* Page storage bug fix (#87)

* add test cases for PageStorage

* split PageStorage gc stage into small helper functions

* add test cases for PageStorage gc concurrency

* add stress test and dump utils of PageStorage

* Fix bug:
1. ensure PageFile with meta && data, in case gc drop file but be killed before drop dir
2. turn PageStorage::Config::sync_on_write = true by default
3. avoid PageStorage::gc run by multi-threads
4. print PageFile's path if checksum is not correct

* throw exception for must_exist==false and errno != ENOENT

* add RefPage && ref counting

* 1. move PageCacheMap into isolated file
2. rename PageCacheMap -> PageEntryMap

* accept non-exist Page reference/del while being used by WriteBatch or Gc

* adjust gc on RefPages

* add more test on gc RefPages

* add memory usage comparing between vec/hashmap/treemap

* fix tests code format

* Bug fix: Add RefPage to non exist Page

* Add unit test for PageStorage

* fix bug

* stress test for benchmark

* fix bug of PageEntryMap updating RefPage

* Keep RefPage entry ref-count if it has been gc and move to another place

* MVCC on PageMap, All previse tests passed. TODO: refactor on read_mutex on PageStorage

* PageStorage gc: only delete file that are not used by any version

* fix bug under multi-threads;
remove unused code && format codes

* 1. support snapshot read
2. standalone VersionSet/MultiVersionCountable template

* 1. Apply merge delta if no read ref
2. Apply generate new delta if has read ref
3. Delta merged when snapshot released
4. Add simple restore test
5. Use template and constexpr instead of copying codes in PageEntryMap

* add test for PageStorage snapshot read

* use typed test

* Bug fix: Add RefPage to non exist Page

* fix bug

* Delta merged when view is finished

* 1. VersionDeltaSet: add snapshots for snapshot linked-list, so that gc can collect valid PageFiles
2. fix some bugs

* do compact on delta -> base

* rebase after do compact on delta

* fix bugs

* avoid duplicted codes

* 1. new find API
2. all tests passed

* 1. remove legacy API
2. remove duplicated codes

* reformat

* apply inplace if there are no readers

* fix bug of applying edits

* fix bug of gc apply

* fix bug of merging PageEntryMap

* fix bug of invalid ref page

* fix bug:
1. listAllLiveFiles
2. double compaction on same version

* tmp

* fixbug

* split PageEntryMapView to single cpp file

* minor fix

* 1. Avoid visit same version multiple time in PageEntryMapDeltaVersionSet::listAllLiveFiles
2. use template to reduce duplicated code for gcApply

* todo mark

* fix bug: RefPage -> RefPage record may become invalid record when PageStorage GC

* remove unused function

* simple fix on unittest

* fix bug: PageEntryMapView::isRefId

* fix bug: PageEntryMapDeltaBuilder::applyPut

* fix

* fix bug: PageEntryView::validNormalPageIds filter out tombstone of PageEntry

* iterator over PageEntryMap

* remove unused tests

* Refactor on PageEntryMapDeltaVersionSet

* Reduce lock range when PageStorage Snapshot is release

* Add metrics for PageStorage MVCC

* rename some classes

* reformat.

* use exception instead

* adress comment

* adress comment

* fix compiler error under gcc

* fix compiler error under gcc
* Segment support range scan

* Bug fix: ColumnVector insertRangeFrom out of range

* Only read tag column when necessary

* print more info on sidLowerBound exception

* Refactor DeltaMergeBlockInputStream

* Optimize DMVersionFilterBlockInputStream, try unroll loops.

* Support multi range read

* bug fix

* Optimize synchronization mechanism between read/write thread, decrease latency.

* Fix testes

* Address comments

* fix typo
* ci run unittests defined using gtest

* fix bug in CMakeLists

* 1. disable zookeeper test cases
2. use template struct SchemaBuilder instead of template class
3. more portable run-gtest.sh

* use EXCLUDE_FROM_ALL in add_subdirectory instead of add_executable

* new container of tics just for gtest

* disable tics-gtest in unused command

* address comment

* address comment

* add continue_on_error option in run-gtest.sh

* ci run unittests defined using gtest

* fix bug in CMakeLists

* 1. disable zookeeper test cases
2. use template struct SchemaBuilder instead of template class
3. more portable run-gtest.sh

* use EXCLUDE_FROM_ALL in add_subdirectory instead of add_executable

* new container of tics just for gtest

* disable tics-gtest in unused command

* address comment

* address comment

* add continue_on_error option in run-gtest.sh
* Atomic read/write on DeltaMergeStore level

* Bug fix

* bugfix 2

* bug fix

* fix compile error

* remove useless code

* address comments

* fix compile error
* cast DataType while reading from PageStorage

* add isLossyCast function

* alter for StroageDeltaMerge, WIP

* add Alter for StorageDeltaMerge

* add Alter for StorageDeltaMerge

* add TableInfo in StroageDeltaMerge

* rename table for StorageDeltaMerge

* fix bug: the TableInfo from TiDB

* add comments for DeltaMerge flush && cache

* more faster(?) way to cast mismatch datatype

* support cast for numeric type null/not null

* support for other data type just change null / not null

* isLossyCast -> isSupportedDataTypeCast

* isSupportedDataTypeCast add decimal detect

* rename function

* fix compile errors in gtests

* small fix

* fix broken tests

* support new column with non-zero default value

* remove unused code

* fix compile error in CI

* fix bug in table rename

* small fix

* minor fix

* refine cast function in chunk

* update DeltaMergeStore's segments within lock

* [WIP]Add test cases for default value ddl. Still has bugs

* fix bugs after rebasing to latest master

* add some TODO marks

* fix compile error in gtests && remove unused comments

* fix broken gtests

* flush cached chunks in delta instead of doing DeltaMerge when ddl-changes apply

* use TypeIndex instead of typeid_cast

* clean up data after tests

* address comment
JaySon-Huang and others added 10 commits November 4, 2019 18:23
* Throw exception with region status; Use reverseGetColumnInfo

* Use IManageableStorage instead of StorageMergeTree

* Fix args of mock_tidb_table && ensure engine is DeltaMerge

* For learner read from TiDB/TiSpark, we set num_streams by mvcc_query_info.concurrent

* fix compile error

* address comment; add log for level trace
* refactor default_value

* Apply suggestions from code review

* enable alter default value test

* add default_value to make sure read can fill value with it

* fix issue with reading Float default value

* add test for read Float default value

* resolve suggestions

* add abs

* fix issue with DateTime default value

* add test for DateTime default value

* modify test case

* handle Decimal64

* modify expect values

* format

* fix issue with Decimal default_value

* modify test case

* resolve suggestions

* fix bug: column in StorageDeltaMerge but not in TiDB::TableInfo::columns
* do handle range filter in

* add status support for DM

* compact continue insert

* bug fix, and temporary split big insert block

* enable split logical

* add store restore log

* fix lock in PageEntriesVersionSetWithDelta::listAllLiveFiles

* disable posix_fadvise in PageStorage

* Update default DM settings

* optimize DMVersionFilterBlockInputStream

* optimize DMVersionFilterBlockInputStream v2

* added some metrics

* add dm_insert_max_rows setting

* Fix compilation error of TiDB.h

* address comment

* fix broken unit tests

* fix unit tests of DeltaTree

* Bug fix: DiskValueSpace::createAppendTask remove data pages by mistake

* remove debug code
* Add test case for migrate ref-page to ref-page

* check if ref-page is valid

* only remove PageFile's data

* Add MOVE_NORMAL_PAGE for only move normal page

* enhance utils_get_valid_pages
…315)

* Add concurrency for wait index of regions while dong learner read

* Add some status about StoragePool
* Remove obsoleted data in DeltaMerge when region is removed

* Add test case for region removed

* Update GC safe point from PD to remove obsolated data

* applySnapshot: TMT don't required to lock data

* only update gc safe point in background merge delta

* Add dm_safe_point_update_interval_seconds

* fix bug: acquire lock && ensure not null pointer

* fix uninitialized variables

* Add cache in PDClientHelper for gc safe point
…316)

* add configuration to disable background flush

* Add log to trace disable_bg_flush

* Add log for debug

* enable background KVStore persist

* judge whether if a region is dirty with result

* add time count for snapshot

* enable background flush task in tests

* Apply suggestions from code review

Co-Authored-By: JaySon <jayson.hjs@gmail.com>

* add disable_bg_flush to example configs
* Add callback for gc

* Add helper for list capacity of PageFile

* Add external page scanner for get external pages when gc begin

* fix compile error of tests
@leiysky leiysky requested a review from JaySon-Huang November 13, 2019 05:57
* Check region version while doing learner read
* remove unused code
* use global_context for getting PDClient in doing DM bg tasks
* Ensure external_pages_remover will be called each time PageStorage gc run
* If callback of external_pages is empty, return empty live_normal_pages
@leiysky
Copy link
Contributor Author

leiysky commented Nov 21, 2019

/run-integration-tests

@leiysky
Copy link
Contributor Author

leiysky commented Nov 21, 2019

/rebuild

@leiysky
Copy link
Contributor Author

leiysky commented Nov 21, 2019

/run-integration-tests

1 similar comment
@leiysky
Copy link
Contributor Author

leiysky commented Nov 22, 2019

/run-integration-tests

@JaySon-Huang
Copy link
Contributor

Closed. Will be done in FLASH-854.

JaySon-Huang pushed a commit to JaySon-Huang/tiflash that referenced this pull request Dec 3, 2024
Signed-off-by: Calvin Neo <calvinneo1995@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants