Refactor tuple forming function for bulk decompression of text columns #6448

akuzm · 2023-12-20T12:04:45Z

This commit makes some mechanical changes to the tuple forming function make_next_tuple(), to prepare it for the subsequent introduction of bulk decompression for text columns. It also simplifies the layout of data for the compressed columns to contain less indirections. This is important because tuple forming is a hot function that we try to keep simple.

Disable-check: force-changelog-file

This commit makes some mechanical changes to the tuple forming function make_next_tuple(), to prepare it for the subsequent introduction of bulk decompression for text columns. It also simplifies the layout of data for the compressed columns to contain less indirections. This is important because tuple forming is a hot function that we try to keep simple.

github-actions · 2023-12-20T12:05:04Z

@konskov, @gayyappan: please review this pull request.

Powered by pull-review

akuzm · 2023-12-20T12:05:49Z

tsl/src/nodes/decompress_chunk/compressed_batch.h

+/* How to obtain the decompressed datum for individual row. */
+typedef enum
 {
-	/* For row-by-row decompression. */
-	DecompressionIterator *iterator;
-
+	DT_Default = -2,
+	DT_Iterator = -1,
+	DT_Invalid = 0,
 	/*
-	 * For bulk decompression and vectorized filters, mutually exclusive
-	 * with the above.
+	 * Any positive number is also valid for the decompression type. It means
+	 * arrow array of a fixed-size by-value type, with size given by the number.
 	 */
-	ArrowArray *arrow;
+} DecompressionType;


This will later have two new options for arrow arrays for text columns, with dictionary encoding and without.

akuzm · 2023-12-20T12:07:31Z

tsl/src/nodes/decompress_chunk/compressed_batch.c

+		/* No variable-width columns support bulk decompression. */
+		Assert(false);


Here will be the implementation for text arrow arrays.

codecov · 2023-12-20T12:12:39Z

Codecov Report

Attention: 5 lines in your changes are missing coverage. Please review.

Comparison is base (4f2f658) 87.33% compared to head (ec41710) 87.31%.

Files	Patch %	Lines
tsl/src/nodes/decompress_chunk/compressed_batch.c	92.75%	3 Missing and 2 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #6448      +/-   ##
==========================================
- Coverage   87.33%   87.31%   -0.03%     
==========================================
  Files         187      187              
  Lines       41820    41775      -45     
  Branches     9313     9289      -24     
==========================================
- Hits        36525    36474      -51     
- Misses       3623     3627       +4     
- Partials     1672     1674       +2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

github-actions bot assigned akuzm Dec 20, 2023

github-actions bot requested review from gayyappan and konskov December 20, 2023 12:05

akuzm commented Dec 20, 2023

View reviewed changes

akuzm mentioned this pull request Dec 20, 2023

Vectorize text equality and LIKE #6189

Merged

7 tasks

svenklemm approved these changes Dec 20, 2023

View reviewed changes

antekresic approved these changes Dec 21, 2023

View reviewed changes

akuzm merged commit ed59c05 into timescale:main Dec 21, 2023
47 of 48 checks passed

akuzm deleted the tuple-forming branch December 21, 2023 10:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor tuple forming function for bulk decompression of text columns #6448

Refactor tuple forming function for bulk decompression of text columns #6448

akuzm commented Dec 20, 2023 •

edited

Loading

github-actions bot commented Dec 20, 2023

akuzm Dec 20, 2023

akuzm Dec 20, 2023

codecov bot commented Dec 20, 2023 •

edited

Loading

		/* No variable-width columns support bulk decompression. */
		Assert(false);

Refactor tuple forming function for bulk decompression of text columns #6448

Refactor tuple forming function for bulk decompression of text columns #6448

Conversation

akuzm commented Dec 20, 2023 • edited Loading

github-actions bot commented Dec 20, 2023

akuzm Dec 20, 2023

Choose a reason for hiding this comment

akuzm Dec 20, 2023

Choose a reason for hiding this comment

codecov bot commented Dec 20, 2023 • edited Loading

Codecov Report

akuzm commented Dec 20, 2023 •

edited

Loading

codecov bot commented Dec 20, 2023 •

edited

Loading