[NSE-1170] Setting correct row number in batch scan w/ partition columns #1172

zhouyuan · 2022-11-24T05:58:12Z

What changes were proposed in this pull request?

This patch fixes the row number in batch scan w/ partition columns

Signed-off-by: Yuan Zhou yuan.zhou@intel.com

How was this patch tested?

pass jenkins

This patch fixes the row number in batch scan w/ partition columns Signed-off-by: Yuan Zhou <yuan.zhou@intel.com>

github-actions · 2022-11-24T05:58:25Z

#1170

PHILO-HE · 2022-11-24T11:45:39Z

Great work! The patch is workable on my side.

…ap-project#1172)

* [NSE-1170] Set correct row number in batch scan w/ partition columns (#1172) * [NSE-1171] Throw RuntimeException when reading duplicate fields in case-insensitive mode (#1173) * throw exception if one more columns matched in case insensitive mode * add schema check in arrow v2 * bump h2/pgsql version (#1176) * bump h2/pgsql version Signed-off-by: Yuan Zhou <yuan.zhou@intel.com> * ignore one failed test Signed-off-by: Yuan Zhou <yuan.zhou@intel.com> Signed-off-by: Yuan Zhou <yuan.zhou@intel.com> * [NSE-956] allow to write parquet with compression (#1014) This patch adds support for writing parquet with compression df.coalesce(1).write.format("arrow").option("parquet.compression","zstd").save(path) Signed-off-by: Yuan Zhou yuan.zhou@intel.com * [NSE-1161] Support read-write parquet conversion to read-write arrow (#1162) * add ArrowConvertExtension * do not convert parquet fileformat while writing to partitioned/bucketed/sorted output * fix cache failed * care about write codec * disable convertor extension by default * add some comments * remove wrong compress type check (#1178) Since the compresssion has been supported in #1014 . The extra compression check in ArrowConvertorExtension can be remove now. * fix to use right arrow branch (#1179) fix to use right arrow branch Signed-off-by: Yuan Zhou <yuan.zhou@intel.com> * [NSE-1171] Support merge parquet schema and read missing schema (#1175) * Support merge parquet schema and read missing schema * fix error * optimize null vectors * optimize code * optimize code * change code * add schema merge suite tests * add test for struct type * to use 1.5 branch arrow Signed-off-by: Yuan Zhou <yuan.zhou@intel.com> Signed-off-by: Yuan Zhou <yuan.zhou@intel.com> Signed-off-by: Yuan Zhou yuan.zhou@intel.com Co-authored-by: Jacky Lee <lijunqing@baidu.com>

Setting correct row number in batch scan w/ partition columns

1fce7d8

This patch fixes the row number in batch scan w/ partition columns Signed-off-by: Yuan Zhou <yuan.zhou@intel.com>

PHILO-HE merged commit 21af97d into oap-project:main Nov 25, 2022

zhouyuan added a commit to zhouyuan/native-sql-engine that referenced this pull request Dec 14, 2022

[NSE-1170] Set correct row number in batch scan w/ partition columns (o…

3eb5477

…ap-project#1172)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NSE-1170] Setting correct row number in batch scan w/ partition columns #1172

[NSE-1170] Setting correct row number in batch scan w/ partition columns #1172

zhouyuan commented Nov 24, 2022

github-actions bot commented Nov 24, 2022

PHILO-HE commented Nov 24, 2022

[NSE-1170] Setting correct row number in batch scan w/ partition columns #1172

[NSE-1170] Setting correct row number in batch scan w/ partition columns #1172

Conversation

zhouyuan commented Nov 24, 2022

What changes were proposed in this pull request?

How was this patch tested?

github-actions bot commented Nov 24, 2022

PHILO-HE commented Nov 24, 2022