-
Notifications
You must be signed in to change notification settings - Fork 3.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[C++][Parquet] Supports write BinaryView/StringView to Parquet file #43244
Comments
Besides, can we separate it into two tasks:
I think this making finishing the task more smothly |
I'm currently trying on part of this feature. arrow/cpp/src/parquet/encoding.cc Lines 251 to 255 in de17643
However, this trait will return false on arrow/cpp/src/arrow/type_traits.h Lines 1124 to 1142 in de17643
I didn't find a trait that represents |
@IndifferentArea Nice catch, I think we can do some changes like:
Then dispatch by view or binary type? |
For now, I implement a local bool is_binary_view_like(::arrow::Type::type type_id){
return type_id == ::arrow::Type::BINARY_VIEW || type_id == ::arrow::Type::STRING_VIEW;
} But I still think this trait may be necessary in arrow/cpp/src/arrow/array/array_binary.cc Lines 35 to 38 in c66b3f1
However, BinaryViewArray 's constructor is implemented as:arrow/cpp/src/arrow/array/array_binary.cc Lines 93 to 96 in c66b3f1
As a result, we can't build a BinaryViewArray from StringViewArray . Comparing to BinaryArray and StringArray , I think it's not expected?
|
Lol, this is funny but
|
#42247 |
Describe the enhancement requested
This part requires write arrow's StringView/BinaryView/LargeStringView/LargeBinaryView to Parquet file.
The Parquet library has the layers below:
In this part, we should:
View
types inparquet/arrow/schema
module, and add tests.Dictionary<View>
might also be testTypedColumnWriterImpl<ByteArrayType>::WriteArrowDense
and allowing it write view typeDeltaLengthByteArrayEncoder<DType>::Put
,DeltaByteArrayEncoder<DType>::Put
,DictEncoderImpl<ByteArrayType>::Put
andPlainEncoder<ByteArrayType>::Put
supports view typeComponent(s)
C++, Parquet
The text was updated successfully, but these errors were encountered: