Replies: 2 comments 3 replies
-
@pjfanning @raboof can you guys please help here |
Beta Was this translation helpful? Give feedback.
2 replies
-
I've come across tailing raw HDFS files (eg raw text stored in an HDFS file that is being appended to) - I haven't come across this for Parquet. I am no expert on Parquet but I wasn't aware that such files could be (i) appended to or (ii) tailed. This link seems to indicate that Parquet files can't be appended to: |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
FileTailSource for reading Parquet files, we'd be working with a lower-level stream of bytes, and it's not ideal for structured formats like Parquet. FileTailSource is designed for streaming content like logs or text files where new data is appended. Since Parquet files are columnar and binary, parsing them as raw byte streams, would Pekko Streams File connectors be ideal for the use case?
Beta Was this translation helpful? Give feedback.
All reactions