ARROW-18253: [C++][Parquet] Add additional bounds safety checks #14592

emkornfield · 2022-11-05T00:12:15Z

No description provided.

github-actions · 2022-11-05T00:12:36Z

https://issues.apache.org/jira/browse/ARROW-18253

github-actions · 2022-11-05T00:12:37Z

⚠️ Ticket has not been started in JIRA, please click 'Start Progress'.

cpp/src/parquet/arrow/reader.cc

pitrou · 2022-11-05T11:14:40Z

Is this something that can happen due to bad user input? Or are you simply checking for internal invariants?

emkornfield · 2022-11-08T18:11:42Z

Is this something that can happen due to bad user input? Or are you simply checking for internal invariants?

A little bit of both. The only one as far as I can determine that would be a function of bad parquet files is the overflow when down-casting. But I think there are other safe guard in place here. In terms of bad user input, if we are talking about code that consumes these libraries I think most of the places do the negative checks are guards against here (the one exception being PathWriter), which I believe is completely internal. These were raised by another team doing a code audit and "yellow flags". If any of them strike you as superfluous I can revert them.

Co-authored-by: Antoine Pitrou <pitrou@free.fr>

ursabot · 2022-11-10T01:42:17Z

Benchmark runs are scheduled for baseline = 28a1152 and contender = 147b5c9. 147b5c9 is a master commit associated with this PR. Results will be available as each benchmark for each run completes.
Conbench compare runs links:
[Finished ⬇️0.0% ⬆️0.0%] ec2-t3-xlarge-us-east-2
[Finished ⬇️0.3% ⬆️0.03%] test-mac-arm
[Finished ⬇️0.0% ⬆️0.0%] ursa-i9-9960x
[Finished ⬇️0.14% ⬆️0.0%] ursa-thinkcentre-m75q
Buildkite builds:
[Finished] 147b5c92 ec2-t3-xlarge-us-east-2
[Finished] 147b5c92 test-mac-arm
[Finished] 147b5c92 ursa-i9-9960x
[Finished] 147b5c92 ursa-thinkcentre-m75q
[Finished] 28a1152a ec2-t3-xlarge-us-east-2
[Finished] 28a1152a test-mac-arm
[Finished] 28a1152a ursa-i9-9960x
[Finished] 28a1152a ursa-thinkcentre-m75q
Supported benchmarks:
ec2-t3-xlarge-us-east-2: Supported benchmark langs: Python, R. Runs only benchmarks with cloud = True
test-mac-arm: Supported benchmark langs: C++, Python, R
ursa-i9-9960x: Supported benchmark langs: Python, R, JavaScript
ursa-thinkcentre-m75q: Supported benchmark langs: C++, Java

emkornfield added 2 commits November 5, 2022 00:09

ARROW-18253: Add additional bounds safety checks

ab5ebba

remove checks

790585e

emkornfield requested a review from pitrou November 5, 2022 00:12

github-actions bot added Component: C++ Component: Parquet labels Nov 5, 2022

add cast

ebaeaed

pitrou reviewed Nov 5, 2022

View reviewed changes

cpp/src/parquet/arrow/reader.cc Outdated Show resolved Hide resolved

Update cpp/src/parquet/arrow/reader.cc

7c2b44c

Co-authored-by: Antoine Pitrou <pitrou@free.fr>

wjones127 approved these changes Nov 8, 2022

View reviewed changes

pitrou changed the title ~~ARROW-18253: Add additional bounds safety checks~~ ARROW-18253: [C++][Parquet] Add additional bounds safety checks Nov 9, 2022

pitrou approved these changes Nov 9, 2022

View reviewed changes

pitrou merged commit 147b5c9 into apache:master Nov 9, 2022

asfimport mentioned this pull request Nov 10, 2022

[C++][Parquet] Improve bounds checking on some inputs #20488

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ARROW-18253: [C++][Parquet] Add additional bounds safety checks #14592

ARROW-18253: [C++][Parquet] Add additional bounds safety checks #14592

emkornfield commented Nov 5, 2022

github-actions bot commented Nov 5, 2022

github-actions bot commented Nov 5, 2022

pitrou commented Nov 5, 2022

emkornfield commented Nov 8, 2022

ursabot commented Nov 10, 2022

ARROW-18253: [C++][Parquet] Add additional bounds safety checks #14592

ARROW-18253: [C++][Parquet] Add additional bounds safety checks #14592

Conversation

emkornfield commented Nov 5, 2022

github-actions bot commented Nov 5, 2022

github-actions bot commented Nov 5, 2022

pitrou commented Nov 5, 2022

emkornfield commented Nov 8, 2022

ursabot commented Nov 10, 2022