-
Notifications
You must be signed in to change notification settings - Fork 3.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ARROW-203: Python: Basic filename based Parquet read/write #83
Closed
Closed
Changes from all commits
Commits
Show all changes
28 commits
Select commit
Hold shift + click to select a range
0463995
ARROW-203: Python: Basic filename based Parquet read/write
xhochy 7192cfb
Add const to slicing parameters
xhochy 081db5f
Limit and document chunk_size
xhochy 0fbed3f
Remove obsolete parquet files
xhochy be6415c
Incorportate review comments
xhochy 9b06e41
Make tests templated
xhochy 5d4929a
Add test-util.h
xhochy b505feb
Install parquet-cpp via conda
xhochy 81f501e
No need to install conda in travis_script_python anymore
xhochy 6a41d23
Re-use conda installation from C++
xhochy cd3b9a9
Also search for Parquet in PyArrow
xhochy 9520c39
Use PARQUET from miniconda path
xhochy 2006e70
Rewrite test py.test style
xhochy 2dffc14
Fix min mistake, use equals instead of ==
xhochy 443de8b
Add miniconda to the LD_LIBRARY_PATH
xhochy 5706db2
Use length and offset instead of slicing
xhochy 066c08a
Add missing functions to smart pointers
xhochy 4a80116
Handle Python3 strings correctly
xhochy 00c1461
Also ensure correct OSX compiler flags in PyArrow
xhochy f583b61
Fix rpath for libarrow_parquet
xhochy 77bd21a
Add pandas roundtrip to tests
xhochy 0514d01
Handle exceptions on RowGroupWriter::Close better
xhochy 8f6010a
Linter fixes
xhochy 000e1e3
Use unique_ptr and shared_ptr from Cython
xhochy 8d90d3f
Do not set LD_LIBRARY_PATH in python build
xhochy ec07768
Set LD_LIBRARY_PATH in python build
xhochy 38d786c
Make code more readable by using using
xhochy 405f85d
Remove FindParquet duplication
xhochy File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,26 @@ | ||
#!/usr/bin/env bash | ||
|
||
set -e | ||
|
||
if [ $TRAVIS_OS_NAME == "linux" ]; then | ||
MINICONDA_URL="https://repo.continuum.io/miniconda/Miniconda-latest-Linux-x86_64.sh" | ||
else | ||
MINICONDA_URL="https://repo.continuum.io/miniconda/Miniconda-latest-MacOSX-x86_64.sh" | ||
fi | ||
|
||
wget -O miniconda.sh $MINICONDA_URL | ||
export MINICONDA=$TRAVIS_BUILD_DIR/miniconda | ||
bash miniconda.sh -b -p $MINICONDA | ||
export PATH="$MINICONDA/bin:$PATH" | ||
conda update -y -q conda | ||
conda info -a | ||
|
||
conda config --set show_channel_urls yes | ||
conda config --add channels conda-forge | ||
conda config --add channels apache | ||
|
||
conda install --yes conda-build jinja2 anaconda-client | ||
|
||
# faster builds, please | ||
conda install -y nomkl | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this needed? In theory this should not be necessary (handled by conda's shared library patching)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Or is this only for running the C++ unit tests (which don't have the rpath of the conda environment)? If so that's OK
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It seems to be required to load libparquet.so: https://travis-ci.org/apache/arrow/jobs/136655629
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think the issue is that parquet-cpp is not being installed in the conda environment where pyarrow library is being built and tests run (https://github.com/apache/arrow/blob/master/ci/travis_script_python.sh#L32).
Two related issues, then:
parquet-cpp needs to be added to the build and runtime requirements in python/conda.recipe/meta.yaml
parquet-cpp must be installed after
conda create
is installed -- it's only being installed in the primary / top level environmentI think if you do this you can get rid of the LD_LIBRARY_PATH hack