Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update docs to advise use of pbzip2 for new track corpus #955

Merged
merged 1 commit into from
Apr 7, 2020

Conversation

dliappis
Copy link
Contributor

@dliappis dliappis commented Apr 7, 2020

Related to #947

@dliappis dliappis added enhancement Improves the status quo :Docs Changes to the documentation labels Apr 7, 2020
@dliappis dliappis added this to the 1.5.0 milestone Apr 7, 2020
@dliappis dliappis self-assigned this Apr 7, 2020
Copy link
Member

@danielmitterdorfer danielmitterdorfer left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM


bzip2 -9 -c documents.json > documents.json.bz2
pbzip2 -9 -k -m2000 -v documents.json
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

2GB turns out to be the maximum size (for compression); note this is a hint on the max memory it could gobble up, provided it's available.

@dliappis dliappis merged commit 763aebf into elastic:master Apr 7, 2020
@dliappis dliappis deleted the example-pbzip2-docs branch April 7, 2020 09:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
:Docs Changes to the documentation enhancement Improves the status quo
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants