Hive guide

Quick start guide for new users working with Hive partitioned data in R or Python.

Files

To create the data in a container using Podman or Docker:

Build the container podman build -t demo_data .
Run the container podman run demo_data
Check the container hash podman ps --all
Fetch the sample data from the container with: podman cp <container hash>:/home/r-environment/test_data.tar.gz test_data.tar.gz
Decompress the archive and try the examples.

If you're interested in some Arrow benchmarks, check out this repo: https://github.com/mikerspencer/arrow_test/, which was used for an EdinbR talk.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
R_hive_example.md		R_hive_example.md
R_hive_example.qmd		R_hive_example.qmd
generate_test_data.R		generate_test_data.R
generate_test_data_container.R		generate_test_data_container.R
hive_guide.Rproj		hive_guide.Rproj