Benchmark Scripts for Evaluating Query Languages and Systems for High-Energy Physics Data

This repository contains benchmarks scripts for running the implementations of High-energy Physics (HEP) analysis queries from the IRIS HEP benchmark for various general-purpose query processing systems. The results have been published in the following paper:

Dan Graur, Ingo Müller, Mason Proffitt, Ghislain Fourny, Gordon T. Watts, Gustavo Alonso. Evaluating Query Languages and Systems for High-Energy Physics Data. In: PVLDB 15(2), 2022. DOI: 10.14778/3489496.3489498.

Please cite both, the paper and the software, when citing in academic contexts.

Overview of the repository

This repository contains the scripts for producing the datasets, the scripts for running the experiments, and the scripts for plotting the results used in the paper mentioned above.

We recommend to get started with the scripts in the following order:

Get individual queries to run with the systems you are interested in using the small sample datasets provided for each system.

For that purpose, look at the general instructions in the experiments folder as well as the system-specific instructions in the subfolders of the respective systems.
Generate the full datasets as described in the datasets folder and upload them to cloud storage and/or load them as per the system-specific instructions.
Run the actual experiments using the system-specific scripts from the subfolders of the respective systems.

Running all experiments takes several days and costs at least several hundred dollars of cloud credits, so it's probably a good idea to start with a small subset, then extend them as you gain experience and confidence.
Re-generate the plots with the scripts in the plots folder.

We provide the data we used for the plots in the original paper, but you can also copy over your own measurement data and plot that.

Name		Name	Last commit message	Last commit date
Latest commit History 290 Commits
.release-tools		.release-tools
datasets		datasets
experiments		experiments
plots		plots
.gitignore		.gitignore
.gitmodules		.gitmodules
CITATION.cff		CITATION.cff
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Benchmark Scripts for Evaluating Query Languages and Systems for High-Energy Physics Data

Overview of the repository

About

Releases 6

Packages

Contributors 2

Languages

RumbleDB/hep-iris-benchmark-scripts

Folders and files

Latest commit

History

Repository files navigation

Benchmark Scripts for Evaluating Query Languages and Systems for High-Energy Physics Data

Overview of the repository

About

Resources

Stars

Watchers

Forks

Releases 6

Packages 0

Contributors 2

Languages

Packages