Spliced Experiment Artifacts

This repository does an automated retrieval of results from splice-experiment-runs. It used to derive from build-abi-containers, a (previously written) GitHub pipeline that I wrote, but we didn't use. We use GitHub in an effort to provide reproducible analyses, and not require a complex, privileged HPC system. The process of retrieving artifacts works via the GitHub workflow that runs the script get_artifacts.py. Since the data is smaller (a matrix of results across compilers).

Analysis

1. Setup

Make a virtual environment and install dependencies:

$ python -m venv env
$ source env/bin/activate
$ pip install -r requirements.txt

If you take this approach, you'll also need sqlite3. As an alternative, you can use the provided Dockerfie environment (also provided as an automated build alongside the repository). You'll need to bind the data:

$ docker run -it -v $PWD/artifacts:/code/artifacts ghcr.io/buildsi/splice-experiment-results

You can also build the container from scratch locally:

$ docker build -t ghcr.io/buildsi/splice-experiment-results .

In the above, you'll have Python with dependencies and sqlite3 ready to go! We recommend the container as it will give you a clean environment separate from the cloned repository on your local machine. If you want to keep results, be sure to bind the entire local directory (and be careful about writing files and permissions).

$ docker run -it -v $PWD/:/code ghcr.io/buildsi/splice-experiment-results

2. Artifacts

informational only

To download artifacts, we did the following:

export GITHUB_TOKEN=xxxxxxxxxxx

$ python get_artifacts.py

You can also ask for a specific run, and this is how we were able to run jobs in batches (a range of libraries per batch). If you needed to customize the artifacts repository:

export REPO_NAME=splice-experiment-another-run

Here is how the artifacts were retrieved that are present here.

$ rm -rf artifacts/*
$ python get_artifacts.py 3056175039  # high fidelity results (not used)

$ python get_artifacts.py 3017137349  # fedora results 0-50
$ python get_artifacts.py 3017172931  # fedora results 50-100
$ python get_artifacts.py 3017684511  # fedora results 100-150
$ python get_artifacts.py 3018521212  # fedora results 150-250
$ python get_artifacts.py 3019031567  # fedora results 250-350
$ python get_artifacts.py 3019032895  # fedora results 350-450
$ python get_artifacts.py 3023207569  # fedora results 450-500
$ python get_artifacts.py 3019727038  # fedora results 550-650
$ python get_artifacts.py 3019728826  # fedora results 650-750
$ python get_artifacts.py 3020343625  # fedora results 750-850
$ python get_artifacts.py 3027789022  # fedora results 850-900
$ python get_artifacts.py 3027790466  # fedora resutls 900-950
$ python get_artifacts.py 3023223782  # fedora results 950-1000 (34 vs all others had runner issues)
$ python get_artifacts.py 3023226112  # fedora results 1000-1050 (36 vs 37 runner had issues)
$ python get_artifacts.py 3025057440  # fedora results 1050-1100
$ python get_artifacts.py 3026209175  # fedora results 1100-1150
$ python get_artifacts.py 3026210109  # fedora results 1150-1200
$ python get_artifacts.py 3027040794  # fedora results 1200-1250 
$ python get_artifacts.py 3027334558  # fedora results 1300-1350 # this is where results start to peter out
$ python get_artifacts.py 3027336141  # fedora results 1350-1400
$ python get_artifacts.py 3027772125  # fedora results 1400-1450
$ python get_artifacts.py 3027773113  # fedora results 1450-1500
$ python get_artifacts.py 3028703062  # fedora resutls 1500-1550
$ python get_artifacts.py 3028703764  # fedora results 1550-1600
$ python get_artifacts.py 3030132290  # fedora results 1600-1650
$ python get_artifacts.py 3029103477  # fedora resuts 1650-1700

You should not need to re-download these artifacts, and likely they will have been purged from GitHub after 90 days.

3. Running analysis scripts

The following steps will run the full analysis. If you want to do this all programatically, you can also just use run.sh to run the same commands for you and generate the output.

Export results to CSV results.csv (an intermediate file) and use it to generate SQLite database results.sqlite3

$ python3 make_database.py

Extract error messages from JSON results

$ python3 error_messages.py

Parse error messages

$ for f in *.messages; do echo $f; perl parse_messages.pl $f; echo; done >messages.summary

Run analyses

These can be run in any order.

Calculate counts matrix (Main tables in Results section)

$ python3 counts_matrix.py

Fraction of libraries where all predictors agree

$ python3 agreement.py

Average fraction of libraries by predictor and filename cases. Only show for changed case- unchanged cases are complementary

$ python3 libraries_by_predictor.py

At least one predictor detects a breakage

$ python3 predictor_breakages.py

Fraction of libraries missing in the three-predictor case

$ python3 three_predictors_missing.py

Show error messages for a JSON file. Used for debugging.

$ python3 read_errors.py

License

BUILDSI is under the same umbrella as Spack, which is distributed under the terms of both the MIT license and the Apache License (Version 2.0). Users may choose either license, at their option.

All new contributions must be made under both the MIT and Apache-2.0 licenses.

See LICENSE-MIT, LICENSE-APACHE, COPYRIGHT, and NOTICE for details.

SPDX-License-Identifier: (Apache-2.0 OR MIT)

LLNL-CODE-811652

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

Spliced Experiment Artifacts

Analysis

1. Setup

2. Artifacts

3. Running analysis scripts

License

About

Licenses found

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
.github/workflows		.github/workflows
artifacts		artifacts
docs		docs
not-used		not-used
.dockerignore		.dockerignore
.gitignore		.gitignore
COPYRIGHT		COPYRIGHT
Dockerfile		Dockerfile
LICENSE-APACHE		LICENSE-APACHE
LICENSE-MIT		LICENSE-MIT
NOTICE		NOTICE
README.md		README.md
agreement.py		agreement.py
counts_matrix.py		counts_matrix.py
error_messages.py		error_messages.py
get_artifacts.py		get_artifacts.py
helpers.py		helpers.py
libraries_by_predictor.py		libraries_by_predictor.py
library_counts.py		library_counts.py
make_database.py		make_database.py
make_results_table.sql		make_results_table.sql
messages.summary		messages.summary
notes		notes
parse_messages.pl		parse_messages.pl
predictor_breakages.py		predictor_breakages.py
read_errors.py		read_errors.py
requirements.txt		requirements.txt
run.sh		run.sh
three_predictors_missing.py		three_predictors_missing.py
timings.py		timings.py

License

Licenses found

buildsi/splice-experiment-results

Folders and files

Latest commit

History

Repository files navigation

Spliced Experiment Artifacts

Analysis

1. Setup

2. Artifacts

3. Running analysis scripts

License

About

Resources

License

Licenses found

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages