-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ALTO: Integrate CERN data model with IETF ALTO format #37
Comments
After meeting with Mario, we have a better understanding of the application-level data format used by Rucio with CERN. The next step is to design a JSON schema that both:
|
The attached files provide a view of my current thoughts re: a new data model. The file "rucio-non-alto.json" is an example of Rucio's current data format; the file "alto-rucio.json" represents the same data under the proposed new format. The idea is that the latter file would be what is returned by an ALTO server. The new format is based on RFC 8189. The main new feature beyond that
If we pass this value of
Additionally, note that some of the dimensions of
Note that the first two dimensions are not associated with arrays, but scalars. The result of this is that the output is constrained to have unit "mbps" and measure based on the 95th percentile of data, but no new dimension is added to the output array:
I hope that this format makes sense and is logical. The next step would be to figure out how to elegantly include timestamp information. |
Apologies; I uploaded an outdated
|
Get access to CERN data and write a script to transform it into a format that can be processed by our hypothetical ML model.
The text was updated successfully, but these errors were encountered: