Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support multiple derivation prov charts and multiple source prov charts per object #449

Open
gothub opened this issue Jan 22, 2018 · 3 comments
Assignees

Comments

@gothub
Copy link
Contributor

gothub commented Jan 22, 2018

The prov editor doesn't allow a package member to have both input/outputs from a program and directly from a source or to a destination file.

screen shot 2018-01-22 at 11 44 01 am

In the above example, the package member Meierbachtol_inclinometry_allSensors.csv currently has a derived program, and the derived program has an output.

The user wishes to add another derivation to Meierbachtol_inclinometry_allSensors.csv that is
not an output from the program, but is instead a direct derivation from the .csv.

The prov editor should be able to allow both types of derivations for a package member.

In terms of the prov model, the following relationships should be able to be added:

Here is the use case as relayed by one of the datateam members:

t would be helpful to add the following functionality to the prov arrows: (1) allow multiple arrows to stem from the same dataset, such as a large zip file whose components are processed in 2+ ways, (2) allow file-less arrows with comments on how the files may have been manually modified (extracted from a zip, changed " " to "_" in column names, converted xlsx to csv, etc.)
@laurenwalker
Copy link
Member

I know we talked about supporting multiple prov charts per object before. I thought we had created a Github issue for that before. I'm going to go ahead and change the title of this issue to better reflect that feature.

@laurenwalker
Copy link
Member

As for #2 - (2) allow file-less arrows with comments on how the files may have been manually modified (extracted from a zip, changed " " to "_" in column names, converted xlsx to csv, etc.)

I think this is information that needs to get added to the science metadata.

@laurenwalker laurenwalker changed the title Prov editor doesn't allow multiple dependency/derivation types Support multiple derivation prov charts and multiple source prov charts per object Jan 23, 2018
@laurenwalker laurenwalker added this to the 2.1.0 milestone May 24, 2018
@laurenwalker
Copy link
Member

This is the most high priority provenance issue, so this should be prioritized for 2.1.0 over other prov issues.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants