Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[MISC] Clarify participant_id in participants.tsv file if it exists #738

Merged
merged 10 commits into from
Mar 3, 2021

Conversation

adam2392
Copy link
Member

Closes: #737

Question: "In case of single-session studies, this file has one compulsory column participant_id that consists of sub-, followed by a list of optional columns describing participants. " is in the current spec.

What does that mean? In case of single-session studies? Does that mean in single-session studies, this file is compulsory with participant_id a certain way?

@sappelhoff
Copy link
Member

Thanks for taking a stab at this @adam2392 --> I think it's good to clarify it.

With regards to your question --> I have no clue. Perhaps the other @bids-standard/maintainers know.

I had a look at the Longitudinal and multi-site studies section to see whether it clarifies things, but I think it should rather be clarified itself (using SHOULD / MUST terminology).

Let me give a few examples of what I think should change:

  • Session label can MUST consist only of alphanumeric characters [a-zA-Z0-9] and should SHOULD be consistent across subjects.
  • If numbers are used in session labels we recommend RECOMMEND using zero padding
  • Acquisition time of session can be defined in the sessions file (see below for details) [insert markdown link for "sessions file"]
  • The extra session layer (at least one /ses- subfolder) should MUST be added for all subjects if at least one subject in the dataset has more than one session. Skipping the session layer for only some subjects in the dataset is not allowed.

And then under Sessions file I find it confusing that the text refers to a "participant key file" ... probably, sessions.tsv is meant by that, so that should be clarified as well IMHO

In case of multiple sessions there is an option of adding additional participant key files describing variables changing between sessions. In such case one file per participant should be added. These files need to include compulsory session_id column and describe each session by one and only one row. Column names in per participant key files have to be different from group level participant key column names.

@adam2392
Copy link
Member Author

@sappelhoff

  • addressed all your comments, and
  • assuming the participants.tsv file is always RECOMMENDED, I added explicit language to clarify the question I had.

Comment on lines 11 to 14
The extra session layer (at least one `/ses-<label>` subfolder) should
The extra session layer (at least one `/ses-<label>` subfolder) SHOULD
be added for all subjects if at least one subject in the dataset has more than
one session. Skipping the session layer for only some subjects in the dataset is
not allowed. If a `/ses-<label>` subfolder is included as part of the
directory hierarchy, then the same `ses-<label>` tag must also be
one session. If a `/ses-<label>` subfolder is included as part of the
directory hierarchy, then the same `ses-<label>` tag MUST also be
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Skipping the session layer for only some subjects in the dataset is
not allowed

I actually think that the original text here meant to say that:

The extra session layer (at least one /ses-<label> subfolder) MUST
be added for all subjects if at least one subject in the dataset has more than
one session.

instead of the SHOULD you used here.

opinions @Remi-Gau @tsalo ?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

on the other hand, that doesn't really make sense, because there are experiments where some subjects perform an extra session 🤔

Copy link
Member Author

@adam2392 adam2392 Feb 28, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think skipping the ses-<X> in some subjects is allowed if say not all subjects have the same sessions. Rn this results in a warning in validator, which basically all my data has this warning :p.

Re MUST vs SHOULD:

The extra session layer (at least one /ses-<label> subfolder) MUST
be added for all subjects if at least one subject in the dataset has more than
one session.

seems reasonable to me... The problem is how does one validate this? If one has multiple "sessions" per subject, then that implies there are potentially different electrode/channels across those sessions, so the user would want to store it as such. Does this basically mean if one has:

sub-01/
   sub-01_ses-01_task-blah_ieeg.vhdr
   ...

it's allowed, but if you have:

sub-01/
   sub-01_ses-01_task-blah_ieeg.vhdr
   sub-01_ses-02_task-blah_ieeg.vhdr
   ...

it's not?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think skipping the ses- in some subjects is allowed if say not all subjects have the same sessions. Rn this results in a warning in validator, which basically all my data has this warning

yes, I experienced that warning multiple times myself 🙄 I think this case is more common than thought when this warning was originally implemented. That is, more often it's a choice, rather than an overlooked error.

re: your example --> yes 🤔 but I think your current changes (apart from my comments below) are actually fine and clarify the situation without changing the state of anything (which is what we should aim for), so I'd be fine with leaving the SHOULD, not interpreting it as a MUST post-hoc.

Copy link
Collaborator

@Remi-Gau Remi-Gau Mar 2, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

So basically turning it into a MUST would make the following non-BIDS, correct?

├── sub-1
│   └── eeg
└── sub-2
    ├── ses-1
    │   └── eeg
    └── ses-2
        └── fmri

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yes, that's how I interpret it as well. What's annoying is that first the text says "should" (not MUST), and then the follow up sentence is:

Skipping the session layer for only some subjects in the dataset is not allowed.

so basically the "should" is actually a MUST.

However:

  1. I am not sure why this rule makes sense
  2. I am pretty sure that I have seen datasets not conforming to this rule
  3. There is no validator support for this rule as far as I know

🙄 perhaps we can discuss this tonight in the maintainers meeting

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah the inconsistency should be fixed or clarified.

I mean I find the folder structure I described above "ugly", but thank goodness we don't just rely on my sense of aesthetics.

😉

src/06-longitudinal-and-multi-site-studies.md Outdated Show resolved Hide resolved
src/06-longitudinal-and-multi-site-studies.md Outdated Show resolved Hide resolved
src/06-longitudinal-and-multi-site-studies.md Outdated Show resolved Hide resolved
src/06-longitudinal-and-multi-site-studies.md Outdated Show resolved Hide resolved
src/03-modality-agnostic-files.md Outdated Show resolved Hide resolved
src/06-longitudinal-and-multi-site-studies.md Outdated Show resolved Hide resolved
Co-authored-by: Stefan Appelhoff <stefan.appelhoff@mailbox.org>
src/06-longitudinal-and-multi-site-studies.md Outdated Show resolved Hide resolved
Comment on lines 11 to 14
The extra session layer (at least one `/ses-<label>` subfolder) should
The extra session layer (at least one `/ses-<label>` subfolder) SHOULD
be added for all subjects if at least one subject in the dataset has more than
one session. Skipping the session layer for only some subjects in the dataset is
not allowed. If a `/ses-<label>` subfolder is included as part of the
directory hierarchy, then the same `ses-<label>` tag must also be
one session. If a `/ses-<label>` subfolder is included as part of the
directory hierarchy, then the same `ses-<label>` tag MUST also be
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think skipping the ses- in some subjects is allowed if say not all subjects have the same sessions. Rn this results in a warning in validator, which basically all my data has this warning

yes, I experienced that warning multiple times myself 🙄 I think this case is more common than thought when this warning was originally implemented. That is, more often it's a choice, rather than an overlooked error.

re: your example --> yes 🤔 but I think your current changes (apart from my comments below) are actually fine and clarify the situation without changing the state of anything (which is what we should aim for), so I'd be fine with leaving the SHOULD, not interpreting it as a MUST post-hoc.

Copy link
Member

@tsalo tsalo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some minor wording/formatting requests.

src/03-modality-agnostic-files.md Outdated Show resolved Hide resolved
src/03-modality-agnostic-files.md Outdated Show resolved Hide resolved
src/06-longitudinal-and-multi-site-studies.md Outdated Show resolved Hide resolved
adam2392 and others added 3 commits March 1, 2021 12:53
Co-authored-by: Taylor Salo <tsalo006@fiu.edu>
Co-authored-by: Taylor Salo <tsalo006@fiu.edu>
Copy link
Member

@tsalo tsalo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The changes look good to me. Thanks!

Copy link
Collaborator

@effigies effigies left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Only remaining suggestion is to split sentences with newlines.

@effigies effigies added this to the 1.6.0 milestone Mar 2, 2021
src/03-modality-agnostic-files.md Outdated Show resolved Hide resolved
src/06-longitudinal-and-multi-site-studies.md Outdated Show resolved Hide resolved
src/06-longitudinal-and-multi-site-studies.md Outdated Show resolved Hide resolved
src/06-longitudinal-and-multi-site-studies.md Outdated Show resolved Hide resolved
@sappelhoff
Copy link
Member

Thanks @adam2392!

@sappelhoff sappelhoff merged commit 99eec00 into bids-standard:master Mar 3, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

If participants.tsv file is present, the participant_id column MUST be present and MUST follow certain format
5 participants