[FIX] clarify that filters should be specified as object of objects #348

sappelhoff · 2019-10-16T10:43:44Z

closes #339

In this PR, that for SoftwareFilters and HardwareFilters we expect an object of objects.

Previously, the text said we expect "a list" ... although the text thereafter and the included example showed an object of objects.

The alternative was to allow "lists of objects", which might have been a more elegant solution that would have allowed to naturally specify an order of applied temporal filters. However, as this would break backward compatibility, I vote against that change.

With the proposed object of objects, an order of filters (if required) can be indicated with additional, custom, key value pairs.

effigies

Fine with me. As I've stated elsewhere, the only condition under which I would strenuously object is if order can ever matter, for example, if nonlinear filters can be used. If order can matter, I think the cost in fixing examples and potentially invalidating datasets is worth it.

Also worth considering whether people previously read "list" and interpreted it literally; you may be invalidating datasets that were valid under that interpretation. Checking ///openneuro via datalad (which doesn't include all current datasets...), the only examples I see are "n/a", {"SpatialCompensation":{"GradientOrder":"3rd"}} and {"SpatialCompensation":{"GradientOrder": 3}}. So this isn't a definite concern...

I'll leave approving reviews to ephys people, who know more about these filters than I do.

jasmainak · 2019-10-17T04:53:04Z

Thinking about this issue once more, I am not sure if it's really a big difference to have a list or not. It's true that the order could matter but in practice, since this is not a derivatives specification yet, the software filter field is for storing the bare minimum preprocessing that has been done to make the files shareable. This could be, for instance, Maxfilter for Elekta systems and gradient compensation for CTF systems. We don't want to encourage users to hack this to share what could be considered derivative files.

sappelhoff · 2019-10-17T09:18:16Z

Also worth considering whether people previously read "list" and interpreted it literally; you may be invalidating datasets that were valid under that interpretation.

That is true, but these datasets would not have passed the validator, so these cases are confined to those were people only read the spec, and never used the validator.

I interpret @jasmainak's comment above as another pro for dealing with the issue in the way proposed in this PR. Or did you mean it in another way Mainak?

src/04-modality-specific-files/02-magnetoencephalography.md

jasmainak · 2019-10-17T16:56:04Z

yes it is pro. Do you think though that the current phrasing introduces more flexibility in the specification? I'd prefer something like dictionary as opposed to object

sappelhoff · 2019-10-18T10:23:53Z

yes it is pro. Do you think though that the current phrasing introduces more flexibility in the specification? I'd prefer something like dictionary as opposed to object

That's why I put a link there to "JSON OBJECT", where it is unambiguously defined what we expect: https://www.w3schools.com/js/js_json_objects.asp

jasmainak · 2019-10-18T13:52:26Z

right but a JSON object can contain almost anything unless I'm mistaken?

jasmainak · 2019-10-18T13:54:26Z

If you change to something like dictionary of dictionary, where the values of key-value pair are restricted to be strings, then you reduce the risk of breaking backwards compatibility. However, this would need a tweak to the JSON schemas in the validator.

effigies · 2019-10-18T14:44:53Z

We seem to use "object" and "dictionary" interchangeably in Common principles.

a simple data dictionary in a JSON format

a description of the corresponding column, using an object containing the following fields:

a dictionary of possible values (keys) and their descriptions (values).

I don't see either used elsewhere, so I don't think there's a strong precedent.

I also don't think it's correct to say that "object" is more general than "dictionary", and it corresponds to a JSON notion, while "dictionary" is a Python name (although the JSON docs associate the two, presumably to help orient Python programmers), so this is a question of style, not of semantics.

If you change to something like dictionary of dictionary, where the values of key-value pair are restricted to be strings, then you reduce the risk of breaking backwards compatibility.

Some values are numeric, so this would definitely break backwards compatibility. This level of specification feels like it's headed in an unnecessarily restrictive direction. "Object of objects"/"dictionary of dictionaries" makes the depth clear; a programmer knows they can make a two-level nested loop over entries, and get some value. It is not our job to say what all those values can or must be.

If we want to be more specific about what's found in the value of certain keys, then we can specify fields using dot notation, as in Derived dataset and pipeline description:

| Field                | Description |
|----------------------|-------------|
| TopField.SecondLevel | ...         |

jasmainak · 2019-10-18T15:51:31Z

I just wanted to prevent people from hacking into this and providing the filter parameters through arbitrary nesting. The JSON schema gives straightforward ways to do this without adding any complexity to the validator code. But I'm not sure how this would work with the dot notation.

Anyway, I recognize @sappelhoff just intended this PR as a clarification. So, it's fine by me as it is.

sappelhoff · 2019-10-21T10:06:47Z

I just wanted to prevent people from hacking into this and providing the filter parameters through arbitrary nesting. The JSON schema gives straightforward ways to do this without adding any complexity to the validator code.

I see your point! Perhaps we can make this as a separate improvement:

in the spec, provide some more guidelines (e.g., nested objects are only allowed up to X levels depth)
in the validator, use the schema that allows only a max depth
provide some RECOMMENDED guidelines for parameter names and their expected value types (e.g. Cutoff, float)

However, as said above - I would see that as separate from this PR.

Can we have some more approving or challenging voices?

effigies · 2019-10-30T13:31:12Z

Looks like everybody's happy with this. Merging.

clarify that filters should be specified as object of objects

84be32a

sappelhoff requested review from effigies, robertoostenveld, CPernet and jasmainak October 16, 2019 10:43

sappelhoff changed the title ~~clarify that filters should be specified as object of objects~~ FIX: clarify that filters should be specified as object of objects Oct 16, 2019

add missing code specifier

e7ddb64

effigies reviewed Oct 17, 2019

View reviewed changes

jasmainak reviewed Oct 17, 2019

View reviewed changes

src/04-modality-specific-files/02-magnetoencephalography.md Outdated Show resolved Hide resolved

jasmainak reviewed Oct 17, 2019

View reviewed changes

src/04-modality-specific-files/02-magnetoencephalography.md Outdated Show resolved Hide resolved

address code review by @jasmainak

3690b88

jasmainak approved these changes Oct 18, 2019

View reviewed changes

CPernet approved these changes Oct 22, 2019

View reviewed changes

effigies merged commit b1f05f5 into bids-standard:master Oct 30, 2019

franklin-feingold added a commit that referenced this pull request Oct 30, 2019

[DOC] Auto-generate changelog entry for PR #348

264e1b2

sappelhoff deleted the filter branch March 31, 2020 18:18

effigies mentioned this pull request May 26, 2020

[ENH] BEP 003: Common Derivatives #265

Merged

5 tasks

sappelhoff changed the title ~~FIX: clarify that filters should be specified as object of objects~~ [FIX] clarify that filters should be specified as object of objects Aug 8, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FIX] clarify that filters should be specified as object of objects #348

[FIX] clarify that filters should be specified as object of objects #348

sappelhoff commented Oct 16, 2019

effigies left a comment

jasmainak commented Oct 17, 2019

sappelhoff commented Oct 17, 2019

jasmainak commented Oct 17, 2019 •

edited

Loading

sappelhoff commented Oct 18, 2019

jasmainak commented Oct 18, 2019

jasmainak commented Oct 18, 2019

effigies commented Oct 18, 2019

jasmainak commented Oct 18, 2019

sappelhoff commented Oct 21, 2019

effigies commented Oct 30, 2019

[FIX] clarify that filters should be specified as object of objects #348

[FIX] clarify that filters should be specified as object of objects #348

Conversation

sappelhoff commented Oct 16, 2019

effigies left a comment

Choose a reason for hiding this comment

jasmainak commented Oct 17, 2019

sappelhoff commented Oct 17, 2019

jasmainak commented Oct 17, 2019 • edited Loading

sappelhoff commented Oct 18, 2019

jasmainak commented Oct 18, 2019

jasmainak commented Oct 18, 2019

effigies commented Oct 18, 2019

jasmainak commented Oct 18, 2019

sappelhoff commented Oct 21, 2019

effigies commented Oct 30, 2019

jasmainak commented Oct 17, 2019 •

edited

Loading