Make heuristics a simple set of "if .. elif .. else" statements and use a dictionary instead of variables. #209

smoia · 2020-04-10T20:18:30Z

Closes #89, #44

Proposed Changes

All the heuristics set run to run-01 rather than run-00 (closes 0- vs 1-indexing the run keyword in use_heuristic() #89 )
The tests have been updated consequently to this change
All the "functional code" of the heuristics moved to phys2bids.use_heuristics rather than in the heuristic file, leaving it cleaner for the user (closes Making heuristic functions a simple set of if elif else statements #44 ).
Instead of variables, the heuristic file populates a dictionary (this will be used to read yaml files too).
The documentation has been updated to reflect the changes.

Now, I have two doubts:

Is this a breaking change? It discontinues support for older heuristics, although not that much.
I added a docstring to the heuristic file for completeness, but in this way the first code that the user should change comes after a long while - and a whole lot of information that probably is not necessary to them. I wouldn't see anything bad in removing the docstring in the heuristics to decrease the possibility of confounding the user, as in any case there's a full tutorial on how to use this function, we're not testing it, and we're not going to report it in the API.

codecov · 2020-04-10T20:25:33Z

Codecov Report

Merging #209 into master will decrease coverage by 0.13%.
The diff coverage is 90.90%.

@@            Coverage Diff             @@
##           master     #209      +/-   ##
==========================================
- Coverage   94.46%   94.32%   -0.14%     
==========================================
  Files           7        7              
  Lines         578      582       +4     
==========================================
+ Hits          546      549       +3     
- Misses         32       33       +1

Impacted Files	Coverage Δ
phys2bids/phys2bids.py	`90.00% <90.90%> (-0.45%)`	⬇️

eurunuela

LGTM! 👍

I really like the idea of using a dictionary to have the BIDS keys accessible at all times. It makes the code much easier to read imo.

I'm not worried about the decrease in coverage. I think we can ignore it this time.

phys2bids/phys2bids.py

eurunuela · 2020-04-11T18:08:47Z

phys2bids/heuristics/heur_euskalibur.py

@@ -0,0 +1,93 @@
+import fnmatch


I assume this one is an example?

Yep, why not.
it's the one used for a dataset that will be shared at a certain point.

smoia · 2020-04-18T18:15:37Z

@eurunuela I addressed your point - hopefully also the coverage decrease.

@rmarkello @RayStick I had the same issue as #208 related to the deleted file. I think I solved it but @RayStick and @vinferrer please confirm that for our aims this old file: https://osf.io/u5dq8/ and this new file: https://osf.io/5829m/ are the same.

rmarkello

Hey @smoia! Thanks for all this 🙌

I made a few (very minor) suggestions below, but otherwise this looks good 💯

One lingering question that this all raised for me, though: is there a way to handle the heuristic files such that users don't have to specify the exact run number if they have multiple runs of the same task? Could we use a list to store the info and just loop through? I know heudiconv uses an approach like that so I'm wondering if it would be feasible here!

That said, I think we can merge this as-is in and open another issue to address this stuff more broadly!

phys2bids/phys2bids.py

docs/heuristic.rst

phys2bids/phys2bids.py

RayStick · 2020-04-20T18:12:38Z

@rmarkello @RayStick I had the same issue as #208 related to the deleted file. I think I solved it but @RayStick and @vinferrer please confirm that for our aims this old file: https://osf.io/u5dq8/ and this new file: https://osf.io/5829m/ are the same.

Apologies for that @smoia. I deleted them for a reason, but it was a stupid reason. Instead of explaining, I have actually just put those text files back now. They may be slightly different lengths (i.e. different number of samples) compared to the original file, but otherwise they are the same. You can use either of those files, that you linked to (though 'Test3' will have have a new hyperlink now). Let me know if there are any issues.

Co-Authored-By: Ross Markello <rossmarkello@gmail.com>

smoia · 2020-04-20T19:12:31Z

@RayStick, no worries! If it's shorter than the one we were using before and it has the same characteristics as the previous one that we use to test, then it's even better!
I just want to be sure.

@rmarkello, it's a great idea and we should implement it - not only for heuristics but also for mappings!
I would close this PR as is (with the suggestions you gave), and open an issue to add that feature (that will come extremely helpful once #206 is complete).
The only doubt that I still have is about the docstring in the heuristic file. Maybe it's not only not necessary, but also problematic. But if you think it's good as it is, think about which label to apply to this PR (if it's a breaking change or not - if it is the label is "majormod"), and merge it in!

rmarkello

@smoia: a few thoughts on the heuristics files, then!

I vote to remove the logger import and statement from all the examples. Handle this inside phys2bids (i.e., if task isn't set then raise an error).
It is unclear from the doc-string in the heuristic files that the parameter physinfo is the filename. You currently say "Name of the file or partial match" but it's always a filename. It would be great to clarify this (as you have done in the documentation)!
Is it possible to avoid pass info to the heuristic function? I know it comes from the calling function but can't you just define info as a dict inside the heur() function, update it accordingly, and then update the info dict in the calling function? I'm thinking something like:

filename = 'my_physio_data.txt'
info = {'sub': '001', 'ses': '01'}
info.update(heuristic.heur(filename))

That way heur() only has one parameter (the filename, for now—eventually more info!).

Let me know what you think.

… than passing it as an argument

…ad of reassigning it.

…/heuristic_refactor

smoia · 2020-04-21T18:45:10Z

@rmarkello good ideas! I implemented them in the last commit.
I'm not totally sure about initialising the dictionary in the heuristic file vs passing it as an argument, but for sure updating the dictionary is a much better idea!

smoia · 2020-04-21T19:02:39Z

Ok, codecov is not happy about the decrease in testing due to the new line that is in the function.
Since we don't have breaking tests (do we?), I would move on and merge it anyway, if @rmarkello you're happy with the latest version.

rmarkello

This is great! Thanks so much for this @smoia !

rmarkello · 2020-04-21T19:45:32Z

This might now count as a breaking change... Let me know and I'll update the labels + merge in accordingly.

smoia · 2020-04-21T21:06:57Z

@rmarkello I trust you - up to you to decide which label to add (majormod or minormod) and merge in!

Stefano Moia added 8 commits April 10, 2020 19:57

Start from 1 in run numeration

cd55bf0

Delete heur_ex and heur

38a7142

Restructure heuristics to use info dictionary

c4e62b8

Adapt use_heuristic to accept new heuristics and use info dictionary

2f9146a

Update heuristics for tutorial and tests

fbc30fd

Update documentation

28570e8

Correct folder path

7abb36e

Update tests to match new heuristic (run-00→01)

bd46897

smoia requested review from rmarkello and eurunuela April 10, 2020 20:18

smoia assigned rmarkello Apr 10, 2020

smoia changed the title ~~Enh/heuristic refactor~~ Make heuristics a simple set of "if .. elif .. else" statements and use a dictionary instead of variables. Apr 10, 2020

smoia added the Refactoring Improve nonfunctional attributes label Apr 10, 2020

eurunuela approved these changes Apr 11, 2020

View reviewed changes

Stefano Moia added 3 commits April 18, 2020 19:45

Change dictionary name from 'info' to 'bids_keys'

447c176

Add pytest parametrisation not to loose coverage

a7cf2c6

Change file link for deleted file

e21c138

rmarkello approved these changes Apr 20, 2020

View reviewed changes

phys2bids/phys2bids.py Outdated Show resolved Hide resolved

phys2bids/phys2bids.py Outdated Show resolved Hide resolved

docs/heuristic.rst Outdated Show resolved Hide resolved

phys2bids/phys2bids.py Outdated Show resolved Hide resolved

Stefano Moia and others added 4 commits April 20, 2020 21:03

Update phys2bids/phys2bids.py

c9a54e1

Co-Authored-By: Ross Markello <rossmarkello@gmail.com>

Update phys2bids/phys2bids.py

0ba2ec5

Co-Authored-By: Ross Markello <rossmarkello@gmail.com>

Update docs/heuristic.rst

8158e41

Co-Authored-By: Ross Markello <rossmarkello@gmail.com>

Update phys2bids/phys2bids.py

c0d9954

Co-Authored-By: Ross Markello <rossmarkello@gmail.com>

smoia added Minormod This PR generally closes an `Enhancement` issue. It increments the minor version (0.+1.0) and removed Minormod This PR generally closes an `Enhancement` issue. It increments the minor version (0.+1.0) labels Apr 20, 2020

rmarkello reviewed Apr 21, 2020

View reviewed changes

Remove logger from heuristic, initialise dictionary internally rather…

4eecc21

… than passing it as an argument

Stefano Moia added 4 commits April 21, 2020 20:20

Add logger that was in the heuristics before, update dictionary inste…

6c94afe

…ad of reassigning it.

Merge remote-tracking branch 'origin/enh/heuristic_refactor' into enh…

ebe5614

…/heuristic_refactor

Tiny parenthesis matter

34fa496

Update documentations

2d64201

It's always the whitespaces. Always.

3387c34

rmarkello approved these changes Apr 21, 2020

View reviewed changes

smoia added the Majormod This PR breaks compatibility, and increments the major version (+1.0.0) label Apr 22, 2020

rmarkello merged commit 0ec1338 into physiopy:master Apr 22, 2020

smoia deleted the enh/heuristic_refactor branch April 22, 2020 16:53

smoia added the released This issue/pull request has been released. label Oct 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make heuristics a simple set of "if .. elif .. else" statements and use a dictionary instead of variables. #209

Make heuristics a simple set of "if .. elif .. else" statements and use a dictionary instead of variables. #209

smoia commented Apr 10, 2020

codecov bot commented Apr 10, 2020 •

edited

Loading

eurunuela left a comment

eurunuela Apr 11, 2020

smoia Apr 18, 2020

smoia commented Apr 18, 2020

rmarkello left a comment

RayStick commented Apr 20, 2020

smoia commented Apr 20, 2020

rmarkello left a comment

smoia commented Apr 21, 2020

smoia commented Apr 21, 2020

rmarkello left a comment

rmarkello commented Apr 21, 2020

smoia commented Apr 21, 2020

Make heuristics a simple set of "if .. elif .. else" statements and use a dictionary instead of variables. #209

Make heuristics a simple set of "if .. elif .. else" statements and use a dictionary instead of variables. #209

Conversation

smoia commented Apr 10, 2020

Proposed Changes

codecov bot commented Apr 10, 2020 • edited Loading

Codecov Report

eurunuela left a comment

Choose a reason for hiding this comment

eurunuela Apr 11, 2020

Choose a reason for hiding this comment

smoia Apr 18, 2020

Choose a reason for hiding this comment

smoia commented Apr 18, 2020

rmarkello left a comment

Choose a reason for hiding this comment

RayStick commented Apr 20, 2020

smoia commented Apr 20, 2020

rmarkello left a comment

Choose a reason for hiding this comment

smoia commented Apr 21, 2020

smoia commented Apr 21, 2020

rmarkello left a comment

Choose a reason for hiding this comment

rmarkello commented Apr 21, 2020

smoia commented Apr 21, 2020

codecov bot commented Apr 10, 2020 •

edited

Loading