Refactor unit/cluster data #674

ajtritt · 2018-10-15T21:39:28Z

ajtritt · 2018-10-15T21:40:22Z

@tjd2002 @bendichter @NileGraddis please provide any feedback

bendichter · 2018-10-15T22:18:54Z

I like the idea of incorporating UnitTimes into the units table, especially now that you've made it possible to put region references in dynamic tables. This would solve some usability issues and bring everything together in a more cohesive framework. Though it would be redundant to keep "Unit" in the name if it's in the units table, so I would suggest calling the column "SpikeTimes" rather than "UnitTimes."

It would be nice to remove the redundancy of supporting both Clustering and UnitTimes, since they hold the same info, but I have a few concerns about deprecating Clustering and ClusterWaveforms

Clustering and ClusterWaveform are easier to stream to, because they progress in time. Clustering is more similar in form to other data standards for spike times, e.g. NeuroScope, where spikes can be written in as they are recorded. UnitTimes is better if you are writing data by cell but that is not usually the case unless you are writing simulation output. I've heard one possible solution for this: explicitly allowing multiple regions in the spike_times dataset for a single unit. This would allow us to iterate through every unit at every buffer cycle and write the spikes of each unit for that cycle, and then sort them at the end so that all regions of a specific unit are contiguous. I think this makes streaming technically possible, but painful in comparison to just using Clustering. We'd have num_cycles * num_units regions that we would need to sort, whereas it seems like streaming to Clustering would probably be very straightforward. A similar argument can be made for ClusterWaveforms.
The form of Clustering caters better to time window queries than UnitTimes does, so it may be better for some analyses
With Clustering it was easy to indicate which shank a unit was from by providing multiple Clustering objects, one for each shank. There appears to be no standard way to do that right now in the units table. One possible solution to this is to have a default column "electrodes_group" and have each group correspond to a shank. This would need to be an optional column or be allowed to be some pre-specified default value because simulation output will not have electrodes nor electrode_groups in the units table.

ajtritt · 2018-10-15T23:23:05Z

@bendichter

With Clustering it was easy to indicate which shank a unit was from by providing multiple Clustering objects, one for each shank.

This is one of the unmentioned motivations of making UnitTimes a DynamicTable. UnitTimes would contain optional columns for specifying which group or electrode a unit came from.

Given that there Clustering has some query/streaming capabilities that UnitTimes does not, it makes sense to keep it around.

With ClusterWaveform in UnitTimes, do you still see a reason to keep it around?

bendichter · 2018-10-16T00:12:35Z

ClusterWaveform is fine to move to the units table since it's just mean and std of the waveform.

bendichter · 2018-10-16T00:16:36Z

see https://github.com/NeurodataWithoutBorders/pynwb/issues/675 for a proposal for dealing with Clustering

tjd2002 · 2018-10-16T14:43:59Z

Though it would be redundant to keep "Unit" in the name if it's in the units table, so I would suggest calling the column "SpikeTimes" rather than "UnitTimes."

I think the reason we opted not to call the Unit table the 'Spike table' in the first place was to account for use cases other than spikes. E.g. time of detected calcium transients. So I'd vote either for 'UnitTimes', 'timestamps', or 'Times'

ajtritt self-assigned this Oct 15, 2018

ajtritt added this to the NWB 2.0 Full milestone Oct 15, 2018

oruebel mentioned this issue Oct 17, 2018

Release Notes: Refactor unit/cluster data NeurodataWithoutBorders/nwb-schema#196

Closed

ajtritt mentioned this issue Oct 27, 2018

enh/units dynamic table #684

Merged

5 tasks

ajtritt closed this as completed in #684 Oct 30, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor unit/cluster data #674

Refactor unit/cluster data #674

ajtritt commented Oct 15, 2018 •

edited

Loading

ajtritt commented Oct 15, 2018

bendichter commented Oct 15, 2018

ajtritt commented Oct 15, 2018

bendichter commented Oct 16, 2018

bendichter commented Oct 16, 2018

tjd2002 commented Oct 16, 2018 •

edited

Loading

Refactor unit/cluster data #674

Refactor unit/cluster data #674

Comments

ajtritt commented Oct 15, 2018 • edited Loading

Problem/Use Case

Checklist

ajtritt commented Oct 15, 2018

bendichter commented Oct 15, 2018

ajtritt commented Oct 15, 2018

bendichter commented Oct 16, 2018

bendichter commented Oct 16, 2018

tjd2002 commented Oct 16, 2018 • edited Loading

ajtritt commented Oct 15, 2018 •

edited

Loading

tjd2002 commented Oct 16, 2018 •

edited

Loading