-
Notifications
You must be signed in to change notification settings - Fork 241
Implement select_sorting_periods in metrics
#4302
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
|
This is OK for me. |
Co-authored-by: Chris Halcrow <[email protected]>
Co-authored-by: Chris Halcrow <[email protected]>
|
@chrishalcrow I refactored a few metrics to make sure durations, spike counts, and bins are properly accounted for when slicing with periods. Happy to discuss about this! |
select_sorting_periodsselect_sorting_periods in metrics
yger
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This looks good to me, I think this is a great extension to be able to compute metrics on sub_periods only. A couple of minor comments, and I think everything is not too slow, but good work!
|
|
||
| # Check that all spikes in the sliced sorting are within the periods | ||
| for segment_index in range(sorting.get_num_segments()): | ||
| periods_in_segment = periods[periods["segment_index"] == segment_index] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
do we assume not too many periods, just to be sure that all these masks won't take too long?
| for segment_index in range(sorting.get_num_segments()): | ||
| global_indices_segment = all_global_indices[segment_index] | ||
| # filter periods by segment | ||
| periods_in_segment = periods[periods["segment_index"] == segment_index] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
assuming not too many periods, these masks would be fine. Otherwise, we'll need to optimize
|
We don't want to extend these quality metrics to the extension() ? What if someone wants to only get the ISI, CCG or anything else only on the periods? Would it be easy to slice the sorting, and then compute only on the sub sorting? Are the extension robust w.r.t. periods? |
…select_sorting_periods_core
I think that extension by extension we could use |
|
@samuelgarcia @chrishalcrow tests added! all good now :) |
| num_segments = len(segment_samples) | ||
| bin_duration_samples = int(bin_duration_s * sorting.sampling_frequency) | ||
|
|
||
| if periods is not None: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
When there are no good periods, then periods will be an empty array? Should we have a general strategy for when this is true - empty units often cause pain/nans...
| """ | ||
|
|
||
| sorting = sorting_analyzer.sorting | ||
| sorting = sorting.select_periods(periods) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
When I choose periods so that only one unit is good, and get the num spikes, get
import spikeinterface.full as si
import numpy as np
from spikeinterface.core.base import unit_period_dtype
rec, sort = si.generate_ground_truth_recording(seed=1205)
sa = si.create_sorting_analyzer(sort, rec)
periods = np.array((0,1000,4000,0), dtype=unit_period_dtype)
quality_metrics = si.compute_quality_metrics(sa, periods=periods, metric_names = ['num_spikes'])
print(quality_metrics['num_spikes'].values)>>> [162 173 135 154 158 144 151 141 153 162]But this sorting only has spikes from unit index 0, right? Is count_num_spikes_per_unit below using an old cache or something?
Depends on #4316
Slices a sorting object based on an array ov valid periods. Periods are defined as a structured dtype as:
EDIT:
Refactored computation of spike train metrics, to make sure that periods are consistently taken into account. Added 2 utils functions to compute durations per unit and bin edges per unit, that optionally use the provided periods