Add "intersect" method to GenomicArray #340

etal · 2018-04-08T22:44:39Z

Just collect the results of "iter_ranges" into a single DataFrame and return a new GenomicArray.

This was an odd oversight, but apparently CNVkit has been using iter_ranges instead to do what it needs.

etal · 2018-05-02T19:15:06Z

For speed: Collect the matched row indices and do a single slice operation on the original dataframe -- should be much faster than the current approach of extracting each row into a tuple and collecting the tuples into a new dataframe. (See #346)

That won't work on its own if mode='trim' is specified. In that case, iter_ranges provides a simple solution. It may still be significantly faster to do the index-only approach as with mode='inner', and then separately collect just the bins that need to be trimmed, generate them separately, and concatenate the two dataframes.

…340)

etal added the skgenome label Apr 8, 2018

etal added a commit that referenced this issue May 11, 2018

skgenome.intersect: separate func to get intersecting indices (#346, #…

5dd27f3

…340)

etal closed this as completed in f41eeb4 Jun 30, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add "intersect" method to GenomicArray #340

Add "intersect" method to GenomicArray #340

etal commented Apr 8, 2018

etal commented May 2, 2018 •

edited

Loading

Add "intersect" method to GenomicArray #340

Add "intersect" method to GenomicArray #340

Comments

etal commented Apr 8, 2018

etal commented May 2, 2018 • edited Loading

etal commented May 2, 2018 •

edited

Loading