tSNE: Update to use new implementation #292

pavlin-policar · 2018-09-05T12:08:39Z

Issue

Once biolab/orange3#3192 is merged, the tSNE widget here will no longer work due to the API change.

Should not be merged until biolab/orange3#3192.

Description of changes

tSNE widget now works. I've removed the MulticoreTSNE code as well since the current implementation also implements Barnes-hut and is about as fast as MulticoreTSNE.

I've set the step size to 50 (chosen for no reason in particular), so the visualization is updated every 50 iterations. This is primarily done so the optimization can be stopped in between, otherwise one would have to wait until all iterations finished before the widget became responsive again.

One thing to note is the early exaggeration phase. The previous version was limited in the sense that the number of early exaggeration iterations was hardcoded to be 250 (in sklearn and MulticoreTSNE themselves). The early exaggeration factor was 1, so it behaved like the regular optimization. However, this mean that we could not optmimize for any less than 250 steps. With the new implementation, there is no such limitation, so we can actually run a single iteration if we want. Both the previous and current implementation completely remove the early exaggeration phase. This is generally not a good idea and can lead to worse visualizations. The point of early exaggeration is to correct poor initializations and clump similar points together.

I've set the optimization scheme to automatically switch to FFT interpolation when the number of points exceeds 10k. This was arbitratily chosen and is likely not the best cutoff point for Barnes-Hut. However, I am certain that FFT is faster than BH at 10k points.

Also, I've removed the old code with the TODO to remove once merged into core. I am fairly certain that code has long since been merged into core.

Also, from what I could tell, the previous implementation always

Includes

Code changes
Tests
Documentation

mstrazar · 2018-10-12T07:55:47Z

@lanzagar @pavlin-policar
It would be really useful to have this soon, to handle larger datasets. Are there any news on this?

codecov-io · 2018-10-21T16:19:56Z

Codecov Report

Merging #292 into master will increase coverage by 0.04%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           master     #292      +/-   ##
==========================================
+ Coverage   61.36%   61.41%   +0.04%     
==========================================
  Files          28       28              
  Lines        6264     6240      -24     
==========================================
- Hits         3844     3832      -12     
+ Misses       2420     2408      -12

lanzagar · 2018-11-08T10:54:02Z

orangecontrib/single_cell/widgets/owtsne.py

-    create_annotated_table, create_groups_table, ANNOTATED_DATA_SIGNAL_NAME)
+    create_annotated_table, create_groups_table, ANNOTATED_DATA_SIGNAL_NAME,
+    get_unique_names,
+)


 RE_FIND_INDEX = r"(^{} \()(\d{{1,}})(\)$)"


This is not used anymore.

pavlin-policar changed the title ~~[NOMERGE] tSNE: Update to use new implementation~~ tSNE: Update to use new implementation Sep 12, 2018

JakaKokosar requested a review from lanzagar September 19, 2018 07:33

pavlin-policar added 2 commits October 21, 2018 18:05

OWtSNE: Fix widget to use faster implementation of tSNE

4706c45

OWtSNE: Remove old functions that have since been added to core

6de8a51

pavlin-policar force-pushed the tsne-update branch from 6026264 to 6de8a51 Compare October 21, 2018 16:06

lanzagar reviewed Nov 8, 2018

View reviewed changes

OWtSNE: Remove unneeded regex

9663399

lanzagar merged commit 7383050 into biolab:master Nov 8, 2018

pavlin-policar deleted the tsne-update branch November 9, 2018 11:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tSNE: Update to use new implementation #292

tSNE: Update to use new implementation #292

pavlin-policar commented Sep 5, 2018

mstrazar commented Oct 12, 2018

codecov-io commented Oct 21, 2018 •

edited

Loading

lanzagar Nov 8, 2018

tSNE: Update to use new implementation #292

tSNE: Update to use new implementation #292

Conversation

pavlin-policar commented Sep 5, 2018

Issue

Description of changes

Includes

mstrazar commented Oct 12, 2018

codecov-io commented Oct 21, 2018 • edited Loading

Codecov Report

lanzagar Nov 8, 2018

Choose a reason for hiding this comment

codecov-io commented Oct 21, 2018 •

edited

Loading