Loading large JSON data causes error because of Firefox threshold #13

eharkins · 2018-08-20T17:34:54Z

metasoarous · 2018-08-20T19:25:41Z

I see a few things that might help resolve this:

Right now the json we spit out is pretty printed into the file, which will make the string that gets loaded up in JS way bigger than it needs to be. Changing this in the build_olmsted_data.py script of cft will likely greatly improve this.
There may also be a way to zip/compress the json data which may resolve the issue.
Right now for seed-lineage-pruning reconstructions, we end up sending along sequence metadata for all of the sequences, not just the sequences chosen as representatives for the seed lineage. For minadcl, we do an aggregation step that leaves us with metadata only for the representative sequences, but its merely an oversight that because this step isn't necessary for the seed-lineage trees, we end up not reducing the dimensionality of the metadata. This should be a relatively easy fix in the cft pipeline.
Ultimately, breaking up the data payloads will be the real silver bullet here. The fundamental problem is that we're sending over not just all of the clonal families, but all of their trees, sequences, and sequence metadata. If instead we only loaded this additional data lazily as folks click on specific trees, this problem would be largely mitigated. The cost of doing this is a) much more architectural complexity loading data on demand, and b) tree/alignment details won't be able to render instantly if we do this because they'll have to wait for the data to get in. My suggestion is that we push this off as long as we can, and see where we end up.

metasoarous · 2018-08-20T19:27:38Z

@eharkins If you are keen to get your fingers dirty with some data processing work, you could take a stab at nailing the third of the steps above (filtering seed-lineage downsampled metadata).

…249) * added height calculation for evenly spaces leaves * floating point division * adding nt sequences for Olmsted#17 * github.com/matsengrp/olmsted/issues/13 * including multiplicity for Olmsted(11) * tabs -> spaces * nt seqs dict using tripl lookup instead of fasta parser * comment white space

eharkins · 2018-09-26T00:14:14Z

Should we close this for now / now that matsengrp/cft#249 has been merged? Don't see an error on Firefox but I also never checked to see if I could reproduce the error in the first place. @metasoarous?

metasoarous · 2018-09-27T07:23:23Z

I guess for now let's close this, since the issue isn't pressing anymore now that matsengrp/cft#249 has been merged. One of these issues has already broken off into #42. The other two could potentially be helpful to consider eventually, but aren't high priority.

* added height calculation for evenly spaces leaves * floating point division * adding nt sequences for Olmsted#17 * github.com/matsengrp/olmsted/issues/13 * including multiplicity for Olmsted(11) * tabs -> spaces * nt seqs dict using tripl lookup instead of fasta parser * comment white space * first try on #250; using ecgtheow script to color pruned nodes * set prune_count back to default

metasoarous · 2019-01-03T00:20:40Z

Since increasing the number of clonal families sampled per unseeded sample, we're now having this issue again. The best solution is probably to #42 (split up clonal family details into separate files, which only get loaded once the given clonal family is selected). This will add a bit of delay to those detail views loading up for the first time, but will also reduce the amount of time it takes for the initial dataset load, and solve the FF loading problem for large datasets.

eharkins self-assigned this Aug 22, 2018

eharkins mentioned this issue Sep 10, 2018

Editing datascript for Olmsted#17, Olmsted#11, scons for Olmsted#13 matsengrp/cft#249

Merged

metasoarous added scale Having to do with scaling out to more data data-in Change in shape of data going into app labels Sep 13, 2018

metasoarous mentioned this issue Sep 13, 2018

Scalable data input model #42

Closed

metasoarous closed this as completed Sep 27, 2018

This was referenced Sep 27, 2018

Look at zipping/compressing data from server -> client #61

Closed

Add command line flag to build_olmsted_json.py for togling pretty-print matsengrp/cft#251

Closed

metasoarous reopened this Jan 3, 2019

metasoarous assigned metasoarous and unassigned eharkins Jan 3, 2019

metasoarous mentioned this issue Feb 16, 2019

Add command line flag to build_olmsted_json.py for togling pretty-print #134

Open

eharkins closed this as completed Mar 11, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loading large JSON data causes error because of Firefox threshold #13

Loading large JSON data causes error because of Firefox threshold #13

eharkins commented Aug 20, 2018

metasoarous commented Aug 20, 2018 •

edited by eharkins

Loading

metasoarous commented Aug 20, 2018

eharkins commented Sep 26, 2018

metasoarous commented Sep 27, 2018

metasoarous commented Jan 3, 2019

Loading large JSON data causes error because of Firefox threshold #13

Loading large JSON data causes error because of Firefox threshold #13

Comments

eharkins commented Aug 20, 2018

metasoarous commented Aug 20, 2018 • edited by eharkins Loading

metasoarous commented Aug 20, 2018

eharkins commented Sep 26, 2018

metasoarous commented Sep 27, 2018

metasoarous commented Jan 3, 2019

metasoarous commented Aug 20, 2018 •

edited by eharkins

Loading