Porting `compare.R` to JavaScript? #16762

Trott · 2017-11-04T22:42:17Z

Version: 10.0.0-pre
Platform: all
Subsystem: benchmarks

Any chance the benefits of having benchmark/compare.R functionality in JavaScript would outweigh any downsides?

No opinion from me on it. Just pondering. I know R and Python tend to be the languages of choice for this sort of thing but maybe there's something to be said for a stdlib-js approach to it? /cc @kgryte

(For that matter, would a Python 2 port make sense? At least we'd have one less external tool we rely on, as we need Python 2 for our build tool chain.)

The text was updated successfully, but these errors were encountered:

kenany · 2017-11-04T23:03:47Z

#12585 has some discussion on doing the stats tests in JS/Python.

refack · 2017-11-04T23:30:36Z

The main benefit of R is the ggplot library, but ATM we don't use it in our main benchmarking flows.

I asked an R ninja to do this porting, but he lost interest.
IMHO if we port this as-is, there should be no difference, we just need an implementation of the T-table

joyeecheung · 2017-11-05T02:12:19Z

Should we mark this as good-first-issue? One don't really need to know too much about core to do this but if they are a stats ninja they might be interested.

kgryte · 2017-11-05T22:18:23Z

@Trott Thanks for the ping. In stdlib, we have implemented T-test functionality, which seems to be the main feature of the compare.R script not readily achievable in JavaScript. For an implementation, see here. While we would like to be able to say that you can just npm install the package directly, this is not possible at the moment, as we have yet to flip the switch and publish separate packages to npm.

While not available at the moment to use out-of-the-box, the code should provide some insight into what would be required to "roll your own" implementation. Most importantly, a proper T-test implementation won't rely on a T-table. Instead, it will rely on computing the CDF of a Students t-distribution, which can be found here. And computing the CDF, requires computing the incomplete beta function, which is not straightforward.

So, my assessment is that this is not a good first issue. You would need to put in considerable time to actually implement something comparable to R/Python, as we have.

As a stop gap, if you are wanting to rid yourselves of the R dependency, then use SciPy. The talk about using Pandas in the PR thread mentioned above is misguided. The SciPy functionality should work for Python 2.7 and above. You can achieve the box plot functionality using Matplotlib.

Once we have decomposed stdlib into individual packages, you'll be able to do everything in JS. But until then, I would opt for Python.

refack · 2017-11-06T00:30:07Z

@kgryte thank you so more for the input.

As a stop gap, if you are wanting to rid yourselves of the R dependency, then use SciPy.

Getting SciPy installed on Windows while doable is IMHO just a ted more cumbersome than installing the R runtime. So I'd say that's not a big win.

💡
🤓 What we could do is write a t-test-as-a-service, that way we replace the dependency on R with internet access.

But just to put things in perspective, we are not looking for scientific paper grade statistics, we just need a way to measure significance, so IMHO having a precalculated T-table as a rough approximation of the CDF of a Student's t-distribution seems good enough from my POV.

Planeshifter · 2017-11-06T01:37:03Z

As a slight addendum to @kgryte's post, I would just like to note that the reason we haven't published to npm is that the final project structure is not cast in stone yet. Almost all of the existing packages are fully functional and thoroughly tested. Also, while we make no guarantees at this point, it's not very likely that the API of the t-test will change in the future.

As of now, our recommended approach to use stdlib-js is to create a bundle of the required functions. We provide a bundling tool for this purpose. Alternatively, we provide UMD bundles for the entire library (https://github.com/stdlib-js/stdlib/tree/develop/dist).

Trott added benchmark Issues and PRs related to the benchmark subsystem. discuss Issues opened for discussions and feedbacks. python PRs and issues that require attention from people who are familiar with Python. labels Nov 4, 2017

Trott closed this as completed Jan 5, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Porting `compare.R` to JavaScript? #16762

Porting `compare.R` to JavaScript? #16762

Trott commented Nov 4, 2017

kenany commented Nov 4, 2017

refack commented Nov 4, 2017

joyeecheung commented Nov 5, 2017 •

edited

Loading

kgryte commented Nov 5, 2017

refack commented Nov 6, 2017

Planeshifter commented Nov 6, 2017 •

edited

Loading

Porting compare.R to JavaScript? #16762

Porting compare.R to JavaScript? #16762

Comments

Trott commented Nov 4, 2017

kenany commented Nov 4, 2017

refack commented Nov 4, 2017

joyeecheung commented Nov 5, 2017 • edited Loading

kgryte commented Nov 5, 2017

refack commented Nov 6, 2017

Planeshifter commented Nov 6, 2017 • edited Loading

Porting `compare.R` to JavaScript? #16762

Porting `compare.R` to JavaScript? #16762

joyeecheung commented Nov 5, 2017 •

edited

Loading

Planeshifter commented Nov 6, 2017 •

edited

Loading