Use JSC C API for better invocation speed #1037

robertjpayne · 2015-04-27T05:39:50Z

This converts the existing JSEvaluateScript call for require('<ModuleName>').<MethodName>.apply(null, <args>); to native JSC C API methods which shaves off about 10-15% of invocation time on average, I used pretty primitive profiling methods to track the minimum, maximum and average invocation time so would appreciate any extra eyes on the performance.

If the argument count is zero the method is invoked directly with no arguments, if the argument count is 1 it's invoked directly with just that argument. If there is more than 1 argument then apply is used and the arguments are passed as the second parameter.

Ensured all existing tests pass and instruments leaks shows nothing is leaking.

This converts the existing JSEvaluateScript call for `require('<ModuleName>').<MethodName>.apply(null, <args>);` to native JSC C API methods which shaves off about 10-15% of invocation time on average.

ide · 2015-04-27T09:11:14Z

Nice find...

Have you looked into the Obj-C API? I believe it would slim the code down to 1/4 its size, partly because it's more succinct and also because you could write [module invokeMethod:methodName withArguments:arguments]. (This would also fix what I think is a bug in your diff where thisObject is missing or referring to the global object instead of the module object.)

robertjpayne · 2015-04-27T09:42:52Z

@ide I had looked into it but saw a previous issue ticket that stated the Objective-C API was avoided due to corrupted memory issues.

Mind elaborating on the thisObject bug?

ide · 2015-04-27T10:03:27Z

Left some comments on the diff regarding the thisObject bug.

I took a look at the issue mentioning memory corruption and it sounded like the problem was with JSExport or perhaps the wrapper layer between Obj-C and JS objects. There's also some chance that the issue was due to bridged native objects getting freed at non-deterministic times if they weren't registered with the GC (just a guess). For this code at hand the Obj-C layer is straightforward without JSExport magic so I expect it not to have strange bugs.

robertjpayne · 2015-04-27T10:37:35Z

@ide I'm looking at the thisObject stuff, because I'm pretty sure I tried what you're mentioning and found the behavior to be unexpected ( the C API seemed to be acting up ) but I'll double check.

As for the Objective-C API, it would presumably work just fine, but it'll be less performant due to how JSC internally maps objects and stuff. Since this method is executed every frame would it be best to keep it as performant as possible? I suppose the slowest bit really is the JSON conversion that will need to remain either way.

robertjpayne · 2015-04-27T10:42:45Z

@ide tidied up the thisObject, doesn't seem to affect anything but it's at least correct now.

vjeux · 2015-04-27T15:15:55Z

This is awesome! I'm sure that there's a lot of low hanging fruits in terms of performance at the bridge level :)

You mention a 10-15% reduction, mind sharing your methodology so that we can reproduce it?

robertjpayne · 2015-04-27T15:25:09Z

@vjeux The 10-15% was very rudimentary profiling:

I added:

static CFTimeInterval lowInterval = CGFLOAT_MAX;
static CFTimeInterval avgInterval = 0;
static CFTimeInterval highInterval = CGFLOAT_MIN;

CFTimeInterval before = CACurrentMediaTime();

.... bridge code ....

CFTimeInterval after = CACurrentMediaTime();
CFTimeInterval elapsed = after - before;

lowInterval = MIN(lowInterval, elapsed);
avgInterval = (avgInterval + elapsed) / 2.0;
highInterval = MAX(highInterval, elapsed);

printf("%f, %f, %f\n", lowInterval, avgInterval, highInterval);

Running on device I was looking mostly at the lowInterval and it's "idle" performance was 10-15% lower than it's counter part. I should probably look the interval markers only when payloads are being passed (such as scroll events).

robertjpayne · 2015-04-27T15:46:32Z

It's actually pretty hard to formulate a good test plan to profile this in a "fair" way. I think mostly what I was going off was the fastest execution time marked by the low interval. Since a lot of the messages going over the bridge are nearly identical ( scroll events for instance ) it was a decent mark to play around for awhile.

I've tested it again only measuring messages that have more than 300 JSON characters in length and it's still measuring 10-15% faster on my 4S

robertjpayne · 2015-04-27T16:08:49Z

@vjeux here's the code I have in place now, trying to formulate how I could make this more predictable / comparable but the nature of the bridge is hard.

    static CFTimeInterval lowInterval = CGFLOAT_MAX;
    static CFTimeInterval totalInterval = 0;
    static NSUInteger totalIntervalCount = 0;
    static CFTimeInterval highInterval = CGFLOAT_MIN;

    CFTimeInterval before = CACurrentMediaTime();

    JSValueRef errorJSRef = NULL;
    JSValueRef resultJSRef = NULL;
    JSGlobalContextRef contextJSRef = JSContextGetGlobalContext(strongSelf->_context.ctx);
    JSObjectRef globalObjectJSRef = JSContextGetGlobalObject(strongSelf->_context.ctx);

#pragma mark - C API

//    NSError *error;
//    NSString *argsString = (arguments.count == 1) ? RCTJSONStringify(arguments[0], &error) : RCTJSONStringify(arguments, &error);
//    if (!argsString) {
//      RCTLogError(@"Cannot convert argument to string: %@", error);
//      onComplete(nil, error);
//      return;
//    }
//    
//    // get require
//    JSStringRef requireNameJSStringRef = JSStringCreateWithUTF8CString("require");
//    JSValueRef requireJSRef = JSObjectGetProperty(contextJSRef, globalObjectJSRef, requireNameJSStringRef, &errorJSRef);
//    JSStringRelease(requireNameJSStringRef);
//    
//    if (requireJSRef != NULL && errorJSRef == NULL) {
//      
//      // get module
//      JSStringRef moduleNameJSStringRef = JSStringCreateWithCFString((__bridge CFStringRef)name);
//      JSValueRef moduleNameJSRef = JSValueMakeString(contextJSRef, moduleNameJSStringRef);
//      JSValueRef moduleJSRef = JSObjectCallAsFunction(contextJSRef, (JSObjectRef)requireJSRef, NULL, 1, (const JSValueRef *)&moduleNameJSRef, &errorJSRef);
//      JSStringRelease(moduleNameJSStringRef);
//      
//      if (moduleJSRef != NULL && errorJSRef == NULL) {
//        
//        // get method
//        JSStringRef methodNameJSStringRef = JSStringCreateWithCFString((__bridge CFStringRef)method);
//        JSValueRef methodJSRef = JSObjectGetProperty(contextJSRef, (JSObjectRef)moduleJSRef, methodNameJSStringRef, &errorJSRef);
//        JSStringRelease(methodNameJSStringRef);
//        
//        if (methodJSRef != NULL && errorJSRef == NULL) {
//          
//          // direct method invoke with no arguments
//          if (arguments.count == 0) {
//            resultJSRef = JSObjectCallAsFunction(contextJSRef, (JSObjectRef)methodJSRef, (JSObjectRef)moduleJSRef, 0, NULL, &errorJSRef);
//          }
//          // direct method invoke with 1 argument
//          else if(arguments.count == 1) {
//            JSStringRef argsJSStringRef = JSStringCreateWithCFString((__bridge CFStringRef)argsString);
//            JSValueRef argsJSRef = JSValueMakeFromJSONString(contextJSRef, argsJSStringRef);
//            resultJSRef = JSObjectCallAsFunction(contextJSRef, (JSObjectRef)methodJSRef, (JSObjectRef)moduleJSRef, 1, &argsJSRef, &errorJSRef);
//            JSStringRelease(argsJSStringRef);
//          }
//          // apply invoke with array of arguments
//          else {
//            // get apply
//            JSStringRef applyNameJSStringRef = JSStringCreateWithUTF8CString("apply");
//            JSValueRef applyJSRef = JSObjectGetProperty(contextJSRef, (JSObjectRef)methodJSRef, applyNameJSStringRef, &errorJSRef);
//            JSStringRelease(applyNameJSStringRef);
//            
//            if (applyJSRef != NULL && errorJSRef == NULL) {
//              // invoke apply
//              JSStringRef argsJSStringRef = JSStringCreateWithCFString((__bridge CFStringRef)argsString);
//              JSValueRef argsJSRef = JSValueMakeFromJSONString(contextJSRef, argsJSStringRef);
//              
//              JSValueRef args[2];
//              args[0] = JSValueMakeNull(contextJSRef);
//              args[1] = argsJSRef;
//              
//              resultJSRef = JSObjectCallAsFunction(contextJSRef, (JSObjectRef)applyJSRef, (JSObjectRef)moduleJSRef, 2, args, &errorJSRef);
//              JSStringRelease(argsJSStringRef);
//            }
//          }
//        }
//      }
//      
//    }

#pragma mark - EVALUATE SCRIPT

      NSError *error;
      NSString *argsString = RCTJSONStringify(arguments, &error);
      if (!argsString) {
        RCTLogError(@"Cannot convert argument to string: %@", error);
        onComplete(nil, error);
        return;
      }

    NSString *script = [NSString stringWithFormat:@"require('%@').%@.apply(null, %@);", name, method, argsString];
    JSStringRef scriptJSStringRef = JSStringCreateWithCFString((__bridge CFStringRef)script);
    resultJSRef = JSEvaluateScript(contextJSRef, scriptJSStringRef, globalObjectJSRef, NULL, 1, &errorJSRef);
    JSStringRelease(scriptJSStringRef);

    if(argsString.length > 300) {
      CFTimeInterval after = CACurrentMediaTime();
      CFTimeInterval elapsed = after - before;

      lowInterval = MIN(lowInterval, elapsed);
      totalInterval += elapsed;
      highInterval = MAX(highInterval, elapsed);

      totalIntervalCount++;

      printf("%f, %f, %f, %f\n", lowInterval, totalInterval, highInterval, elapsed);
    }

vjeux · 2015-04-27T16:20:32Z

cc @bryceredd who has been looking at performance measurement lately

ide · 2015-04-27T19:24:11Z

As for the Objective-C API, it would presumably work just fine, but it'll be less performant due to how JSC internally maps objects and stuff.

Keeping the NSObject -> JSON -> JSValue marshalling, the Obj-C API would perform the same work as the C API plus Obj-C's message passing. I think it's a good tradeoff, given how much more easily the code reads.

robertjpayne · 2015-04-27T22:25:36Z

@ide hmm really need to get some proper profiling in place. I implemented the Objective-C API locally ( though managing errors is a bit unfortunate as it's a global catch all rather than a per-function catch ) and the performance is worse than JSEvaluateScript.

The internals of JSC do a lot of weak ref and value mapping for JSValue ontop of the raw C-api.

I'm going to try and get something setup that can properly profile communication over the bridge at different payload sizes.

tadeuzagallo · 2015-05-06T18:02:36Z

React/Executors/RCTContextExecutor.m

+              args[0] = JSValueMakeNull(contextJSRef);
+              args[1] = argsJSRef;
+
+              resultJSRef = JSObjectCallAsFunction(contextJSRef, (JSObjectRef)applyJSRef, (JSObjectRef)moduleJSRef, 2, args, &errorJSRef);


Here it should be methodJSRef instead of moduleJSRef, since the context of apply is the actual function, not the module.

tadeuzagallo · 2015-05-06T18:12:19Z

Hey, I'm merging it internally, I wrote some micro benchmarks, and it seems pretty faster, thanks! :D

I wrote a test case for that, the code won't be on the open source repo yet, but I create a gist (https://gist.github.com/tadeuzagallo/04ee3ac7017cc4e1bc8e) so you can have a look and share what you think.

I tested with a minimal amount of data, because it will have to be stringified and passed across the bridge anyway, so I don't think it should matter. Also, garbage collection make the tests a lot noisier.

The numbers are slightly volatile, but I took some time to try to make them more predictable, the tests still vary a little bit, but every test runs 400k times, and I ran them a lot of times, and the following results had a variation of less than 5%.

I just made 1 comment, but I already fixed it internally, and also had to update RCTJSONStringify to accept fragments to make the tests run.

Here are the results I got:

Previous: 20903.009869, New: 11474.709258 -> 0x1.6695ac8f62c14p+13
You made JS calls with 0 argument(s) 45.10% faster :) - Remember to update the tests with the new value: 0x1.6695ac8f62c14p+13
Previous: 22317.600661, New: 13314.305765 -> 0x1.a0127234e8a24p+13
You made JS calls with 1 argument(s) 40.34% faster :) - Remember to update the tests with the new value: 0x1.a0127234e8a24p+13
Previous: 22095.529905, New: 14783.073747 -> 0x1.cdf897088a593p+13
You made JS calls with 2 argument(s) 33.09% faster :) - Remember to update the tests with the new value: 0x1.cdf897088a593p+13
Previous: 22359.833559, New: 15148.703574 -> 0x1.d965a0eb9bf81p+13
You made JS calls with 3 argument(s) 32.25% faster :) - Remember to update the tests with the new value: 0x1.d965a0eb9bf81p+13

robertjpayne · 2015-05-06T22:36:38Z

@tadeuzagallo that looks awesome, so your tests indicating a 32-45% faster rate?

Thanks for the fix ups too, I'm not a JS dev so the module vs method context was a bit of naivety there.

tadeuzagallo · 2015-05-06T22:45:34Z

That's right, and practically it's closer to ~40%, since 99% of the calls are batched, and just have one argument (the array of calls to be executed).
The values of the result are in nanoseconds, so it'll be ~9 microseconds / call.

robertjpayne · 2015-05-06T22:47:38Z

That's still awesome though 👍 9 microseconds adds up quite quickly at 60fps! Thanks again for profiling this properly.

tadeuzagallo · 2015-05-06T23:02:06Z

Sure, it was a lot of fun! Hehe
Thanks for doing that, I didn't mean to diminish it at all, it's great! Sorry if it sounded like that... I was just mentioning, because I noticed I dropped a lot of context-less values.

brentvatne · 2015-05-06T23:59:12Z

Wow great job folks

Summary: This converts the existing JSEvaluateScript call for `require('<ModuleName>').<MethodName>.apply(null, <args>);` to native JSC C API methods which shaves off about 10-15% of invocation time on average, I used pretty primitive profiling methods to track the minimum, maximum and average invocation time so would appreciate any extra eyes on the performance. If the argument count is zero the method is invoked directly with no arguments, if the argument count is 1 it's invoked directly with just that argument. If there is more than 1 argument then apply is used and the arguments are passed as the second parameter. Ensured all existing tests pass and instruments leaks shows nothing is leaking. Closes facebook#1037 Github Author: Robert Payne <robertpayne@me.com> Test Plan: Imported from GitHub, without a `Test Plan:` line.

Move to compliant pool for Ubuntu

Use JSC C API for extra invocation speed

592db07

This converts the existing JSEvaluateScript call for `require('<ModuleName>').<MethodName>.apply(null, <args>);` to native JSC C API methods which shaves off about 10-15% of invocation time on average.

reactjs-bot added the GH Review: review-needed label Apr 27, 2015

robertjpayne changed the title ~~Use JSC C API for extra invocation speed~~ Use JSC C API for better invocation speed Apr 27, 2015

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 27, 2015

Altered some thisObject arguments

e170043

nicklockwood added the Internal diff created label May 1, 2015

tadeuzagallo reviewed May 6, 2015
View reviewed changes

tadeuzagallo removed the GH Review: review-needed label May 6, 2015

vjeux closed this May 15, 2015

brentvatne mentioned this pull request May 31, 2015

Why not use JSExport to implement the bridge? #573

Closed

snyk-bot mentioned this pull request Sep 3, 2021

[Snyk] Security upgrade sharp from 0.17.3 to 0.19.0 hixio-mh/react-native#17

Open

dannyvv added a commit to dannyvv/react-native that referenced this pull request Mar 4, 2022

Merge pull request facebook#1037 from dannyvv/pr/UbuntuPool

9540993

Move to compliant pool for Ubuntu

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use JSC C API for better invocation speed #1037

Use JSC C API for better invocation speed #1037

robertjpayne commented Apr 27, 2015

ide commented Apr 27, 2015

robertjpayne commented Apr 27, 2015

ide commented Apr 27, 2015

robertjpayne commented Apr 27, 2015

robertjpayne commented Apr 27, 2015

vjeux commented Apr 27, 2015

robertjpayne commented Apr 27, 2015

robertjpayne commented Apr 27, 2015

robertjpayne commented Apr 27, 2015

vjeux commented Apr 27, 2015

ide commented Apr 27, 2015

robertjpayne commented Apr 27, 2015

tadeuzagallo May 6, 2015

tadeuzagallo commented May 6, 2015

robertjpayne commented May 6, 2015

tadeuzagallo commented May 6, 2015

robertjpayne commented May 6, 2015

tadeuzagallo commented May 6, 2015

brentvatne commented May 6, 2015

Use JSC C API for better invocation speed #1037

Use JSC C API for better invocation speed #1037

Conversation

robertjpayne commented Apr 27, 2015

ide commented Apr 27, 2015

robertjpayne commented Apr 27, 2015

ide commented Apr 27, 2015

robertjpayne commented Apr 27, 2015

robertjpayne commented Apr 27, 2015

vjeux commented Apr 27, 2015

robertjpayne commented Apr 27, 2015

robertjpayne commented Apr 27, 2015

robertjpayne commented Apr 27, 2015

vjeux commented Apr 27, 2015

ide commented Apr 27, 2015

robertjpayne commented Apr 27, 2015

tadeuzagallo May 6, 2015

Choose a reason for hiding this comment

tadeuzagallo commented May 6, 2015

robertjpayne commented May 6, 2015

tadeuzagallo commented May 6, 2015

robertjpayne commented May 6, 2015

tadeuzagallo commented May 6, 2015

brentvatne commented May 6, 2015