Perf issues after m9 deployment #232

rafrombrc · 2017-08-29T18:43:17Z

Folks are reporting major lag and STMO performance issues that started conspicuously after the last deployment went out, in bugzilla (https://bugzilla.mozilla.org/show_bug.cgi?id=1394468) as well as multiple times in various IRC and slack channels. This is impacting people's work, so hopefully we can figure this out and get a 9.1 (or whatever the next minor rev is) out ASAP.

madalincm · 2017-08-30T14:30:00Z

The issue is most visible in the new query page and appears to be reproducing for all the data sources in production. In stage the issue is only visible for the Athena data source. Here is a link for a profile for a query written for against the Athena data source: http://bit.ly/2vrU6Ag

Also I have observed another issue that might be related. When writing a query the first letter introduced by the user is duplicated. This is only reproducing in production. Please view this screencast:

alison985 · 2017-08-30T17:33:36Z

I can reproduce slowness in prod on OS X in Nightly and 55.0.3 (64-bit). Nightly is giving a javascript error, though it's a minified variable name. I'm going to create/update a local test environment on my nightly machine.

alison985 · 2017-08-30T18:26:26Z

So... in nightly with postgres - current Mozilla master autocomplete works fine and has the drop-down on my local and production.

Production misses the dropdown and has a speed issue on Athena and Presto.

alison985 · 2017-08-30T18:35:45Z

Just merged PR #233 to staging. Don't think it will fix the issue but it will be helpful for diagnostic purposes since Athena and Presto are the same on production and Athena is on staging and production.

wlach · 2017-08-30T18:39:56Z

I did a quick profile yesterday of this, and it seems like the client-side lag is coming from some long-running javascript in angular's digest cycle: https://www.sitepoint.com/understanding-angulars-apply-digest/

Several possibilities here come to mind:

We're processing more data in the digest cycle than we were before due to a server-side change
The digest cycle is being triggered more often than before due to a client-side change
We're doing more in the digest cycle due to a client-side change

alison985 · 2017-08-31T02:42:35Z

So the JS in redash has a 5000 token limit set right now to turn the auto completer on and off with a note about performance. Also this isn't reproducible in nightly on staging or prod for the metadata data source.

I lowered the redash javascript token count check to 3000 as part of PR #234 and will see if that helps on staging. Given that the problem doesn't happen in Chrome (and at least 1 older version of Firefox), I currently think this was happening for a combination of:

the list of fields/tables in presto and athena got longer.
firefox latest/firefox nightly is doing something differently with javascript interpretation that was making it take longer.

wlach · 2017-08-31T12:53:08Z

This problem is totally reproducible in Chrome, so I think we can rule out (2). The problem is less grave there, but it's still definitely present.

Losing autocompletion kind of sucks. :( Looking at a Firefox performance profile, it seems like much of the time is being spent in this function:

redash/client/app/pages/queries/query-editor.js

Line 118 in 0c80396

const schemaCompleter = {

(I'm not sure why, but Chrome's profiler is refusing to give me more detail on this machine...)

If you have time, it might be worth investigating where time is being spent using an unminified version of the site sources and see if there's anything that we could improve. In particular, I'm wondering if we're unnecessarily recomputing stuff every time the autocomplete callback is being executed -- in particular, why are we recreating $scope.autoCompleteSchema every time the function is called?

redash/client/app/pages/queries/query-editor.js

Line 121 in 0c80396

$scope.autoCompleteSchema = removeExtraSchemaInfo($scope.schema);

Could we not get away with only calling that function if it does not already exist? In addition to likely being expensive in itself, recreating it also requires us to rebuild the keywords property later in the function:

redash/client/app/pages/queries/query-editor.js

Line 140 in 0c80396

$scope.autoCompleteSchema.keywords = map(keywords, (v, k) =>

alison985 · 2017-08-31T16:38:23Z

Hi @wlach. Thanks for continuing to investigate. This is what I ended up doing: https://github.com/mozilla/redash/compare/master@%7B1day%7D...master It seems to be working nicely on staging in Nightly with the Athena data source but it does eliminate autocomplete.

FWIW:
A) The getCompletions function is inside a const variable declaration (schemaCompleter) So it only should have gotten called once anyway. I had also tried moving it within the file but that's the only place $scope.schema wasn't undefined.
B) auto-complete was already lost at the 5000 mark for some data sources but it was still being called in the background, adding a lag without any benefit. Now there is no autocomplete but there is also no lag.
C) The next time I do a push to staging I'll try upping it to the 5000 mark again to see if autocomplete can come back when the schema browser toggle is turned on (to eliminate the _v tables).

wlach · 2017-08-31T17:05:34Z

@alison985 have you verified that getCompletions is only being called once? It looked like it was being called multiple times from what I could tell in my profile.

alison985 · 2017-08-31T18:40:19Z

@wlach I tested again and it is getting called multiple times. Firefox organizes the dev console differently than Chrome (aggregates repeating prints) so I missed it first time around.

However, if you print langTools before the langTools.addCompleter you see that it only stores the function call, not the values. I've tried changing that to no avail. I think this is the way the framework for autocomplete was built to work. It's also always been true.

The thing that changed within redash related to the autocomplete was adding logic to take out the [P] (indicating partition key) and (column_type) characters for the auto-complete list.

The things that would have changed in the data source was the number of tokens generated by the table and field count in a data source. I did find a line of code that was duplicating some tokens, but removing it still hasn't put us beneath the 5000 token mark for Athena.

wlach · 2017-08-31T18:48:01Z

@alison985 So my suggestion was maybe we could cache some/all of what getCompletions returns, so subsequent calls to it don't have to regenerate a set of search suggestions from the schema. :) This might be as easy as putting in something like this logic in the function:

getCompletions(state, session, pos, prefix, callback) {
    if ($scope.autoCompleteSchema === undefined) {
       // make a variable for the auto completion in the query editor
       $scope.autoCompleteSchema = removeExtraSchemaInfo($scope.schema);
   }
   ...

Could you give that a try and see if it helps? It looks like whoever wrote this originally was trying to cache the result in $scope, but the extra call to removeExtraSchemaInfo totally defeated that effort.

alison985 · 2017-08-31T19:06:33Z

I had actually already tried that if statement and many others. Regardless of what I do the framework stores the function call name, not the results.

Have you seen the latest code? https://github.com/mozilla/redash/blob/master/client/app/pages/queries/query-editor.js#L37

wlach · 2017-08-31T19:34:16Z

Yeah, I think that's ok that the editor calls the function repeatedly -- that's probably just what it's designed to do. Your latest code looks like it should be fast though -- is the function still showing up in profiles?

alison985 · 2017-08-31T19:45:29Z

@wlach I have absolutely no idea how to check on a profile. It's on staging if you want to.

alison985 added this to the 10 milestone Aug 29, 2017

alison985 added the bug label Aug 29, 2017

alison985 self-assigned this Aug 29, 2017

alison985 added the in progress label Aug 30, 2017

alison985 mentioned this issue Aug 30, 2017

add athena handling map() and row() #233

Merged

alison985 mentioned this issue Aug 31, 2017

Autocomplete to 3000; diff fix for map() row() in athena #234

Merged

This was referenced Aug 31, 2017

More Athena cases for map() and row() #235

Merged

Athena array(), map() or row() #236

Merged

sort partition keys to top, disable basic autocomplete #237

Merged

alison985 added in review and removed in progress labels Aug 31, 2017

alison985 closed this as completed Aug 31, 2017

alison985 removed the in review label Aug 31, 2017

alison985 reopened this Aug 31, 2017

alison985 added the in review label Aug 31, 2017

alison985 mentioned this issue Aug 31, 2017

Tokens to 5000 and cut dups #239

Merged

washort pushed a commit that referenced this issue Dec 12, 2017

make autocomplete for large schemas faster (re #232)

6c3ef66

washort pushed a commit that referenced this issue Jan 8, 2018

make autocomplete for large schemas faster (re #232)

e41881c

washort pushed a commit that referenced this issue Jan 17, 2018

make autocomplete for large schemas faster (re #232)

678a3eb

washort pushed a commit that referenced this issue Feb 6, 2018

make autocomplete for large schemas faster (re #232)

71400d0

washort pushed a commit that referenced this issue Feb 28, 2018

make autocomplete for large schemas faster (re #232)

e33d31e

jezdez pushed a commit that referenced this issue Mar 5, 2018

make autocomplete for large schemas faster (re #232)

5eaa966

emtwo pushed a commit that referenced this issue May 25, 2018

make autocomplete for large schemas faster (re #232)

2c3ec01

jezdez mentioned this issue Jul 13, 2018

Port making autocomplete for large schemas faster #454

Closed

washort pushed a commit that referenced this issue Jul 25, 2018

make autocomplete for large schemas faster (re #232)

095092e

washort pushed a commit that referenced this issue Jul 30, 2018

make autocomplete for large schemas faster (re #232)

961c67e

alison985 mentioned this issue Aug 12, 2018

port making autocomplete for large schemas faster getredash/redash#2746

Closed

jezdez pushed a commit that referenced this issue Aug 16, 2018

make autocomplete for large schemas faster (re #232)

145dd98

jezdez pushed a commit that referenced this issue Sep 6, 2018

make autocomplete for large schemas faster (re #232)

3f7516b

jezdez pushed a commit that referenced this issue Sep 6, 2018

make autocomplete for large schemas faster (re #232)

48ddd18

jezdez pushed a commit that referenced this issue Nov 1, 2018

make autocomplete for large schemas faster (re #232)

8edb053

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Perf issues after m9 deployment #232

Perf issues after m9 deployment #232

rafrombrc commented Aug 29, 2017

madalincm commented Aug 30, 2017

alison985 commented Aug 30, 2017

alison985 commented Aug 30, 2017

alison985 commented Aug 30, 2017

wlach commented Aug 30, 2017

alison985 commented Aug 31, 2017

wlach commented Aug 31, 2017

alison985 commented Aug 31, 2017

wlach commented Aug 31, 2017

alison985 commented Aug 31, 2017

wlach commented Aug 31, 2017

alison985 commented Aug 31, 2017

wlach commented Aug 31, 2017

alison985 commented Aug 31, 2017

Perf issues after m9 deployment #232

Perf issues after m9 deployment #232

Comments

rafrombrc commented Aug 29, 2017

madalincm commented Aug 30, 2017

alison985 commented Aug 30, 2017

alison985 commented Aug 30, 2017

alison985 commented Aug 30, 2017

wlach commented Aug 30, 2017

alison985 commented Aug 31, 2017

wlach commented Aug 31, 2017

alison985 commented Aug 31, 2017

wlach commented Aug 31, 2017

alison985 commented Aug 31, 2017

wlach commented Aug 31, 2017

alison985 commented Aug 31, 2017

wlach commented Aug 31, 2017

alison985 commented Aug 31, 2017