Smarter use of eth_getLogs #1260

leoyvens · 2019-09-30T18:22:12Z

Currently we merge all log filters for a subgraph in a single large filter. This is can be bad for two reasons:

It returns false positives, which may be slowing down the calls.
The filter can become too broad with many data sources, timing out on Alchemy or returning too many logs on Infura, requiring the range to be further split up.

This PR splits the eth_getLogs calls so that we don't request contract-event combinations we don't actually need, and even split it up a bit more than necessary for more parallel requests.

I tested with the NuoNetwork subgraph for speed and correctness, and we no longer get timeouts on it, probably because we're splitting up the call for the thousands of dynamic data sources per event, which splits it in five parallel requests. I Also tested wildcard event signatures still work.

The ERC20 subgraph didn't spend too much time getting logs, though I didn't wait for it to get very far and I'm not sure what the situation was before. At least on a cold DB the slowest thing was downloading receipts, as usual.

I expected we'll be tweaking this based on the behaviour of specific subgraphs on specific Ethereum node providers, but for now I'll consider that this fixes #1130.

As it is this goes from a single call to way too many calls because `eth_log_filters` is naive, but already avoids false positives.

Do a clever graph algorithm to make not too few and too many eth_getLogs calls.

This did not cause any bugs, it's just for the logs to look nice.

Jannis · 2019-10-01T12:32:30Z

I was concerned the reduced parallel block ranges might make scanning logs slower for subgraphs with e.g. just one data source, so I benchmarked this PR with both Gravity (1 data source, 2 events) and Melon (9 data sources, 16 events if I've counted correctly). Here are the results:

Gravity

Scanning past logs for Gravity against Alchemy, before and after:

Before

Oct 01 13:45:19.369 INFO Start subgraph, data_sources: 1, subgraph_id: Qmdp5oSxE28VNF17LsoojjdYKMMDKDvfLxQLzXacW3kCSC, component: SubgraphInstanceManager
Oct 01 13:48:00.589 DEBG Scanning blocks [6000001, 6010000], subgraph_id: Qmdp5oSxE28VNF17LsoojjdYKMMDKDvfLxQLzXacW3kCSC, component: SubgraphInstanceManager > BlockStream

Time: 02:41 mins

After

Oct 01 13:57:25.438 INFO Start subgraph, data_sources: 1, subgraph_id: Qmdp5oSxE28VNF17LsoojjdYKMMDKDvfLxQLzXacW3kCSC, component: SubgraphInstanceManager
Oct 01 14:00:32.118 DEBG Scanning blocks [6000001, 6010000], subgraph_id: Qmdp5oSxE28VNF17LsoojjdYKMMDKDvfLxQLzXacW3kCSC, component: SubgraphInstanceManager > BlockStream

Time: 02:53 mins

Melon

Scanning past logs for Melon against Alchemy, before and after:

Before

Oct 01 14:23:01.647 INFO Start subgraph, data_sources: 9, subgraph_id: QmbfdXRXpeQL3vafKVLYDjev7avzDoUGp1BKQBfgJhpRo9, component: SubgraphInstanceManager
Oct 01 14:26:37.194 DEBG Scanning blocks [6000001, 6010000], subgraph_id: QmbfdXRXpeQL3vafKVLYDjev7avzDoUGp1BKQBfgJhpRo9, component: SubgraphInstanceManager > BlockStream

Time: 3:36 mins

After

Oct 01 14:09:53.718 INFO Start subgraph, data_sources: 9, subgraph_id: QmbfdXRXpeQL3vafKVLYDjev7avzDoUGp1BKQBfgJhpRo9, component: SubgraphInstanceManager
Oct 01 14:13:03.868 DEBG Scanning blocks [6000001, 6010000], subgraph_id: QmbfdXRXpeQL3vafKVLYDjev7avzDoUGp1BKQBfgJhpRo9, component: SubgraphInstanceManager > BlockStream

Time: 3:10 mins

Conclusion

This is just a small sample, but it suggests the performance has degraded slightly for subgraphs with just one data source and few events and improved slightly for subgraphs with many data sources and events.

Overall the differences are small enough to not be a concern, especially since subgraphs with just one data source are rare and so we're likely going to see an improvement with most subgraphs.

Jannis

Love it! I've got a few small comments but this is close to ready.

Jannis · 2019-10-01T12:12:14Z