-
Notifications
You must be signed in to change notification settings - Fork 987
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Address certain corner cases of time-travel queries by block hash #1405
Comments
I wonder if all this could be simplified to the following:
The thought (and hope) behind this is that the first thing that happens when we know that we have to revert blocks is that the old block is marked as an ommer in the block explorer; as that would happen before the subgraph gets reorged, it would also be quicker in detecting queries against invalid blocks, and it would avoid the complication of storing per subgraph block hashes. |
There's one other complication, somewhat unrelated, that the current implementation does not address: it is possible to query at a block that is on the main chain, but that the subgraph hasn't caught up to yet. We need to ensure that queries at a block only happen for blocks before the subgraph's current head. |
Here's a simpler way to solve these two problems. For each subgraph, we will track three pieces of data:
We then run queries doing the following:
It's probably not needed, but |
If I understand correctly thegraph transform blockHash query into blockNumber query when the blockHash in question is not part of the chain that the graph-node track If that is the case, I don't think this is the right approach. as a client I need to query consistent data for a particular block. and so if I use a query for |
That's not quite what can go wrong here; the situation described in this ticket can only arise if we processed a reorg for a certain block; with that, we would translate both the hash for the reorged block and the one for the block that replaced that block (at the same height) into the same block number, and would therefore produce the same query result.
With the exception of what I described above, that's what happens, and using a hash that is not the hash of a valid block will cause an error. And we do plan on addressing that corner-case at some point, it's just not entirely straightforward without affecting query performance. |
Thanks for the reply @lutter You got me confused with the main thing important to me is whether a query for a particular blockHash produce the same output in all circumstances (error are fine) ? Is it the case or not ? in my opinion there should be zero exception for it. |
Time-travel queries introduced in PR #1397 might still return incorrect results in certain circumstances. A query like
query { things(block: { hash: $hash }) { ... } }
might return data from different blocks if eitherUsers can make sure they do not encounter this problem by only querying per block hash for blocks that are final, and by ensuring that the block hash is on the main chain.
Problem H1 can happen if we downloaded a block with
$hash
that was later removed from the subgraph's chain because of a reorganization, and is possible for any block, no matter whether it can be considered final or not. The query should return an error, as the block is not on the main chain, but might return successfully. The data returned will be from the block on the main chain that has the same number as block$hash
.Problem H2 can only happen for blocks that can not be considered final yet; for
graph_node
, that means that they are withinREORG_THRESHOLD
of the current chain head, but in practice only affects blocks within two or three blocks of the chain head. The query should return an error, as the block$hash
is no longer on the main chain when the query finishes, but will return successfully with data that might come from either the block with$hash
or one of the siblings with the same number thatgraph-node
downloaded while it was processing the query.Solving H1
Solving this issue requires that we have the infrastructure in place to answer the question "Is the block
$hash
on the chain starting at the subgraph's current head?"Phase 1 of the block explorer will make it possible to distinguish between blocks on the main chain and ommers. If block
$hash
is at leastREORG_THRESHOLD
away from the chain head, we reject the query if the block is an ommer. For blocks$hash
withinREORG_THRESHOLD
of the chain head, we will keep a list of(block_hash, block_number)
for each subgraph, tying the subgraph's current head to the finalized chain, and reject the query if$hash
is not in that list.Solving H2
For this, we need to detect that the subgraph's view of the chain was reorganized while the query was running; this is only necessary for blocks that are not final yet, i.e., that are within
REORG_THRESHOLD
of the chain head. To support this check, we will add ageneration
attribute to eachSubgraphDeployment
that is incremented every time the subgraph's chain is reorganized. We retrieve thegeneration
attribute before we start executing the query, and check that it has not changed after we have finished executing the query. If it did change, we report an error to the user.Time-travel queries by block number
Problem H1 does not affect these queries, but problem H2 could. It might be worth putting these checks in place when doing a time-travel query by block number, i.e. get the hash of the block with the given number before executing the query, and then executing as if the user had given us that hash. That ensures that the result of such a query comes from a consistent state of the chain.
The text was updated successfully, but these errors were encountered: