Use correct range to highlight line continuation error #12016

dhruvmanila · 2024-06-25T03:14:19Z

Summary

This PR fixes the range highlighted for the line continuation error.

Previously, it would highlight an incorrect range:

1 | call(a, b, \\\
  |           ^^ Syntax Error: unexpected character after line continuation character
2 | 
3 | def bar():
  |

And now:

  |
1 | call(a, b, \\\
  |             ^ Syntax Error: unexpected character after line continuation character
2 | 
3 | def bar():
  |

This is implemented by avoiding to update the token range for the Unknown token which is emitted when there's a lexical error. Instead, the push_error helper method will be responsible to update the range to the error location.

This actually becomes a requirement which can be seen in follow-up PRs.

Test Plan

Update and validate the snapshot.

github-actions · 2024-06-25T03:33:50Z

`ruff-ecosystem` results

Linter (stable)

✅ ecosystem check detected no linter changes.

Linter (preview)

✅ ecosystem check detected no linter changes.

Formatter (stable)

✅ ecosystem check detected no format changes.

Formatter (preview)

✅ ecosystem check detected no format changes.

MichaReiser · 2024-06-25T06:21:57Z

crates/ruff_python_parser/src/lexer.rs

+        // For `Unknown` token, the `push_error` method updates the current range.
+        if !matches!(self.current_kind, TokenKind::Unknown) {
+            self.current_range = self.token_range();
+        }


It's a bit unfortunate that we now have this branch in the very hot next_token function for the very rare case of an unknown token. But I don't see a better way of solving this that doesn't require a lot of repetition on the token range.

Yeah, I agree. I thought of doing something like what you've suggested (computing the token range at the place where it's emitted) but wanted to see if this actually impacts the performance. It doesn't seem to be which is why I'm fine moving ahead with this.

## Summary This PR updates the unterminated string error range to not include the final newline character. This is a follow-up to #12016 and required for #12019 This is not done for when the unterminated string goes till the end of file (not a newline character). The unterminated f-string range is correct. ### Why is this required for #12019 ? Because otherwise the token ranges will overlap. For example: ```py f"{" f"{foo!r" ``` Here, the re-lexing logic recovers from an unterminated f-string and thus emitting a `Newline` token for the one at the end of the first line. But, currently the `Unknown` and the `Newline` token would overlap because the `Unknown` token (unterminated string literal) range would include the newline character. ## Test Plan Update and validate the snapshot.

## Summary This PR updates the parser test infrastructure to validate the token ranges. From the code documentation: ``` /// Verifies that: /// * the ranges are strictly increasing when loop the tokens in insertion order /// * all ranges are within the length of the source code ``` Follow-up from #12016 and #12017 resolves: #11938 ## Test Plan Make sure that there are no failures.

This PR reverts #12016 with a small change where the error location points to the continuation character only. Earlier, it would also highlight the whitespace that came before it. The motivation for this change is to avoid panic in #11950. For example: ```py \) ``` Playground: https://play.ruff.rs/87711071-1b54-45a3-b45a-81a336a1ea61 The range of `Unknown` token and `Rpar` is the same. Once #11950 is enabled, the indexer would panic. It won't panic in the stable version because we stop at the first `Unknown` token.

Use correct range to highlight line continuation error

0d2726e

dhruvmanila added the parser Related to the parser label Jun 25, 2024

dhruvmanila requested a review from MichaReiser as a code owner June 25, 2024 03:14

This was referenced Jun 25, 2024

Do not include newline for unterminated string range #12017

Merged

Update parser tests to validate token ranges #12019

Merged

MichaReiser approved these changes Jun 25, 2024

View reviewed changes

dhruvmanila merged commit 9c1b6ec into main Jun 25, 2024
20 checks passed

dhruvmanila deleted the dhruv/unknown-range branch June 25, 2024 08:05

dhruvmanila mentioned this pull request Jun 28, 2024

Revert "Use correct range to highlight line continuation error" #12089

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use correct range to highlight line continuation error #12016

Use correct range to highlight line continuation error #12016

dhruvmanila commented Jun 25, 2024

github-actions bot commented Jun 25, 2024

MichaReiser Jun 25, 2024

dhruvmanila Jun 25, 2024

Use correct range to highlight line continuation error #12016

Use correct range to highlight line continuation error #12016

Conversation

dhruvmanila commented Jun 25, 2024

Summary

Test Plan

github-actions bot commented Jun 25, 2024

ruff-ecosystem results

Linter (stable)

Linter (preview)

Formatter (stable)

Formatter (preview)

MichaReiser Jun 25, 2024

Choose a reason for hiding this comment

dhruvmanila Jun 25, 2024

Choose a reason for hiding this comment

`ruff-ecosystem` results