lexer+parser: many improvements and cleanups #985

feds01 · 2023-09-18T20:00:25Z

token: remove DelimiterVariant
lexer: flatten token stream
parser: remove AstGenFrame::token_at()
parser: cleanup accesses to offset in AstGenFrame
token: introduce -> and => tokens to simplify parsing & error reporting
token: introduce :: tokens to simplify parsing access expr/ty/pats
parser: cleanup begins_pat() implementation
parser: avoid using peek_nth() in binary expression parsing
token: introduce .., ..<, ... tokens to simplify spread/range parsing
parser: fix typo in diagnostics
parser: several cleanups, and stricter use of token stream API
parser: Integrated new lexer into the parser
parser: directly use TokenCursor API
parser: avoid using confusing next_pos() function
analysis: use indexmap in pattern bind analysis to enforce stable error order
parser: name errors consistently, and remove a bunch of old un-used variants
parser: remove use of confusing next_pos() and replace with eof_pos() or expected_pos()
source: Change ByteRange to be inclusive on both ends
lexer: cleanup + greatly improve lexer errors

…orting

…arsing

- This change moves away from the parser bits "directly" accessing `TokenKind::Tree(..)`s in preparation for using the new lexing system. - Furthermore, the parser now uses either `skip_token()` which in the future will be a "safe" variant of setting the cursor, or `skip_fast()` which should be used to skip atomic tokens (like `;` or `,`). - Remove access to `backtrack()` completely. - Use `peek_kind()` where possible to simplify token matching. - Prepare for removing `peek_nth()` - Prepare for switching over to a new `TokenCursor` API.

This commit switches over the sources of the `v2` lexer over the original lexer source and hooks it up with the rest of the parsing pipeline. This commit uses the new `v2` experimental lexer for the parser which now accepts the "flat" version of the token stream. Since much work was done before this commit to abstract away dealing with the token trees. The migration was reasonably simple (with some minor span calculation adjustments).

…or order

…ariants

…s()` or `expected_pos()`

compiler/hash-parser/src/parser/pat.rs

Clicked the wrong button

feds01 added 19 commits September 18, 2023 13:45

token: remove DelimiterVariant

1c39a41

lexer: flatten token stream

601a1a6

parser: remove AstGenFrame::token_at()

559b4be

parser: cleanup accesses to offset in AstGenFrame

2ec86ca

token: introduce -> and => tokens to simplify parsing & error rep…

dab3d11

…orting

token: introduce :: tokens to simplify parsing access expr/ty/pats

eed9ef2

parser: cleanup begins_pat() implementation

8b46510

parser: avoid using peek_nth() in binary expression parsing

8c314d8

token: introduce .., ..<, ... tokens to simplify spread/range p…

e905519

…arsing

parser: fix typo in diagnostics

9a6df8b

parser: directly use TokenCursor API

fe21fa5

parser: avoid using confusing next_pos() function

6ab48d2

analysis: use indexmap in pattern bind analysis to enforce stable err…

61d6030

…or order

parser: name errors consistently, and remove a bunch of old un-used v…

b3562cb

…ariants

parser: remove use of confusing next_pos() and replace with `eof_po…

0e2d585

…s()` or `expected_pos()`

source: Change ByteRange to be inclusive on both ends

6c9e599

lexer: cleanup + greatly improve lexer errors

5c6dfb6

feds01 self-assigned this Sep 18, 2023

feds01 requested a review from kontheocharis September 18, 2023 20:00

feds01 added parser Issues related with parsing sub-system. interface Issues that are regarding the compiler ui, specifically how the user interacts with the compiler labels Sep 18, 2023

kontheocharis previously approved these changes Sep 19, 2023

View reviewed changes

compiler/hash-parser/src/parser/pat.rs Outdated Show resolved Hide resolved

compiler/hash-parser/src/parser/pat.rs Outdated Show resolved Hide resolved

kontheocharis self-requested a review September 19, 2023 14:33

parser: debug assert in skip_token() the desired token kind

0a1f509

feds01 force-pushed the lexer-experiment branch from c95e77f to 0a1f509 Compare September 19, 2023 15:29

kontheocharis approved these changes Sep 19, 2023

View reviewed changes

feds01 merged commit 54432fe into main Sep 19, 2023
1 check passed

feds01 deleted the lexer-experiment branch September 19, 2023 15:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lexer+parser: many improvements and cleanups #985

lexer+parser: many improvements and cleanups #985

feds01 commented Sep 18, 2023

lexer+parser: many improvements and cleanups #985

lexer+parser: many improvements and cleanups #985

Conversation

feds01 commented Sep 18, 2023