The `$` prefix is ambiguous #347

gamesaucer · 2023-02-22T18:50:04Z

Steps to reproduce: Parse the following grammar:

$Start = $Part
$Part = .

Expected result: The validity of this Peggy grammar is decided by whether rule names can start with $.
Actual result: It is not a valid Peggy grammar because rule Part is not defined.

Because Peggy allows the dollar sign $ to start rule names AND has a $ prefix operator, one of the following must be true:

The first $ is always parsed as an operator. In this case, a rule cannot reliably be matched simply by writing its name. There is also no way to match $Part without returning the text rather than the match. This is the current implementation.
The first $ is always parsed as part of the identifier. In this case, to return the matched text of a nonterminal, a space is required before the nonterminal to disambiguate between $ Part (return text of rule Part) and $Part (return match of rule $Part), or the expression needs to be wrapped in parentheses. This makes it inconsistent with the operator's behaviour before terminals.
The $ is parsed as part of the identifier if that identifier exists. In this case, adding a rule called Part would break existing references to a rule called $Part or vice versa. It would also it becomes ambiguous what a $ does before a nonterminal and require an extra parsing pass to determine how it is interpreted.

All those options seem undesirable to me. Instead, I'd advocate for either disallowing $ at the start of rule names, or changing the $ operator to something that cannot appear at the start of an identifier. Unfortunately, any alteration to this behaviour would be a breaking change. So if it's decided this won't be fixed, I understand why that's the case.

The text was updated successfully, but these errors were encountered:

hildjj · 2023-02-22T18:54:13Z

This is a valid issue, and I'm surprised it hasn't come up before. I don't have a better solution that disallowing $ as a start character in identifiers.

@Mingun any insights?

Mingun · 2023-02-22T19:10:44Z

Yes, I think we should forbid that symbol in identifiers (only at start or everywhere if something$something could be treated ambiguously). It's amazing how it hasn't been noticed before

hildjj · 2023-02-24T19:48:08Z

I think $ is fine in the middle of identifier.

top = fb

fb = $foo$bar

foo$bar = foo $bar

$foo$bar = 'baz'

foo = 'foo'

bar = 'b' 'a'+ 'r'

this matches "foobaaar", but not "baz".

* main: (21 commits) Update CHANGELOG.md Update version number & rebuild Update dependencies Update test/unit/compiler/passes/report-infinite-repetition.spec.js Fixes peggyjs#357. Do not allow infinite recursion in repetition delimiter. Update changelog Allow extra semicolons between rules. Fix an error in the code generator for "repeated" node Update changelog Fixes peggyjs#329 Update changelog Fixes peggyjs#359. Clarifies documentation about reserved words. Fix more HTML indentation. Test that the generated parser also works without errors Remove use of expect.to.not.throw() Add Rene Saarsoo to AUTHORS Typo in test description Add test to ensure special non-reserved keywords are allowed Comment out unnecessary reserved words Fixes peggyjs#347. Makes $ invalid as an identifier start character. ...

hildjj added a commit to hildjj/peggy that referenced this issue Feb 24, 2023

Fixes peggyjs#347. Makes $ invalid as an identifier start character.

89d77cb

hildjj mentioned this issue Feb 24, 2023

Makes $ invalid as an identifier start character #356

Merged

hildjj closed this as completed in #356 Feb 25, 2023

hildjj added a commit to hildjj/peggy that referenced this issue Feb 25, 2023

Fixes peggyjs#347. Makes $ invalid as an identifier start character.

457ea8e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The `$` prefix is ambiguous #347

The `$` prefix is ambiguous #347

gamesaucer commented Feb 22, 2023

hildjj commented Feb 22, 2023

Mingun commented Feb 22, 2023

hildjj commented Feb 24, 2023

The $ prefix is ambiguous #347

The $ prefix is ambiguous #347

Comments

gamesaucer commented Feb 22, 2023

hildjj commented Feb 22, 2023

Mingun commented Feb 22, 2023

hildjj commented Feb 24, 2023

The `$` prefix is ambiguous #347

The `$` prefix is ambiguous #347