fix #34105, unescape triple-quoted strings after dedenting #35001

JeffBezanson · 2020-03-04T19:54:45Z

It seems clear to me that triple-quoted strings should only specially handle whitespace that directly occurs in the source text, not including escape sequences. This also makes it easier to fix #34967, since if unescaping is done first then you get a UTF-8 error when we try to strip whitespace if you write invalid UTF-8 strings using escape sequences (which is allowed of course).
But, this could be breaking since it obviously changes the values of some string literals.

fix #34105

Keno · 2020-03-04T20:10:33Z

Yes, I'm in favor. I came across this when implementing the unindenting in one of the pure Julia parsers (can't remember which one) and thought it was an odd behavior. I concur with your assessment that this needs to happen in the opposite order.

StefanKarpinski · 2020-03-04T20:12:44Z

While this is a behavior change, it's hard to argue that this wasn't a bug. 💯

davidanthoff · 2020-03-11T20:24:15Z

@ZacLN probably something we need to replicate in CSTParser as well?

GunnarFarneback · 2020-03-29T10:56:40Z

This PR changed the parsing of

"""\n
"""

from \n to \n\n.

Is this unavoidable collateral damage or something that should be fixed?

(Real code example that gets an extra newline: https://github.com/JuliaRegistries/RegistryTools.jl/blob/cbebbed17e13e7a18556a7dd62346d8610e867ea/src/types.jl#L126)

StefanKarpinski · 2020-03-29T17:05:03Z

It's unclear to me why that had the \n after the opening """ in the first place.

GunnarFarneback · 2020-03-29T21:43:56Z

It's unclear to me too, but one reason might be that

julia> """
       \na
       """
"       \na\n       "

before this PR. That's another thing that has changed, probably for the better.

…uliaLang#35001)

JeffBezanson added parser Language parsing and surface syntax triage This should be discussed on a triage call minor change Marginal behavior change acceptable for a minor release labels Mar 4, 2020

JeffBezanson force-pushed the jb/triplequoteescapes branch from f854f9f to a723195 Compare March 10, 2020 20:22

JeffBezanson removed the triage This should be discussed on a triage call label Mar 10, 2020

fix #34105, unescape triple-quoted strings after dedenting

a723195

JeffBezanson merged commit d5d71d7 into master Mar 11, 2020

JeffBezanson deleted the jb/triplequoteescapes branch March 11, 2020 18:47

vtjnash mentioned this pull request Mar 11, 2020

triple quotes lose escaped whitespace #21542

Closed

ravibitsgoa pushed a commit to ravibitsgoa/julia that referenced this pull request Apr 9, 2020

fix JuliaLang#34105, unescape triple-quoted strings after dedenting (J…

4fc6f77

…uliaLang#35001)

KristofferC pushed a commit that referenced this pull request Apr 11, 2020

fix #34105, unescape triple-quoted strings after dedenting (#35001)

18f1248

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix #34105, unescape triple-quoted strings after dedenting #35001

fix #34105, unescape triple-quoted strings after dedenting #35001

JeffBezanson commented Mar 4, 2020

Keno commented Mar 4, 2020

StefanKarpinski commented Mar 4, 2020

davidanthoff commented Mar 11, 2020

GunnarFarneback commented Mar 29, 2020

StefanKarpinski commented Mar 29, 2020

GunnarFarneback commented Mar 29, 2020 •

edited

Loading

fix #34105, unescape triple-quoted strings after dedenting #35001

fix #34105, unescape triple-quoted strings after dedenting #35001

Conversation

JeffBezanson commented Mar 4, 2020

Keno commented Mar 4, 2020

StefanKarpinski commented Mar 4, 2020

davidanthoff commented Mar 11, 2020

GunnarFarneback commented Mar 29, 2020

StefanKarpinski commented Mar 29, 2020

GunnarFarneback commented Mar 29, 2020 • edited Loading

GunnarFarneback commented Mar 29, 2020 •

edited

Loading