Unicode modifiers for adjoint operator #34507

simeonschaub · 2020-01-24T21:46:53Z

The original motivation for this is having a nicer syntax for transpose and conj, where the most appropriate I could come up with was M'ᵀ and z'ᶜ. Currently this would only be possible by special casing transpose and conj at a parser level, but if #33683 was merged, one could extend the concept of Unicode modifiers for infix operators to ' very nicely. This would also be useful for packages like Zygote, which like to pun on ' as notation for taking the derivative, which could then export e.g. 'ᴰ instead.
A problem is that currently, a'ᵀb is valid syntax for adjoint(a) * (ᵀb), which is quite unfortunate, since this is different from other infix operators like +, where the modifier gets parsed as part of the operator, even if there is no whitespace in between. I therefore believe that parsing these as part of the operator will make for more consistency, but as this is technically breaking, it might be necessary to deprecate this syntax for one minor release first. Eventually it might make sense to disallow modifiers in front of variable names altogether, but that would be a separate issue.

The text was updated successfully, but these errors were encountered:

stevengj · 2020-01-27T21:34:00Z

Yes, I guess it is kind of weird that we allow category Lm (Letter, modifier) to start identifiers. We probably should have discussed that in #6805 😢. The same issue was also discussed in #28441. Unfortunately, it would be breaking to disallow identifiers starting with Lm now, and I'm skeptical that this counts as a "minor change" that we can do in 1.x.

In any case, allowing Unicode modifiers for ' seems reasonable, analogous to #22089, I guess?

stevengj · 2020-01-27T21:36:25Z

Duplicate of #28494?

See also JuliaLang/LinearAlgebra.jl#410 where a'ᵀ was proposed.

simeonschaub · 2020-01-27T22:25:33Z

The unfortunate thing is that I don't see any non-breaking way to introduce this feature. Currently, even a'ᵀ is valid syntax, and I would argue that it's less breaking to throw a clear error here than just to silently interpret it as something different. Whether it is then still worth making this change is up to discussion.

Duplicate of #28494?

Oh, I didn't discover that. Also seems to propose some of the changes made in #33683. The concrete proposal is a bit different though, so should I still leave this issue open?

StefanKarpinski · 2020-01-27T22:26:47Z

Any easy way to see if any packages are using this feature is to make it a syntax error and then run PkgEval.

simeonschaub · 2020-01-27T22:46:33Z

What would be the usual protocol for that? Should I open a PR here?

StefanKarpinski · 2020-01-28T04:05:34Z

It would be to make a [NO NOT MERGE] PR that causes the relevant syntax to be an error and then ask someone to trigger PkgEval. Might be easier to grep through all the registered packages though.

stevengj · 2020-01-28T14:07:27Z

Created a PR in #34549 if someone wants to trigger PkgEval on that.

c42f · 2020-02-27T01:56:15Z

As mentioned in #34549 (comment), a survey of the fairly small number of packages which were broken by trying this out identified the following being used as postfix operators in category Lm:

(x)ᵀ   ↦  x'ᵀ    (category Lm)
(x)ˣ   ↦  x'ˣ    (category Lm)

But there was also the following in AbstractTensors

(x)⁻¹  ↦  x'⁻¹   (category Sm,No)
(x)₊   ↦  x'₊    (category Sm)
(x)₋   ↦  x'₋    (category Sm)
(x)ǂ             (category Lo)

Currently it seems we allow a lot of category Sm to begin an identifier, for example:

julia> ₋x = 1
1

So we'd also have trouble with parsing things like x'⁻¹y which currently produces

julia> :(x'⁻¹y)
:(x' * ⁻¹y)

Maybe this isn't a problem but it's kind of annoying.

simeonschaub · 2020-03-03T14:37:39Z

What does triage think would be the best way forward here? Is the change in #34549 acceptable for a minor release, considering what PkgEval revealed? A probably less breaking alternative would be to only change the parsing of modifiers right after ' to be consistent with how we do it for infix operators, although in this case, it might make sense to have a deprecation period, where make this syntax a syntax error.

baggepinnen · 2020-03-25T20:55:17Z

While I like the proposals in this issue, I just wanted to share that many characters don't render well in Chrome for Android.

simeonschaub · 2020-04-16T19:34:22Z

Just bumping this again. Is there any consensus forming?

stevengj added parser Language parsing and surface syntax speculative Whether the change will be implemented is speculative unicode Related to unicode characters and encodings labels Jan 27, 2020

stevengj mentioned this issue Jan 28, 2020

(DO NOT MERGE) don't allow identifiers to start with Lm (Letter, modifier) #34549

Open

MasonProtter mentioned this issue Jan 29, 2020

RFC: parse a' as call expression #33683

Open

simeonschaub mentioned this issue Feb 3, 2020

RFC: lower a' as var"'"(a) #34634

Merged

MasonProtter mentioned this issue Apr 25, 2020

new postfix transpose() operator different from ' #35593

Closed

simeonschaub mentioned this issue Aug 27, 2020

allow unicode modifiers after ' #37247

Merged

JeffBezanson closed this as completed in #37247 Oct 16, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unicode modifiers for adjoint operator #34507

Unicode modifiers for adjoint operator #34507

simeonschaub commented Jan 24, 2020

stevengj commented Jan 27, 2020

stevengj commented Jan 27, 2020 •

edited

Loading

simeonschaub commented Jan 27, 2020

StefanKarpinski commented Jan 27, 2020

simeonschaub commented Jan 27, 2020

StefanKarpinski commented Jan 28, 2020

stevengj commented Jan 28, 2020

c42f commented Feb 27, 2020

simeonschaub commented Mar 3, 2020

baggepinnen commented Mar 25, 2020

simeonschaub commented Apr 16, 2020

Unicode modifiers for adjoint operator #34507

Unicode modifiers for adjoint operator #34507

Comments

simeonschaub commented Jan 24, 2020

stevengj commented Jan 27, 2020

stevengj commented Jan 27, 2020 • edited Loading

simeonschaub commented Jan 27, 2020

StefanKarpinski commented Jan 27, 2020

simeonschaub commented Jan 27, 2020

StefanKarpinski commented Jan 28, 2020

stevengj commented Jan 28, 2020

c42f commented Feb 27, 2020

simeonschaub commented Mar 3, 2020

baggepinnen commented Mar 25, 2020

simeonschaub commented Apr 16, 2020

stevengj commented Jan 27, 2020 •

edited

Loading