Ad-hoc operator overloading #399

SiegeLord · 2014-10-15T18:29:08Z

Rendered.

This RFC is in many ways an alternative to #392.

Gankra · 2014-10-15T19:13:30Z

How do multi-method operators like slice work?

SiegeLord · 2014-10-15T19:20:55Z

How do multi-method operators like slice work?

Each method gets its own #[operator] attribute.

sfackler · 2014-10-15T19:27:35Z

How will I as a consumer of a library be able to easily find out if I can use + or whatever with a type defined in that library?

Gankra · 2014-10-15T19:28:07Z

Can slice be partially implemented, then? Only implement [..n] or whatever?

SiegeLord · 2014-10-15T19:42:01Z

How will I as a consumer of a library be able to easily find out if I can use + or whatever with a type defined in that library?

Typically you will still implement the traits from core::ops, so that your type can be used in most generic code (same reason why'd you implement Clone rather than make your own duplication method). Additionally, I imagine we could enhance rustdoc to indicate what attributes are attached to the method.

Can slice be partially implemented, then? Only implement [..n] or whatever?

Yeah.

ftxqxd · 2014-10-16T04:32:48Z

I like this idea. Currently I think it is bad that we describe operators using traits: people will try to use those operator traits as trait bounds, which in my opinion is a bad idea as the operator could mean anything: for numbers it’s addition, but for strings it’s concatenation. Using + for concatenation is not inherently bad, but is an example of how one shouldn’t assume that + represents addition (and thus is commutative, transitive, and so on). I think we should instead still have an Add trait in the standard library, with a #[operator="add"] decoration, but say that Add represents numerical addition, and anything else wanting to implement + in a non-number-like way would implement it as an inherent method, or perhaps as an impl of another concatenation/whatever trait (with an #[operator="add"] decoration on the relevant method).

I also like the way that slicing could potentially be split up with this. Strings really shouldn’t implement [a..b], [a..], and [..b], because they would probably take byte indices, which aren’t normally what you want. However, the [] syntax is very useful. It would be great if we could split Slice into AsSlice and SliceRange or something.

I’d also like to make the suggestion that the #[operator] use the actual operator symbol instead of just some arbitrary name, i.e., use #[operator="+"] and #[operator="[a..]"] instead of #[operator="add"] and #[operator="slice_from"]. That would mean that we could extend this to arbitrary operators in the future (albeit probably with some extra complications).

blaenk · 2014-10-16T05:22:03Z

I agree with @P1start's final point about making it use the actual operator, to at least leave open the possibility of arbitrary operators.

netvl · 2014-10-16T05:27:48Z

How do you write things like generic ranges in this case? Currently range() is defined like this:

pub fn range<A: Add<A, A> + PartialOrd + Clone + One>(start: A, stop: A) -> Range<A>

There is a nice and clean bound which only requests what is absolutely needed for the range. How will this function look with ad-hoc operator overloading? The reason why it works in C++, for example, because templates there are just raw substitutions. But I don't think we want to introduce C++-like templates into Rust.

BTW, Haskell is not a valid example of ad-hoc overloading. You can't overload (+) function without implementing a type class. If you do write your own (+) for the type you need without implementing Num type class, this new definition will take over (or there will be a conflict with imported name, I don't remember) the original one, so you won't be able to add numbers, for example. There is no function overloading in any form in Haskell.

bill-myers · 2014-10-16T07:00:14Z

Thinking about this, it makes a lot of sense.

After all, when you write "x.foo(y)", the compiler doesn't lookup a Foo trait and then call its foo method, but rather it just performs method lookup for foo (and obviously this is a good thing).

There doesn't seem to be any reason for "x + y" to work differently.

Traits with operators are still possible and would work.

bill-myers · 2014-10-16T07:04:44Z

active/0000-ad-hoc-operators.md

+```rust
+trait MyTrait {
+    #[operator="add"]
+    fn add(&self, rhs: &uint) -> uint;


Why not just use a C++-ish syntax like this:

fn op +(&self, rhs: &uint) -> uint;

or a Scala-ish notation like this:

fn +(&self, rhs: &uint) -> uint;

or even this daring syntax (not sure if this is unambiguous and parsable):

fn (&self + rhs: &uint) -> uint;

One could also imagine allowing any sequence of non-[A-Za-z0-9] characters (with exceptions to avoid conflicts) as an operator.

It's a possibility. I went with a more conservative choice for implementation ease.

Without committing to supporting this RFC, the Haskell style fn (+)(lhs: T, rhs: U) -> V might make more sense, seeing as binary operators are called in a different way to regular functions.

reem · 2014-10-16T07:06:39Z

This RFC would be more convincing if it included examples of useful patterns this would allow that can't be created using today's system. Personally I'm not really aware of many cases, but I could be easily convinced if I saw code that was much nicer to write or work with under this RFC.

huonw · 2014-10-16T07:44:31Z

How do you write things like generic ranges in this case? Currently range() is defined like this:

@netvl in some ways this is a bug; range could/should be using a Range trait with succ and pred like Haskell's Enum type class. This allows it to be used with general enumerated types, like char.

huonw · 2014-10-16T07:50:26Z

Currently I think it is bad that we describe operators using traits: people will try to use those operator traits as trait bounds, which in my opinion is a bad idea as the operator could mean anything: for numbers it’s addition, but for strings it’s concatenation. Using + for concatenation is not inherently bad, but is an example of how one shouldn’t assume that + represents addition (and thus is commutative, transitive, and so on).

We could say that impls of Add should be a commutative, associative operation (and remove the broken ones in the stdlib). It's perfectly reasonable to define a trait to have specific meaning that's not strictly enforced by the compiler, as long as it doesn't lead to memory unsafety without using unsafe. In fact, capturing shared behaviour like this is basically the point of using traits. (That is, an "invalid" impl of Add would just lead to confusion and annoyance at the library author.)

(This is not a comment about this RFC specifically, just pointing out that that reasoning doesn't necessarily apply.)

ftxqxd · 2014-10-16T09:16:53Z

We could say that impls of Add should be a commutative, associative operation (and remove the broken ones in the stdlib). It's perfectly reasonable to define a trait to have specific meaning that's not strictly enforced by the compiler, as long as it doesn't lead to memory unsafety without using unsafe.

Yes, but then you don’t get the nice a + b syntax for non-addition-like things. My point was that at the moment, Add is impld as a way of doing operator overloading (which just happens to be done through a trait), but it’s treated in bounds as a normal trait (with the associated not-enforced-by-the-compiler invariants). I agree that that traits should be allowed to have properties that aren’t enforced by the compiler, and in fact that was part of my point: we should have a way of having an Add trait with the related invariants (commutativity, transitivity) while at the same time having the ability to use the quite unrelated syntactic sugar of a + b without having to obey those invariants (and thus not implementing the Add trait specifically).

Basically, I’m trying to say that we are ‘abusing’ traits by making Add (and most other operator traits) represent two different things: the syntax a + b, which is purely sugar and should be unrelated to its actual functionality, and the idea of addition, which is what traits should actually be used for.

huonw · 2014-10-16T09:23:40Z

Why would you want to use + for something that's not addition-like? That seems like an abuse of operator overloading, i.e. a canonical example of what proponents of no-overloaded-operators complain about (yes, I don't like our current impl of Add for String). It's very disingenuous to say that the a + b syntax is unrelated to the concept of addition, since + is the symbol for addition.

japaric · 2014-10-16T15:00:27Z

This RFC would be more convincing if it included examples of useful patterns this would allow that can't be created using today's system. Personally I'm not really aware of many cases, but I could be easily convinced if I saw code that was much nicer to write or work with under this RFC.

This would make the indexing and slicing operators actually useful without having to wait for HKT, the current method signature of index and slice are too limiting and pretty much only work for Vec/[T] and String/str. With this proposal you could define:

Matrix indexing that returns rows:

// Self = Mat<T>
fn index<'a>(&'a self, row: &uint) -> Row<'a, T> { .. }
let row = mat[0];
let elem = mat[1][2];

Matrix slicing that returns a sub matrix view

// Self = Mat<T>
fn slice_or_fail<'a>(&'a self, start: (uint, uint), end: (uint, uint)) -> View<'a, T> { .. }
assert_eq!(mat[(1, 2) .. (3, 4)].size(), (2, 2));

Slicing of strided slices, etc

None of that is possible with the current Index/Slice traits

About the proposal, I got two questions:

Will it be possible to overload operations on types not defined in the current crate? Example

// crate foo
trait CharIndex {
    #[operator="[]"]
    fn index(&self, pos: uint) -> char;
}

impl CharIndex for String {}

What happens in this case:

impl Foo {
    #[operator="[]"]
    fn index(&self, index: uint) -> Bar { .. }
}

trait MyIndex {
    #[operator="[]"]
    fn index(&self, index: uint) -> Baz;
}

// is it possible to overload the operator again?
impl MyIndex for Foo { .. }

foo[0]  // what's the return value: Bar or Baz?

Other thoughts:

This RFC could prevent the scenario of getting the stdlib cluttered with HKT and non-HKT variants of traits used for operator overloading.
This RFC would allow unsafe (no bounds checked) indexing on *const [T]: unsafe { raw_slice[0] } (I think @gankro wanted this)
This RFC would probably remove the need of having two very similar indexing traits: Index and IndexGet.

SiegeLord · 2014-10-16T15:40:44Z

I’d also like to make the suggestion that the #[operator] use the actual operator symbol instead of just some arbitrary name, i.e., use #[operator="+"] and #[operator="[a..]"] instead of #[operator="add"] and #[operator="slice_from"]. That would mean that we could extend this to arbitrary operators in the future (albeit probably with some extra complications).

Yeah, that might be a good idea. My only concern is the arbitrariness of the a in [a..]. We'd have to spec what exactly can go in there and what it would mean, if anything.

BTW, Haskell is not a valid example of ad-hoc overloading.

Yes, I'm aware this is not practical to take advantage of in Haskell. Nevertheless, if you want to have a hard time (i.e. only use your non-Num operators in a module that doesn't import Num's functions), the possibility is there. That's sort of a general issue with Haskell, and not specific to operator overloading.

@japaric in all cases those would act like normal methods. E.g. you will be able to add new operators to types outside the crate (you'll have to use your own trait or a wrapper type).

In the second case, it'd be a conflict that you'll have to two options of resolving:

Using UFCS and the raw method names (you lose syntax sugar in this case)
Selectively don't import the trait or use trait bounds (today's workabout for the lack of UFCS). This does allow you to keep using the syntax sugar.

Gankra · 2014-10-16T16:21:04Z

Re usecases: some people would also like to be able to unsafely index or +/- a ptr.

This RFC makes me feel some feels. I think this is the sort of solution that makes the most sense long term. I'm not sure if it would be appropriate for 1.0, though. Lots of work to get this going, I imagine.

There will need to be some checks for arity involved (I don't recall seeing this in the proposal?). If actual symbols are used, then something will need to be done to handle arity-overloaded operators. "-" being the most obvious one. For those interested in generalizing this to a future with "any operator", how would this be handled? Would all operators be candidates for unary and binary forms? Or perhaps we could have a syntax like "_ - _", "- _", "[_ ...]", etc. to distinguish.

It might also make sense to put some sanity constraints on the ownership of the LHS. For instance, += with &LHS makes basically no sense in decent code, but this proposal seems to admit it as possible.

Ericson2314 · 2014-10-16T17:51:55Z

I think also allowing #[operator="...] on normal functions would be more consistent. And overloading is probably a real pain for type checking, so I think the choices should be non-overloaded #[operator="...] (pretty close to the Haskell way), or the current system.

arielb1 · 2014-10-16T20:25:46Z

@Ericson2314

This shouldn't cause new type-checking problems – a <OP> b will just become a.<OP>(b) but with different autoref rules.

Anyway, it will be interesting to know how this handles Index vs. IndexMut.

Ericson2314 · 2014-10-16T21:24:59Z

Ah ok, the overloading and methods-only restriction go together. I am not sure how I feel about exploiting identifier ambiguities in this way however---the system is analogous to an unhygienic macro. But yes, it does get the job done.

ftxqxd · 2014-10-17T02:26:23Z

@huonw

Why would you want to use + for something that's not addition-like? That seems like an abuse of operator overloading, i.e. a canonical example of what proponents of no-overloaded-operators complain about (yes, I don't like our current impl of Add for String). It's very disingenuous to say that the a + b syntax is unrelated to the concept of addition, since + is the symbol for addition.

+ represents addition in the context of mathematics, but in programming that’s not necessarily the case. Languages like Python use + for string concatenation; it’s quite useful and it’s obvious what it does. It’s not great being forced to sacrifice nice syntax simply because it doesn’t match the precise mathematical definition of the symbol. Yes, adding other operators is a better solution (for this particular case) in my opinion, but until Rust gets that, using + for concatenation can do. Python and similar languages had the option to make a new operator for concatenation, but didn’t as they saw it unnecessary.

And + isn’t the only example of an operator that can be overloaded nicely without preserving their primary meaning. |, &, and ! can be useful in various scenarios, ranging from types representing patterns or PEGs to DSL-like constructs to generate a mini language. It seems strange to me to forcefully couple the syntax with one particular meaning, instead of allowing a broader scope of meanings, with common sense determining what is an appropriate use. Operators are basically the same as method names, and I see no reason, for example, to force all methods named add to represent mathematical addition: adding items to a set could use the same name. The reason that this seems fine is because add can have multiple meanings that depend on the context. I see no reason why we can’t accept that + can have multiple meanings, mathematical or not, in the same way that names can have multiple meanings.

pcwalton · 2014-10-23T21:30:37Z

I am sympathetic to this but I think that Haskell is not a valid comparison, because being able to overload + is not something you can do on a type-based basis in the same scope. To me the strongest argument is that it's similar to how dot notation works today: a.add(b) works with multiple signatures of add. This is something that some Haskellers don't like :) But it is how Rust works, and extending to operators makes sense to me for consistency's sake (and we do something similar with Deref/Index and friends right now anyhow).

I believe this can be added backwards compatibly post 1.0, so I'll mark this as postponed.

brendanzab · 2014-10-23T23:46:05Z

Languages like Python use + for string concatenation;

As a counterexample, Haskell and Scala use (++) for both string and vector concatenation, and D uses (~).

ghost · 2014-10-28T01:48:30Z

I'm glad to see something like this being proposed. The current approach, while conceptually simple, just does not seem to scale very well at all.

Basically, I’m trying to say that we are ‘abusing’ traits by making Add (and most other operator traits) represent two different things: the syntax a + b, which is purely sugar and should be unrelated to its actual functionality, and the idea of addition, which is what traits should actually be used for.

I agree with this 100%. I have a semigroup library where having an operator like * makes sense for each impl, but the operation does not necessarily represent arithmetic multiplication. Due to the fact that operators are tied to traits, and given coherence restrictions, I have to go through various contortions to make it work.

huonw · 2014-10-28T03:32:55Z

@darinmorrison, that's a perfectly reasonable multiplication, in particular, it is associative; I'm not concerned about arithmetic operations mainly mathematical convention. I'm afraid you'll need to be more specific about what the contortions are because that doesn't seem outrageous to me.

ghost · 2014-10-28T04:14:10Z

I'm afraid you'll need to be more specific about what the contortions are because that doesn't seem outrageous to me.

Sorry, I should have been more specific. In particular, it's a nuisance to have to wrap everything in S in order to get * to work reasonably across the various impls. In an earlier version I made Semigroup a trivial extension of Mul but this just doesn't work because you can't provide an impl of Mul for things like Option. Worse, I can't define a new operator (as far as I know), so the solution is to wrap everything in a trivial struct S which creates an unnecessary layer everywhere which you can see here.

I suppose there are ways to work around this using macros (see here) but that seems a bit unsatisfying, not to mention complicated.

ghost · 2014-10-28T04:50:52Z

@darinmorrison, that's a perfectly reasonable multiplication, in particular, it is associative; I'm not concerned about arithmetic operations mainly mathematical convention.

I'm not sure mathematical convention is a good measure because it varies too much depending on your perspective. For instance, * would still make sense as an operator for Magma. In that case, we have nothing other than the fact that there is an operation. Indeed, we might choose to start there instead and define Semigroup as AssociativeMagma, etc.

brendanzab · 2014-10-28T21:14:46Z

@darinmorrison Have you seen my num-rs library? It shows one way to do it using the current operator traits.

ghost · 2014-10-28T23:10:33Z

@bjz I hadn't seen that. Looks very nice! I'll have to take a closer look, thanks.

ghost · 2014-10-28T23:23:54Z

@bjz Ah, having looked at it closer now, I think I see what you mean. This is actually how I was doing Semigroup originally (using std::ops::Mul<_,_>). Like I mentioned earlier though, this is a problem if you intend to provide impls of the algebraic traits for non-numeric types, which is my primary interest here. It seems that a choice is forced between either providing a wrapper for all builtin types and providing std::ops::Mul<_,_> impls for each wrapper or just doing it once like with S. The latter seemed to be less noisy and simpler but still not ideal.

SiegeLord added 2 commits October 15, 2014 14:24

Ad-hoc operators

1841fa3

Reword

63a503c

Mention operator overloading traits with multiple methods

cf7163a

bill-myers reviewed Oct 16, 2014
View reviewed changes

pcwalton added the postponed RFCs that have been postponed and may be revisited at a later time. label Oct 23, 2014

pcwalton closed this Oct 27, 2014

pcwalton mentioned this pull request Oct 27, 2014

More flexible operator overloading #420

Open

SiegeLord mentioned this pull request Oct 31, 2014

Operator dispatch rust-lang/rust#18486

Merged

aturon mentioned this pull request Nov 3, 2014

Unsafe Indexing and Slicing #392

Closed

Gankra mentioned this pull request Nov 3, 2014

Improve unsafe ergonomics #433

Open

Centril added T-lang Relevant to the language team, which will review and decide on the RFC. A-typesystem Type system related proposals & ideas A-operator Operators related proposals. A-resolve Proposals relating to name resolution. A-attributes Proposals relating to attributes labels Nov 27, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ad-hoc operator overloading #399

Ad-hoc operator overloading #399

SiegeLord commented Oct 15, 2014

Gankra commented Oct 15, 2014

SiegeLord commented Oct 15, 2014

sfackler commented Oct 15, 2014

Gankra commented Oct 15, 2014

SiegeLord commented Oct 15, 2014

ftxqxd commented Oct 16, 2014

blaenk commented Oct 16, 2014

netvl commented Oct 16, 2014

bill-myers commented Oct 16, 2014

bill-myers Oct 16, 2014

SiegeLord Oct 16, 2014

brendanzab Oct 17, 2014

reem commented Oct 16, 2014

huonw commented Oct 16, 2014

huonw commented Oct 16, 2014

ftxqxd commented Oct 16, 2014

huonw commented Oct 16, 2014

japaric commented Oct 16, 2014

SiegeLord commented Oct 16, 2014

Gankra commented Oct 16, 2014

Ericson2314 commented Oct 16, 2014

arielb1 commented Oct 16, 2014

Ericson2314 commented Oct 16, 2014

ftxqxd commented Oct 17, 2014

pcwalton commented Oct 23, 2014

brendanzab commented Oct 23, 2014

ghost commented Oct 28, 2014

huonw commented Oct 28, 2014

ghost commented Oct 28, 2014

ghost commented Oct 28, 2014

brendanzab commented Oct 28, 2014

ghost commented Oct 28, 2014

ghost commented Oct 28, 2014

Ad-hoc operator overloading #399

Ad-hoc operator overloading #399

Conversation

SiegeLord commented Oct 15, 2014

Gankra commented Oct 15, 2014

SiegeLord commented Oct 15, 2014

sfackler commented Oct 15, 2014

Gankra commented Oct 15, 2014

SiegeLord commented Oct 15, 2014

ftxqxd commented Oct 16, 2014

blaenk commented Oct 16, 2014

netvl commented Oct 16, 2014

bill-myers commented Oct 16, 2014

bill-myers Oct 16, 2014

Choose a reason for hiding this comment

SiegeLord Oct 16, 2014

Choose a reason for hiding this comment

brendanzab Oct 17, 2014

Choose a reason for hiding this comment

reem commented Oct 16, 2014

huonw commented Oct 16, 2014

huonw commented Oct 16, 2014

ftxqxd commented Oct 16, 2014

huonw commented Oct 16, 2014

japaric commented Oct 16, 2014

SiegeLord commented Oct 16, 2014

Gankra commented Oct 16, 2014

Ericson2314 commented Oct 16, 2014

arielb1 commented Oct 16, 2014

Ericson2314 commented Oct 16, 2014

ftxqxd commented Oct 17, 2014

pcwalton commented Oct 23, 2014

brendanzab commented Oct 23, 2014

ghost commented Oct 28, 2014

huonw commented Oct 28, 2014

ghost commented Oct 28, 2014

ghost commented Oct 28, 2014

brendanzab commented Oct 28, 2014

ghost commented Oct 28, 2014

ghost commented Oct 28, 2014