-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Intersection: The Ordered Layers Formulation #48
Comments
Moving here from #38. Thank you @mark-todd for the quick response and the provided context, that was very helpful! I am still by far not up to date on all past discussions, but the more I read, the more I realize that I probably did not bring anything new to the discussion but just regurgitated other peoples' ideas without giving them credit, especially mikeshardmind, who basically opened #38 just to pursue the same idea; sorry for that :( I did not read thoroughly enough and gave the existing ideas not enough thought. I agree with the following (non-exhaustive list of) points made in #38:
I also agree with your approach in this issue. I am not sure if introducing the concept of "layering" is beneficial, as it is basically just inheritance with some special cases. Maybe just "extending" the rules of inheritance for the special cases would make more sense/ would be more easily understood? Currently the structural type will contain the "behaviour" of object (the python class) if a non-protocol type is in the parameter list, correct? Would it be beneficial if we could split the result into two structural parts with one being the ObjectProtocol and the other one being everything "interesting"/non-default? This would then effectively be a way to extract a protocol from a type (which we are already implicitely doing in this PEP, right?). Or should we always "remove" the default object behaviour from the resulting protocol as the ObjectProtocol is basically of no use for annotations? Regarding the issue with Any: I just posted another idea to the issue that you mentioned in the draft. I am curious to see what the reactions will be, but even if nothing happens, we could maybe transfer the idea from there and provide two options for how to treat Any (either LSP-compatible or not)? |
For what it's worth, I believe we should just do a true (unordered) intersection, and I don't think there are any significant issues standing in the way of doing so, after the clarifications to the typing spec around the meaning of Any in gradual typing that were made last May (in some part motivated by all the intersection discussions). I think that the questions around the meaning of If we do add an "ordered inheritance type", I would prefer that it be little or nothing more than syntactic sugar for "inline definition of an anonymous empty class type with given bases", so that it doesn't require new special handling or complexity in type checkers or in the type system. This would imply that no type would be valid in such a type that isn't already valid in the bases of a class, nor would any type have a different interpretation than it would have in the bases of a class. It would also imply that type checkers would check LSP validity of such a type in the same way they would check it for creation of a real class. |
I believe the prior efforts at trying to minimize impact showed fatal flaws to any approximation of it via ordering. I don't have the energy to re-detail all of that right now, but I agree with @carljm that we have sound definitions available for a true unordered intersection and it's just a matter of someone motivated enough to get it from those definitions to a full proposal with an implementation. I think there's maybe two things which should be cleared up as a prerequisite to this though, because there's going to be issues with the sound definitions if this isn't. The implied types imposed by the datamodel (specifically the parts of the datamodel that are enforced by the interpreter/c-api) should be always understood as the lower bound.
|
Thanks for thinking through edge cases!
Do you have an example of where this needs to be special-cased in a way that isn't already encoded in typeshed annotations?
Not sure I understand what specific behavior you're suggesting here, or why it's related to intersections specifically. I think |
I believe it's just
Hashable works as I expect currently, it's a bit underspecified, but pyright and mypy at least are both working around it currently. I'd like to ensure we have a consistent way of handling
This is fixable by changing the way we type Going that route, there's an implied negation type required to fully express the set of types here, because it's possible for a type to remove hashability and then a subtype of that type to re-add it. The other route for fixing it would be that the first non-object base defines whether something is Hashable or not, and it is an LSP violation to change that in further subclasses. This would be my preferred method as it is simpler in nature, but it reraises issues with the fact that in 3.11, subclassing Any was specified as allowed without defining the semantics of that. The obvious route here is that subclassing Any is ignored for this calculation and the type is assumed compatible. |
Ok, I can certainly see that people will be tempted to write In a vacuum I think my preference would be to change |
I'll try and get something together tonight or tomorrow for discussion on that for discourse, I think it's the only remaining major blocker (I understand the feeling of things not moving forward...). There are other things I'd prefer handling before then, but all of those seem like things which are less important and less likely to be encountered at a point that leaks into public interfaces of libraries. (And therefore things that we can just accept might have some minor sharp edges we can smooth over in the future) |
@mikeshardmind and @carljm can tell us the state of affairs? I am sorry I really lost track. Edit:Okay in the answer below I didn't know that if you overwrite
|
The thread on |
I might be able to put it into specification language in the near future, but I'm a bit concerned about some of the questions it poses about LSP compatibility in general, and that there's likely more that needs doing. This isn't the only place where an addition removes capability. The obvious other cases change variance, but there are a few that are more involved than that. Fixing this one is likely enough to unblock intersections, as it is the most likely to be interacted with due to how fundamental hashability is in python, but I don't think the other known issues stop being issues. (Some other places require larger changes to how variance is handled, a few would benefit from more narrowly scoping LSP exceptions in a similar way, but those would be more disruptive to existing code, even when highlighting real issues) |
Summary
I'm starting this thread as a fresh start on the Intersection PEP. We've come across various difficulties in formulating the Intersection PEP, and it's been a long road of discussion - I'm keen for all that hard work not to go to waste.
What's the idea?
This PEP will build a lot on the ideas of the "Ordered Intersection" that's been discussed a lot over the past few weeks, but conceptually is perhaps slightly different. I'm hoping that perhaps these differences will offer solutions to some of the issues we've been facing. The main concepts:
Intersection[A,B]
is A layered on top of B. There will be more details on exactly what I mean by layering to follow.What isn't part of this PEP
&
will not be considered, to avoid confusion with unordered intersections. While it may be possible to include, as in the previous PEP there was a subcase dealing with this, for a first PEP I think it will be simpler to exclude this from discussion. It's always possible to add this syntax at a later date, but I leave this to a future PEP.Other considerations
TypedDict
,Callable
andProtocol
will be considered "structural types", and will not have the instance restriction.Callable
will act exactly as a form of the__call__
method, andTypedDict
will effectively define the type for the__getitem__
method.What is layering?
The concept of layering is inspired by the way a lot of photo editing software works. Layers are built up - where features overlap the highest priority layer appears in the final image. In this analogy each class/type is effectively a layer, and the stack of layers is the intersection. A layer does not need to a concrete class and can be a protocol. It only represents the exposed method and attribute types of the class/protocol.
This might be quite controversial, but it seems to me so far that the two concepts at the core of the idea deal with a lot of the discussed edge cases. The PEP will put quite a lot of emphasis on the "How to teach" section as these concepts will be at the very heart of the idea.
The text was updated successfully, but these errors were encountered: