[RFC] externally implementable functions #3632

m-ou-se · 2024-05-10T09:40:13Z

An alternative to this is #3635

Rendered

Tracking:

Tracking Issue for externally implementable items rust#125418

text/0000-externally-implementable-functions.md

jdonszelmann · 2024-05-10T09:55:14Z

How do you deal with the following case?

crate A defines an extern function f.

crate B imports A and implements f.
crate C imports A and implements f.

crate D imports B and C but cannot because both of them implement f and they conflict.

I agree conflicting implementations are a compiler error, but if libraries do it then you make some libraries mutually exclusive.

Edit: as I noted below, you might make it so B and C can both be imported as long as D also implements f

bjorn3 · 2024-05-10T09:56:03Z

text/0000-externally-implementable-functions.md

+    and outside (e.g. global logger). This just makes it much easier (and safer) to get right.
+
+# Rationale and alternatives
+


Should we allow grouping multiple functions together like global_allocator in this RFC? Or should that be left as future potential improvement?

you could work around that with a TAIT:

pub trait MyFunctions { fn fn1() -> String; fn fn2(a: String, b: u32); } pub type MyFunctionsImpl = impl MyFunctions; fn f(v: Infallible) -> MyFunctionsImpl { my_functions(v) } pub extern impl fn my_functions(v: Infallible) -> impl MyFunctions; pub fn fn3() -> String { MyFunctionsImpl::fn1() }

I think that'd be part of a potential future (more compplicated) RFC, such as #2492

Possibly global_alloc could at least use the same internal mechanism, even if it's not visible to the user?

#[global_alloc] static ALLOC: MyAlloc = ...;

could expand to something like

static ALLOC: MyAlloc = ...; impl fn alloc::alloc::alloc(layout: Layout) -> *mut u8 { ALLOC.alloc(layout) } impl fn alloc::alloc::dealloc(layout: Layout) -> *mut u8 { ALLOC.dealloc(layout) } // ...

Then codegen and Miri would only have to support one such mechanism. :)

text/0000-externally-implementable-functions.md

m-ou-se · 2024-05-10T10:05:07Z

I agree conflicting implementations are a compiler error, but if libraries do it then you make some libraries mutually exclusive.

Yes, those would be mutually exclusive. Exactly how today panic-halt, panic-semihosting, and panic-reset are all mutually exclusive.

jdonszelmann · 2024-05-10T10:08:14Z

Can libraries define a new default implementation of an extern function? Something like

// crate A:
extern impl fn logger() -> Logger {
    Logger::to_stdout().with_colors()
}

// crate B imports A:
extern impl fn a::logger() -> Logger {
    Logger::to_stderr().with_colors()
}

// crate C imports B:
impl fn b::logger() -> Logger {
    Logger::to_file("log.txt")
}

m-ou-se · 2024-05-10T10:10:52Z

Can libraries define a new default implementation of an extern function?

No, I don't think we should do that. It'd be hard to define the priority of the multiple defaults.

juntyr · 2024-05-10T10:14:28Z

How do you deal with the following case?

crate A defines an extern function f.

crate B imports A and implements f. crate C imports A and implements f.

crate D imports B and C but cannot because both of them implement f and they conflict.

I agree conflicting implementations are a compiler error, but if libraries do it then you make some libraries mutually exclusive.

Edit: as I noted below, you might make it so B and C can both be imported as long as D also implements f

On the one hand, I would hope that crates that implement externally implementable functions would be minimal, i.e. only have this implementation and nothing else, so that including both in the crate graph would always be a clear and understandable error (and not a mistake because you'd want functionality from both).

On the other hand, the example of logging shows that there are clear cases where you might want to combine different loggers into some super-logger that is specific for your use case. Logging implementation crates could then either ship a companion crate that only implements the function, or could have a non-default feature to enable the implementation.

jdonszelmann · 2024-05-10T10:16:20Z

Simply feature gating these implementations solves a lot of the problems I think.

kennytm · 2024-05-10T10:41:35Z

text/0000-externally-implementable-functions.md

+impl fn core::panic::panic_handler(_: &PanicInfo) -> ! {
+    loop {}
+}


I don't like this particular syntax very much. It is too close to existing impl $t:ty { } syntax when $t is an fn type.

#![feature(rustc_attrs)] impl fn(_: &core::panic::PanicInfo) -> ! { #[rustc_allow_incoherent_impl] pub fn what() {} }

Granted, there is no ambiguity at the moment for the actual syntax proposed here since an fn type can't specify a name (right?) plus you aren't really allowed to impl on an fn type anyway unless you are the standard library.

Given that users aren't allowed to have an impl block for fn types anyway, and that the syntax is unambiguous regardless, I'm not too worried about this.

But I also don't care that much about the syntax. We can consider other syntaxes before stabilization of course.

text/0000-externally-implementable-functions.md

Diggsey · 2024-05-10T11:46:05Z

If none are found, the result is either an error, or, if the extern impl fn has a default body, an implementation is generated that calls that default body.

I think the RFC needs to clarify how this works with different crate types. Presumably this check is not required when building a library crate (or else the feature would be useless). What about when building a dylib or cdylib?

Also, I think it would be good if the root (binary) crate could resolve the issue with multiple implementations by providing its own implementation.

m-ou-se · 2024-05-10T11:47:13Z

I think it would be good if the root (binary) crate could resolve the issue with multiple implementations by providing its own implementation.

That's a possible extension. But that's not what we currently have for the panic handler or global allocator, right?

fbstj · 2024-05-10T12:45:37Z

text/0000-externally-implementable-functions.md

+- The syntax re-uses existing keywords. Alternatively, we could:
+  - Use the `override` reserved keyword.
+  - Add a new (contextual) keyword (e.g. `existential fn`).
+  - Use an attribute (e.g. `#[extern_impl]`) instead.


should the following alternative be mentioned / discussed:

multiple impl's are allowed

the root crate must import the impl they want

the normal-default impl is imported via the prelude

also just had the thought: does use crateA::different_name as panic_handler work similar to how (I believe) it works for main?

burdges · 2024-05-10T12:53:23Z

Very nice RFC.

It's unlikely the root binary crate should provide these or micro-manage them too much.

It's most likely library crate features control these, which enables many options. If you do want micro-managment by the root binary crate then these could live in micro-crates too.

I'd think pub const and pub static would be the most likely future extensions here. pub fn groups could probably wait for more existential types discussions ala #2492

Co-authored-by: Josh Triplett <josh@joshtriplett.org>

m-ou-se · 2024-05-15T15:56:03Z

Process question for @rust-lang/lang: should this be an experimental feature gate instead of an RFC?

joshtriplett · 2024-05-16T11:49:21Z

@m-ou-se Definitely not "instead of", but possibly "in addition to". This feature absolutely needs an RFC in order to be a future stable feature, but it could also be an experimental feature gate in order to implement it without waiting for the RFC to be merged.

That said, I really hope we accept and merge this soon, and I'm hoping we get to it in the next lang meeting. I would like to prioritize it.

petrochenkov · 2024-05-17T12:57:27Z

At high level, I would expect something like this to be implemented using traits.

Basically we have

An interface defined in one place - possibly including multiple functions like with allocators, possibly having default implementaitons.
An implementation of that interface defined in a different place.

That screams "use traits for this", because all kinds of interfaces and implementations are always done through traits in Rust.
But the RFC defines something that is similar to trait impls, has its own set of surface rules similar to trait impls, but is not trait impls.

m-ou-se · 2024-05-17T14:42:47Z

That screams "use traits for this"

Yeah, I agree that would be a good fit, but then we basically end up with #2492, which was not accepted at the time because it involved too much complexity.

I'd be happy if we could pick that route and make that all work. This RFC is just an attempt to do something much more basic to start with, since a much more complicated change like #2492 seems unlikely to work out any time soon.

burdges · 2024-05-17T20:41:57Z

As written, linkers can do this resolution, no? Isn't that enough reason this should still exist, even if some non-linker-friendly trait based scheme emerges in 5 years or whatever?

jdonszelmann · 2024-05-17T22:28:54Z

Just like global registration I don't think you want to implement this feature through the linker, at the very least initially. Personally, I'd be worried about the errors that are generated and which are hard to bring to the same standard as other rust compiler errors. I also like this comment about it.

bjorn3 · 2024-05-18T09:14:25Z

I'd be worried about the errors that are generated and which are hard to bring to the same standard as other rust compiler errors.

My suggestion is to have the check for a downstream crate implementing it be in rustc, but the doing the actual tying up with the linker, like is currently done for #[panic_handler] and #[global_allocator] (for the latter the default is handled using a compiler generated shim, but when you actually use #[global_allocator] no shim is used at all.). In the end some form of tying up by the linker is necessary anyway unless you want to have a global constructor which at runtime sets a static to point to the external implementation of the function.

Co-authored-by: Josh Triplett <josh@joshtriplett.org>

joshtriplett · 2024-05-22T16:03:15Z

@rfcbot resolve statically-unused-extern-impl-fn

Resolved by the use of cfg.

petrochenkov · 2024-05-22T16:06:06Z

@m-ou-se

That screams "use traits for this"

Yeah, I agree that would be a good fit, but then we basically end up with #2492, which was not accepted at the time because it involved too much complexity.

I'd be happy if we could pick that route and make that all work. This RFC is just an attempt to do something much more basic to start with, since a much more complicated change like #2492 seems unlikely to work out any time soon.

I don't see why traits/impls used for this feature cannot be restricted to prohibit, for example, associated types or generics, if they complicate the minimal design.

joshtriplett · 2024-05-22T16:22:47Z

I'd be happy to see an implementation using traits that is initially restricted in what can appear in the trait, if that makes it simple enough to ship.

I do hope eventually we can ship a version that allows, for instance, associated constants, and that we can use those associated constants in generic bounds on functions in the standard library. (For instance, conditionally providing impl From<u64> for usize.) However, I know that'd be more complex, and I think we should ship something without that support first if we can do that more easily.

m-ou-se · 2024-05-22T16:36:26Z

I don't see why traits/impls used for this feature cannot be restricted to prohibit, for example, associated types or generics, if they complicate the minimal design.

So you're proposing basically #2492 with some restrictions to make the implementation much simpler?

If you have time to write down a more concrete proposal (not necessariliy an RFC, but some clear examples or something), that would be valuable.

petrochenkov · 2024-05-22T17:32:12Z

I don't have a concrete proposal, just a general suggestion to resyntax this

// log crate:

extern impl fn logger() -> Logger {
    Logger::default()
}

// user:

impl fn log::logger() -> Logger {
    Logger::to_stdout().with_colors()
}

into something like

// log crate:

#[some_extern_impl_attribute]
trait LoggerInterface {
  fn create_logger() -> Logger {
    Logger::default() // either default body for the default
  }
}

// or a separate impl for the default
struct DefaultLoggerCreator;
#[maybe_some_other_extern_impl_attribute_if_really_necessary]
impl LoggerInterface for DefaultLoggerCreator {
  fn create_logger() -> Logger {
      Logger::default()
  }
}

// user:

struct MyLoggerCreator;
#[maybe_some_third_extern_impl_attribute_if_really_necessary]
impl log::LoggerInterface for MyLoggerCreator {
  fn create_logger() -> Logger {
      Logger::to_stdout().with_colors()
  }
}

EDIT: Not just resyntax, this should have all the usual trait semantics (safety, visibility, signature subtyping, etc) until we reach the codegen stage.

joshtriplett · 2024-05-22T18:26:46Z

(Reiterating that all of this is speculation on a different proposal, not a blocker on this proposal.) I would generally expect that a trait-based proposal should separate the concept of implementing a trait from the concept of setting a specific implementer of that trait as a global. Or, in other words, something more like:

/// log

trait LoggerInterface {
  fn create_logger() -> Logger;
}

impl LoggerInterface for DefaultLoggerCreator {
    fn create_logger() -> Logger { ... }
}

pub extern type LoggerCreator: LoggerInterface = DefaultLoggerCreator;

// user code

impl log::LoggerInterface for MyLoggerCreator {
    fn create_logger() -> Logger { ... }
}

extern type log::LoggerCreator = MyLoggerCreator;

(With appropriate restrictions on what trait LoggerInterface can contain to make this implementable.)

traviscross · 2024-05-22T18:30:49Z

We discussed this in the lang meeting today. We developed a consensus that this is addressing an important problem and one that we would like to solve.

In the meeting, there were various alterations and alternatives put forward, including by @Amanieu and @tmandry. We also wanted to cross-check this against the recenty-accepted RFC:

Unsafe Extern Blocks #3484

That RFC adopts a conceptual separation between the unsafety of declaring an extern block (and verifying that the signatures within are correct) and the unsafety of calling (or otherwise using) an extern item that may have other invariants that may need to be upheld. We just need to check that whatever we do here is consistent with that conceptually (maybe it already is).

While we allow some time for these things, in the interest of not blocking experimentation, we've decided to approve this to go forward as a lang experiment under our process for that. @joshtriplett has offered to be the liaison.

We've opened a tracking issue for this experiment here:

Tracking Issue for externally implementable items rust#125418

traviscross · 2024-05-22T19:39:40Z

@rustbot labels -I-lang-nominated

Since the next step here is to discuss the full set of options, let's nominate the tracking issue in place of this RFC.

Amanieu · 2024-05-22T21:43:45Z

I wrote up my alternative proposal in #3645, which is heavily based on this one. It keeps the basic idea of just having functions that defined downstream and resolved by the linker, but changes the syntax to look more like traits. This works better for things like the global allocator which consists of multiple functions and allows safety to be defined separately on the trait (unsafe to implement) and its functions (unsafe to call).

tmandry · 2024-05-23T00:11:49Z

Move extern impls to blocks

My proposal is to move both the declarations and implementations into blocks. That would let us differentiate between functions that are unsafe to implement and functions that are unsafe to call. It would look something like this:¹

// alloc::global:

extern unsafe impl {
    fn allocate(layout: Layout) -> Result<NonNull<[u8]>, AllocError>;
    unsafe fn deallocate(ptr: NonNull<u8>, layout: Layout);
}

// user:

// Note the use of a path here – already allowed, except it's a module not a type!
// This means we won't have to add a new case to the syntax, and keeps things nicely grouped together.
unsafe impl alloc::global {
    fn allocate(layout: Layout) -> Result<NonNull<[u8]>, AllocError> {
        todo!()
    }
    
    unsafe fn deallocate(ptr: NonNull<u8>, layout: Layout) {
        todo!()
    }
}

From here we can, optionally, do the following:

Unify declarations with `extern "Rust" {}` blocks

We can make extern "Rust" {} work exactly like extern impl above. This includes preserving namespacing of function names, unlike the "C" ABI.

If we do this we should transition the default ABI inside extern {} blocks to be "Rust". This can be done over an edition.

Also as a delta to #3484, extern "Rust" {} blocks would not need unsafe. In fact, it wouldn't make sense to mark them as such, because the compiler checks the signatures for you. We would not want to use unsafe extern to mean "unsafe to override"; instead, we should keep the extern unsafe impl {} syntax.

The first proposal is forward-compatible with this one.

As I prepare to post this I see that it's quite close to a proposal for extern mod in the other thread, though it isn't clear where to hang the unsafe for impls in that proposal. ↩

Jules-Bertholet · 2024-05-23T00:43:17Z

Unify declarations with extern "Rust" {} blocks

No, just because you happen to be using the Rust ABI for some extern declarations, you should not thereby be forced to adopt the new compiler checks? (For example, maybe you are linking to an object file that was compiled with the same version of rustc, but is otherwise completely opaque to you.

Also, this "checked extern" feature would IIUC be able to support fns with generic type and const parameters, which is not something today's extern blocks can support; which suggests to my mind that these are distinct and non-unifiable features, despite their similarity.

tmandry · 2024-05-23T21:59:08Z

For the record I don't think extern "Rust" {} is useful for anything today (possibly due to a bug); it seems to rename the function according to the C ABI, but a matching definition does not. All my attempts to use it resulted in linker errors and the only results on Github are uses of the cxx crate.

For example, maybe you are linking to an object file that was compiled with the same version of rustc, but is otherwise completely opaque to you.

Given what I said above, that's a new feature. I would argue that the default Rust ABI should check externs and require an rmeta file at minimum. But we could add unsafe "unchecked" Rust externs in the future that work across, e.g., staticlib/cdylib boundaries.

Also, this "checked extern" feature would IIUC be able to support fns with generic type and const parameters, which is not something today's extern blocks can support; which suggests to my mind that these are distinct and non-unifiable features, despite their similarity.

I can see your argument here. Ultimately it is just a question of how much we decide to semantically group the features, despite various differences in their capabilities. It would be helpful to imagine how we might represent stable ABI boundaries and see how these would fit in (cc @Amanieu).

As I said in my comment above though, it would be fine to defer the question of unifying with extern "Rust" until later, since the first part of what I propose is forward compatible with it.

Jules-Bertholet · 2024-05-23T22:57:21Z

For the record I don't think extern "Rust" {} is useful for anything today (possibly due to a bug); it seems to rename the function according to the C ABI, but a matching definition does not.

Even with #[no_mangle]? That sounds like a bug

programmerjake · 2024-05-23T23:01:54Z

Even with #[no_mangle]? That sounds like a bug

I think he's complaining that #[no_mangle] is required rather than that #[no_mangle] doesn't work...

lolbinarycat · 2024-06-10T03:38:56Z

to conditionally provide make other functions

i assume this is a typo?

also, if i'm not mistaken, this same functionality can be provided with no_mangle and extern "Rust", cam it not?

programmerjake · 2024-06-10T03:49:21Z

also, if i'm not mistaken, this same functionality can be provided with no_mangle and extern "Rust", cam it not?

if there's a default body: only if your platform happens to support weak symbols, and even then it's unsafe.

m-ou-se added the T-lang Relevant to the language team, which will review and decide on the RFC. label May 10, 2024

m-ou-se changed the title ~~Add RFC for externally implementable functions.~~ [RFC] externally implementable functions. May 10, 2024

m-ou-se changed the title ~~[RFC] externally implementable functions.~~ [RFC] externally implementable functions May 10, 2024

m-ou-se force-pushed the extern-impl-fn branch from bfb4071 to ecdc89c Compare May 10, 2024 09:41

m-ou-se mentioned this pull request May 10, 2024

#[distributed_slice] aka a method to enumerate tests rust-lang/testing-devex-team#3

Open

bjorn3 reviewed May 10, 2024

View reviewed changes

text/0000-externally-implementable-functions.md Outdated Show resolved Hide resolved

bjorn3 reviewed May 10, 2024

View reviewed changes

epage reviewed May 10, 2024

View reviewed changes

text/0000-externally-implementable-functions.md Outdated Show resolved Hide resolved

m-ou-se force-pushed the extern-impl-fn branch from ecdc89c to 3b5e17e Compare May 10, 2024 09:56

programmerjake reviewed May 10, 2024

View reviewed changes

text/0000-externally-implementable-functions.md Outdated Show resolved Hide resolved

epage reviewed May 10, 2024

View reviewed changes

text/0000-externally-implementable-functions.md Outdated Show resolved Hide resolved

text/0000-externally-implementable-functions.md Outdated Show resolved Hide resolved

m-ou-se force-pushed the extern-impl-fn branch from 471fb81 to 7b9cdfd Compare May 10, 2024 10:09

m-ou-se force-pushed the extern-impl-fn branch from 7b9cdfd to 0934349 Compare May 10, 2024 10:40

Add RFC for externally implementable functions.

b0acfc6

m-ou-se force-pushed the extern-impl-fn branch from 0934349 to b0acfc6 Compare May 10, 2024 10:41

kennytm reviewed May 10, 2024

View reviewed changes

text/0000-externally-implementable-functions.md Outdated Show resolved Hide resolved

m-ou-se added the I-lang-nominated Indicates that an issue has been nominated for prioritizing at the next lang team meeting. label May 10, 2024

Update future possibilities.

13e0a58

Update.

c84a3c8

fbstj reviewed May 10, 2024

View reviewed changes

burdges mentioned this pull request May 15, 2024

Move std::io::Error out of std. rust-lang/project-error-handling#11

Open

m-ou-se and others added 2 commits May 15, 2024 17:48

Update text/0000-externally-implementable-functions.md

01fbc99

Co-authored-by: Josh Triplett <josh@joshtriplett.org>

Update.

d15cc37

Add note on cfg().

1b56e58

Co-authored-by: Josh Triplett <josh@joshtriplett.org>

traviscross mentioned this pull request May 22, 2024

Tracking Issue for externally implementable items rust-lang/rust#125418

Open

15 tasks

rustbot removed the I-lang-nominated Indicates that an issue has been nominated for prioritizing at the next lang team meeting. label May 22, 2024

Amanieu mentioned this pull request May 22, 2024

Externally implementable traits #3645

Open

		and outside (e.g. global logger). This just makes it much easier (and safer) to get right.

		# Rationale and alternatives

[RFC] externally implementable functions #3632

Are you sure you want to change the base?

[RFC] externally implementable functions #3632

Conversation

m-ou-se commented May 10, 2024 • edited Loading

jdonszelmann commented May 10, 2024 • edited Loading

bjorn3 May 10, 2024

Choose a reason for hiding this comment

programmerjake May 10, 2024

Choose a reason for hiding this comment

m-ou-se May 10, 2024

Choose a reason for hiding this comment

RalfJung May 10, 2024 • edited Loading

Choose a reason for hiding this comment

m-ou-se commented May 10, 2024

jdonszelmann commented May 10, 2024 • edited Loading

m-ou-se commented May 10, 2024

juntyr commented May 10, 2024

jdonszelmann commented May 10, 2024

kennytm May 10, 2024

Choose a reason for hiding this comment

m-ou-se May 13, 2024

Choose a reason for hiding this comment

Diggsey commented May 10, 2024

m-ou-se commented May 10, 2024

fbstj May 10, 2024

Choose a reason for hiding this comment

burdges commented May 10, 2024 • edited Loading

m-ou-se commented May 15, 2024

joshtriplett commented May 16, 2024

petrochenkov commented May 17, 2024

m-ou-se commented May 17, 2024

burdges commented May 17, 2024 • edited Loading

jdonszelmann commented May 17, 2024

bjorn3 commented May 18, 2024

joshtriplett commented May 22, 2024

petrochenkov commented May 22, 2024 • edited Loading

joshtriplett commented May 22, 2024

m-ou-se commented May 22, 2024

petrochenkov commented May 22, 2024 • edited Loading

joshtriplett commented May 22, 2024 • edited Loading

traviscross commented May 22, 2024

traviscross commented May 22, 2024

Amanieu commented May 22, 2024

tmandry commented May 23, 2024

Move extern impls to blocks

Unify declarations with extern "Rust" {} blocks

Footnotes

Jules-Bertholet commented May 23, 2024

tmandry commented May 23, 2024

Jules-Bertholet commented May 23, 2024

programmerjake commented May 23, 2024 • edited Loading

lolbinarycat commented Jun 10, 2024

programmerjake commented Jun 10, 2024

m-ou-se commented May 10, 2024 •

edited

Loading

jdonszelmann commented May 10, 2024 •

edited

Loading

RalfJung May 10, 2024 •

edited

Loading

jdonszelmann commented May 10, 2024 •

edited

Loading

burdges commented May 10, 2024 •

edited

Loading

burdges commented May 17, 2024 •

edited

Loading

petrochenkov commented May 22, 2024 •

edited

Loading

petrochenkov commented May 22, 2024 •

edited

Loading

joshtriplett commented May 22, 2024 •

edited

Loading

Unify declarations with `extern "Rust" {}` blocks

programmerjake commented May 23, 2024 •

edited

Loading