Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unsafe Extern Blocks #3484

Merged
merged 29 commits into from
May 20, 2024
Merged
Changes from 6 commits
Commits
Show all changes
29 commits
Select commit Hold shift + click to select a range
63441ea
add rfc text
Lokathor Sep 7, 2023
726478f
Update text/0000-unsafe-extern-blocks.md
Lokathor Sep 7, 2023
47949ec
per https://github.com/rust-lang/rfcs/pull/3484#issuecomment-1758275493
Lokathor Mar 25, 2024
0637684
typo: missing "have"
Lokathor Mar 25, 2024
3fa1a61
corrections from Zulip feedback
Lokathor Mar 25, 2024
7754241
typo: remove "of"
Lokathor Mar 25, 2024
2144ac3
Update text/0000-unsafe-extern-blocks.md
Lokathor Mar 31, 2024
676383f
Update text/0000-unsafe-extern-blocks.md
Lokathor Apr 1, 2024
d5bb7db
Update text/0000-unsafe-extern-blocks.md
Lokathor Apr 2, 2024
6dba902
Cleanup whitespace
traviscross May 6, 2024
23f0acf
Improve wording of the drawback
traviscross May 6, 2024
1cef026
Improve wording of where `safe` is allowed
traviscross May 6, 2024
842bd55
Fix typo
traviscross May 6, 2024
176d73f
Clarify extent of UB
traviscross May 6, 2024
60631ce
Clarify what we're replacing in the Reference
traviscross May 6, 2024
fc53654
Add reference to Rust issue 46188
traviscross May 6, 2024
5cc4cc3
Clarify that we will "eventually" lint
traviscross May 6, 2024
4684d53
Unwrap lines
traviscross May 6, 2024
b423b2b
Lowercase "undefined behavior"
traviscross May 6, 2024
09a088c
Address feedback and questions
traviscross May 6, 2024
ca7713c
Add alternative of fixing LLVM (if it is a fix)
traviscross May 7, 2024
efc671c
Clarify about fixing LLVM despite C
traviscross May 7, 2024
2c106c3
Clarify about `unsafe_code` and edition migration
traviscross May 7, 2024
c1192da
Fix typo
traviscross May 7, 2024
9f36a92
Remove issue 46188 as a motivation
traviscross May 7, 2024
c198396
Remove unused "prior art" section
traviscross May 7, 2024
39795a0
Fix optionality of `safe`/`unsafe` in guide section
traviscross May 7, 2024
3b6ae2b
Prepare RFC 3484 to be merged
traviscross May 19, 2024
45590fe
Do some copyediting
traviscross May 19, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
121 changes: 121 additions & 0 deletions text/0000-unsafe-extern-blocks.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,121 @@

- Feature Name: `unsafe_extern`
- Start Date: 2023-05-23
- RFC PR: [rust-lang/rfcs#0000](https://github.com/rust-lang/rfcs/pull/0000)
- Rust Issue: [rust-lang/rust#0000](https://github.com/rust-lang/rust/issues/0000)

# Summary
[summary]: #summary

In Edition 2024 it is `unsafe` to declare an `extern` function or static, but external functions and statics *can* be safe to use after the initial declaration.

# Motivation
[motivation]: #motivation

Simply declaring extern items, even without ever using them, can cause Undefined Behavior.
When performing cross-language compilation, attributes on one function declaration can flow to the foreign declaration elsewhere within LLVM and cause a miscompilation.
traviscross marked this conversation as resolved.
Show resolved Hide resolved
In Rust we consider all sources of Undefined Behavior to be `unsafe`, and so we must make declaring extern blocks be `unsafe`.
The up-side to this change is that in the new style it will be possible to declare an extern fn that's safe to call after the initial unsafe declaration.

# Guide-level explanation
[guide-level-explanation]: #guide-level-explanation

Rust can utilize functions and statics from foreign code that are provided during linking, though it is `unsafe` to do so.

An `extern` block can be placed anywhere a function declaration could appear (generally at the top level of a module).

* On editions >= 2024, you *must* write all `extern` blocks as `unsafe extern`.
* On editions < 2024, you *may* write `unsafe extern`, or you can write an `extern` block without the `unsafe` keyword. Writing an `extern` block without the `unsafe` keyword is provided for compatibility only, and will generate a warning.
traviscross marked this conversation as resolved.
Show resolved Hide resolved
* `unsafe extern` interacts with the `unsafe_code` lint, and a `deny` or `forbid` with that lint will deny or forbid the unsafe external block.
traviscross marked this conversation as resolved.
Show resolved Hide resolved

Within an `extern` block is zero or more declarations of external functions and/or external static values.
An extern function is declared with a `;` instead of a function body (similar to a method of a trait).
An extern static value is also declared with a `;` instead of an expression (similar to an associated const of a trait).
In both cases, the actual function body or value is provided by whatever external source (which is probably not even written in Rust).

When an `unsafe extern` block is used, all declarations within that `extern` block *should* have the `unsafe` or `safe` keywords as part of their signature.
Lokathor marked this conversation as resolved.
Show resolved Hide resolved
If one of the two keywords is not explicitly provided, the declaration is assumed to be `unsafe`, and also a warning is generated.
Lokathor marked this conversation as resolved.
Show resolved Hide resolved
The `safe` keyword is a contextual keyword, it is currently only used within `extern` blocks.
traviscross marked this conversation as resolved.
Show resolved Hide resolved

If an `extern` block is used in an older edition without the `unsafe` keyword, declarations *cannot* specify `safe` or `unsafe`.
Code must update to `unsafe extern` style blocks if it wants to make `safe` declarations.

```rust
unsafe extern {
// sqrt (from libm) can be called with any `f64`
pub safe fn sqrt(x: f64) -> f64;

// strlen (from libc) requires a valid pointer,
// so we mark it as being an unsafe fn
pub unsafe fn strlen(p: *const c_char) -> usize;

// this function doesn't say safe or unsafe, so it defaults to unsafe
pub fn free(p: *mut core::ffi::c_void);
traviscross marked this conversation as resolved.
Show resolved Hide resolved

pub safe static IMPORTANT_BYTES: [u8; 256];

pub safe static LINES: SyncUnsafeCell<i32>;
}
```

`extern` blocks are `unsafe` because if the declaration doesn't match the actual external function, or the actual external data, then it causes compile time Undefined Behavior (UB).
traviscross marked this conversation as resolved.
Show resolved Hide resolved

Once they are unsafely declared, a `safe` item can be used outside the `extern` block as if it were any other safe function or static value declared within rust.
The unsafe obligation of ensuring that the correct items are being linked to is performed by the crate making the declaration, not the crate using that declaration.

Items declared as `unsafe` *must* still have a correctly matching signature at compile time, but they *also* have some sort of additional obligation for correct usage at runtime.
They can only be used within an `unsafe` block.

# Reference-level explanation
[reference-level-explanation]: #reference-level-explanation

The grammar of the langauge is updated so that:
traviscross marked this conversation as resolved.
Show resolved Hide resolved

* Editions >= 2024 *must* prefix all `extern` blocks with `unsafe`.
* Editions < 2024 *should* prefix `extern` blocks with `unsafe`, this is a warn-by-default compatibility lint when `unsafe` is missing.
traviscross marked this conversation as resolved.
Show resolved Hide resolved
traviscross marked this conversation as resolved.
Show resolved Hide resolved

Replace the *Functions* and *Statics* sections with the following:
traviscross marked this conversation as resolved.
Show resolved Hide resolved

### Functions
Functions within external blocks are declared in the same way as other Rust functions, with the exception that they must not have a body and are instead terminated by a semicolon. Patterns are not allowed in parameters, only IDENTIFIER or _ may be used. The function qualifiers `const`, `async`, and `extern` are not allowed. If the function is unsafe to call, then the function should use the `unsafe` qualifier. If the function is safe to call, then the function should use the `safe` qualifier (a contextual keyword). Functions that are not qualified as `unsafe` or `safe` are assumed to be `unsafe`.
traviscross marked this conversation as resolved.
Show resolved Hide resolved

If the function signature declared in Rust is incompatible with the function signature as declared in the foreign code it is Undefined Behavior to compile and link the code.

Functions within external blocks may be called by Rust code, just like functions defined in Rust. The Rust compiler will automatically use the correct foreign ABI when making the call.

When coerced to a function pointer, a function declared in an extern block has type
```rust
extern "abi" for<'l1, ..., 'lm> fn(A1, ..., An) -> R
```
where `'l1`, ... `'lm` are its lifetime parameters, `A1`, ..., `An` are the declared types of its parameters and `R` is the declared return type.

### Statics
Statics within external blocks are declared in the same way as statics outside of external blocks, except that they do not have an expression initializing their value. If the static is unsafe to access, then the static should use the `unsafe` qualifier. If the static is safe to access (and immutable), then the static should use the `safe` qualifier (a contextual keyword). Statics that are not qualified as `unsafe` or `safe` are assumed to be `unsafe`.
traviscross marked this conversation as resolved.
Show resolved Hide resolved

Extern statics can be either immutable or mutable just like statics outside of external blocks. An immutable static must be initialized before any Rust code is executed. It is not enough for the static to be initialized before Rust code reads from it. A mutable extern static is always `unsafe` to access, the same as a Rust mutable static.
Lokathor marked this conversation as resolved.
Show resolved Hide resolved

# Drawbacks
[drawbacks]: #drawbacks

* It is very unfortunate to have to essentially reverse the status quo.
traviscross marked this conversation as resolved.
Show resolved Hide resolved
* Hopefully, allowing people to safely call some foreign functions will make up for the churn caused by this change.

# Rationale and alternatives
[rationale-and-alternatives]: #rationale-and-alternatives

Incorrect extern declarations can cause UB in current Rust, but we have no way to automatically check that all declarations are correct, nor is such a thing likely to be developed. Making the declarations `unsafe` so that programmers are aware of the dangers and can give extern blocks the attention they deserve is the minimum step.

# Prior art
[prior-art]: #prior-art

None we are aware of.

# Unresolved questions
[unresolved-questions]: #unresolved-questions

* Extern declarations are actually *always* unsafe and able to cause UB regardless of edition. This RFC doesn't have a specific answer on how to improve pre-2024 code.

# Future possibilities
[future-possibilities]: #future-possibilities

None are apparent at this time.