Safe Transmute RFC #5

jswrenn · 2020-08-03T15:37:22Z

Rendered

For a (quicker) summary of the proposed API surface, see this rustdoc.

rfcs/0000-safe-transmute.md

@jonas-schievink

typo fixes from @jonas-schievink Co-authored-by: Jonas Schievink <[email protected]>

rfcs/0000-safe-transmute.md

rylev · 2020-08-03T16:24:05Z

rfcs/0000-safe-transmute.md

+#### Stability & Transmutation
+A `Src` type is *stably* transmutable into a `Dst` type *only if* `<Src as PromiseTransmutableInto>::Archetype` is transmutable, stability notwithstanding, into `<Dst as PromiseTransmutableFrom>::Archetype`; formally:
+```rust
+unsafe impl<Src, Dst> TransmuteFrom<Src> for Dst


Hmmm... you mention TransmuteFrom here which you've hinted at above but never formally introduce, and you never explicitly state that this will be implemented by the compiler on behalf of the user automatically.

rylev · 2020-08-03T16:26:33Z

rfcs/0000-safe-transmute.md

+
+The type `<Src as PromiseTransmutableInto>::Archetype` exemplifies the furthest extreme of non-breaking changes that could be made to the layout of `Src` that could affect its use as a source type in transmutations. Conversely, `<Dst as PromiseTransmutableFrom>::Archetype` exemplifies the furthest extreme of non-breaking changes that could be made to the layout of `Dst` that could affect its use as a destination type in transmutations. If a transmutation between these extremities is valid, then so is `Src: TransmuteInto<Dst>`.
+
+#### Common Use-Case: Promising a Fixed Layout


I've mention this before but I would really love a short-hand for types that implement PromiseTransmutableFrom<Archetype=Self> and PromiseTransmutableInto<Archetype=Self> - something like FixedLayout.

This would also allow us to introduce FixedLayout first if we wanted to let that use case bake before we allowed flexibly declaring stability guarantees.

@rylev I'm unsure how useful this shorthand would be.

For end-users, I anticipate that PromiseTransmutableFrom and PromiseTransmutableInto will virtually never appear outside of the context of implementing those traits. In writing the examples for this RFC, I haven't yet encountered an instance where I needed to use either of those traits in a where bound. So, I don't think this shorthand would be useful outside of the context of implementing the traits.

And, within the context of implementing those traits, nearly all users will be using the shorthand #[derive(PromiseTransmutableFrom, PromiseTransmutableInto)]. So here, #[derive(FixedLayout)] would be a shorthand for a shorthand.

Is there a use-case you have in mind?

Short-hand for short-hand is not necessarily a bad thing 😄 . I believe a vast majority of the time that users will want #[derive(PromiseTransmutableFrom, PromiseTransmutableInto)]. Given the long names and the frequency of its use #[derive(FixedLayout)] strikes me as an ergonomic choice. But I realize that it's not super elegant to introduce a new trait just as a short-hand for deriving.

No, that's a better trait name for sure. People will understand it more when looking at docs and source.

@rylev Defining a trait is rather tricky (since those derives are a little more nuanced than just Archetype = Self to account for field stability declarations), but it's very technically feasible to provide a derive(PromiseTransmutableFromAndInto) macro that would be equivalent to derive(PromiseTransmutableFrom, PromiseTransmutableInto) — without a PromiseTransmutableFromAndInto trait.

Defining a trait is rather tricky (since those derives are a little more nuanced than just Archetype = Self to account for field stability declarations)

To clarify a little more about why having a FixedLayout shorthand/trait is tricky, it's because the concept of "fixed layout" is a little tricky.

First, there's the name: PromiseTransmutableFrom isn't really making a promise about layout, it's making a promise about future transmutability. Transmutability involves factors beyond layout, like constructability. If we wanted to create a derive shorthand for derive(PromiseTransmutableFrom, PromiseTransmutableInto), the name PromiseFixedLayout (or PromiseStableLayout, etc...) misrepresents what's actually being promised.

Second, there's the semantics: what does promising a fixed layout mean? You're probably mean to promise that you won't make changes to the type's transmutability, but what about others? What if your type's fields come from other crates? What if your type's fields are generic?

The behavior of #[derive(PromiseTransmutableFrom, PromiseTransmutableInto)] reflects this inherent complexity: it doesn't mean your type has totally-assured-transmutability, it means your type has as-assured-as-possible-transmutability. Concretely: When you write #[derive(PromiseTransmutable{From,Into})], your type inherits the instability of its fields. If its fields have strong stability promises, so too will your type; if its fields have weak stability promises, so too will your type.

To conclude, I don't think there's a good mapping of a mem-markers-style FixedLayout abstraction onto PromiseTransmutableFrom and PromiseTransmutableInto. While it might seem simpler, at first glance, to just present one trait (FixedLayout) instead of two (PromiseTransmutableFrom and PromiseTransmutableInto), the concept of a FixedLayout isn't well-defined, and the complexities of PromiseTransmutable{From,Into} reflect the complexity inherent to the problem of transmutation stability.

What I'm arguing here is that there is a use case for "POD" that will be quite common. By "POD" in this case I mean types where the in memory-representation of that type is intrinsic to its functionality and changing that would be a breaking change. This is common in the parsing scenario for instance. Having some sort of short hand for saying this type's layout/alignment/etc cannot change would be a simple "happy path" for many use cases. I'm not arguing this as a replacement to PromiseTransmutable{From,Into} but as a compliment. This doesn't have to be a derive either. I can imagine also #[repr(C, fixed)] for instance.

Per a conversation with @rylev, ed172fe proposes a #[derive(PromiseTransmutable)] ergonomic extension. I'm fairly convinced that adding such a shorthand is the right thing to do and I refer to it in the Guide-Level Explanation, but it's formally defined as an Extension because it's technically non-central to the RFC and poses a few unusual design quirks.

rfcs/0000-safe-transmute.md

rylev · 2020-08-03T16:35:59Z

rfcs/0000-safe-transmute.md

+
+The question is, then: *how can the author of a type reason about transmutations they did not write, from-or-to types they did not write?* We address this problem by introducing two traits which both allow an author to opt-in to stability guarantees for their types, and allow third-parties to reason at compile-time about what guarantees are provided for such types.
+
+#### `PromiseTransmutableFrom` and `PromiseTransmutableInto`


How do you feel about these names? Open to bike shedding?

I'm fond of these names because Promise reflects that the user implements these traits to make a promise, and Transmutable{From,Into} corresponds exactly to what they're promising. I'm quite open to bikeshedding, though!

rylev · 2020-08-04T15:44:39Z

First, thank you @jswrenn. This is an amazing document, and I think you're really on to something here.

In general, I'm a bit concerned that the flexibility brings a lot of the complexity forward when the user might often need to think about the complexity. That's what I suggested the #[derive(FixedLayout)] because I imagine this is the use-case for a large proportion of users (any change to the type would be considered a breaking change). I don't have any additional suggestions to help with this beyond good documentation that starts with the presumed common case and then expands to the more advanced use case where the flexibility is needed.

Beyond this, I'm wondering if validity currently captures two distinct ideas and whether it's worth it to treat these as separate. As far as I can tell validity captures both the idea of in memory values being valid representations of a type (e.g., 3u8 is not a valid representation of bool) as well as whether the value is a valid instance of the type given the current context. For instance, given a pointer wrapper type struct MyPointer(*const T), a usize is always valid to transmute to that type. In other words, all possible values of a usize could be a valid value of MyPointer, but a given usize might not be valid given a specific context (namely that a valid T doesn't reside at the memory address that the usize represents). Is it worth separating these ideas? Is it useful to say that usize -> MyPointer is always a valid transmute, but might not be a correct one?

rfcs/0000-safe-transmute.md

Resolves https://github.com/rust-lang/project-safe-transmute/pull/5/files#r464502312

jswrenn · 2020-08-04T16:05:03Z

As far as I can tell validity captures both the idea of in memory values being valid representations of a type (e.g., 3u8 is not a valid representation of bool) as well as whether the value is a valid instance of the type given the current context.

@rylev I think the distinction you're referring to is captured by the distinction the RFC draws between soundness and safety. usize -> MyPointer is a sound transmutation, because usize is a bit-valid instance of MyPointer. However, the resulting MyPointer value is not necessarily safe to use. The *const T field of MyPointer is private, and we should assume it's private because MyPointer imposes invariants on it that make it safe to dereference.

I actually like this MyPointer example quite a bit, because raw pointers are virtually always subjected to validity invariants by the types they appear in. I might change my example of implicit constructibility to instead use a type containing a raw pointer!

jswrenn

Per #5 (comment)

rfcs/0000-safe-transmute.md

rust-lang#5 (comment)

rfcs/0000-safe-transmute.md

rust-lang#5 (comment)

rfcs/0000-safe-transmute.md

Kelly -> Kelley

jswrenn

Fix #5 (comment)

rfcs/0000-safe-transmute.md

rust-lang#5 (comment)

rfcs/0000-safe-transmute.md

rust-lang#5 (comment)

rfcs/0000-safe-transmute.md

rust-lang#5 (comment) Co-authored-by: Florian Gilcher <[email protected]>

JulianKnodt · 2020-08-28T20:28:35Z

Just curious, why is the versioning baked into the type system? Is this not solved by Cargo and version locking? I think this was one of the largest complexities I found while reading the proposal

jswrenn · 2020-08-28T21:08:40Z

Just curious, why is the versioning baked into the type system? Is this not solved by Cargo and version locking? I think this was one of the largest complexities I found while reading the proposal

@JulianKnodt This is a great question!

Why do we need stability?

We need a stability system for at least two reasons:

The usual rules SemVer stability dictates that if a trait is implemented in version m.a.b, it'll continue to be implemented for all versions m.x.y, where x ≥ a and y ≥ b. TransmuteFrom<Src, NeglectStability> is the exception to this rule. I think the drawbacks of introducing exceptional behavior are small compared to the drawbacks of not solving safer transmutation, but it would be irresponsible to do nothing to mitigate this stability hazard. The compromise of this RFC is that TransmuteFrom should be stable-by-default: Dst: TransmuteFrom<Src> follows the usual SemVer rules; Dst: TransmuteFrom<Src, NeglectStability> does not.
I believe we'll need to use the simplified definition of constructability (or it will be a very long time before safer transmutation is realized), but that definition has a soundness hole. We have three options:
- Pretend it doesn't exist. I don't view this as a real option.
- Give up on completely safe transmutation; only offer unsafe transmutation. This option fails to remove any unsafe blocks from end-users code.
- Allow safe transmutation only when the type authors have promised they're not creating a situation where this soundness hole arises. This is the option the RFC recommends, and stability declaration provides a natural mechanism for making this promise.

Why is stability so complicated?

I go into a ton of depth exploring the implications and design rationale behind stability, but I think it might be one of the simpler parts of the RFC. Whereas ensuring soundness and safety requires non-trivial compiler support, stability doesn't—it's just two normal traits and an impl:

pub trait PromiseTransmutableFrom
{
    type Archetype
        : TransmuteInto<Self, NeglectStability>
        + PromiseTransmutableFrom;
}

pub trait PromiseTransmutableInto
{
    type Archetype
        : TransmuteFrom<Self, NeglectStability>
        + PromiseTransmutableInto;
}

unsafe impl<Src, Dst> TransmuteFrom<Src, ()> for Dst
where
    Src: PromiseTransmutableInto,
    Dst: PromiseTransmutableFrom,

    <Dst as PromiseTransmutableFrom>::Archetype:
        TransmuteFrom<
            <Src as PromiseTransmutableInto>::Archetype,
            NeglectStability>
{}

That's the entire stability system!

In short: I think the depth I go into when documenting stability might make it seem more complicated than it is. I'm increasingly thinking that removing the in-depth explanations of stability from the RFC might be a good idea.

UPDATE: I've incorporated this answer into the RFC. I've also removed the 'Dissecting Stability' and 'Uncommon Use-Case: Weak Stability Guarantees' sections. If anyone has objections, please let me know.

This improves the rendering of rustdoc, and leaves us free to remove that bound if need be without violating stability.

jswrenn · 2020-08-31T23:28:41Z

The RFC is ready to formally submit! I'm going to merge this PR. I'll file a PR against rust-lang/rfcs either tonight or tomorrow morning.

Continuation of rust-lang/project-safe-transmute#5

jswrenn · 2020-09-01T00:01:59Z

RFC submitted!

lovasoa · 2022-02-23T16:41:07Z

rfcs/0000-safe-transmute.md

+//    |                           ^^^^^^^^^^^^^^ the trait `TransmuteFrom<foo::Foo, _>` is not implemented for `u32`
+//    |
+//   = note: required because of the requirements on the impl of `TransmuteInto<u32, _>` for `foo::Foo`
+//   = note: byte 8 of the source type may be uninitialized; byte 8 of the destination type cannot be uninitialized.


Shouldn't this say "bit 8" instead of "byte 8" ?

Create 0000-safe-transmute.md

8e0dc26

This comment has been minimized.

Sign in to view

rylev mentioned this pull request Aug 3, 2020

Integrate prior art from Typic #3

Open

nikomatsakis mentioned this pull request Aug 3, 2020

project-safe-transmute rust-lang/lang-team#21

Open

jonas-schievink reviewed Aug 3, 2020

View reviewed changes

Apply suggestions from code review

651e3bf

typo fixes from @jonas-schievink Co-authored-by: Jonas Schievink <[email protected]>

jswrenn commented Aug 3, 2020

View reviewed changes

rfcs/0000-safe-transmute.md Outdated Show resolved Hide resolved

Fix derive expansion.

7a88370

rylev reviewed Aug 3, 2020

View reviewed changes

jswrenn commented Aug 4, 2020

View reviewed changes

rfcs/0000-safe-transmute.md Outdated Show resolved Hide resolved

reword motivation

32bc4b9

Resolves https://github.com/rust-lang/project-safe-transmute/pull/5/files#r464502312

jswrenn commented Aug 4, 2020

View reviewed changes

Constrained -> NonEmptySlice

6ea3705

rust-lang#5 (comment)

rylev reviewed Aug 4, 2020

View reviewed changes

rfcs/0000-safe-transmute.md Show resolved Hide resolved

rylev reviewed Aug 4, 2020

View reviewed changes

rfcs/0000-safe-transmute.md Outdated Show resolved Hide resolved

missing word

5eafe9f

rust-lang#5 (comment)

zachlute reviewed Aug 4, 2020

View reviewed changes

rfcs/0000-safe-transmute.md Outdated Show resolved Hide resolved

zachlute reviewed Aug 4, 2020

View reviewed changes

rfcs/0000-safe-transmute.md Outdated Show resolved Hide resolved

zachlute reviewed Aug 4, 2020

View reviewed changes

rfcs/0000-safe-transmute.md Outdated Show resolved Hide resolved

zachlute reviewed Aug 4, 2020

View reviewed changes

rfcs/0000-safe-transmute.md Show resolved Hide resolved

jswrenn commented Aug 4, 2020

View reviewed changes

rfcs/0000-safe-transmute.md Outdated Show resolved Hide resolved

rfcs/0000-safe-transmute.md Outdated Show resolved Hide resolved

rfcs/0000-safe-transmute.md Outdated Show resolved Hide resolved

rfcs/0000-safe-transmute.md Outdated Show resolved Hide resolved

Apply suggestions from code review

3a4e576

Kelly -> Kelley

jswrenn commented Aug 4, 2020

View reviewed changes

add lifetime parameter to NonEmptySlice

00167d6

rust-lang#5 (comment)

jswrenn commented Aug 4, 2020

View reviewed changes

rfcs/0000-safe-transmute.md Outdated Show resolved Hide resolved

zachlute reviewed Aug 4, 2020

View reviewed changes

rfcs/0000-safe-transmute.md Outdated Show resolved Hide resolved

TransmuteFrom connotes conversion

cf20426

rust-lang#5 (comment)

jswrenn commented Aug 4, 2020

View reviewed changes

rfcs/0000-safe-transmute.md Outdated Show resolved Hide resolved

skade reviewed Aug 27, 2020

View reviewed changes

rfcs/0000-safe-transmute.md Outdated Show resolved Hide resolved

jswrenn added 2 commits August 27, 2020 16:31

factor out extensions

c4f5d60

rework motivation; cut case studies

1fb9949

danielhenrymantilla reviewed Aug 28, 2020

View reviewed changes

rfcs/0000-safe-transmute.md Outdated Show resolved Hide resolved

jswrenn and others added 3 commits August 28, 2020 15:52

lingering Type -> Archetype

579a7a0

abusing -> using

8295f5b

rust-lang#5 (comment) Co-authored-by: Florian Gilcher <[email protected]>

update minimal impl

2eaa505

jswrenn added 14 commits August 28, 2020 17:18

transmutation definition

9081e53

union repr

099896a

REMOVE Dissecting Stability

7fe2146

REMOVE Uncommon Use-Case: Weak Stability Guarantees

ed02a09

syntax highlighting

c0063c4

rework stability rationale

079324e

note on endianness

19e629e

undefined *or* unspecified behavior

aadfec7

improve stability explanation

9ed6b6d

add explanatory note to error message

93d8394

prior art: haskell

b150c3f

typo: TransmutableFromArchetype -> TransmutableIntoArchetype

9fe396e

change casting section heading

61448fb

Move one of Archetype's bounds to where

3217d4a

This improves the rendering of rustdoc, and leaves us free to remove that bound if need be without violating stability.

jswrenn merged commit 038f193 into rust-lang:master Aug 31, 2020

jswrenn added a commit to jswrenn/rfcs that referenced this pull request Aug 31, 2020

Safer Transmute

1a8561e

Continuation of rust-lang/project-safe-transmute#5

jswrenn mentioned this pull request Aug 31, 2020

Safer Transmute rust-lang/rfcs#2981

Closed

kstasik-legion mentioned this pull request Aug 6, 2021

Structured Data Model legion-labs/legion#107

Closed

lovasoa reviewed Feb 23, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Safe Transmute RFC #5

Safe Transmute RFC #5

jswrenn commented Aug 3, 2020 •

edited

Loading

This comment has been minimized.

rylev Aug 3, 2020

rylev Aug 3, 2020

jswrenn Aug 3, 2020 •

edited

Loading

rylev Aug 4, 2020

Lokathor Aug 4, 2020

jswrenn Aug 4, 2020

jswrenn Aug 5, 2020

rylev Aug 5, 2020

jswrenn Aug 5, 2020

rylev Aug 3, 2020

jswrenn Aug 3, 2020

rylev commented Aug 4, 2020

jswrenn commented Aug 4, 2020

jswrenn left a comment

jswrenn left a comment

JulianKnodt commented Aug 28, 2020

jswrenn commented Aug 28, 2020 •

edited

Loading

jswrenn commented Aug 31, 2020

jswrenn commented Sep 1, 2020

lovasoa Feb 23, 2022


		The type `<Src as PromiseTransmutableInto>::Archetype` exemplifies the furthest extreme of non-breaking changes that could be made to the layout of `Src` that could affect its use as a source type in transmutations. Conversely, `<Dst as PromiseTransmutableFrom>::Archetype` exemplifies the furthest extreme of non-breaking changes that could be made to the layout of `Dst` that could affect its use as a destination type in transmutations. If a transmutation between these extremities is valid, then so is `Src: TransmuteInto<Dst>`.

		#### Common Use-Case: Promising a Fixed Layout


		The question is, then: how can the author of a type reason about transmutations they did not write, from-or-to types they did not write? We address this problem by introducing two traits which both allow an author to opt-in to stability guarantees for their types, and allow third-parties to reason at compile-time about what guarantees are provided for such types.

		#### `PromiseTransmutableFrom` and `PromiseTransmutableInto`

Safe Transmute RFC #5

Safe Transmute RFC #5

Conversation

jswrenn commented Aug 3, 2020 • edited Loading

This comment has been minimized.

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jswrenn Aug 3, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rylev commented Aug 4, 2020

jswrenn commented Aug 4, 2020

jswrenn left a comment

Choose a reason for hiding this comment

jswrenn left a comment

Choose a reason for hiding this comment

JulianKnodt commented Aug 28, 2020

jswrenn commented Aug 28, 2020 • edited Loading

Why do we need stability?

Why is stability so complicated?

jswrenn commented Aug 31, 2020

jswrenn commented Sep 1, 2020

Choose a reason for hiding this comment

jswrenn commented Aug 3, 2020 •

edited

Loading

jswrenn Aug 3, 2020 •

edited

Loading

jswrenn commented Aug 28, 2020 •

edited

Loading