Encoding traits, Path + Entry impls #23

sgwilym · 2024-07-09T11:21:18Z

Add a compact_width module
Add a max_power module
Adds Encoder and Decoder traits which work with Ufotofu consumers and producers.
Implements Encoder + Decoder on PathRc
Implements Encoder + Decoder on Entry
Adds an earthstar crate with nascent NamespaceId and IdentityId structs
- These are built on top of a wip Cinn25519 implementation with validated shortnames.
- I did this to implement Arbitrary on these types so we could use realistic values in fuzz tests as per Willow parameters for testing? #17.

This is already enough to review, I will do the relative encodings in a separate branch off of this.

Still needs:

A non-local Ufotofu dependency

data-model/src/encoding/compact_width.rs

data-model/src/encoding/error.rs

data-model/src/encoding/max_power.rs

data-model/src/encoding/parameters.rs

data-model/src/entry.rs

data-model/src/parameters.rs

data-model/src/path.rs

fuzz/fuzz_targets/entry_encoding.rs

fuzz/fuzz_targets/entry_encoding_random.rs

…o fn

data-model/src/encoding/compact_width.rs

data-model/src/encoding/unsigned_int.rs

Frando · 2024-07-18T12:58:11Z

Hi, I am having a look at this at the moment and the implications for usage in iroh-willow.

My question is: Is it necessary to couple the Encoder and Decoder traits to ufotofu?

My guess is that the reasoning is that with this design encode is async and can yield on-the-fly once the write buffer is full? In my encoder trait in iroh-wilow, the trait has a required method encoded_len instead, and when sending I yield if the buffer doesn't fit the full message. The cost is, I guess, that I have to calculate the message length in advance. However, I would think that this is reasonably cheap.

With the trait coupled to ufotofu traits it becomes near-impossible to use willow-rs struct encoders and decoders if the higher-level protocol implementation does not use ufotofu but other stream and channel primitives, no? I did not yet look into implementing the ufotofu traits for other channels though.

Edit: After another look at ufotofu, I think I can implement the traits for my channel struct. Will report back once I tried it out.

AljoschaMeyer · 2024-07-18T13:42:20Z

@Frando

The cost is, I guess, that I have to calculate the message length in advance.

To me, the true cost is the write buffer itself. You need to allocate it, and you need to copy its contents into the channel primitive. The ufotofu design allows for zero-copy serialisation, where you write directly to the channel (even if its internal buffer is smaller than the total size of the encoding) without intermediate allocations.

This is a win, and the resulting code is also more (or, at least, quite) elegant (for example, the entry encoder).

The big drawback is having to adopt ufotofu, of course. For obvious reasons I don't consider that as a drawback conceptually, but I cannot talk away the real engineering cost (and risk?), especially since the ufotofu codebase is immature and still undergoing some changes. In particular, you'd need to code against the slice_helpers branch, which should more accurately be called refactor_all_the_things.

I'd be up for a call to help with ufotofu integration and just general high-bandwidth sharing.

Frando · 2024-07-18T14:46:42Z

You need to allocate it, and you need to copy its contents into the channel primitive.

Not necessarily. My encoder trait encodes into &mut [u8] with the invariant that the slice is at least self.encoded_len() bytes long.

My Channel::send takes item: impl Encoder, checks if the available buffer is at least item.encoded_len, yields if not, and then calls item.encode with a mut slice of the channel buffer.

It does not use MaybeUninitialized, but that is an ortogonal concern.

I do like the design of the ufotofu traits, I am just unclear if the coupling to these new and still unstable traits is required for the quite low-level encoder trait.

Edit: Links to the traits in iroh-willow:

traits: https://github.com/n0-computer/iroh/blob/willow/iroh-willow%2Fsrc%2Futil%2Fcodec.rs

usage in channel send:
https://github.com/n0-computer/iroh/blob/willow/iroh-willow%2Fsrc%2Futil%2Fchannel.rs#L234

(exact signatures are different from what I wrote pseudocodish above but conceptually the same)

Note that I am not opposed to the traits introduced here conceptually, but interested if the encoding can be made less coupled to the io traits without perf or ergonomic downsides.

AljoschaMeyer · 2024-07-18T15:19:06Z

My encoder trait encodes into &mut [u8] with the invariant that the slice is at least self.encoded_len() bytes long.

I'm not sure whether we are talking past each other. I'm basically interested in the setting where the buffer of the channel (say, the OS-managed buffer for to write tcp data into) is smaller than the encoding. As far as I'm understanding the sentence I quoted, you'd require a channel that can allocate arbitrarily large buffers on demand, which is what the ufotofu design avoids.

Then again, the code you linked uses io::Write and not &mut [u8], and io::Write does not have this problem. Which is why I feel like I'm misunderstanding something.

In any case, the ufotofu design is pretty much the same as that of io::Write, except it fixes a couple of glaring issues (hardcoded item and error type (the latter being important for encoding), inability to pipe from reader to writer without an intermediate buffer, weirdness around uninitialised memory).

It does not use MaybeUninitialized, but that is an ortogonal concern.

Agreed. I keep switching between being happy or unhappy with the ufotofu approach. But irrespective of those feelings, it will stay the way it is, because it should have the ability to work with uninitialised memory. I just wish it was less of a pain...

interested if the encoding can be made less coupled to the io traits without perf or ergonomic downsides

Yeah, I share the sentiment, but I didn't find a satisfying way to do it. I think the ability for the channel to dictate fragment sizes is crucial, and that is what the "io[-like] traits" are all about. Buf and BufMut from bytes are surprisingly clsoe in some respects, but they have no error handling...

In my opinion, there is simply a void in terms of popular traits/apis in the rust ecosystem for general, io-like traits. And ufotofu fills that void in a - to me - satisfying manner. It just isn't a popular API (yet?).

Feel free to reach out on discord if you want to schedule a call.

sgwilym · 2024-07-19T17:29:55Z

Now with zero-copy Path changes merged in.

AljoschaMeyer · 2024-07-20T10:21:08Z

@Frando Just published ufotofu version 0.2.0 fyi, that one should be fairly stable to code against.

sgwilym added 4 commits July 5, 2024 09:52

Add compact width encoding utils

4809236

Add path encodings + some fuzz testing

7980ea0

Use TestConsumer in path fuzz test

31da9a3

Add random path encoding fuzz test

f3ef506

sgwilym added the enhancement New feature or request label Jul 9, 2024

sgwilym added 4 commits July 9, 2024 15:39

Entry encoding start...

9c99d05

Earthstar crate, Entry encoding + fuzz, Encoder + Decoder traits

a7529f3

Impl Encoder on PathRc

36dd89d

Implement Encoder + Decoder on Entry

8684a96

sgwilym requested a review from AljoschaMeyer July 10, 2024 15:50

AljoschaMeyer reviewed Jul 11, 2024

View reviewed changes

data-model/src/encoding/compact_width.rs Outdated Show resolved Hide resolved

AljoschaMeyer reviewed Jul 11, 2024

View reviewed changes

data-model/src/encoding/compact_width.rs Outdated Show resolved Hide resolved

AljoschaMeyer reviewed Jul 11, 2024

View reviewed changes

data-model/src/encoding/compact_width.rs Outdated Show resolved Hide resolved

AljoschaMeyer reviewed Jul 11, 2024

View reviewed changes

data-model/src/encoding/compact_width.rs Outdated Show resolved Hide resolved

AljoschaMeyer reviewed Jul 11, 2024

View reviewed changes

data-model/src/encoding/error.rs Outdated Show resolved Hide resolved

AljoschaMeyer reviewed Jul 11, 2024

View reviewed changes

data-model/src/encoding/max_power.rs Outdated Show resolved Hide resolved

AljoschaMeyer reviewed Jul 11, 2024

View reviewed changes

data-model/src/encoding/parameters.rs Outdated Show resolved Hide resolved

AljoschaMeyer reviewed Jul 11, 2024

View reviewed changes

data-model/src/encoding/parameters.rs Outdated Show resolved Hide resolved

AljoschaMeyer reviewed Jul 11, 2024

View reviewed changes

data-model/src/encoding/parameters.rs Outdated Show resolved Hide resolved

AljoschaMeyer reviewed Jul 11, 2024

View reviewed changes

data-model/src/entry.rs Show resolved Hide resolved

AljoschaMeyer reviewed Jul 11, 2024

View reviewed changes

data-model/src/entry.rs Outdated Show resolved Hide resolved

AljoschaMeyer reviewed Jul 11, 2024

View reviewed changes

data-model/src/entry.rs Outdated Show resolved Hide resolved

AljoschaMeyer reviewed Jul 11, 2024

View reviewed changes

data-model/src/entry.rs Outdated Show resolved Hide resolved

AljoschaMeyer reviewed Jul 11, 2024

View reviewed changes

data-model/src/entry.rs Show resolved Hide resolved

AljoschaMeyer reviewed Jul 11, 2024

View reviewed changes

data-model/src/parameters.rs Outdated Show resolved Hide resolved

AljoschaMeyer reviewed Jul 11, 2024

View reviewed changes

data-model/src/path.rs Outdated Show resolved Hide resolved

AljoschaMeyer reviewed Jul 11, 2024

View reviewed changes

data-model/src/path.rs Outdated Show resolved Hide resolved

AljoschaMeyer reviewed Jul 11, 2024

View reviewed changes

data-model/src/path.rs Outdated Show resolved Hide resolved

AljoschaMeyer reviewed Jul 11, 2024

View reviewed changes

fuzz/fuzz_targets/entry_encoding.rs Show resolved Hide resolved

AljoschaMeyer reviewed Jul 11, 2024

View reviewed changes

fuzz/fuzz_targets/entry_encoding_random.rs Show resolved Hide resolved

sgwilym marked this pull request as ready for review July 11, 2024 09:26

sgwilym added 4 commits July 11, 2024 11:54

Move Consumer / Producer param out of Encoder / Decoder trait and int…

bdaf353

…o fn

Improve Encoder / Decoder docs

7554380

Refactor CompactWidth a bit

57ac313

Implement U8BE, U16BE, U32BE, U64BE + Encode, Decode traits

12e77ea

AljoschaMeyer reviewed Jul 11, 2024

View reviewed changes

data-model/src/encoding/compact_width.rs Outdated Show resolved Hide resolved

AljoschaMeyer reviewed Jul 11, 2024

View reviewed changes

data-model/src/encoding/unsigned_int.rs Outdated Show resolved Hide resolved

sgwilym added 10 commits July 12, 2024 06:34

Make CompactWidth::new pub(crate)

1804222

Use size_of

d2e69c1

Pull max_power encoding out into its own fn

4ede526

Remove unnecessary length check

bfde231

Better UintBE rustdocs

8da1e47

Genericise encoding fuzz tests into a reusable fn

1de2102

Add UintBE fuzz tests

17f5e2d

Refactor IsAuthorisedWrite

8856458

Arbitrary feature gating

c748d90

Implement Error on EncodingConsumerError, DecodeError

7fad5c1

sgwilym force-pushed the encodings branch from 557681f to 7fad5c1 Compare July 14, 2024 15:02

sgwilym mentioned this pull request Jul 14, 2024

Relative encodings #24

Merged

6 tasks

Merge branch 'zerocopypath'

4f967ba

sgwilym added 2 commits July 20, 2024 10:41

Use ufotofu 0.2.0, fix some warnings

b92529f

Merge branch 'main' into encodings

61bb7ab

sgwilym merged commit 06457bd into main Jul 20, 2024
1 check passed

sgwilym deleted the encodings branch July 20, 2024 09:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Encoding traits, Path + Entry impls #23

Encoding traits, Path + Entry impls #23

sgwilym commented Jul 9, 2024 •

edited

Loading

Frando commented Jul 18, 2024 •

edited

Loading

AljoschaMeyer commented Jul 18, 2024

Frando commented Jul 18, 2024 •

edited

Loading

AljoschaMeyer commented Jul 18, 2024 •

edited

Loading

sgwilym commented Jul 19, 2024

AljoschaMeyer commented Jul 20, 2024

Encoding traits, Path + Entry impls #23

Encoding traits, Path + Entry impls #23

Conversation

sgwilym commented Jul 9, 2024 • edited Loading

Frando commented Jul 18, 2024 • edited Loading

AljoschaMeyer commented Jul 18, 2024

Frando commented Jul 18, 2024 • edited Loading

AljoschaMeyer commented Jul 18, 2024 • edited Loading

sgwilym commented Jul 19, 2024

AljoschaMeyer commented Jul 20, 2024

sgwilym commented Jul 9, 2024 •

edited

Loading

Frando commented Jul 18, 2024 •

edited

Loading

Frando commented Jul 18, 2024 •

edited

Loading

AljoschaMeyer commented Jul 18, 2024 •

edited

Loading