Added doc on simulator API. #565

mwhittaker · 2023-08-25T17:15:43Z

No description provided.

spetrovic77 · 2023-08-25T18:30:54Z

docs/randomized_testing.md

+        Gen:  func(*rand.Rand) int { ... },
+        Func: func(ctx context.Context, replies map[int]int, x int, a A) error {
+            if y, err := a.A(ctx, x); err == nil {
+                replies[x] = y


I don't like this option because it's difficult to explain what instance of replies will be passed to the function. In reality, for each "workload", you'll be creating a state, and running a series of ops on it. But this isn't as encapsulated as it is in Option 3.

spetrovic77 · 2023-08-25T18:31:31Z

docs/randomized_testing.md

+    return nil
+}
+
+func (*Workload) GenCallA(r *rand.Rand) int {


We discussed possibly defining a generator as a different type, like we did for routing.

Replied to your other comment with an example of this proposal.

spetrovic77 · 2023-08-25T18:32:27Z

docs/randomized_testing.md

+
+### Decision
+
+NOTE(mwhittaker): I don't love any of these options. I have a slight preference


I vote for Option 3 or a tweaked version of it, because it encapsulates a "run" or a "workload" really well, along with its state etc.

Here are a few cosmetic tweaks to Option 3 that might make it more appealing. First, we can move the generator to a separate struct, as you suggested. Second, we can change the Init method to return all fakes using the existing weavertest.Fake function. Note that I'm showing the simulator in a sim package, but it will probably end up in the weavertest package as well.

type Workload struct { a weaver.Ref[A] b weaver.Ref[B] replies map[int]int } func (w *Workload) Init(context.Context) ([]weavertest.FakeComponent, error) { w.replies = map[int]int{} return []weavertest.FakeComponent{weavertest.Fake[B](&fakeb{})}, nil } func (w *Workload) CallA(ctx context.Context, x int) error { if y, err := w.a.Get().A(ctx, x); err == nil { w.replies[x] = y } return nil } func (w *Workload) CallB(ctx context.Context) error { w.b.Get().B(ctx) return nil } type Generator struct {} func (Generator) CallA(r *rand.Rand) int { ... } func TestWorkload(t *testing.T) { s, err := sim.New[Workload, Generator](sim.Options{...}) if err != nil { t.Fatal(err) } results, err := s.Simulate(10 * time.Second) if err != nil { t.Fatal(err) } }

mwhittaker · 2023-08-30T00:25:42Z

@ghemawat @spetrovic77 Added the options we discussed today to the doc.

ghemawat

Thanks for the clear writeup of the options and the pros and cons. Very helpful.

docs/randomized_testing.md

ghemawat · 2023-08-30T16:07:50Z

docs/randomized_testing.md

+
+**Cons.**
+Magic struct tags. Also, it is awkward to fake a component to which we don't
+need a direct reference.


What if we change to something like:

type Workload struct { a weaver.Ref[A] b weavertest.Fake[B,fakeb] replies map[int]int }

Would Fake[I,F] be something applicable to weavertest as well, which might make it more palatable.

E.g., chain_test.go's fake could instead by supplied as follows:

runner.Test(t, func(t *testing.T, a weavertest.Fake[A, fake]) { f := a.Fake() // type *fake i := a.Component() // type A ... })

We can then get rid of weavertest.Fake and weavertest.FakeComponent. We should try writing some user documentation for this and see how it looks.

Added this option to the doc. I can work on writing user docs.

docs/randomized_testing.md

ghemawat · 2023-08-30T16:12:49Z

docs/randomized_testing.md

+- The `Generator[T]` interface is clunky. A value of type `T` generates a value
+  of type `T`, which is weird. It means that op arguments are doubling as
+  generators and generated values. Plus, the user sometimes has to typecast
+  these arguments. We saw that in the `CallA` method above.


Instead of type-casting, we should perhaps have a method. That will provide two benefits:

The representation of Generator[T] can hold more state than T.

It will reduce the chance of typos like the user saying byte(x) instead of int(x).

I agree that typecasting is both un-ergonomic and error-prone. I described one approach to a Get method in Option 8 which separates the Generate method and Get method across two different interfaces, but we can maybe merge them into a single interface.

ghemawat · 2023-08-30T16:15:38Z

docs/randomized_testing.md

+- If a workload has many ops that don't re-use types much, this approach is more
+  onerous that defining generator methods.
+- The `Generator[T]` interface is clunky. A value of type `T` generates a value
+  of type `T`, which is weird. It means that op arguments are doubling as


What if we renamed the Generate() method to Init() and have it modify the receiver. E.g.,

func (i *myInt) Init(r *rand.Rand) { *i = myInt(r.Intn(100)) } }

I'm still working on addressing the other comments, but I've been thinking more about how to specify generators. The approach I like the most is a Frankenstein combination of the existing proposals. First, we introduce a handful of interfaces:

type Generator[T any] interface { Generate(r *rand.Rand) T } type Shrinker[T any] interface { Shrink(T) []T } type Formatter[T any] interface { Format(T) string }

Generator[T] generates values of type T.

Shrinker[T] shrinks values, which is used for minimization.

Formatter[T] pretty prints values, which is used for visualization.

I think these interfaces are as simple as possible, without any subtleties or oddities of some of the other proposals.

Next, workload methods have plain argument types:

type Workload struct { ... } func (w *Workload) Foo(ctx context.Context, x int, y string) error { ... } func (w *Workload) Bar(ctx context.Context, z bool) error { ... }

There is no casting, no Get method, and no confusion about a type doubling as its generator. The user then has to define a separate struct with one method for every op in their workload struct. Methods return the Generators used to generate arguments to the ops.

type WorkloadGenerator struct {} func (WorkloadGenerator) Foo() (Generator[int], Generator[string]) { ... } func (WorkloadGenerator) Bar() (Generator[bool]) { ... }

To make implementing these methods easier, we provide a family of general purpose Generators. For example, sim.Int, sim.String, sim.ReadableString, and so on. We can also provide basic functions to build Generators. For example, Pick[T any](xs ...T) Generator[T] returns a Generator that randomly returns the provided elements. Here's how we might implement WorkloadGenerator:

type WorkloadGenerator struct {} func (WorkloadGenerator) Foo() (Generator[int], Generator[string]) { return sim.NegativeInt, sim.Pick("a", "b", "c") } func (WorkloadGenerator) Bar() (Generator[bool]) { return sim.BiasedFlip(0.75) }

Of course, users are free to implement their own custom instances of Generator[T] as well. Moreover, a returned Generator[T] may optionally implement the Shrinker[T] or Formatter[T] interfaces. The general purpose Generators we provide all implement the Shrinker and Formatter interfaces automatically.

If a user finds themselves re-using the same Generator across a bunch of methods, they can use helper functions to avoid some of the repetition. Imagine a key-value store based workload, for example:

func key() Generator[string] { return sim.Pick("a", "b", "c") } func value() Generator[string] { return sim.ReadableString } type StoreGenerator struct {} func (StoreGenerator) Get() Generator[string] { return key() } func (StoreGenerator) Set() (Generator[string], Generator[string]) { return key(), value() } func (StoreGenerator) Swap() (Generator[string], Generator[string]) { return key(), key() } func (StoreGenerator) Delete() Generator[string] { return key() }

The main downside of this approach is that it requires by far the most typing out of any approach. The more I thought about this though, the less it bothered me. I started to think of it similar to how Go implements errors and how code is littered with if err != nil { ... } code. It's more typing, but there's not really any subtleties or gotchas. I started to feel that this approach is the easiest to understand, but the most annoying to type out.

Finally, a small but related point. We've discussed the idea of generators having state. I started to think this is a bad idea and muddies the otherwise simple abstraction of a generator. I think all state should be in the workload, and the generator is left as a stateless and relatively independent entity.

Formatter

can we use the normal "func (...) String() string" API instead?

mwhittaker

Thanks Sanjay!

mwhittaker · 2023-08-30T22:28:19Z

docs/randomized_testing.md

+- If a workload has many ops that don't re-use types much, this approach is more
+  onerous that defining generator methods.
+- The `Generator[T]` interface is clunky. A value of type `T` generates a value
+  of type `T`, which is weird. It means that op arguments are doubling as


I'm still working on addressing the other comments, but I've been thinking more about how to specify generators. The approach I like the most is a Frankenstein combination of the existing proposals. First, we introduce a handful of interfaces:

type Generator[T any] interface { Generate(r *rand.Rand) T } type Shrinker[T any] interface { Shrink(T) []T } type Formatter[T any] interface { Format(T) string }

Generator[T] generates values of type T.

Shrinker[T] shrinks values, which is used for minimization.

Formatter[T] pretty prints values, which is used for visualization.

I think these interfaces are as simple as possible, without any subtleties or oddities of some of the other proposals.

Next, workload methods have plain argument types:

type Workload struct { ... } func (w *Workload) Foo(ctx context.Context, x int, y string) error { ... } func (w *Workload) Bar(ctx context.Context, z bool) error { ... }

There is no casting, no Get method, and no confusion about a type doubling as its generator. The user then has to define a separate struct with one method for every op in their workload struct. Methods return the Generators used to generate arguments to the ops.

type WorkloadGenerator struct {} func (WorkloadGenerator) Foo() (Generator[int], Generator[string]) { ... } func (WorkloadGenerator) Bar() (Generator[bool]) { ... }

To make implementing these methods easier, we provide a family of general purpose Generators. For example, sim.Int, sim.String, sim.ReadableString, and so on. We can also provide basic functions to build Generators. For example, Pick[T any](xs ...T) Generator[T] returns a Generator that randomly returns the provided elements. Here's how we might implement WorkloadGenerator:

type WorkloadGenerator struct {} func (WorkloadGenerator) Foo() (Generator[int], Generator[string]) { return sim.NegativeInt, sim.Pick("a", "b", "c") } func (WorkloadGenerator) Bar() (Generator[bool]) { return sim.BiasedFlip(0.75) }

Of course, users are free to implement their own custom instances of Generator[T] as well. Moreover, a returned Generator[T] may optionally implement the Shrinker[T] or Formatter[T] interfaces. The general purpose Generators we provide all implement the Shrinker and Formatter interfaces automatically.

If a user finds themselves re-using the same Generator across a bunch of methods, they can use helper functions to avoid some of the repetition. Imagine a key-value store based workload, for example:

func key() Generator[string] { return sim.Pick("a", "b", "c") } func value() Generator[string] { return sim.ReadableString } type StoreGenerator struct {} func (StoreGenerator) Get() Generator[string] { return key() } func (StoreGenerator) Set() (Generator[string], Generator[string]) { return key(), value() } func (StoreGenerator) Swap() (Generator[string], Generator[string]) { return key(), key() } func (StoreGenerator) Delete() Generator[string] { return key() }

The main downside of this approach is that it requires by far the most typing out of any approach. The more I thought about this though, the less it bothered me. I started to think of it similar to how Go implements errors and how code is littered with if err != nil { ... } code. It's more typing, but there's not really any subtleties or gotchas. I started to feel that this approach is the easiest to understand, but the most annoying to type out.

Finally, a small but related point. We've discussed the idea of generators having state. I started to think this is a bad idea and muddies the otherwise simple abstraction of a generator. I think all state should be in the workload, and the generator is left as a stateless and relatively independent entity.

mwhittaker

Thanks Sanjay! I addressed the rest of your comments.

docs/randomized_testing.md

mwhittaker · 2023-09-05T18:09:29Z

docs/randomized_testing.md

+        Gen:  func(*rand.Rand) int { ... },
+        Func: func(ctx context.Context, replies map[int]int, x int, a A) error {
+            if y, err := a.A(ctx, x); err == nil {
+                replies[x] = y


mwhittaker · 2023-09-05T18:10:23Z

docs/randomized_testing.md

+    return nil
+}
+
+func (*Workload) GenCallA(r *rand.Rand) int {


Replied to your other comment with an example of this proposal.

docs/randomized_testing.md

mwhittaker · 2023-09-05T18:14:55Z

docs/randomized_testing.md

+- The `Generator[T]` interface is clunky. A value of type `T` generates a value
+  of type `T`, which is weird. It means that op arguments are doubling as
+  generators and generated values. Plus, the user sometimes has to typecast
+  these arguments. We saw that in the `CallA` method above.


I agree that typecasting is both un-ergonomic and error-prone. I described one approach to a Get method in Option 8 which separates the Generate method and Get method across two different interfaces, but we can maybe merge them into a single interface.

mwhittaker · 2023-09-05T18:22:37Z

docs/randomized_testing.md

+
+**Cons.**
+Magic struct tags. Also, it is awkward to fake a component to which we don't
+need a direct reference.


Added this option to the doc. I can work on writing user docs.

mwhittaker · 2023-09-05T21:20:37Z

docs/randomized_testing.md

+- People can easily forget to register a generator for every type.
+- People can register two generators for the same type.
+
+### Decision


I added the decision we discussed on offline. I'll start implementing things, and we can tweak stuff as we go.

See #565 for details.

See ServiceWeaver#565 for details.

Added doc on simulator API.

2e244ce

mwhittaker requested review from rgrandl, spetrovic77 and ghemawat August 25, 2023 17:15

mwhittaker self-assigned this Aug 25, 2023

mwhittaker marked this pull request as ready for review August 25, 2023 17:15

spetrovic77 reviewed Aug 25, 2023

View reviewed changes

Added more options to the testing doc.

0a1412d

mwhittaker force-pushed the sim_api_doc branch from 88a81be to 0a1412d Compare August 30, 2023 00:24

ghemawat reviewed Aug 30, 2023

View reviewed changes

mwhittaker commented Aug 30, 2023

View reviewed changes

mwhittaker added the simulator label Aug 30, 2023

Addressed Sanjay's comments.

6c066eb

mwhittaker commented Sep 5, 2023

View reviewed changes

Added final random testing decision (for now).

29c73bd

mwhittaker commented Sep 5, 2023

View reviewed changes

mwhittaker added a commit that referenced this pull request Sep 6, 2023

Added rough draft of simulator API.

d8b67a7

See #565 for details.

mwhittaker added a commit that referenced this pull request Sep 6, 2023

Added rough draft of simulator API.

9c2f3a9

See #565 for details.

mwhittaker mentioned this pull request Sep 6, 2023

Added rough draft of simulator API. #588

Merged

mwhittaker added a commit that referenced this pull request Sep 8, 2023

Added rough draft of simulator API.

54bcadd

See #565 for details.

mwhittaker added a commit that referenced this pull request Sep 8, 2023

Added rough draft of simulator API. (#588)

047c33f

See #565 for details.

htiennv pushed a commit to htiennv/weaver that referenced this pull request Sep 9, 2023

Added rough draft of simulator API. (ServiceWeaver#588)

30a5ebd

See ServiceWeaver#565 for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added doc on simulator API. #565

Added doc on simulator API. #565

mwhittaker commented Aug 25, 2023

spetrovic77 Aug 25, 2023

mwhittaker Sep 5, 2023

spetrovic77 Aug 25, 2023

mwhittaker Sep 5, 2023

spetrovic77 Aug 25, 2023

mwhittaker Aug 25, 2023 •

edited

Loading

mwhittaker commented Aug 30, 2023

ghemawat left a comment

ghemawat Aug 30, 2023

mwhittaker Sep 5, 2023

ghemawat Aug 30, 2023

mwhittaker Sep 5, 2023

ghemawat Aug 30, 2023

mwhittaker Aug 30, 2023

ghemawat Sep 5, 2023

mwhittaker left a comment

mwhittaker Aug 30, 2023

mwhittaker left a comment

mwhittaker Sep 5, 2023

mwhittaker Sep 5, 2023

mwhittaker Sep 5, 2023

mwhittaker Sep 5, 2023

mwhittaker Sep 5, 2023


		### Decision

		NOTE(mwhittaker): I don't love any of these options. I have a slight preference

Added doc on simulator API. #565

Are you sure you want to change the base?

Added doc on simulator API. #565

Conversation

mwhittaker commented Aug 25, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mwhittaker Aug 25, 2023 • edited Loading

Choose a reason for hiding this comment

mwhittaker commented Aug 30, 2023

ghemawat left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mwhittaker left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mwhittaker left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mwhittaker Aug 25, 2023 •

edited

Loading