Fuzzy tests #61

matklad · 2018-09-08T06:37:32Z

There are several things that parser self-checks for during parsing:

it might get stuck during error recovery code
Marker might be dropped without completing or abandoning code
It might produce a tree with invalid block structure code

We need to fuzz/property test the parser to verify that this does not happen in practice.

To that end, we need to implement:

fuzzing with arbitrary &str inputs: will be most useful for lexer probably?
fuzzing with arbitrary (invalid) Rust code. Arb Rust code might be acquired by taking a sample of real Rust code and applying random edits to it.

As described in rust-lang#61, fuzz testing some parts of this would be ~~fun~~ helpful. So, I started with the most trivial fuzzer I could think of: Put random stuff into File::parse and see what happens. To speed things up, I also did cp src/**/*.rs fuzz/corpus/parser/ in the `crates/libsyntax2/` directory (running the fuzzer once will generate the necessary directories).

killercup · 2018-09-08T15:01:56Z

Made a small PR with the most trivial fuzzer: #63. With the rs files from the libsyntax2 crate as corpus it quickly goes up to a coverage of 20199 (probably code paths but I don't know libfuzzer well enough; it's a high number compared to other fuzz targets, though).

Aaaaan it even found a crash! https://gist.github.com/killercup/a0be6701333a635de9dec6cffc89aedb

killercup · 2018-09-08T15:03:05Z

Two other interesting approaches: vegard/prog-fuzz@c80b1a7 and creating edits from commits

matklad · 2018-09-08T15:09:03Z

Aaaaan it even found a crash!

That's a good sign! Today I've made sure that parser deals with rust-lang/rust, with fail-tests and all, so this failure must be non-trivial.

I'll write a fix shortly with a description of how to fix these things: I expect this won't be the sole fuzzing error :)

With the rs files from the libsyntax2 crate as corpus it quickly goes up to a coverage of 20199

As a bonus objective, I think it makes sense to add cargo gen-fuzz-corpus path-to-dir-with-sources command to tools, so that we can easily generate a corpus on CI using libsynax sources.

killercup · 2018-09-08T15:13:49Z

Oh yes it most definitely is a non-trivial failure, it took the fuzzer almost 3 minutes to find – compared to most other projects where the first failures show up after 3 seconds! ;)

This is just the very first stab at things! I think I'll have a few hours to kill waiting at an airport tomorrow and might just integrate this with you tools crate and also throw other fuzzers at it. I've been meaning to make fuzzer definitions more generic anyway. And maybe after I've understood how to use libsyntax2 I can also use it to write a tool to auto-generate fuzzers for all functions that take a &[u8] or &str as input!

matklad · 2018-09-08T15:19:10Z

@killercup is it running the parser with debug-assertions on? One check ({} block validity) is predicated on them. Ideally I think it should use something like --release + debug_assertion ?

And maybe after I've understood how to use libsyntax2 I can also use it to write a tool to auto-generate fuzzers for all functions that take a &[u8] or &str as input!

Yep, I think libsyntax2, even in its current state, might be a good tool for the job (although right now syn might be better though?) Like, in theory you could even make an intention out of it, so that you place the cursor over the function in the editor, press ctrl+., and the fuzzer is auto-generated :) That's probably too special case to be an intention, but it's a fun possibility, and, super long term, custom intentions project-specific intentions are definitely on the roadmap

63: Add trivial fuzzer for parser r=matklad a=killercup As described in #61, fuzz testing some parts of this would be ~~fun~~ helpful. So, I started with the most trivial fuzzer I could think of: Put random stuff into File::parse and see what happens. To speed things up, I also did cp src/**/*.rs fuzz/corpus/parser/ in the `crates/libsyntax2/` directory (running the fuzzer once will generate the necessary directories). Co-authored-by: Pascal Hertleif <[email protected]>

matklad · 2018-09-08T15:21:47Z

Oh, one more useful thing to do would be to add a short note to readme, explaning how to actually run this thing :)

matklad · 2018-09-08T15:39:27Z

Ah, the delicious taste of dog food: I've copy-pasted your example as a test, and of course the plugin is now half-broken, because the indexing thread panics :D

matklad · 2018-09-08T16:12:58Z

Added a fix with a long explanation of what actually is going on: #64 (second commit msg).

If your fuzzers find more errors like these, you now know how to fix 'em!

killercup · 2018-09-09T08:50:28Z

Wow, a5c333c has a beautiful commit message!

I have indeed found another crash (output). I'll see if I can fix it later this afternoon!

killercup · 2018-09-09T08:57:50Z

is it running the parser with debug-assertions on?

Totally forgot to answer this one! Yes:

     Running `fuzz/target/x86_64-apple-darwin/debug/parser -artifact_prefix=/Users/pascal/Projekte/tools/libsyntax2/crates/libsyntax2/fuzz/artifacts/parser/ fuzz/artifacts/parser/`

And good point. I'll also run it in release to see if there is a unknown dependency on this somewhere (or a things that fails differently without overflow checks)!

Edit: Yes, it also crashes with --release (also atom_pat -> err_recover -> at -> nth). (For added fun, the fuzzer changed a whole bunch of bytes to \x00, so I rcat --quoted the example input to be usable as a rust binary string.)

matklad · 2018-09-09T09:10:21Z

. I'll also run it in release to see if there is a unknown dependency on this somewhere (or a things that fails differently without overflow checks)!

I've refactored code a bit, so that for fuzzing we always do that block check, even in release mode:
ba4a697

killercup · 2018-09-09T09:12:29Z

Cool! When using `cargo fuzz`, it'll also pass `--cfg fuzzing` FYI

…

Am 09.09.2018 um 11:10 schrieb Aleksey Kladov ***@***.***>: . I'll also run it in release to see if there is a unknown dependency on this somewhere (or a things that fails differently without overflow checks)! I've refactored code a bit, so that for fuzzing we always do that block check, even in release mode: ba4a697 — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or mute the thread.

matklad · 2018-09-09T10:57:08Z

I have indeed found another crash

BTW, I am seeing the same match_arm_list in the trace (https://gist.github.com/killercup/254c98a31d921972ecc88e2c3c35ecad#file-gistfile1-txt-L28), so I suspect it might be the same crash.

YaLTeR · 2018-09-12T07:42:20Z

Ran the fuzzer and found this: https://gist.github.com/YaLTeR/2d0c734a0cca830e136a59af3891f0b2

EDIT: and another one, although this contains match_arm_list so it might be the same as the other crash in this thread: https://gist.github.com/YaLTeR/16bc702d06a76f8192bcb2e17d9cd0ae

matklad · 2018-09-12T08:28:55Z

Thanks! First one fixed in b6f8037. Letme take a look at the second one...

EDIT: the second one is actually the same. It's not an match_arm_list, it's a slice_pat though. Refactored code a bit to also have pat_list in that case.

DJMcNab · 2018-12-31T12:20:44Z

I think the current action on this is just to run these every few weeks for a little bit to see if there's any code which can trigger a panic. I wonder if there's any neat way to remind us of this automatically - any ideas?

matklad · 2018-12-31T12:31:23Z

I wonder if there's any neat way to remind us of this automatically - any ideas?

A travis cron job?

matklad · 2018-12-31T12:34:11Z

So yeah, there are two bits I want to add to the current setup to close the issue

add a cargo fuzz-tests task (which should trick me to finally run this on my machine :) )
add an automated way to run fuzzing on CI. A daily cron job to run fuzzing for half an hour seems ideal.

A further improvement would be a high-level fuzzing, on the ra_analysis level: creating an (in-memory) rust project and running random diagnostics, completions and edits on it, but that is definitelly a separate issue.

393: Add a fuzzing subcommand r=matklad a=DJMcNab Part of #61 (comment). Co-authored-by: DJMcNab <[email protected]>

matklad · 2019-01-01T13:26:57Z

Only cron job is left. Here are the docs: https://docs.travis-ci.com/user/cron-jobs/.

memoryruins · 2019-01-28T15:05:44Z

Left it running overnight; no crashes yet :)

#6248625        REDUCE cov: 28690 ft: 163494 corp: 4204/91Kb lim: 110 exec/s: 234 rss: 605Mb L: 9/108 MS: 1 EraseBytes-
#6251047        REDUCE cov: 28690 ft: 163494 corp: 4204/91Kb lim: 110 exec/s: 234 rss: 605Mb L: 67/108 MS: 2 ShuffleBytes-EraseBytes-
#6251189        REDUCE cov: 28690 ft: 163494 corp: 4204/91Kb lim: 110 exec/s: 234 rss: 605Mb L: 37/108 MS: 2 ChangeBinInt-EraseBytes-
#6251406        NEW    cov: 28690 ft: 163503 corp: 4205/91Kb lim: 110 exec/s: 234 rss: 605Mb L: 109/109 MS: 2 InsertByte-CopyPart-
#6253052        REDUCE cov: 28690 ft: 163503 corp: 4205/91Kb lim: 110 exec/s: 234 rss: 605Mb L: 28/109 MS: 1 EraseBytes-

skade · 2020-04-14T15:53:21Z

Hm, travis cron jobs are not really meant for long-running tasks. Is there a maximum time we want to run this?

I would recommend a service such as oss-fuzz:
https://google.github.io/oss-fuzz/
https://google.github.io/oss-fuzz/getting-started/accepting-new-projects/

We'd need to apply, though.

matklad · 2020-04-15T08:38:08Z

Is there a maximum time we want to run this?

Yeah, that's what I had in mind: limit fuzzing time by, say, five minutes every nigh. We've recently added some extra time consuming checks on nightly.

We'd need to apply, though.

My gut feeling is that, at this stage, fuzzing would add relatively little benefit. So I'd rather avoid integrating with services besides the ones we already have.

matklad · 2020-07-15T18:04:59Z

we have some fuzzing tests set up, we don't have automated fuzzing on CI, but, given the number of known bugs, I don't think it's worthwhile to invest time into this at this point.

matthiaskrgr · 2024-01-06T19:25:08Z

Just for the record, I have fed rust-lang/glacier and rust-lang/rust + around 30% of random files contained in the history of rust-lang/rust into rust-analyzer highlight, and filtered out a couple of crashes:

#16288
#16287
#16286
#16284
#16283
#16282
#16281
#16280
#16278

I can do mutation-based fuzzing based on randomly mutated (using tree-splicer) rust files using https://github.com/matthiaskrgr/icemaker/ if anyone is interested :)

matklad added help wanted E-medium fun A technically challenging issue with high impact labels Sep 8, 2018

killercup mentioned this issue Sep 8, 2018

Add trivial fuzzer for parser #63

Merged

matklad removed the help wanted label Dec 31, 2018

DJMcNab mentioned this issue Dec 31, 2018

Add a fuzzing subcommand #393

Merged

bors bot added a commit that referenced this issue Dec 31, 2018

Merge #393

700b334

393: Add a fuzzing subcommand r=matklad a=DJMcNab Part of #61 (comment). Co-authored-by: DJMcNab <[email protected]>

matklad closed this as completed Jul 15, 2020

sify21 mentioned this issue Sep 16, 2020

ra_lsp_server keeps hogging one core #2812

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fuzzy tests #61

Fuzzy tests #61

matklad commented Sep 8, 2018

killercup commented Sep 8, 2018

killercup commented Sep 8, 2018

matklad commented Sep 8, 2018

killercup commented Sep 8, 2018

matklad commented Sep 8, 2018

matklad commented Sep 8, 2018

matklad commented Sep 8, 2018

matklad commented Sep 8, 2018 •

edited

Loading

killercup commented Sep 9, 2018

killercup commented Sep 9, 2018 •

edited

Loading

matklad commented Sep 9, 2018

killercup commented Sep 9, 2018 via email

matklad commented Sep 9, 2018

YaLTeR commented Sep 12, 2018 •

edited

Loading

matklad commented Sep 12, 2018 •

edited

Loading

DJMcNab commented Dec 31, 2018

matklad commented Dec 31, 2018

matklad commented Dec 31, 2018

matklad commented Jan 1, 2019

memoryruins commented Jan 28, 2019

skade commented Apr 14, 2020 •

edited

Loading

matklad commented Apr 15, 2020

matklad commented Jul 15, 2020

matthiaskrgr commented Jan 6, 2024

Fuzzy tests #61

Fuzzy tests #61

Comments

matklad commented Sep 8, 2018

killercup commented Sep 8, 2018

killercup commented Sep 8, 2018

matklad commented Sep 8, 2018

killercup commented Sep 8, 2018

matklad commented Sep 8, 2018

matklad commented Sep 8, 2018

matklad commented Sep 8, 2018

matklad commented Sep 8, 2018 • edited Loading

killercup commented Sep 9, 2018

killercup commented Sep 9, 2018 • edited Loading

matklad commented Sep 9, 2018

killercup commented Sep 9, 2018 via email

matklad commented Sep 9, 2018

YaLTeR commented Sep 12, 2018 • edited Loading

matklad commented Sep 12, 2018 • edited Loading

DJMcNab commented Dec 31, 2018

matklad commented Dec 31, 2018

matklad commented Dec 31, 2018

matklad commented Jan 1, 2019

memoryruins commented Jan 28, 2019

skade commented Apr 14, 2020 • edited Loading

matklad commented Apr 15, 2020

matklad commented Jul 15, 2020

matthiaskrgr commented Jan 6, 2024

matklad commented Sep 8, 2018 •

edited

Loading

killercup commented Sep 9, 2018 •

edited

Loading

YaLTeR commented Sep 12, 2018 •

edited

Loading

matklad commented Sep 12, 2018 •

edited

Loading

skade commented Apr 14, 2020 •

edited

Loading