Full experiment with the new C-specific prompt #370

DonggeLiu · 2024-06-21T07:45:36Z

Running a full experiment with the new C-specific prompt.

DonggeLiu · 2024-06-21T07:46:58Z

@DavidKorczynski: Are there any other changes required to use the C-specific prompt?
Would you recommend updating the benchmarks before the exp?

DonggeLiu · 2024-06-21T08:13:38Z

/gcbrun exp -n dg

DavidKorczynski · 2024-06-21T09:28:08Z

@DavidKorczynski: Are there any other changes required to use the C-specific prompt?

No

Would you recommend updating the benchmarks before the exp?

Yeah, I don't think it makes too much sense to try on C++ unless we're testing that it doesn't break the workflow or so -- I wouldn't expect the generated prompts to generate good C++ harnesses. We can use these benchmarks for pure C projects only: #371

Adds a set of benchmarks that have been extracted for the projects - clib - htslib - croaring - kamailio - opensips - unit - ntpsec - bind9 - libyang - miniz - mdbtools - libiec61850 and using the oracles - far-reach-low-coverage - low-cov-with-fuzz-keyword - easy-params-far-reach Ref: #370 Signed-off-by: David Korczynski <[email protected]>

DavidKorczynski · 2024-06-21T12:27:50Z

/gcbrun exp -n dk-9000 -b c-specific

DonggeLiu · 2024-06-24T03:05:23Z

The experiment report with c-benchmarks only:
https://llm-exp.oss-fuzz.com/Result-reports/scheduled/2024-06-23-weekly-all/

~~## The incorrect header error persists.~~
Take bind9 as an example:
The prompt provided the header full path (/src/bind9/lib/dns/include/dns/view.h), but LLM still used the incorrect path (/src/bind9/lib/isc/include/dns/view.h), which highlights the need to fix this programmatically.
BTW, LLM generated the correct header with the default prompt.

~~I reckon to avoid regression, we can:~~

~~Only fix the header when compilation fails due to this.~~
~~Fix header programmatically: 1) Parse the wrong header from the error message, 2) find the correct file path, 3) replace the wrong line with the correct path, 4) compile the fuzz target again.~~

We are trying a different solution to this.

Missing function implementation

Here is an example.

@DavidKorczynski will you address these?

DonggeLiu · 2024-06-24T08:13:32Z

LLM code-bison-32k failed to generate results of some benchmarks because it is not available in certain regions.
Re-running under the same config as 2024-06-23-weekly-all/ with C benchmarks only using Gemini-1.5:
https://llm-exp.oss-fuzz.com/Result-reports/scheduled/2024-06-24-weekly-all/

DonggeLiu added the Experiment-only A PR only to run experiments, do not merge it to main. label Jun 21, 2024

DonggeLiu requested a review from DavidKorczynski June 21, 2024 07:45

DavidKorczynski mentioned this pull request Jun 21, 2024

benchmarks: add c-specific set #371

Merged

Use the new C-specific prompt

9add280

DonggeLiu force-pushed the full-exp-c-specific branch from 4339083 to 9add280 Compare June 21, 2024 10:36

This was referenced Jun 22, 2024

core: find where functions are declared in header files ossf/fuzz-introspector#1624

Merged

webapp: limit size of sample references ossf/fuzz-introspector#1626

Merged

Replace all with C-only benchmarks for testing

6dde591

DonggeLiu force-pushed the full-exp-c-specific branch from ded4eb8 to 6dde591 Compare June 22, 2024 23:51

Merge branch 'main' into full-exp-c-specific

2b8d7dd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Full experiment with the new C-specific prompt #370

Full experiment with the new C-specific prompt #370

DonggeLiu commented Jun 21, 2024

DonggeLiu commented Jun 21, 2024

DonggeLiu commented Jun 21, 2024

DavidKorczynski commented Jun 21, 2024 •

edited

Loading

DavidKorczynski commented Jun 21, 2024

DonggeLiu commented Jun 24, 2024 •

edited

Loading

DonggeLiu commented Jun 24, 2024 •

edited

Loading

Full experiment with the new C-specific prompt #370

Are you sure you want to change the base?

Full experiment with the new C-specific prompt #370

Conversation

DonggeLiu commented Jun 21, 2024

DonggeLiu commented Jun 21, 2024

DonggeLiu commented Jun 21, 2024

DavidKorczynski commented Jun 21, 2024 • edited Loading

DavidKorczynski commented Jun 21, 2024

DonggeLiu commented Jun 24, 2024 • edited Loading

Missing function implementation

DonggeLiu commented Jun 24, 2024 • edited Loading

DavidKorczynski commented Jun 21, 2024 •

edited

Loading

DonggeLiu commented Jun 24, 2024 •

edited

Loading

DonggeLiu commented Jun 24, 2024 •

edited

Loading