Skip to content
This repository has been archived by the owner on Jul 31, 2024. It is now read-only.

Latest commit

 

History

History
15 lines (14 loc) · 672 Bytes

README.md

File metadata and controls

15 lines (14 loc) · 672 Bytes

Dataset

  • We share the dataset split we used in our experiments: data.zip and parallel_functions.zip.
    • Download them by running bash download.sh
  • data.zip contains the AVATAR dataset. Total number of examples in the AVATAR dataset is 9515. sources:
    • CodeForces - 2193
    • AtCoder - 871
    • AIZU - 1043
    • CodeJam - 120
    • GeeksforGeeks - 5019
    • LeetCode - 107
    • ProjectEuler - 162
  • We extract a collection of parallel standalone functions (parallel_functions.zip) from AVATAR.
    • It consists of 3391 examples for training.
    • We use the validation and testing dataset collected from GeeksforGeeks released with TransCoder.