-
Notifications
You must be signed in to change notification settings - Fork 26
Description
Clarify -i issue_id vs. the /work/cvdp_<datapoint>/harness/<issue_id>/ directory structure notation
It seems that there are two different things called "issue_id"?
There's the CLI arg -i / --id which seems to be called the "issue_id", and is a value like -i cvdp_agentic_fixed_arbiter_0001 (per the README example).
However, conversely, according to the README_NON_AGENTIC.md document (and likely others), the directory structure ALSO uses an issue_id:
work/cvdp_[datapoint]/
├── harness/[issue_id]/
│ ├── docs/ # Documentation files
│ ├── rtl/ # Hardware description files
│ ├── verif/ # Verification files
│ ├── rundir/ # Working directory
│ └── src/ # Source files (if applicable)
├── prompts/ # Stored prompts and responses
└── reports/ # Test results and analysis
It seems that the -i argument matches more closely to what's called a "datapoint" into the directory structure docs.
Reconciling this notation, and explaing in what cases the directory structure's issue_id number increments (which is commonly 1) would be very helpful.
I'd propose using "sequence number" to refer to subseqent attempts, in contrast with "ID" which is a reference to an input condition/test case.
I would have thought a new issue_id folder would be created for each k in pass@k (i.e., when k-1 failed, given k>=2), but it seems that's not the case:
./run_samples.py -f example_dataset/cvdp_v1.0.1_example_nonagentic_code_generation_no_commercial_with_solutions.jsonl -l -m gpt-4o-mini -n 5 -k 3 -p work_composite