Skip to content

Docs: Clarify what "issue_id" means #27

@parker-research

Description

@parker-research

Clarify -i issue_id vs. the /work/cvdp_<datapoint>/harness/<issue_id>/ directory structure notation

It seems that there are two different things called "issue_id"?

There's the CLI arg -i / --id which seems to be called the "issue_id", and is a value like -i cvdp_agentic_fixed_arbiter_0001 (per the README example).

However, conversely, according to the README_NON_AGENTIC.md document (and likely others), the directory structure ALSO uses an issue_id:

work/cvdp_[datapoint]/
├── harness/[issue_id]/
│   ├── docs/         # Documentation files
│   ├── rtl/          # Hardware description files  
│   ├── verif/        # Verification files
│   ├── rundir/       # Working directory
│   └── src/          # Source files (if applicable)
├── prompts/          # Stored prompts and responses
└── reports/          # Test results and analysis

It seems that the -i argument matches more closely to what's called a "datapoint" into the directory structure docs.

Reconciling this notation, and explaing in what cases the directory structure's issue_id number increments (which is commonly 1) would be very helpful.

I'd propose using "sequence number" to refer to subseqent attempts, in contrast with "ID" which is a reference to an input condition/test case.

I would have thought a new issue_id folder would be created for each k in pass@k (i.e., when k-1 failed, given k>=2), but it seems that's not the case:

./run_samples.py -f example_dataset/cvdp_v1.0.1_example_nonagentic_code_generation_no_commercial_with_solutions.jsonl -l -m gpt-4o-mini -n 5 -k 3 -p work_composite

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions