Skill Submissions

This repository is the submission intake for the ABEvalFlow A/B evaluation pipeline. Push a skill folder under submissions/ and the pipeline will automatically validate, build, evaluate, and report on it.

Quick start

# 1. Clone this repo
git clone <this-repo-url>
cd skill-submissions

# 2. Copy the sample as a starting point
cp -r submissions/sample-skill submissions/my-skill

# 3. Edit the files for your use case
#    - metadata.yaml   → set your skill name, description, tags
#    - instruction.md   → describe the task the agent must solve
#    - skills/SKILL.md  → write the skill guidance for the agent
#    - tests/test_outputs.py → pytest tests that verify the solution

# 4. Push to trigger evaluation
git add submissions/my-skill/
git commit -m "Submit my-skill for evaluation"
git push

Submission structure

Each submission is a folder under submissions/ with the following layout:

submissions/<skill-name>/
├── metadata.yaml          ← describes your submission (required)
├── instruction.md         ← the task the agent must solve (required)
├── skills/
│   └── SKILL.md           ← your skill file (required)
├── tests/
│   └── test_outputs.py    ← pytest tests that verify the solution (required)
├── docs/                  ← reference docs for the agent (optional)
└── supportive/            ← mock MCPs, sample data (optional, <50 MB)

For full details on each file, naming rules, submission modes, and FAQ, see the ABEvalFlow Trigger Guide.

How it works

When you push to this repo, the ABEvalFlow pipeline:

Validates your submission (file structure, metadata schema, test compilation)
Scaffolds treatment (with skill) and control (without skill) variants
Builds container images for both variants
Evaluates each variant over 20 trials using Harbor
Analyzes pass rates, uplift, and statistical significance
Reports a PASS or FAIL recommendation

Results are visible in the OpenShift console under Pipelines > PipelineRuns in the ab-eval-flow namespace.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.github		.github
submissions		submissions
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Skill Submissions

Quick start

Submission structure

How it works

Related

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Skill Submissions

Quick start

Submission structure

How it works

Related

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages