Skip to content

Pull requests: vercel-labs/agent-eval

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Kimi] Add Kimi CLI agent harness
#117 opened Apr 20, 2026 by gaojude Collaborator Draft
2 of 4 tasks
Skip missing validation scripts
#92 opened Mar 17, 2026 by gaojude Collaborator Loading…
[wip] add bub agent support
#91 opened Mar 7, 2026 by CorrectRoadH Draft
Add timings for phases
#88 opened Feb 25, 2026 by jeffsee55 Loading…
Add ability to choose which eval --smoke runs
#84 opened Feb 20, 2026 by jeffsee55 Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.