Commit 16b2e00
committed
chore(release): 2.3.0 [skip ci]
# [2.3.0](v2.2.0...v2.3.0) (2026-03-02)
### Bug Fixes
* auto-install deps, python3 symlink, detect full commands in fail_to_pass, language-aware test scripts ([a38497f](a38497f))
* config test race condition with env var mutex ([2963325](2963325))
* expose agent_output and agent_patch in TaskResult and API responses ([348c251](348c251))
* extract_agent_only for /evaluate - no tasks/ dir required ([2b90ee1](2b90ee1))
* filter out apt-get/system commands from install (Basilica blocks syscalls), keep project-level installs ([e5365da](e5365da))
* handle null test_patch from HuggingFace API (deserialize null as empty string) ([492d068](492d068))
* increase clone/install timeout from 180s to 600s ([95cecc3](95cecc3))
* install corepack/yarn/pnpm globally via npm in Dockerfile ([b7183e8](b7183e8))
* normalize repo URL in parse_task (add github.com prefix) ([398a6fd](398a6fd))
* run as root (Basilica blocks sudo), remove sudo prefix logic ([477a433](477a433))
* sudo for apt-get in install commands, add golang/corepack/sudo to Dockerfile ([1aceb88](1aceb88))
* upgrade Go to 1.23 and Node to 20 LTS in Dockerfile ([67ca713](67ca713))
* use :id path params for Axum 0.7 (not {id} which is 0.8) ([5dfa0c1](5dfa0c1))
### Features
* /evaluate endpoint using stored agent + TRUSTED_VALIDATORS whitelist ([b6aee7a](b6aee7a))
* add /code-hash endpoint for code integrity verification ([0a8e01b](0a8e01b))
* add /upload-agent-json endpoint for JSON-based agent upload ([9cfa1da](9cfa1da))
* add POST /submit_tasks endpoint + fix HuggingFace dataset compat ([d92444c](d92444c))
* agent user with sudo for apt-install, run all commands as non-root agent ([e3f574a](e3f574a))
* agent ZIP upload frontend with env vars + SUDO_PASSWORD auth ([3aa5184](3aa5184))
* change default max_concurrent_tasks from 8 to 6, support CONCURRENTLY_TASKS env var ([eaba581](eaba581))
* fat Docker image with all language runtimes (java, rust, pnpm, unzip, etc.) ([3855f2d](3855f2d))
* fetch task definitions from HF repo (workspace.yaml + tests/), remove auto_install hack ([7162a39](7162a39))
* propagate agent_env to run_agent and pass --instruction arg to Python agents ([d922264](d922264))
* swe-bench/swe-forge integration - extend WorkspaceConfig with fail_to_pass/pass_to_pass/install_config/difficulty fields - parse swe-forge workspace.yaml native fields as test script fallback - capture git diff (agent patch) after agent execution - add /dataset endpoint to fetch from HuggingFace CortexLM/swe-forge - wire fail_to_pass/pass_to_pass in dataset entry conversion ([814259e](814259e))1 parent 5a7cad1 commit 16b2e00
1 file changed
Lines changed: 34 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
18 | 18 | | |
19 | 19 | | |
20 | 20 | | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
21 | 55 | | |
22 | 56 | | |
23 | 57 | | |
| |||
0 commit comments