Feat/自进化（经验记忆）框架重构 by chenjw · Pull Request #2503 · volcengine/OpenViking

chenjw · 2026-06-08T06:24:36Z

Description

记忆导入并行化，降低锁粒度（两阶段合并），locomo导入可降低到15分钟（locomo 80.78%）
自进化框架重构，提供离线训练框架和rollout service的对接（参见：https://github.com/volcengine/OpenViking/discussions/2533）
按照新框架重构tau2的评测方案，参见：benchmark/tau2/train/README.md，
加入case定义（类似skillx里面的plan skill），提取记忆时先提取到case（任务意图），才会进一步提取traj和exp记忆。
tau2的经验加载改成skill渐进方式（llm决策去检索case，返回case关联的exp的situation和链接，然后由模型决策去read exp原文）
训练框架加入进度条，并result目录提供完整的运行日志（结合debug_trace工具，可供llm来分析badcase原因）
traj和exp记忆模板还在持续调整中，目前经验记忆的效果对齐还没有完全稳定，先提交一版后续继续对齐。

Related Issue

Type of Change

Bug fix (non-breaking change that fixes an issue)
New feature (non-breaking change that adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation update
Refactoring (no functional changes)
Performance improvement
Test update

Changes Made

Testing

I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes
I have tested this on the following platforms:
- Linux
- macOS
- Windows

Checklist

My code follows the project's coding style
I have performed a self-review of my code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
Any dependent changes have been merged and published

Screenshots (if applicable)

Additional Notes

github-actions · 2026-06-08T06:26:44Z

PR Reviewer Guide 🔍

Here are some key observations to aid the review process:

⏱️ Estimated effort to review: 4 🔵🔵🔵🔵⚪
🏅 Score: 70
🧪 PR contains tests
🔒 No security concerns identified
✅ No TODO sections
🔀 Multiple PR themes Sub-PR theme: Add streaming memory updater and compressor V3 Relevant files: openviking/session/memory/streaming_memory_updater.py openviking/session/compressor_v3.py tests/session/memory/test_streaming_memory_updater.py tests/session/test_compressor_v3.py tests/integration/test_compressor_v3_case_extraction.py Sub-PR theme: Add training framework and patch merge context provider Relevant files: openviking/session/train/*/.py tests/session/train/*/.py tests/session/memory/test_patch_merge_context_provider.py
⚡ Recommended focus areas for review Missing VLM parameter for training components The ExperienceGradientEstimator and PatchMergePolicyOptimizer are initialized without the required 'vlm' parameter, which will prevent them from functioning correctly (as seen in the test file where 'vlm' is explicitly provided). gradient_estimator=ExperienceGradientEstimator( viking_fs=viking_fs, ), policy_optimizer=PatchMergePolicyOptimizer( viking_fs=viking_fs, memory_type="experiences", ), Silent exception handling without logging The MemoryFileUtils.write call is wrapped in a broad exception handler that falls back to operation_after_content without logging the failure, which will make debugging issues difficult. ) except Exception: return operation_after_content(op) Potential incorrect VLM instance retrieval The code calls get_vlm_instance() on get_openviking_config().vlm, but test files use get_openviking_config().vlm directly as the VLM instance. This could cause an AttributeError if config.vlm is already the instance. vlm = get_openviking_config().vlm.get_vlm_instance()

github-actions · 2026-06-08T06:29:28Z

PR Code Suggestions ✨

Explore these optional code suggestions:

Category	Suggestion	Impact
Possible issue	Add missing VLM instance to policy gradient/optimizer The `ExperienceGradientEstimator` and `PatchMergePolicyOptimizer` are likely missing a required VLM instance (as seen in the test file). Retrieve the VLM from the openviking config and pass it to both initializers. openviking/session/compressor_v3.py [389-395] +config = get_openviking_config() +vlm = config.vlm.get_vlm_instance() gradient_estimator=ExperienceGradientEstimator( viking_fs=viking_fs, + vlm=vlm, ), policy_optimizer=PatchMergePolicyOptimizer( viking_fs=viking_fs, + vlm=vlm, memory_type="experiences", ), Suggestion importance[1-10]: 7 __ Why: The test files show that `ExperienceGradientEstimator` and `PatchMergePolicyOptimizer` require a `vlm` parameter for proper operation, and the codebase already uses `get_openviking_config().vlm.get_vlm_instance()` to retrieve the VLM elsewhere, making this a likely missing initialization issue.	Medium
General	Remove unnecessary `field()` usage from non-dataclass The `SessionCompressorV3` class is using `field()` for class attributes but is not decorated with `@dataclass`. This is misleading because `field()` only works with dataclasses. Remove the `field()` usage since the `init` method properly initializes these attributes. openviking/session/compressor_v3.py [74-80] class SessionCompressorV3: """Session compressor with lock-free patch-merge user memory extraction.""" rollout_analyzer: TrajectoryRolloutAnalyzer \| Any - streaming_trainer_config: StreamingPolicyTrainerConfig = field( - default_factory=StreamingPolicyTrainerConfig - ) - streaming_memory_updater_config: StreamingMemoryUpdaterConfig = field( - default_factory=StreamingMemoryUpdaterConfig - ) + streaming_trainer_config: StreamingPolicyTrainerConfig + streaming_memory_updater_config: StreamingMemoryUpdaterConfig Suggestion importance[1-10]: 5 __ Why: The `SessionCompressorV3` class is not a dataclass, so using `field()` for class attributes is misleading and unnecessary, as `__init__ properly initializes them. Removing the` field()` usage improves code clarity.	Low

qin-ctx

这次 review 只发布 inline findings。结论：REQUEST_CHANGES。前三条是 blocking，后面三条是 non-blocking，但都建议在这个大 PR 合入前处理或明确取舍。

qin-ctx

这次 review 结论是 REQUEST_CHANGES。

整体方向是合理的：V3 把抽取、patch merge、训练和 rollout 拆开，并通过 streaming batch 降低锁粒度；之前几条历史 blocking 评论我也重新核过，大多已经修掉。

历史评论追踪：

locomo import 的 canonical sample id 问题已修，并补了覆盖 peer wiring 的测试。
patch merge hidden fields 已补 last_update_trace_id。
case 到 experience 的 root uri fallback 已改为显式报错。
no-op merge fast path 已改为比较 render 后内容。
memory.version 现在代码和文档都说明已废弃且强制 v3；建议 PR 模板里补上变更类型和测试说明。

我在 PR 快照上跑了 python -m compileall -q openviking benchmark/locomo/vikingbot benchmark/tau2 bot/vikingbot tests/session tests/unit tests/integration，没有语法错误。

需要先修已发出的两条 inline 问题，尤其是 batch result 泄漏到单个 session archive 的阻塞问题。

qin-ctx

本次只发布必须修复项：

streaming memory updater 的 batch aggregate result 仍未按 submitter/session 隔离，可能导致跨 session 的 memory diff/context/case 串写。
remote rollout 的 policy_set metadata 会携带 OpenViking API key，存在凭证外泄风险。

建议修复后再合入。

qin-ctx

发现 1 个 blocking 问题：case 训练输入在 patch merge 前固定，可能和最终落库的 canonical case 不一致。

* fix(server): restore identity resolution in /health endpoint The /health endpoint was refactored in #2503 which removed the identity resolution logic. This caused the frontend dashboard to show 'Usage/Audit 未初始化' because the role field was missing from the response. Restored the identity resolution so that /health returns account_id, user_id, and role when an API key is provided. Co-Authored-By: Claude <noreply@anthropic.com> * test(server): update health endpoint tests for identity resolution - Test that /health returns identity info when API key is provided - Test that /health omits identity info when no API key is provided Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: wugj <wugj@g-bits.com> Co-authored-by: Claude <noreply@anthropic.com>

chenjw added 14 commits June 5, 2026 22:04

Add trajectory experience learning redesign doc

b6d7450

auto-commit before eval 20260607_043406

3b2ce3b

auto-commit before eval 20260607_044129

5dff792

auto-commit before eval 20260607_123706

e610310

auto-commit before eval 20260607_125514

cc86f08

auto-commit before eval 20260607_133737

a115609

auto-commit before eval 20260607_144649

53b71bd

auto-commit before eval 20260607_154631

b64bdd7

Refine streaming memory train merge pipeline

89a1fdd

Refine session train policy optimization architecture

84b5896

Add VikingMem ARA paper analysis

737c980

Force merge for mixed extraction memory patches

78befe5

auto-commit before eval 20260608_134426

566facf

auto-commit before eval 20260608_142108

ea5d999

github-project-automation Bot added this to OpenViking project Jun 8, 2026

github-project-automation Bot moved this to Backlog in OpenViking project Jun 8, 2026

chenjw marked this pull request as draft June 8, 2026 06:24

github-actions Bot added the Review effort 4/5 label Jun 8, 2026

chenjw added 10 commits June 8, 2026 15:39

auto-commit before eval 20260608_153909

24ad3ae

auto-commit before eval 20260608_154845

a967eb3

auto-commit before eval 20260608_170143

8bd428e

update

40978ac

auto-commit before eval 20260610_232526

27a1dd8

auto-commit before eval 20260611_150946

b640967

auto-commit before eval 20260611_153933

76c446e

auto-commit before eval 20260611_154251

654c824

Fix tau2 reward wrapper call

9474dd3

auto-commit before eval 20260611_193803

f8d832e

chenjw added 4 commits June 28, 2026 17:56

train: finish rollout and memory refactor

4898d8c

memory: refine runtime-visible extraction prompts

8e1b3f7

train: constrain communication memory extraction

1606800

auto-commit before eval 20260629_235623

c8b84c3

chenjw changed the title ~~Feat/memory train~~ Feat/自进化（经验记忆）框架重构 Jun 30, 2026

chenjw marked this pull request as ready for review June 30, 2026 04:28

qin-ctx requested changes Jun 30, 2026

View reviewed changes

chenjw added 4 commits June 30, 2026 22:36

memory: address training review fixes

7f4ab87

update

c54a4f9

Merge branch 'main' into feat/memory_train

f95250e

update

3efaa9c

qin-ctx reviewed Jul 1, 2026

View reviewed changes

Comment thread openviking/session/compressor_v3.py

qin-ctx reviewed Jul 1, 2026

View reviewed changes

Comment thread openviking/message/message.py Outdated

qin-ctx requested changes Jul 1, 2026

View reviewed changes

chenjw added 3 commits July 1, 2026 13:01

message: reuse part deserializer

46bd303

train: snapshot memory prompt yaml

0848492

prompts: restore memory yaml templates from main

8c08dc5

qin-ctx requested changes Jul 2, 2026

View reviewed changes

Comment thread openviking/session/compressor_v3.py

Comment thread openviking/session/train/batch_runner.py

chenjw added 4 commits July 2, 2026 12:34

memory: scope streaming update results

75a6bc2

update

e3b00e8

rebase

cc46ae9

update

121a762

qin-ctx requested changes Jul 2, 2026

View reviewed changes

Comment thread openviking/session/compressor_v3.py Outdated

session: train canonical merged cases

7f1cb09

qin-ctx approved these changes Jul 3, 2026

View reviewed changes

qin-ctx merged commit fd73dcf into main Jul 3, 2026
10 checks passed

qin-ctx deleted the feat/memory_train branch July 3, 2026 03:27

github-project-automation Bot moved this from Backlog to Done in OpenViking project Jul 3, 2026

dfwgj mentioned this pull request Jul 3, 2026

[Bug]: /health endpoint no longer returns user identity after #2503 refactor #2977

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feat/自进化（经验记忆）框架重构#2503

Feat/自进化（经验记忆）框架重构#2503
qin-ctx merged 194 commits into
mainfrom
feat/memory_train

chenjw commented Jun 8, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Jun 8, 2026

Uh oh!

github-actions Bot commented Jun 8, 2026

Uh oh!

qin-ctx left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

qin-ctx left a comment

Uh oh!

qin-ctx left a comment

Uh oh!

Uh oh!

Uh oh!

qin-ctx left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

chenjw commented Jun 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issue

Type of Change

Changes Made

Testing

Checklist

Screenshots (if applicable)

Additional Notes

Uh oh!

github-actions Bot commented Jun 8, 2026

PR Reviewer Guide 🔍

Uh oh!

github-actions Bot commented Jun 8, 2026

PR Code Suggestions ✨

Uh oh!

qin-ctx left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

qin-ctx left a comment

Choose a reason for hiding this comment

Uh oh!

qin-ctx left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

qin-ctx left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

chenjw commented Jun 8, 2026 •

edited

Loading