Improve DMR support by krissetto · Pull Request #2351 · docker/docker-agent

krissetto · 2026-04-08T16:50:52Z

Improve DMR support

provider_opts.context_size sets the engine's context window; max_tokens stays strictly per-request output instead of pulling-double duty (which was confusing).
Structured _configure request mirroring model-runner's BackendConfiguration (context-size, runtime-flags, speculative, llamacpp.reasoning-budget, vllm.{hf-overrides,gpu-memory-utilization}).
thinking_budget routed properly per backend: reasoning-budget for llama.cpp, thinking_token_budget per-request for vLLM, ignored on MLX/SGLang for now.
Fix session-title generation on reasoning models: DMR now honors NoThinking() by sending chat_template_kwargs.enable_thinking=false
clarified in the docs that sampling params belong on the regular model config, not in provider_opts.runtime_flags.

krissetto · 2026-04-20T08:33:35Z

/review

docker-agent

Assessment: 🟡 NEEDS ATTENTION

Two findings in the new code — one medium-severity validation gap and one low-severity clarity issue. All tests pass.

docker-agent · 2026-04-20T08:43:19Z

+// the same rules as model-runner's inference.ParseKeepAlive:
+//   - Go duration strings: "5m", "1h", "30s"
+//   - "0" to unload immediately
+//   - Any negative value ("-1", "-1m") to keep loaded forever


No description provided.

- fixes session title generation - adds 'context_size' provider_opt for DMR usage instead of giving 'max_tokens' double responsibility to avoid confusion - improved thinking budget support and fix for NoThinking() - improves how flags are sent to the DMR model/runtime configuration endpoint - clarify docs on sampling/runtime params Signed-off-by: Christopher Petito <chrisjpetito@gmail.com> Assisted-By: docker-agent

krissetto force-pushed the improve-dmr-support branch 3 times, most recently from a6a876c to d2eac5f Compare April 17, 2026 17:11

krissetto marked this pull request as ready for review April 17, 2026 17:30

krissetto requested a review from a team as a code owner April 17, 2026 17:30

docker-agent bot reviewed Apr 20, 2026

View reviewed changes

krissetto force-pushed the improve-dmr-support branch from a8837ee to d289608 Compare April 20, 2026 09:24

dgageot approved these changes Apr 20, 2026

View reviewed changes

dgageot merged commit aa8d245 into docker:main Apr 20, 2026
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve DMR support#2351

Improve DMR support#2351
dgageot merged 1 commit intodocker:mainfrom
krissetto:improve-dmr-support

krissetto commented Apr 8, 2026 •

edited

Loading

Uh oh!

krissetto commented Apr 20, 2026

Uh oh!

docker-agent bot left a comment

Uh oh!

docker-agent bot Apr 20, 2026

Uh oh!

krissetto Apr 20, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

krissetto commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!