docs: update evaluation and compaction documentation by aheritier · Pull Request #3044 · docker/docker-agent

aheritier · 2026-06-10T04:06:57Z

Documentation updates

This PR updates documentation to reflect two recently merged code changes.

Changes

Commit	Source PR	What changed
First commit	#3029	Document custom base image behavior for `--base-image` eval flag
Second commit	#3042	Note that compaction budgets scale with `context_size` for small windows

Details

docs/features/evaluation/index.md — Added explanation of how the eval harness handles custom base images: the docker-agent binary is injected from docker/docker-agent:edge at build time and the base image's entrypoint is overridden. Users should provide only the runtime environment in their base image.

docs/providers/dmr/index.md (or troubleshooting doc) — Added note that auto-compaction scales summary and keep-tail budgets proportionally to provider_opts.context_size, so small context windows (e.g. 8k local models) no longer lose session history during compaction.

PRs reviewed and found up to date

Source PR	Reason
#3036	Docs shipped in the same PR (secrets guide updated)
#3032	Removed catalog server IDs not referenced in docs
#3035	Internal security fix, no user-facing behavior change
#3031	Internal SSRF hardening, `allow_private_ips` semantics unchanged
#3033	Go dependency bumps
#3039	Go dependency bump
#3005	Go dependency bumps
#3038	Already a docs PR (CHANGELOG)
#3028	Already a docs PR (`--session-read-only`)

The eval harness copies the docker-agent binary from docker/docker-agent:edge into custom base images at build time and overrides their entrypoint with its own /run.sh wrapper. Users need to know their base image's entrypoint will be replaced and that the base image should only provide the runtime environment. Ref: #3029

…windows After the fix in #3042, the summary and keep-tail token budgets used during session compaction scale proportionally to provider_opts.context_size instead of using absolute 16k/20k constants. Small-context-window models (≤ ~16k) no longer have their history wiped during compaction. Ref: #3042

docker-agent

Assessment: 🟢 APPROVE

Documentation-only PR. All factual claims were verified against the source code:

docs/features/evaluation/index.md — The new ### Custom Base Images section accurately describes the eval harness behavior: the binary is indeed copied from docker/docker-agent:edge in the Dockerfile template, the entrypoint is correctly overridden unconditionally, and the anchor link #custom-base-images resolves correctly.
docs/providers/dmr/index.md — The compaction scaling claim is accurate: summaryTokenBudget = min(16000, contextLimit/4) and keepTokenBudget = min(20000, contextLimit/5) both scale proportionally with context_size, confirming small context windows (e.g. 8k) are handled correctly.

No bugs or inaccuracies found in the added documentation.

aheritier added 2 commits June 10, 2026 04:06

aheritier requested a review from a team as a code owner June 10, 2026 04:06

aheritier added the kind/docs Documentation-only changes label Jun 10, 2026

docker-agent reviewed Jun 10, 2026

View reviewed changes

aheritier added area/agent For work that has to do with the general agent loop/agentic features of the app area/providers/docker-model-runner Docker Model Runner (DMR) local inference labels Jun 10, 2026

dgageot approved these changes Jun 10, 2026

View reviewed changes

dgageot merged commit 7da2be1 into main Jun 10, 2026
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: update evaluation and compaction documentation#3044

docs: update evaluation and compaction documentation#3044
dgageot merged 2 commits into
mainfrom
docs/auto-update

aheritier commented Jun 10, 2026

Uh oh!

docker-agent left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

aheritier commented Jun 10, 2026

Documentation updates

Changes

Details

PRs reviewed and found up to date

Uh oh!

docker-agent left a comment

Choose a reason for hiding this comment

Assessment: 🟢 APPROVE

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants