fix(optimization): handle None metric scores in LocalEvalSampler by JesserHamdaoui · Pull Request #5415 · google/adk-python

JesserHamdaoui · 2026-04-20T17:52:12Z

Summary

When running adk optimize, if a metric evaluation fails (e.g., due to a transient API error, missing rubrics, or a malformed JSONDecodeError response from the LLM judge), local_eval_service.py gracefully catches the exception and returns an EvaluationResult with a None score and NOT_EVALUATED status.

However, LocalEvalSampler._extract_eval_data subsequently attempts to unconditionally round this value, resulting in a TypeError: type NoneType doesn't define __round__ method, which crashes the entire optimization loop rather than safely skipping or reporting the failed case.

Changes

google/adk/optimization/local_eval_sampler.py: Guarded the metric score rounding step in _extract_eval_data.
- Before: "score": round(eval_metric_result.score, 2)
- After: "score": round(eval_metric_result.score, 2) if eval_metric_result.score is not None else None
- This correctly maintains the None value in the diagnostic trace data for failed evals.

Huge shoutout to the issue author @msteiner-google for the detailed bug report, root cause analysis, and for suggesting the fix!

Motivation

Optimization loops can run for a long time and make dozens of LLM calls. If a single evaluation case fails due to an intermittent network issue or a temporary rate limit, the NOT_EVALUATED status is the correct fallback. Crashing the entire adk optimize run because of a missing None check wastes compute, time, and API quotas. By preserving None, the optimizer can safely continue and log that the metric did not produce a score.

Test plan

Unit Tests:

Added test_extract_eval_data_preserves_none_metric_score in tests/unittests/optimization/local_eval_sampler_test.py to verify that _extract_eval_data preserves "score": None and retains the proper NOT_EVALUATED status without throwing a TypeError.
Ran targeted test with uv run pytest tests/unittests/optimization/local_eval_sampler_test.py::test_extract_eval_data_preserves_none_metric_score -q (Result: 1 passed).

Manual Reproduction & Verification:

Simulated the interruption: Created a local script to intentionally trigger the bug by forcing a None score during the evaluation step.
Verified the fix: Ran the simulation against the updated code. Before the fix, the script consistently crashed with TypeError: type NoneType doesn't define __round__ method. After applying the fix in this PR, the optimizer safely handled the None scores and ran to completion without crashing.

Used the hello_world example from the provided samples and followed the optimization documentation. Then added patch_and_run.py file in my local environment to force the eval failure

# Simulated the issue by triggering an eval failure to force None scores
# and verifying the optimizer handles it gracefully.
sampler_config = LocalEvalSamplerConfig(
    eval_config=EvalConfig(criteria={"rubric_based_tool_use_quality_v1": 0.75}), # Or a metric missing rubrics
    app_name="hello_world",
    train_eval_set="train_eval_set",
)
sampler = LocalEvalSampler(
    sampler_config, 
    LocalEvalSetsManager(agents_dir=os.path.dirname(os.getcwd()))
)

opt_config = GEPARootAgentPromptOptimizerConfig(max_metric_calls=5)
optimizer = GEPARootAgentPromptOptimizer(config=opt_config)

# Before PR: Crashes with TypeError on None. After PR: Runs successfully.
result = asyncio.run(optimizer.optimize(agent.root_agent, sampler))

rohityan · 2026-04-24T21:26:47Z

Hi @JesserHamdaoui , Thank you for your contribution! We appreciate you taking the time to submit this pull request. Your PR has been received by the team and is currently under review. We will provide feedback as soon as we have an update to share.

rohityan · 2026-04-24T21:27:07Z

Hi @DeanChensj , can you please review this.

JesserHamdaoui added 2 commits April 20, 2026 18:25

fix(optimization): handle None metric scores in LocalEvalSampler

e301391

test(optimization): add regression test for None metric score extraction

6a8c125

JesserHamdaoui changed the title ~~Fix/5403 local eval sampler type error~~ fix(optimization): handle None metric scores in LocalEvalSampler Apr 20, 2026

adk-bot added the eval [Component] This issue is related to evaluation label Apr 20, 2026

rohityan self-assigned this Apr 20, 2026

Merge branch 'main' into fix/5403-LocalEvalSampler-TypeError

3139106

rohityan added the needs review [Status] The PR/issue is awaiting review from the maintainer label Apr 24, 2026

ankursharmas approved these changes Apr 24, 2026

View reviewed changes

Merge branch 'main' into fix/5403-LocalEvalSampler-TypeError

d507e1a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(optimization): handle None metric scores in LocalEvalSampler#5415

fix(optimization): handle None metric scores in LocalEvalSampler#5415
JesserHamdaoui wants to merge 4 commits intogoogle:mainfrom
JesserHamdaoui:fix/5403-LocalEvalSampler-TypeError

JesserHamdaoui commented Apr 20, 2026

Uh oh!

rohityan commented Apr 24, 2026

Uh oh!

rohityan commented Apr 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

JesserHamdaoui commented Apr 20, 2026

Summary

Changes

Motivation

Test plan

Uh oh!

rohityan commented Apr 24, 2026

Uh oh!

rohityan commented Apr 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants