fix: remove references to deprecated Bedrock Sonnet 4 model ID#146
fix: remove references to deprecated Bedrock Sonnet 4 model ID#146daniel-pittman merged 5 commits intodevelopfrom
Conversation
AWS deprecated anthropic.claude-sonnet-4-20250514-v1:0 (Legacy state April 14 2026, end-of-life October 14 2026). Replace all occurrences in the four targeted AETHER example YAML files with the current inference profile ID us.anthropic.claude-sonnet-4-6, consistent with production. Files updated: - bili/aether/config/examples/demo_research_analysis.yaml (2 refs) - bili/aether/config/examples/thesis_consensus_network.yaml - bili/aether/config/examples/thesis_hierarchical_voting.yaml - bili/aether/config/examples/thesis_deliberative_consensus.yaml Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
…configs Update demo_code_review.yaml (lines 58, 75) and thesis_supervisor_hub.yaml (line 46) to use the correct us.anthropic.claude-sonnet-4-6 inference profile identifier, replacing the deprecated claude-sonnet-4-20250514-v1:0. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Replace all remaining references to the deprecated `us.anthropic.claude-sonnet-4-20250514-v1:0` (and the bare `anthropic.claude-sonnet-4-20250514-v1:0`) with the canonical inference-profile ID `us.anthropic.claude-sonnet-4-6` across: - bili/iris/config/llm_config.py — runtime model registry entry - bili/aegis/evaluator/evaluator_config.py — PRIMARY_EVALUATOR_MODEL constant - bili/aegis/suites/cross_model/run_cross_model_suite.py — docstring table + MODEL_MATRIX entry - bili/aegis/docs/testing-injection.md — Tier 3 primary model reference Also includes formatter-only changes (black/isort/autoflake) on unrelated files touched during the pre-commit formatting pass. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
|
Claude finished @daniel-pittman's task in 1m 47s —— View job Claude Code Review — PR #146
SummaryRecommendation: ✅ Approve — the deprecation cleanup is mechanically clean and complete. One semantic concern worth flagging before merge (display names vs. actual model version); not a blocker. Verification
Observations1. The old ID
This matters most for AEGIS 2. ℹ️ Model-ID format convention in The registry now mixes ID formats for Anthropic Bedrock models:
Both forms are valid cross-region inference profile identifiers and AWS recommends the short alias going forward, so this is a stylistic observation, not a defect. Worth noting in case there's an opinion on registry consistency for a future pass. 3. ✅ YAML quoting preserved Each AETHER example preserved its original quoting style (quoted in File-by-file
Test-plan checklist (from your PR body)
· Branch |
Update all human-readable display strings that still said "Claude Sonnet 4" to "Claude Sonnet 4.6", consistent with the us.anthropic.claude-sonnet-4-6 model ID used throughout. Covers llm_config.py, evaluator_config.py (constant + module docstring), run_cross_model_suite.py MODEL_MATRIX, and the attack_page.py UI caption. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
|
Claude finished @daniel-pittman's task in 1m 28s —— View job Claude Code Review — PR #146 (follow-up on
|
| File | Line | Change |
|---|---|---|
bili/iris/config/llm_config.py |
297 | model_name → "Anthropic Claude Sonnet 4.6" ✅ |
bili/aegis/evaluator/evaluator_config.py |
3, 30 | module docstring + PRIMARY_EVALUATOR_MODEL_DISPLAY ✅ |
bili/aegis/suites/cross_model/run_cross_model_suite.py |
157 | MODEL_MATRIX display name ✅ |
bili/aether/ui/attack_page.py |
311 | cross_model suite caption ✅ (nice extra catch — I missed this one) |
Benchmark CSVs / JSON result files produced by run_cross_model_suite.py and SemanticEvaluator will now label runs as "Claude Sonnet 4.6 (Bedrock)", which keeps historical-vs-current comparisons honest.
🟡 One straggler — security-testing-quickstart.md
grep -E "Sonnet 4([^.]|$)" finds two unversioned references in bili/aegis/docs/security-testing-quickstart.md that describe the same judge LLM whose display name was just bumped to 4.6:
- Line 134:
judge LLM (Claude Sonnet 4 on Bedrock by default, with Gemini 2.5 Flash as fallback). - Line 341:
The judge uses Claude Sonnet 4 on Bedrock as its primary model (temperature 0.0 for
For consistency with evaluator_config.py's module docstring (which was updated), both should read Claude Sonnet 4.6 on Bedrock. Not a blocker — the PRIMARY_EVALUATOR_MODEL constant is the source of truth at runtime — but easy to sweep up.
Verification
git grep "claude-sonnet-4-20250514"→ 0 matches (deprecated ID fully removed)git grep "us.anthropic.claude-sonnet-4-6"→ 13 matches across the 10 listed files (matches the 13-line delta reported bygit diff --stat)git grep -E "Sonnet 4([^.]|$)"→ 2 matches, both in the doc file called out abovegit log --oneline origin/develop..HEAD→ 4 commits, all on-topic
Previous observations still standing
- Model-ID format mix in
llm_config.py(short alias...sonnet-4-6vs. dated/versioned...-20250219-v1:0for older Anthropic Bedrock entries) remains stylistic and out of scope. - Test-plan items ⏳ AEGIS suite run + manual Bedrock console invocation — cannot be verified from a static review.
--- · Branch
Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>
|
Claude finished @daniel-pittman's task in 58s —— View job Claude Code Review — PR #146 (follow-up on
|
| Line | Change |
|---|---|
| 134 | judge LLM (Claude Sonnet 4 on Bedrock …) → Claude Sonnet 4.6 on Bedrock ✅ |
| 341 | The judge uses Claude Sonnet 4 on Bedrock … → Claude Sonnet 4.6 on Bedrock ✅ |
Both now match the source-of-truth PRIMARY_EVALUATOR_MODEL_DISPLAY constant in bili/aegis/evaluator/evaluator_config.py:30. Scope is minimal — 1 file, 4 lines, no other churn.
Final verification
| Check | Result |
|---|---|
git grep "claude-sonnet-4-20250514" |
0 matches — deprecated ID fully removed |
git grep "us.anthropic.claude-sonnet-4-6" |
10 files (matches PR body inventory) |
git grep -E "Sonnet 4([^.]|$)" |
0 matches — no stale unversioned "Sonnet 4" display strings anywhere in the repo |
git diff origin/develop...HEAD --stat |
12 files, 20 insertions / 20 deletions — clean, no formatter leakage |
Commit history on branch
6c2c4a8 docs: bump Sonnet 4 -> 4.6 in security-testing-quickstart
470daf7 fix: update Sonnet 4 display names to reflect 4.6 inference profile
bb657aa fix: replace remaining deprecated Sonnet 4 model IDs and run formatters
553be67 fix: replace deprecated Bedrock model ID in remaining AETHER example configs
cb7fea4 fix: replace deprecated Bedrock model ID in AETHER example configs
Five well-scoped commits, each reviewed in a prior round — no remaining concerns.
Observations still standing (non-blocking)
- Model-ID format mix in
llm_config.py(short alias…sonnet-4-6vs. dated/versioned…-20250219-v1:0for older Anthropic Bedrock entries) — stylistic only, AWS supports both. Out of scope for this PR. - Test-plan items ⏳ AEGIS 1539-test suite run + manual Bedrock console invocation — cannot be verified from static review; relies on author confirmation.
· · Branch
Summary
anthropic.claude-sonnet-4-20250514-v1:0(and its inference-profile aliasus.anthropic.claude-sonnet-4-20250514-v1:0) with the canonical cross-region inference profile IDus.anthropic.claude-sonnet-4-6across 10 files in bili-core.us.anthropic.claude-sonnet-4-6; this PR brings bili-core into alignment.Why
AWS issued a deprecation notice for
anthropic.claude-sonnet-4-20250514-v1:0with the following timeline:Leaving stale model IDs in config files, evaluator defaults, and documentation creates a silent failure path once end-of-life is reached.
Changes
Spread across 3 commits:
cb7fea4— AETHER example configs (4 files)bili/aether/config/examples/demo_research_analysis.yamlbili/aether/config/examples/thesis_consensus_network.yamlbili/aether/config/examples/thesis_hierarchical_voting.yamlbili/aether/config/examples/thesis_deliberative_consensus.yaml553be67— AETHER example configs (2 files)bili/aether/config/examples/demo_code_review.yamlbili/aether/config/examples/thesis_supervisor_hub.yamlbb657aa— IRIS registry, AEGIS evaluator/suite/docs (4 files)bili/iris/config/llm_config.py— runtime model registry entrybili/aegis/evaluator/evaluator_config.py—PRIMARY_EVALUATOR_MODELconstantbili/aegis/suites/cross_model/run_cross_model_suite.py— docstring table +MODEL_MATRIXentrybili/aegis/docs/testing-injection.md— Tier 3 primary model referenceTest plan
anthropic.claude-sonnet-4-20250514-v1:0no longer appears anywhere in the repo (git grep "claude-sonnet-4-20250514")us.anthropic.claude-sonnet-4-6is used inllm_config.py,evaluator_config.py,run_cross_model_suite.py, all 6 AETHER example YAMLs, andtesting-injection.mdlangchain_coreoutside Docker and are unrelated to this change)us.anthropic.claude-sonnet-4-6manually in AWS Bedrock console to confirm the inference profile is active🤖 Generated with Claude Code