diff --git a/docs/docs/pr_benchmark/index.md b/docs/docs/pr_benchmark/index.md index 3e93b548..aa226291 100644 --- a/docs/docs/pr_benchmark/index.md +++ b/docs/docs/pr_benchmark/index.md @@ -83,7 +83,7 @@ A list of the models used for generating the baseline suggestions, and example r 41.7 - Claude-sonnet-4-5 + Claude-sonnet-4.5 2025-09-29 40.7 @@ -188,7 +188,7 @@ weaknesses: - **False positives / speculative fixes:** In several cases it flags non-issues (style, performance, redundant code) or supplies debatable “improvements”, lowering precision and sometimes breaching the “critical bugs only” rule. - **Inconsistent error coverage:** For certain domains (build scripts, schema files, test code) it either returns an empty list when real regressions exist or proposes cosmetic edits, indicating gaps in specialised knowledge. -### Claude-sonnet-4-5 +### Claude-sonnet-4.5 Final score: **40.7**