diff --git a/docs/docs/pr_benchmark/index.md b/docs/docs/pr_benchmark/index.md
index 3e93b548..aa226291 100644
--- a/docs/docs/pr_benchmark/index.md
+++ b/docs/docs/pr_benchmark/index.md
@@ -83,7 +83,7 @@ A list of the models used for generating the baseline suggestions, and example r
41.7 |
- | Claude-sonnet-4-5 |
+ Claude-sonnet-4.5 |
2025-09-29 |
|
40.7 |
@@ -188,7 +188,7 @@ weaknesses:
- **False positives / speculative fixes:** In several cases it flags non-issues (style, performance, redundant code) or supplies debatable “improvements”, lowering precision and sometimes breaching the “critical bugs only” rule.
- **Inconsistent error coverage:** For certain domains (build scripts, schema files, test code) it either returns an empty list when real regressions exist or proposes cosmetic edits, indicating gaps in specialised knowledge.
-### Claude-sonnet-4-5
+### Claude-sonnet-4.5
Final score: **40.7**