mirror of
https://github.com/qodo-ai/pr-agent.git
synced 2025-12-12 02:45:18 +00:00
feat: enhance documentation with programming languages section and clarify dataset details
This commit is contained in:
parent
0a3d655912
commit
08da41c929
1 changed files with 13 additions and 1 deletions
|
|
@ -3,7 +3,7 @@
|
||||||
## Methodology
|
## Methodology
|
||||||
|
|
||||||
Qodo Merge PR Benchmark evaluates and compares the performance of Large Language Models (LLMs) in analyzing pull request code and providing meaningful code suggestions.
|
Qodo Merge PR Benchmark evaluates and compares the performance of Large Language Models (LLMs) in analyzing pull request code and providing meaningful code suggestions.
|
||||||
Our diverse dataset contains 400 pull requests from over 100 repositories, spanning various programming languages and frameworks to reflect real-world scenarios.
|
Our diverse dataset contains 400 pull requests from over 100 repositories, spanning multiple [programming languages](#programming-languages) to reflect real-world scenarios.
|
||||||
|
|
||||||
- For each pull request, we have pre-generated suggestions from eleven different top-performing models using the Qodo Merge `improve` tool. The prompt for response generation can be found [here](https://github.com/qodo-ai/pr-agent/blob/main/pr_agent/settings/code_suggestions/pr_code_suggestions_prompts_not_decoupled.toml).
|
- For each pull request, we have pre-generated suggestions from eleven different top-performing models using the Qodo Merge `improve` tool. The prompt for response generation can be found [here](https://github.com/qodo-ai/pr-agent/blob/main/pr_agent/settings/code_suggestions/pr_code_suggestions_prompts_not_decoupled.toml).
|
||||||
|
|
||||||
|
|
@ -340,3 +340,15 @@ The following models were used for generating the benchmark baseline:
|
||||||
(11) o4-mini_medium
|
(11) o4-mini_medium
|
||||||
```
|
```
|
||||||
|
|
||||||
|
### Programming Languages
|
||||||
|
|
||||||
|
The PR benchmark dataset includes pull requests containing code in the following programming languages:
|
||||||
|
|
||||||
|
```markdown
|
||||||
|
["Python", "JavaScript", "TypeScript", "Java", "CSharp", "PHP", "C++", "Go", "Rust", "Swift", "Kotlin", "Ruby", "Dart", "Scala"
|
||||||
|
```
|
||||||
|
|
||||||
|
Pull requests may also include non-code files such as `YAML`, `JSON`, `Markdown`, `Dockerfile` ,`Shell`, etc.
|
||||||
|
The benchmarked models should also analyze these files, as they commonly appear in real-world pull requests.
|
||||||
|
|
||||||
|
|
||||||
|
|
|
||||||
Loading…
Reference in a new issue