Skip to content

feat: Add SWE-bench benchmarking integration (#415)#670

Open
erkinalp wants to merge 1 commit intostitionai:mainfrom
erkinalp:devin/1734544793-add-swebench-benchmarking
Open

feat: Add SWE-bench benchmarking integration (#415)#670
erkinalp wants to merge 1 commit intostitionai:mainfrom
erkinalp:devin/1734544793-add-swebench-benchmarking

Conversation

@erkinalp
Copy link

Implements SWE-bench benchmarking integration for evaluating Devika's performance.

Link to Devin run: https://app.devin.ai/sessions/121045305ac0458bbdf2566092dbc1b2

Fixes #415

Co-Authored-By: Erkin Alp Güney <erkinalp9035@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Benchmark on SWE-Bench

2 participants