Quantitative benchmark of basic LLM code editing skill.| aider
Benchmarking GPT-3.5 and GPT-4 code editing skill using a new code editing benchmark suite based on the Exercism python exercises.| aider
Quantitative benchmarks of LLM code editing skill.| aider