Quantitative benchmark of basic LLM code editing skill.| aider
o1 scores the top result on aider’s new multi-language, more challenging coding benchmark.| aider