Compare the performance of open-source Large Language Models using multiple benchmarks like IFEval, BBH, MATH, GPQA, MUSR, and MMLU-PRO. Filter results in real-time and vote on your favorite models.| huggingface.co
We’re on a journey to advance and democratize artificial intelligence through open source and open science.| huggingface.co
We’re on a journey to advance and democratize artificial intelligence through open source and open science.| huggingface.co