I received my PhD from UC Berkeley where I was advised by| people.eecs.berkeley.edu
National Institute for Research in Digital Science and Technology| www.inria.fr
Compare the performance of open-source Large Language Models using multiple benchmarks like IFEval, BBH, MATH, GPQA, MUSR, and MMLU-PRO. Filter results in real-time and see community votes for comp...| huggingface.co
Learn more about the only AI benchmark that measures AGI progress.| ARC Prize
How we can use AI for as a "partner in thought", losing faith in long context windows for improved reasoning, and why we should stop anthropomorphizing LLMs| www.latent.space
Scaling Llama3 beyond 1M context window with ~perfect utilization, the difference between ALiBi and RoPE, how to use GPT-4 to create synthetic data for your context extension finetunes, and more!| www.latent.space