olmOCR 2, our latest document OCR model, achieves state-of-the-art performance for English-language digitized print documents.| Ai2 Blog
SamudrACE couples 3D models of both the ocean and the atmosphere, giving it a deep understanding of global Earth system interactions.| Ai2 Blog
We're releasing data that shows which scientific papers our agentic platform for research and discovery, Asta, relies on most when answering questions.| Ai2 Blog
DataVoyager is our new feature in Asta built to address the challenges scientists face in drilling down into structured datasets.| Ai2 Blog
We explore how Fluid Benchmarking can adapt evaluation items to a language model’s capability level.| Ai2 Blog
We release OLMoASR, a family of open automatic speech recognition (ASR) models trained from scratch on a curated, large-scale dataset.| allenai.org
We announce Asta, our bold initiative to accelerate science through trustworthy, truly open agentic AI.| Ai2 Blog
Introducing AstaBench, a novel AI agents evaluation framework and scientific research benchmark suite.| allenai.org
We find that two simple metrics, signal and noise, reveal key differences in the utility of current LLM benchmarks.| Ai2 Blog
Introducing MoNaCo, a benchmark of highly challenging questions spanning dozens of documents for evaluating large language models.| allenai.org
Tülu 3 is a leading instruction following model family, offering fully open-source data, code, and recipes.| allenai.org