A wonderful release, base models, reasoners, model size scales, and all before LlamaCon.| www.interconnects.ai
The latest reasoning model and what it says about the direction of inference time compute and RL training.| www.interconnects.ai
QWEN CHAT GitHub Hugging Face ModelScope Kaggle DEMO DISCORD Introduction Today, we are excited to announce the release of Qwen3, the latest addition to the Qwen family of large language models. Our flagship model, Qwen3-235B-A22B, achieves competitive results in benchmark evaluations of coding, math, general capabilities, etc., when compared to other top-tier models such as DeepSeek-R1, o1, o3-mini, Grok-3, and Gemini-2.5-Pro. Additionally, the small MoE model, Qwen3-30B-A3B, outcompetes QwQ...| Qwen
Where AI is heading, why 2024 felt slow, and shifting priorities of frontier laboratories.| www.interconnects.ai
Yes, ring the true o1 replication bells for DeepSeek R1 🔔🔔🔔. Where we go next.| www.interconnects.ai