Medfuzz tests LLMs by breaking benchmark assumptions, exposing vulnerabilities to bolster real-world accuracy.| Microsoft Research
It’s no longer just about cramming data into bigger and bigger models. I chart the shifts toward inference and compound AI systems and explain what they mean for founders.| ashugarg.substack.com
Some hints about what the next year of AI looks like| www.oneusefulthing.org
A PSA everyone needs. The importance of a wait and see attitude when it comes to new models, big and small, open and closed.| www.interconnects.ai