Login
Roast topics
Find topics
Find it!
From:
Dylan Huang Blog
(Uncensored)
subscribe
The Impossible Testing Problem for AI Applications
https://dylanhuang.com/blog/evals
links
backlinks
Roast topics
Find topics
Roast it!
LLMs make traditional testing untenable; enter benchmarks.