The limits of our language-based benchmarks are the limits of our language.| seantrott.substack.com
Humans learn the rules of chess but large-language models fake it| buildcognitiveresonance.substack.com
Beautiful photo. Difficult to capture, this mysterious, squid-shaped interstellar cloud spans nearly three full moons in planet Earth’s sky. Discovered in 2011 by French astro-imager Nicolas Outters, the Squid Nebula’s bipolar shape is distinguished here by the telltale blue emission from doubly ionized oxygen atoms. Though apparently surrounded by the reddish hydrogen emission region Sh2-129, the true distance and nature of the Squid Nebula have been difficult to determine. Still, one in...| Schneier on Security
One of the things I’ve recognized is that we don’t pay enough attention to context. It turns out to be a really important factor in cognition, as our long-term memory interacts with the current context to determine our interpretation. And, as such, makes our interpretations very ’emergent’. Thus, our training needs to ensure that we’re liable to make the right interpretation and so choose the right action. Do we do this well? And can artificial intelligence (AI), specifically genera...| Learnlets
Bonus links| blog.zgp.org
LLM failures to reason, as documented in Apple’s Illusion of Thinking paper, are really only part of a much deeper problem| garymarcus.substack.com