This week, language, language models and replication: How To Become A Mechanistic Interpretability Researcher: So much great material in here, even if you’re just interested in getting across LLM foundations. From that list, ARENA’s AI Safety course is fantastic - again, even if you are just interested in LLM foundations. Do Machine Learning Models Memorize or Generalize?: A great explainer on grokking. After hearing it mentioned on Dwarkesh’s podcast episode with Sholto Douglas and Tre...