Note: If you’ll forgive the shameless self-promotion, applications for my MATS stream are open until Sept 12. I help people write a mech interp paper…| www.alignmentforum.org
I am looking for people who want to be supervised by me to write a mech interp paper. Apply here now ! Due Aug 29| Neel Nanda
Anthropic's latest interpretability research: a new microscope to understand Claude's internal mechanisms| www.anthropic.com
We investigate the internal mechanisms used by Claude 3.5 Haiku — Anthropic's lightweight production model — in a variety of contexts, using our circuit tracing methodology.| Transformer Circuits