TranformerLens is a Python library for Mechanistic Interpretability. It’s got some great tutorials… but they are all kinda verbose. Here’s a cheatsheet of all the common things you’ll want from the library. Click the links for more details. Setup Creating a model Models have very similar arguments and methods to Torch modules. Full list of […]