Giles Thomas's blog: Practical insights on AI, startups, and software development, drawn from 30 years of building technology and 20 years of blogging.| www.gilesthomas.com
What actually goes on inside an LLM to make it calculate probabilities for the next token?| Giles' Blog
Archive of Giles Thomas’s blog posts from September 2025. Insights on AI, startups, and software development, plus occasional personal reflections.| www.gilesthomas.com
Posts in the 'TIL deep dives' category on Giles Thomas’s blog. Insights on AI, startups, software development, and technical projects, drawn from 30 years of experience.| www.gilesthomas.com
Finally getting to the end of chapter 3 of Raschka’s LLM book! This time it’s multi-head attention: what it is, how it works, and why the code does what it does.| Giles' Blog
Why dropout is kind of like the mandatory vacation policies financial firms have| Giles' Blog
Adding dropout to the LLM's training is pretty simple, though it does raise one interesting question| Giles' Blog