What actually goes on inside an LLM to make it calculate probabilities for the next token?| Giles' Blog