Topic: Understanding the Transformer architecture for neural networks