A from-scratch implementation of Llama 4 LLM, a mixture-of-experts model, using PyTorch code.| Daily Dose of Data Science