Let the LLM 'contemplate' before answering| Maharshi's blog
Understand elementwise operations with tensors and shape broadcasting| Maharshi's blog
Understand what exactly are Tensors, and how you can too implement them from scratch in C| Maharshi's blog
by embracing the multifaceted aspects present within yourself and the world, grow as a person| Maharshi's blog
Attention powers “transformers” - the seemingly complex architecture behind large language models (LLMs) like ChatGPT. But what does attention even mean?| Maharshi's blog
Learning CUDA by optimizing matrix-vector multiplication (SGEMV) for cuBLAS-like performance| Maharshi's blog
Read my blogs, about my projects, and basically my every thought| Maharshi's blog
Learning CUDA by optimizing softmax that beats PyTorch| Maharshi's blog