A step-wise tutorial to demonstrate the steps required to deploy a ML model using AWS Lambda, Github Actions, API Gateway and use Streamlit to access the model API through a UI.| shreyansh26.github.io
A short general introduction to Federated Learning (FL) for folks interested in privacy-preserving machine learning (PPML).| shreyansh26.github.io
Just a quick tutorial to set up a small scale deployment for your ML or DL model| shreyansh26.github.io
Understanding FlashAttention which is the most efficient exact attention implementation out there, which optimizes for both memory requirements and wall-clock time.| shreyansh26.github.io