We’re on a journey to advance and democratize artificial intelligence through open source and open science.| huggingface.co
Last week a guy called Evan Miller tweeted out a blog post claiming to have discovered a flaw in the attention mechanism used by transformers today: The phrasing was sensationalist, and many people…| Data Science Castnet
Here, I provide an in-depth analysis of GPUs for deep learning/machine learning and explain what is the best GPU for your use-case and budget.| Tim Dettmers
How to choose an advisor? Is school prestige important? How important are peers? I answer these questions to help you find the right grad school.| Tim Dettmers
In this guide I analyse hardware from CPU to SSD and their impact on performance for deep learning so that you can choose the hardware that you really need.| Tim Dettmers
Here I develop a theoretical model of TPUs vs GPUs for transformers as used by BERT and show that current GPUs are about 32% to 54% slower for this task.| Tim Dettmers
In this blog post, I discuss thought experiments and relational graphs models to find out how to assign credit to contributions in deep learning.| Tim Dettmers