Here, I provide an in-depth analysis of GPUs for deep learning/machine learning and explain what is the best GPU for your use-case and budget.| Tim Dettmers
How to choose an advisor? Is school prestige important? How important are peers? I answer these questions to help you find the right grad school.| Tim Dettmers
In this guide I analyse hardware from CPU to SSD and their impact on performance for deep learning so that you can choose the hardware that you really need.| Tim Dettmers
Here I develop a theoretical model of TPUs vs GPUs for transformers as used by BERT and show that current GPUs are about 32% to 54% slower for this task.| Tim Dettmers
In this blog post, I discuss thought experiments and relational graphs models to find out how to assign credit to contributions in deep learning.| Tim Dettmers
When I attended NAACL, I wanted to do a little test. I had two pitches for my LLM.int8() paper. One pitch is about how I use advanced quantization methods to achieve no performance degradation transformer inference at scale that makes large models more accessible. The other pitch talks about emergent outliers in transformers and how […]| Tim Dettmers