Login
From:
Windows On Theory
(Uncensored)
subscribe
Emergent abilities and grokking: Fundamental, Mirage, or both? – Windows On Theory
https://windowsontheory.org/2023/12/22/emergent-abilities-and-grokking-fundamental-mirage-or-both/
links
backlinks
Tagged with:
philosophizing
ml theory seminar
One of the lessons we have seen in language modeling is the power of scale. The original GPT paper of Radford et al. noted that at some point during training, the model “acquired” the ability to do…
Roast topics
Find topics
Find it!