Login
From:
Apple Machine Learning Research
(Uncensored)
subscribe
Scaling Smart: Accelerating Large Language Model Pre-training with Small Model Initialization - Apple Machine Learning Research
https://machinelearning.apple.com/research/scaling-smart
links
backlinks
Roast topics
Find topics
Find it!
This paper was accepted at the Efficient Natural Language and Speech Processing (ENLSP) Workshop at NeurIPS 2024. The pre-training phase of…