Login
Roast topics
Find topics
Find it!
From:
Brian Fitzgerald Blog
(Uncensored)
subscribe
Tiny Agents - Training Small LLMs to Use Tools with DPO and Synthetic Data
https://brianfitzgerald.xyz/tool-use-dpo
links
backlinks
Roast topics
Find topics
Roast it!
TL;DR: I've created a synthetic dataset for training LLMs to use tools, and am training a model using DPO to improve accuracy. The dataset is available here, and I'll be releasing trained models soon.