Hi everyone, Jerry and Wyatt here from Halluminate (https://halluminate.ai/). We help AI labs train computer use agents with high quality data and RL environments.Training AI agents to use computers, browsers, and software is one of the highest-potential opportunities for AI. To date, however, this capability is still unreliable. The emerging method to improve this is called Reinforcement Learning with Verifiable Rewards (RLVR). However, researchers are currently bottlenecked by a lack of hig...