A joint team of researchers from Apple and Carnegie Mellon University has proposed a new way to improve how large language models follow instructions, showing that a simple checklist system can outperform traditional reward-based training in several benchmarks.