Reasoning models were as big of an improvement as the Transformer, at least on some benchmarks| epochai.substack.com