Topic: [2509.02522] Implicit Actor Critic Coupling via a Supervised Learning Framework for RLVR