Topic: [2503.01837v1] Multi-Stage Manipulation with Demonstration-Augmented Reward, Policy, and World Model Learning