Topic: [2201.03544] The Effects of Reward Misspecification: Mapping and Mitigating Misaligned Models