Topic: [2411.02306] On Targeted Manipulation and Deception when Optimizing LLMs for User Feedback