New paper!🤖 Investigating Reward Tampering Sycophancy To Subterfuge: Investigating Reward Tampering In Language Models