Reward Hacking: Concrete Problems in AI Safety Part 3