[2505.14617] The Hawthorne Effect in Reasoning Models: Evaluating and Steering Test Awareness
This is a SimPPL canonical link to a reading shared in our newsletter. Browse the rest at simppl.org/library.
This is a SimPPL canonical link to a reading shared in our newsletter. Browse the rest at simppl.org/library.