SimPPL

[2505.14617] The Hawthorne Effect in Reasoning Models: Evaluating and Steering Test Awareness

Source
https://arxiv.org/abs/2505.14617
Tags
llmshuggingface

Permalink: simppl.org/library/item/2505-14617-the-hawthorne-effect-in-reasoning-models-evaluating-and-ste-3ffac3de

This is a SimPPL canonical link to a reading shared in our newsletter. Browse the rest at simppl.org/library.