SimPPL

Persona vectors: Monitoring and controlling character traits in language models \\ Anthropic

Source
https://www.anthropic.com/research/persona-vectors
Tags
llmsinterpretability

Permalink: simppl.org/library/item/persona-vectors-monitoring-and-controlling-character-traits-in-languag-97b257fa

This is a SimPPL canonical link to a reading shared in our newsletter. Browse the rest at simppl.org/library.