SimPPL

[2506.12349] Information Suppression in Large Language Models: Auditing, Quantifying, and Characterizing Censorship i...

Source
https://arxiv.org/abs/2506.12349
Tags
llmshuggingfacemoderation

Permalink: simppl.org/library/item/2506-12349-information-suppression-in-large-language-models-auditing-q-2be0a809

This is a SimPPL canonical link to a reading shared in our newsletter. Browse the rest at simppl.org/library.