SimPPL

[2505.11711] Reinforcement Learning Finetunes Small Subnetworks in Large Language Models

Source
https://arxiv.org/abs/2505.11711
Tags
llmshuggingface

Permalink: simppl.org/library/item/2505-11711-reinforcement-learning-finetunes-small-subnetworks-in-large-6da9ec2c

This is a SimPPL canonical link to a reading shared in our newsletter. Browse the rest at simppl.org/library.