SimPPL

Locke Cai on X: \"RL for reasoning often rely on verifiers — great for math, but tricky for creative writing or open-e...

Source
https://x.com/couplefire12/status/1999169943267381265?s=12
Tags
llmstwitter

Permalink: simppl.org/library/item/locke-cai-on-x-rl-for-reasoning-often-rely-on-verifiers-great-for-math-46b70554

This is a SimPPL canonical link to a reading shared in our newsletter. Browse the rest at simppl.org/library.