Shizhe Diao on X: \"✨Introducing ProfBench. LLM eval shouldn’t be limited to math/code/short QA. Real work is: read pr...
This is a SimPPL canonical link to a reading shared in our newsletter. Browse the rest at simppl.org/library.
This is a SimPPL canonical link to a reading shared in our newsletter. Browse the rest at simppl.org/library.