SimPPL

Shizhe Diao on X: \"✨Introducing ProfBench. LLM eval shouldn’t be limited to math/code/short QA. Real work is: read pr...

Source
https://x.com/shizhediao/status/2005316926021709849
Tags
agentsllmsevaluationtraining

Permalink: simppl.org/library/item/shizhe-diao-on-x-introducing-profbench-llm-eval-shouldn-t-be-limited-t-6838ffcb

This is a SimPPL canonical link to a reading shared in our newsletter. Browse the rest at simppl.org/library.