SimPPL

Crazy random line in the Cursor RL blog post saying they're collecting RL data from real users, updating the checkpoi...

Nathan Lambert flags Cursor's RL blog: collecting RL data from real users and updating checkpoints every 90-120 minutes — unthinkable a year ago.

Source
https://www.linkedin.com/posts/natolambert_crazy-random-line-in-the-cursor-rl-blog-post-activity-7372097746402676737-vupu
Tags
coding-agentstrainingrlhf

Permalink: simppl.org/library/item/crazy-random-line-in-the-cursor-rl-blog-post-saying-theyre-collecting--01d96558

This is a SimPPL canonical link to a reading shared in our newsletter. Browse the rest at simppl.org/library.