SimPPL

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning | Nathan Lambert | 14 comments

Source
https://www.linkedin.com/posts/natolambert_deepseek-r1-incentivizing-reasoning-capability-activity-7338403302356733952-VLWf
Tags
llmstraining

Permalink: simppl.org/library/item/deepseek-r1-incentivizing-reasoning-capability-in-llms-via-reinforceme-bfa25cb9

This is a SimPPL canonical link to a reading shared in our newsletter. Browse the rest at simppl.org/library.