SimPPL

Seunghyun Seo on X: \"Inspired by this thread, I'd like to share my slides on training horizon scaling. Lately, lots o...

Seunghyun Seo's slides on training-horizon scaling, focusing on the role of weight decay (not just learning rate) when scaling.

Source
https://x.com/seunghyunseo7/status/2006363639037788460?s=46
Tags
trainingtwitter

Permalink: simppl.org/library/item/seunghyun-seo-on-x-inspired-by-this-thread-id-like-to-share-my-slides--3e524a9c

This is a SimPPL canonical link to a reading shared in our newsletter. Browse the rest at simppl.org/library.