Hacker News new | past | comments | ask | show | jobs | submit login

The mention of AdamW is brief, but in his defense he includes a link that gives a gloss of it: "An updated overview of recent gradient descent algorithms" [https://johnchenresearch.github.io/demon/].



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: