Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
squiggleblaz
34 days ago
|
parent
|
context
|
favorite
| on:
Recent AI model progress feels mostly like bullshi...
Reinforcement learning, maximise rewards? They work because rabbits like carrots. What does an LLM want? Haven't we already committed the fundamental error when we're saying we're using reinforcement learning and they want rewards?
Consider applying for YC's Summer 2025 batch! Applications are open till May 13
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: