Hacker News new | past | comments | ask | show | jobs | submit login

you mean fine tuning that feels like SFT but is different (since you can't use that with reasoning models) built around the DeepSeek class of models?



I just want to fine tune deepseek v3 chat but it’s not possible or easy for regular consumers





Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: