Hacker News new | past | comments | ask | show | jobs | submit login

Very cool! Thanks for sharing.

Rather than an offline model, why not use an online, continuously relearning model like a Multi-Armed Bandit to do the re-ranking?




We're completely on board with you for reinforcement learning, however we wanted to start with something simpler to build the tool faster. RL is one the plate however!




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: