Rather than an offline model, why not use an online, continuously relearning model like a Multi-Armed Bandit to do the re-ranking?
Rather than an offline model, why not use an online, continuously relearning model like a Multi-Armed Bandit to do the re-ranking?