an issue I've seen in several RAG implementations is assuming that the target do...

marlott · 2024-10-23T20:46:58 1729716418

Interesting! So you basically got a LM to rephrase the search phrase/keys into the style of the target documents, then used that in the RAG pipeline? Did you do an initial search first to limit the documents?

NitpickLawyer · 2024-10-23T22:26:46 1729722406

IIUC they're doing some sort of "q/a" for each chunk from documents, where they ask an LLM to "play the user role and ask a question that would be answered by this chunk". They then embed those questions, and match live user queries with those questions first, then maybe re-rank on the document chunks retrieved.