halyax7's comments

halyax7 · 2025-05-02T17:02:41 1746205361

even if it was made up, its still a serious issue

halyax7 · 2025-02-22T17:11:14 1740244274

I thought this looked familiar - its a reskin of a fantastic full stack tutorial from a few months back:

https://youtu.be/RkwbGuL-dzo?si=mWnIfWnmFLKIwIaD

shivsarthak34 · 2025-02-22T19:36:13 1740252973

It's not a reskin , I developed it on my own. It's just that these workflow builders are built using libraries such as React Flows. As you can see in the bottom right of the sheet of the flow builder. Also my application is built on Svelte, and I'm using Svelte Flow which is a fork of React Flow itself.

graypegg · 2025-02-22T19:29:08 1740252548

IMHO, if someone made something while following along with a tutorial, I do think they should be free to turn THAT software they made (with a guide) into a product. The presumption being that the tutorial isn't just handing you a product that you sell with no changes. (I do see there's access to the source code in the description.)

Skimming the video, and this site, I'm not noticing a ton of differences other than some minor styling changes. Maybe the author could mention any of the changes they've made? If they're significant, IMO, it's fine.

MattDaEskimo · 2025-02-22T18:51:47 1740250307

Yikes. I was wondering where all of these exact replicas were coming from. Thanks for sharing.

halyax7 · 2024-10-23T18:54:39 1729709679

an issue I've seen in several RAG implementations is assuming that the target documents, however cleverly they're chunked, will be good search keys for incoming queries. Unless your incoming search text looks semantically like the documents you're searching over (not the case in general), you'll get bad hits. On a recent project, we saw a big improvement in retrieval relevance when we separated the search keys from the returned values (chunked documents), and we used an LM to generate appropriate keys which were then embedded. Appropriate in this case means "sentences like what the user might input if theyre expecting this chunk back"

marlott · 2024-10-23T20:46:58 1729716418

Interesting! So you basically got a LM to rephrase the search phrase/keys into the style of the target documents, then used that in the RAG pipeline? Did you do an initial search first to limit the documents?

NitpickLawyer · 2024-10-23T22:26:46 1729722406

IIUC they're doing some sort of "q/a" for each chunk from documents, where they ask an LLM to "play the user role and ask a question that would be answered by this chunk". They then embed those questions, and match live user queries with those questions first, then maybe re-rank on the document chunks retrieved.

halyax7 · 2024-10-04T17:17:03 1728062223

Much of those editing steps could be streamlined and/or straight up automated so that estimate will come way down over time