Ask HN: What RAG setup gets me 95% of the way there? 6 points by uptownfunk 4 days ago Alternatively what is your RAG set up?
softwaredoug 2 days ago Couldn’t put it better than this posthttps://www.linkedin.com/posts/softwaredoug_absolutely-kille...
cmcollier 3 days ago This will get you the first 80%:* Any solid search engine (bm25 + embeddings and hnsw)* Any api to a model (gpt3.5, gpt4, claude, etc)* Some middleware to call search then build the promptThen the remaining:* Create an eval dataset, then tune the search and the prompt as needed
Couldn’t put it better than this post
https://www.linkedin.com/posts/softwaredoug_absolutely-kille...
If I did it I would try this first: https://github.com/pgvector/pgvector
Try dify (https://github.com/langgenius/dify)
This will get you the first 80%:
* Any solid search engine (bm25 + embeddings and hnsw)
* Any api to a model (gpt3.5, gpt4, claude, etc)
* Some middleware to call search then build the prompt
Then the remaining:
* Create an eval dataset, then tune the search and the prompt as needed
https://bionic-gpt.com Just signup and start adding data.