Note on Building Search-Based RAG Using Claude, Datasette and Val.Town via Simon Willison

RAG is often implemented using vector search against embeddings, but there’s an alternative approach where you turn the user’s question into some full-text search queries, run those against a traditional search engine, then feed the results back into an LLM and ask it to use them to answer the question.

FROM:
Simon Willison
Building Search-Based RAG Using Claude, Datasette and Val.Town
Source

This considerably easier to reason about than RAG (retrieval-augmented generation) using vector search based on embeddings, and can provide high quality results with a relatively simple implementation.

It’s often much easier to bake FTS (full-text search) on to an existing site than build a pipeline to embedding search.

Reference

Notes
search, llm
Building Search-Based RAG Using Claude, Datasette and Val.Town
Simon Willison
2024, July 02, Tuesday
Permalink to 2024.NTE.112
Edit

← Previous	Next →
Note on I’ve Stopped Using Box Plots. Should You? via Nick Desbarats	Note on Turbopuffer: Fast Search on Object Storage via turbopuffer.com

Referenced By

Using an LLM and RAG to Wring Insights From My Posts

Widgets

Network Graph

Legend

Key	Action
`o`	Source
`e`	Edit
`i`	Insight
`r`	Random
`h`	Home
`s` or `/`	Search