GitHub

Basic Retrieval Augmented Generation

This repository contains a minimal implementation of RAG using a python list of tuples as a vector database, and local embedding and inference through Ollama. See it step by step in the jupyter notebook, or run the simple_rag.py file at the command line.

It has several limitations that would have to be overcome in production:

vector similarity is calculated with respect to all elements of the database at every query, which would be prohibitive at scale
there is no persistent memory of input data
vector similarity is an imperfect metric that needs to be improved for truly useful results. For example, by using knowledge graphs (which will be our next example)
local inference is useful only for a few edge cases, like highly secretive clients willing to deal with slow response times in order to maintain full data control

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
README.md		README.md
cat-facts.txt		cat-facts.txt
simple_rag.py		simple_rag.py
test.ipynb		test.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Languages

sibeliu/simple_rag

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages