RLAMA - User Guide

RLAMA - User Guide

RLAMA is a powerful AI-driven question-answering tool for your documents, seamlessly integrating with your local Ollama models. It enables you to create, manage, and interact with Retrieval-Augmented Generation (RAG) systems tailored to your documentation needs.

Installation

Prerequisites

Ollama installed and running

Installation from terminal

curl -fsSL https://raw.githubusercontent.com/dontizi/rlama/main/install.sh | sh

Tech Stack

RLAMA is built with:

Core Language: Go (chosen for performance, cross-platform compatibility, and single binary distribution)
CLI Framework: Cobra (for command-line interface structure)
LLM Integration: Ollama API (for embeddings and completions)
Storage: Local filesystem-based storage (JSON files for simplicity and portability)
Vector Search: Custom implementation of cosine similarity for embedding retrieval

Architecture

RLAMA follows a clean architecture pattern with clear separation of concerns:

rlama/
├── cmd/                  # CLI commands (using Cobra)
│   ├── root.go           # Base command
│   ├── rag.go            # Create RAG systems
│   ├── run.go            # Query RAG systems
│   └── ...
├── internal/
│   ├── client/           # External API clients
│   │   └── ollama_client.go # Ollama API integration
│   ├── domain/           # Core domain models
│   │   ├── rag.go        # RAG system entity
│   │   └── document.go   # Document entity
│   ├── repository/       # Data persistence
│   │   └── rag_repository.go # Handles saving/loading RAGs
│   └── service/          # Business logic
│       ├── rag_service.go      # RAG operations
│       ├── document_loader.go  # Document processing
│       └── embedding_service.go # Vector embeddings
└── pkg/                  # Shared utilities
    └── vector/           # Vector operations

Data Flow

Document Processing: Documents are loaded from the file system, parsed based on their type, and converted to plain text.
Embedding Generation: Document text is sent to Ollama to generate vector embeddings.
Storage: The RAG system (documents + embeddings) is stored in the user's home directory (~/.rlama).
Query Process: When a user asks a question, it's converted to an embedding, compared against stored document embeddings, and relevant content is retrieved.
Response Generation: Retrieved content and the question are sent to Ollama to generate a contextually-informed response.

Visual Representation

┌─────────────┐     ┌─────────────┐     ┌─────────────┐
│  Documents  │────>│  Document   │────>│  Embedding  │
│  (Input)    │     │  Processing │     │  Generation │
└─────────────┘     └─────────────┘     └─────────────┘
                                              │
                                              ▼
┌─────────────┐     ┌─────────────┐     ┌─────────────┐
│   Query     │────>│  Vector     │<────│ Vector Store│
│  Response   │     │  Search     │     │ (RAG System)│
└─────────────┘     └─────────────┘     └─────────────┘
       ▲                   │
       │                   ▼
┌─────────────┐     ┌─────────────┐
│   Ollama    │<────│   Context   │
│    LLM      │     │  Building   │
└─────────────┘     └─────────────┘

RLAMA is designed to be lightweight and portable, focusing on providing RAG capabilities with minimal dependencies. The entire system runs locally, with the only external dependency being Ollama for LLM capabilities.

Available Commands

You can get help on all commands by using:

rlama --help

Global Flags

These flags can be used with any command:

--host string   Ollama host (default: localhost)
--port string   Ollama port (default: 11434)

rag - Create a RAG system

Creates a new RAG system by indexing all documents in the specified folder.

rlama rag [model] [rag-name] [folder-path]

Parameters:

model: Name of the Ollama model to use (e.g., llama3, mistral, gemma).
rag-name: Unique name to identify your RAG system.
folder-path: Path to the folder containing your documents.

Example:

rlama rag llama3 documentation ./docs

run - Use a RAG system

Starts an interactive session to interact with an existing RAG system.

rlama run [rag-name]

Parameters:

rag-name: Name of the RAG system to use.

Example:

rlama run documentation
> How do I install the project?
> What are the main features?
> exit

list - List RAG systems

Displays a list of all available RAG systems.

rlama list

delete - Delete a RAG system

Permanently deletes a RAG system and all its indexed documents.

rlama delete [rag-name] [--force/-f]

Parameters:

rag-name: Name of the RAG system to delete.
--force or -f: (Optional) Delete without asking for confirmation.

Example:

rlama delete old-project

Or to delete without confirmation:

rlama delete old-project --force

update - Update RLAMA

Checks if a new version of RLAMA is available and installs it.

rlama update [--force/-f]

Options:

--force or -f: (Optional) Update without asking for confirmation.

version - Display version

Displays the current version of RLAMA.

rlama --version

or

rlama -v

Uninstallation

To uninstall RLAMA:

Removing the binary

If you installed via go install:

rlama uninstall

Removing data

RLAMA stores its data in ~/.rlama. To remove it:

rm -rf ~/.rlama

Supported Document Formats

RLAMA supports many file formats:

Text: .txt, .md, .html, .json, .csv, .yaml, .yml, .xml
Code: .go, .py, .js, .java, .c, .cpp, .h, .rb, .php, .rs, .swift, .kt
Documents: .pdf, .docx, .doc, .rtf, .odt, .pptx, .ppt, .xlsx, .xls, .epub

Installing dependencies via install_deps.sh is recommended to improve support for certain formats.

Troubleshooting

Ollama is not accessible

If you encounter connection errors to Ollama:

Check that Ollama is running.
By default, Ollama must be accessible at http://localhost:11434 or the host and port specified by the OLLAMA_HOST environment variable.

If your Ollama instance is running on a different host or port, use the --host and --port flags:

rlama --host 192.168.1.100 --port 8000 list
rlama --host my-ollama-server --port 11434 run my-rag

Check Ollama logs for potential errors.

Text extraction issues

If you encounter problems with certain formats:

Install dependencies via ./scripts/install_deps.sh.
Verify that your system has the required tools (pdftotext, tesseract, etc.).

The RAG doesn't find relevant information

If the answers are not relevant:

Check that the documents are properly indexed with rlama list.
Make sure the content of the documents is properly extracted.
Try rephrasing your question more precisely.

Other issues

For any other issues, please open an issue on the GitHub repository providing:

The exact command used.
The complete output of the command.
Your operating system and architecture.
The RLAMA version (rlama --version).

Configuring Ollama Connection

RLAMA provides multiple ways to connect to your Ollama instance:

Command-line flags (highest priority):

rlama --host 192.168.1.100 --port 8080 run my-rag

Environment variable:

# Format: "host:port" or just "host"
export OLLAMA_HOST=remote-server:8080
rlama run my-rag

Default values (used if no other method is specified):
- Host: localhost
- Port: 11434

The precedence order is: command-line flags > environment variable > default values.

Name	Name	Last commit message	Last commit date
Latest commit DonTizi Merge pull request DonTizi#16 from DonTizi/feat/chunking-service Mar 9, 2025 b56c9a7 · Mar 9, 2025 History 36 Commits
.github/workflows	.github/workflows	Update to version 0.1.1	Mar 8, 2025
cmd	cmd	added chunking service	Mar 9, 2025
internal	internal	added chunking service	Mar 9, 2025
pkg/vector	pkg/vector	RAG Management Enhancements	Mar 8, 2025
scripts	scripts	Release v0.1.0	Mar 8, 2025
.DS_Store	.DS_Store	add RAG management commands and enhance document support	Mar 5, 2025
.gitattributes	.gitattributes	Initial commit	Mar 5, 2025
.gitignore	.gitignore	--help command	Mar 5, 2025
LICENSE	LICENSE	Added Apache License	Mar 8, 2025
README.md	README.md	default env	Mar 8, 2025
go.mod	go.mod	add RAG management commands and enhance document support	Mar 5, 2025
go.sum	go.sum	add RAG management commands and enhance document support	Mar 5, 2025
install.sh	install.sh	Add .org support	Mar 8, 2025
main.go	main.go	add RAG management commands and enhance document support	Mar 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RLAMA - User Guide

Table of Contents

Installation

Prerequisites

Installation from terminal

Tech Stack

Architecture

Data Flow

Visual Representation

Available Commands

Global Flags

rag - Create a RAG system

run - Use a RAG system

list - List RAG systems

delete - Delete a RAG system

update - Update RLAMA

version - Display version

Uninstallation

Removing the binary

Removing data

Supported Document Formats

Troubleshooting

Ollama is not accessible

Text extraction issues

The RAG doesn't find relevant information

Other issues

Configuring Ollama Connection

About

Releases

Packages

Languages

License

yssource/rlama

Folders and files

Latest commit

History

Repository files navigation

RLAMA - User Guide

Table of Contents

Installation

Prerequisites

Installation from terminal

Tech Stack

Architecture

Data Flow

Visual Representation

Available Commands

Global Flags

rag - Create a RAG system

run - Use a RAG system

list - List RAG systems

delete - Delete a RAG system

update - Update RLAMA

version - Display version

Uninstallation

Removing the binary

Removing data

Supported Document Formats

Troubleshooting

Ollama is not accessible

Text extraction issues

The RAG doesn't find relevant information

Other issues

Configuring Ollama Connection

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages