Back to Projects

Local_RAG_Agent

A local Retrieval-Augmented Generation (RAG) agent that loads a .txt or .csv file, embeds its content, retrieves relevant context, and answers user questions using a local language model and vector se

Completed Personal Project

View on GitHub

The Challenge

Learners, developers, and researchers often need to extract structured answers from unstructured text files without uploading sensitive data to the cloud. Many existing solutions depend on hosted AI services, incur costs, or expose data externally.

The goal of this project is to provide a self-hosted RAG agent that can answer contextual questions about local data files without sending the data to third-party services or requiring internet access.

The Solution

File Ingestion: The agent accepts .txt or .csv files as input.

Text Splitting: It splits the contents into manageable chunks for semantic processing.

Embedding: Each chunk is converted into a vector representation using a local embedding model.

Vector Store: These embeddings are stored in ChromaDB for fast similarity search.

Retrieval: When a user asks a question, the vector store retrieves the most relevant chunks.

Generation: A local LLM (via Ollama) uses the retrieved context to generate precise answers.

This pipeline ensures that answers are grounded in the source text, not hallucinated.

Key Features

Answers questions based on local .txt or .csv data.

Uses a local LLM and embeddings (no cloud APIs required).

Performs semantic search over text with a vector database.

Local_RAG_Agent

The Challenge

The Solution

Key Features

Architecture & Implementation

Technologies Used

Challenges & Learnings

Results & Impact

Interested in working together?

Resume Preview