Quick Start

This guide gets you from zero to chatting with a codebase in the shortest path possible. For detailed setup, see Installation.

Five-Minute Quickstart

git clone https://github.com/lovesinghal31/codepilot.git
cd codepilot
pnpm install

Start the database and cache infrastructure:

docker compose up -d

Start Ollama and pull the embedding model:

ollama pull nomic-embed-text
ollama pull qwen2.5-coder:3b

Copy the environment files and set minimum required values:

cp .env.example .env
cp .env.api.example .env.api
cp .env.worker.example .env.worker
cp .env.web.example .env.web

At minimum, ensure DATABASE_URL and REDIS_HOST are set correctly in .env.api and .env.worker.

pnpm --filter @repo/db run db:generate
pnpm --filter @repo/db run db:migrate
pnpm dev

The dashboard is now available at http://localhost:3000.

The worker will clone the repository, parse files, generate embeddings, and store vectors. You can monitor progress in the dashboard.

Once ingestion is complete, navigate to the repository page and start asking questions:

CodePilot retrieves relevant code chunks via semantic search and generates context-aware answers using the local LLM.

When you connect a repository, CodePilot performs the following pipeline:

Clone — The worker clones the repo using a GitHub installation token via simple-git
Scan — Files are scanned and filtered by language (TypeScript, JavaScript, Markdown, JSON)
Parse — ts-morph performs AST-aware analysis to identify functions, classes, and components
Chunk — Code is split into meaningful chunks (function bodies, class definitions, component trees)
Embed — Each chunk is embedded using nomic-embed-text (768-dimensional vectors)
Store — Vectors are stored in PostgreSQL via pgvector with HNSW indexing for fast retrieval

When you ask a question:

Understand how the monorepo is organized.

Deep dive into the architecture.