clssck 59e89772de refactor: consolidate to PostgreSQL-only backend and modernize stack

Remove legacy storage implementations and deprecated examples:
- Delete FAISS, JSON, Memgraph, Milvus, MongoDB, Nano Vector DB, Neo4j, NetworkX, Qdrant, Redis storage backends
- Remove Kubernetes deployment manifests and installation scripts
- Delete unofficial examples for deprecated backends and offline deployment docs
Streamline core infrastructure:
- Consolidate storage layer to PostgreSQL-only implementation
- Add full-text search caching with FTS cache module
- Implement metrics collection and monitoring pipeline
- Add explain and metrics API routes
Modernize frontend and tooling:
- Switch web UI to Bun with bun.lock, remove npm and pnpm lockfiles
- Update Dockerfile for PostgreSQL-only deployment
- Add Makefile for common development tasks
- Update environment and configuration examples
Enhance evaluation and testing capabilities:
- Add prompt optimization with DSPy and auto-tuning
- Implement ground truth regeneration and variant testing
- Add prompt debugging and response comparison utilities
- Expand test coverage with new integration scenarios
Simplify dependencies and configuration:
- Remove offline-specific requirement files
- Update pyproject.toml with streamlined dependencies
- Add Python version pinning with .python-version
- Create project guidelines in CLAUDE.md and AGENTS.md

2025-12-12 16:28:49 +01:00

4.9 KiB

Raw Blame History

Repository Guidelines

LightRAG is an advanced Retrieval-Augmented Generation (RAG) framework designed to enhance information retrieval and generation through graph-based knowledge representation.

Project Structure & Module Organization

lightrag/: Core Python package with orchestrators (lightrag/lightrag.py), storage adapters in kg/, LLM bindings in llm/, and helpers such as operate.py and utils_*.py.
lightrag-api/: FastAPI service (lightrag_server.py) with routers under routers/ and Gunicorn launcher run_with_gunicorn.py.
lightrag_webui/: React 19 + TypeScript client driven by Bun + Vite; UI components live in src/.
Tests live in tests/ and root-level test_*.py. Working datasets stay in inputs/, rag_storage/, temp/; deployment collateral lives in docs/ and docker-compose.yml.

Build, Test, and Development Commands

python -m venv .venv && source .venv/bin/activate: set up the Python runtime.
pip install -e . / pip install -e .[api]: install the package and API extras in editable mode.
lightrag-server or uvicorn lightrag.api.lightrag_server:app --reload: start the API locally; ensure .env is present.
python -m pytest tests (offline markers apply by default) or python -m pytest tests --run-integration / python test_graph_storage.py: run the full suite, opt into integration coverage, or target an individual script.
ruff check .: lint Python sources before committing.
bun install, bun run dev, bun run build, bun test: manage the web UI workflow (Bun is mandatory).

Coding Style & Naming Conventions

Backend code follow PEP 8 with four-space indentation, annotate functions, and reach for dataclasses when modelling state.
Use lightrag.utils.logger instead of print; respect logger configuration flags.
Extend storage or pipeline abstractions via lightrag.base and keep reusable helpers in the existing utils_*.py.
Python modules remain lowercase with underscores; React components use PascalCase.tsx and hooks-first patterns.
Front-end code should remain in TypeScript with two-space indentation, rely on functional React components with hooks, and follow Tailwind utility style.

Testing Guidelines

Keep pytest additions close to the code you touch (tests/ mirrors feature folders and there are root-level test_*.py helpers); functions must start with test_.
Follow tests/pytest.ini: markers include offline, integration, requires_db, and requires_api, and the suite runs with -m "not integration" by default—pass --run-integration (or set LIGHTRAG_RUN_INTEGRATION=true) when external services are available.
Use the custom CLI toggles from tests/conftest.py: --keep-artifacts/LIGHTRAG_KEEP_ARTIFACTS=true, --stress-test/LIGHTRAG_STRESS_TEST=true, and --test-workers N/LIGHTRAG_TEST_WORKERS to dial up workloads or preserve temp files during investigations.
Export other required LIGHTRAG_* environment variables before running integration or storage tests so adapters can reach configured backends.
For UI updates, pair changes with Vitest specs and run bun test.

Commit & Pull Request Guidelines

Use concise, imperative commit subjects (e.g., Fix lock key normalization) and add body context only when necessary.
PRs should include a summary, operational impact, linked issues, and screenshots or API samples for user-facing work.
Verify ruff check ., python -m pytest, and affected Bun commands succeed before requesting review; note the runs in the PR text.

Security & Configuration Tips

NEVER commit secrets: .env, API keys, passwords, connection strings, tokens
NEVER commit: .env, config.ini, *.pem, *.key, credentials files
Copy env.example → .env and config.ini.example → config.ini for local dev
These files are in .gitignore — keep it that way
Configure storage backends through LIGHTRAG_* variables and validate them with docker-compose services when needed
Treat lightrag.log* as local artefacts; purge sensitive information before sharing logs or outputs
Before committing, run git status to ensure no secrets are staged

Automation & Agent Workflow

Use repo-relative workdir arguments for every shell command and prefer rg/rg --files for searches since they are faster under the CLI harness.
Default edits to ASCII, rely on apply_patch for single-file changes, and only add concise comments that aid comprehension of complex logic.
Honor existing local modifications; never revert or discard user changes (especially via git reset --hard) unless explicitly asked.
Follow the planning tool guidance: skip it for trivial fixes, but provide multi-step plans for non-trivial work and keep the plan updated as steps progress.
Validate changes by running the relevant ruff/pytest/bun test commands whenever feasible, and describe any unrun checks with follow-up guidance.

4.9 KiB Raw Blame History

Repository Guidelines

Project Structure & Module Organization

Build, Test, and Development Commands

Coding Style & Naming Conventions

Testing Guidelines

Commit & Pull Request Guidelines

Security & Configuration Tips

Automation & Agent Workflow

4.9 KiB

Raw Blame History