tests: stabilize integration tests + skip external services; fix multi-tenant API behavior and idempotency (#4 )

* feat: Implement multi-tenant architecture with tenant and knowledge base models

- Added data models for tenants, knowledge bases, and related configurations.
- Introduced role and permission management for users in the multi-tenant system.
- Created a service layer for managing tenants and knowledge bases, including CRUD operations.
- Developed a tenant-aware instance manager for LightRAG with caching and isolation features.
- Added a migration script to transition existing workspace-based deployments to the new multi-tenant architecture.

* chore: ignore lightrag/api/webui/assets/ directory

* chore: stop tracking lightrag/api/webui/assets (ignore in .gitignore)

* feat: Initialize LightRAG Multi-Tenant Stack with PostgreSQL

- Added README.md for project overview, setup instructions, and architecture details.
- Created docker-compose.yml to define services: PostgreSQL, Redis, LightRAG API, and Web UI.
- Introduced env.example for environment variable configuration.
- Implemented init-postgres.sql for PostgreSQL schema initialization with multi-tenant support.
- Added reproduce_issue.py for testing default tenant access via API.

* feat: Enhance TenantSelector and update related components for improved multi-tenant support

* feat: Enhance testing capabilities and update documentation

- Updated Makefile to include new test commands for various modes (compatibility, isolation, multi-tenant, security, coverage, and dry-run).
- Modified API health check endpoint in Makefile to reflect new port configuration.
- Updated QUICK_START.md and README.md to reflect changes in service URLs and ports.
- Added environment variables for testing modes in env.example.
- Introduced run_all_tests.sh script to automate testing across different modes.
- Created conftest.py for pytest configuration, including database fixtures and mock services.
- Implemented database helper functions for streamlined database operations in tests.
- Added test collection hooks to skip tests based on the current MULTITENANT_MODE.

* feat: Implement multi-tenant support with demo mode enabled by default

- Added multi-tenant configuration to the environment and Docker setup.
- Created pre-configured demo tenants (acme-corp and techstart) for testing.
- Updated API endpoints to support tenant-specific data access.
- Enhanced Makefile commands for better service management and database operations.
- Introduced user-tenant membership system with role-based access control.
- Added comprehensive documentation for multi-tenant setup and usage.
- Fixed issues with document visibility in multi-tenant environments.
- Implemented necessary database migrations for user memberships and legacy support.

* feat(audit): Add final audit report for multi-tenant implementation

- Documented overall assessment, architecture overview, test results, security findings, and recommendations.
- Included detailed findings on critical security issues and architectural concerns.

fix(security): Implement security fixes based on audit findings

- Removed global RAG fallback and enforced strict tenant context.
- Configured super-admin access and required user authentication for tenant access.
- Cleared localStorage on logout and improved error handling in WebUI.

chore(logs): Create task logs for audit and security fixes implementation

- Documented actions, decisions, and next steps for both audit and security fixes.
- Summarized test results and remaining recommendations.

chore(scripts): Enhance development stack management scripts

- Added scripts for cleaning, starting, and stopping the development stack.
- Improved output messages and ensured graceful shutdown of services.

feat(starter): Initialize PostgreSQL with AGE extension support

- Created initialization scripts for PostgreSQL extensions including uuid-ossp, vector, and AGE.
- Ensured successful installation and verification of extensions.

* feat: Implement auto-select for first tenant and KB on initial load in WebUI

- Removed WEBUI_INITIAL_STATE_FIX.md as the issue is resolved.
- Added useTenantInitialization hook to automatically select the first available tenant and KB on app load.
- Integrated the new hook into the Root component of the WebUI.
- Updated RetrievalTesting component to ensure a KB is selected before allowing user interaction.
- Created end-to-end tests for multi-tenant isolation and real service interactions.
- Added scripts for starting, stopping, and cleaning the development stack.
- Enhanced API and tenant routes to support tenant-specific pipeline status initialization.
- Updated constants for backend URL to reflect the correct port.
- Improved error handling and logging in various components.

* feat: Add multi-tenant support with enhanced E2E testing scripts and client functionality

* update client

* Add integration and unit tests for multi-tenant API, models, security, and storage

- Implement integration tests for tenant and knowledge base management endpoints in `test_tenant_api_routes.py`.
- Create unit tests for tenant isolation, model validation, and role permissions in `test_tenant_models.py`.
- Add security tests to enforce role-based permissions and context validation in `test_tenant_security.py`.
- Develop tests for tenant-aware storage operations and context isolation in `test_tenant_storage_phase3.py`.

* feat(e2e): Implement OpenAI model support and database reset functionality

* Add comprehensive test suite for gpt-5-nano compatibility

- Introduced tests for parameter normalization, embeddings, and entity extraction.
- Implemented direct API testing for gpt-5-nano.
- Validated .env configuration loading and OpenAI API connectivity.
- Analyzed reasoning token overhead with various token limits.
- Documented test procedures and expected outcomes in README files.
- Ensured all tests pass for production readiness.

* kg(postgres_impl): ensure AGE extension is loaded in session and configure graph initialization

* dev: add hybrid dev helper scripts, Makefile, docker-compose.dev-db and local development docs

* feat(dev): add dev helper scripts and local development documentation for hybrid setup

* feat(multi-tenant): add detailed specifications and logs for multi-tenant improvements, including UX, backend handling, and ingestion pipeline

* feat(migration): add generated tenant/kb columns, indexes, triggers; drop unused tables; update schema and docs

* test(backward-compat): adapt tests to new StorageNameSpace/TenantService APIs (use concrete dummy storages)

* chore: multi-tenant and UX updates — docs, webui, storage, tenant service adjustments

* tests: stabilize integration tests + skip external services; fix multi-tenant API behavior and idempotency

- gpt5_nano_compatibility: add pytest-asyncio markers, skip when OPENAI key missing, prevent module-level asyncio.run collection, add conftest
- Ollama tests: add server availability check and skip markers; avoid pytest collection warnings by renaming helper classes
- Graph storage tests: rename interactive test functions to avoid pytest collection
- Document & Tenant routes: support external_ids for idempotency; ensure HTTPExceptions are re-raised
- LightRAG core: support external_ids in apipeline_enqueue_documents and idempotent logic
- Tests updated to match API changes (tenant routes & document routes)
- Add logs and scripts for inspection and audit

2025-12-04 16:04:21 +08:00

13 KiB

Raw Blame History

LightRAG Multi-Tenant Makefile - Complete Usage Guide

Overview

This Makefile provides a comprehensive, user-friendly interface for managing the LightRAG multi-tenant stack with PostgreSQL backend. All commands are color-coded, well-documented, and designed for both developers and operations teams.

Design Philosophy

✅ User-Friendly

Clear, descriptive help messages with examples
Organized by functional groups
Visual feedback with colors and symbols
No cryptic abbreviations

✅ Safe Operations

Confirmation prompts for destructive operations
Clear warnings for data loss
Incremental operations (setup → up → init-db)
Easy to understand state management

✅ Comprehensive

Service control (start, stop, restart)
Database operations (backup, restore, reset)
Monitoring and health checks
Testing and debugging utilities

✅ Production-Ready

Resource limits enforced
Health checks for all services
Proper error handling
Logging and monitoring

Command Structure

Color Coding

🔵 BLUE - Headers, sections, informational messages
🟢 GREEN - Success messages, completion status
🟡 YELLOW - Warnings, important notes, suggestions
🔴 RED - Errors, critical warnings, dangerous operations

Symbol Usage

🔧 Setup commands
🚀 Start/Launch commands
🛑 Stop/Down commands
🔄 Restart commands
📋 View/List commands
🗄️ Database commands
🧪 Testing commands
🧹 Cleanup commands

Command Categories

1. Initial Setup (Run Once)

make setup

What it does:

Checks if .env file exists
Creates .env from env.template.example if needed
Provides instructions for configuration

When to use: First time only, before make up

What to do next:

# Edit .env with your settings
nano .env

# Then start services
make up

2. Service Control Commands

Start All Services

make up

What it does:

Starts PostgreSQL
Starts Redis
Starts LightRAG API
Starts WebUI
Waits for health checks
Shows endpoints and next steps

Output includes:

Service endpoints (http://localhost:3000, etc.)
Confirmation of successful startup
Suggestions for next steps

Typical flow:

make up
make init-db    # Once services are ready
make status     # Verify everything is running
make api-health # Check API is responding

Stop All Services

make down

What it does:

Stops all containers
Preserves data and volumes
Removes network

Important: Data is NOT deleted. Use make reset for full cleanup.

Restart Services

make restart

Equivalent to:

make down
sleep 2
make up

3. Logging Commands

View All Logs

make logs

Shows:

Real-time logs from all services
Press Ctrl+C to exit

Useful for:

Monitoring startup
Debugging issues
Verifying operations

View Service-Specific Logs

make logs-api        # LightRAG API only
make logs-db         # PostgreSQL only
make logs-webui      # WebUI only

Quick debugging:

# See API errors
make logs-api | grep -i error

# See database messages
make logs-db | grep -i postgres

# Follow logs from multiple services
make logs | grep "lightrag-api\|postgres"

4. Database Management Commands

Initialize Database

make init-db

Creates:

Database: lightrag_multitenant
Schema tables (documents, entities, relations, etc.)
Sample data (acme-corp, techstart tenants)
Indexes for performance
Sample knowledge bases

Run after: make up completes (wait 30 seconds)

Idempotent: Safe to run multiple times (uses ON CONFLICT)

Connect to Database Shell

make db-shell

Useful SQL commands:

-- List all tables
\dt

-- Show documents table structure
\d documents

-- Count documents by tenant
SELECT tenant_id, COUNT(*) FROM documents GROUP BY tenant_id;

-- Check all tenants
SELECT * FROM tenants;

-- Check knowledge bases
SELECT * FROM knowledge_bases;

-- Exit shell
\q

Backup Database

make db-backup

Creates:

Directory: ./backups/ (if not exists)
File: lightrag_backup_YYYYMMDD_HHMMSS.sql

How to restore:

make db-restore  # Restores latest backup

# Or manually restore specific backup
psql -U lightrag -d lightrag_multitenant < ./backups/lightrag_backup_20251120_143022.sql

Restore Database

make db-restore

Restores:

Latest backup from ./backups/ directory
Overwrites current database

Warning: Current data will be lost

Reset Database

make db-reset

⚠️ WARNING - This deletes all data!

What it does:

Asks for confirmation (requires typing 'yes')
Drops the database
Creates new database with fresh schema
Creates sample tenants and knowledge bases

When to use:

Starting fresh
Clearing test data
Troubleshooting schema issues

5. Health & Status Commands

Check All Service Status

make status

Shows:

Running containers
Port mappings
Service health status

Example output:

CONTAINER ID   IMAGE                    STATUS              PORTS
a1b2c3d4e5f6   postgres:15             Up 2 minutes        5432/tcp
f6e5d4c3b2a1   redis:7                 Up 2 minutes        6379/tcp
b2c3d4e5f6a1   lightrag-api            Up 2 minutes        9621/tcp
c3d4e5f6a1b2   lightrag-webui          Up 2 minutes        3000/tcp

Check API Health

make api-health

Tests:

HTTP request to /health endpoint
Parses JSON response
Shows API status

Expected response:

{
  "status": "healthy",
  "timestamp": "2025-11-20T14:30:22.123Z",
  "services": {
    "database": "connected",
    "redis": "connected",
    "embedding": "ready"
  }
}

Typical workflow:

make up
sleep 30
make api-health  # Check if API is ready

6. Testing Commands

Run All Multi-Tenant Tests

make test

Runs:

Multi-tenant backend tests
Isolation verification
Data integrity checks

Run Tenant Isolation Tests

make test-isolation

Specifically tests:

Tenant data isolation
Cross-tenant access prevention
Tenant context enforcement

Manual test example:

# Insert for tenant A
curl -X POST http://localhost:9621/api/v1/insert \
  -H "X-Tenant-Id: acme-corp" \
  -H "X-KB-Id: kb-prod" \
  -d '{"document": "Acme data"}'

# Query as tenant A (should work)
curl "http://localhost:9621/api/v1/query" \
  -H "X-Tenant-Id: acme-corp" \
  -H "X-KB-Id: kb-prod"

# Query as tenant B (should NOT return acme data)
curl "http://localhost:9621/api/v1/query" \
  -H "X-Tenant-Id: techstart" \
  -H "X-KB-Id: kb-main"

7. Cleanup & Maintenance Commands

Clean Up Docker Resources

make clean

Removes:

Stopped containers
Dangling images
Dangling volumes

Safe: Only cleans up unused resources, preserves running services

Regular maintenance:

# After stopping services
make down
make clean

# Then restart
make up

Full System Reset

make reset

⚠️ WARNING - Complete data loss!

This command:

Shows scary warning message
Asks for confirmation ('RESET' in all caps)
Stops all containers
Removes all containers
Deletes all volumes (including database)
Removes networks

When to use:

Complete fresh start needed
Migration to new environment
Troubleshooting major issues

Recover from accidental reset:

# From backup
make db-restore

# Or manually
psql -U lightrag -d lightrag_multitenant < ./backups/lightrag_backup_*.sql

System Prune

make prune

Removes:

Unused containers
Unused images
Unused volumes
Unused networks

More aggressive than make clean

8. Utility Commands

Display Docker Compose Configuration

make view-compose

Shows:

Full docker-compose.yml content
Useful for understanding service setup
Good for documentation

List Running Services

make ps

Alias for make status - shows docker-compose ps

Workflow Examples

Fresh Start Workflow

# 1. Clone and navigate
cd starter

# 2. Initial setup
make setup

# 3. Edit configuration
nano .env  # Add API keys, adjust settings

# 4. Start services
make up

# 5. Wait and initialize
sleep 10
make init-db

# 6. Verify health
make api-health
make status

# 7. Access application
# Open http://localhost:3000 in browser

Development Workflow

# Morning: Start services
make up

# During day: Monitor logs
make logs

# Check specific issues
make logs-api

# Before making changes: Backup
make db-backup

# After changes: Verify
make api-health

# Evening: Clean shutdown
make down

Troubleshooting Workflow

# 1. Check status
make status

# 2. View logs
make logs-api

# 3. Check health
make api-health

# 4. If API is stuck
docker compose -p lightrag-multitenant restart lightrag-api

# 5. If database is stuck
make db-shell
# SELECT 1;
# \q

# 6. Last resort: Full reset
make down
make reset
make setup
make up
make init-db

Backup & Restore Workflow

# Before major change
make db-backup

# Make changes...

# If problems, restore
make db-restore

# Verify restored data
make db-shell
# SELECT count(*) FROM documents;
# \q

Performance Tuning

Database Optimization

# Connect to database
make db-shell

# Check query performance
EXPLAIN SELECT * FROM documents WHERE tenant_id='acme-corp' AND kb_id='kb-prod';

# Should show: Index Scan (not Seq Scan)

# Update statistics
ANALYZE;

# Check table sizes
SELECT schemaname, tablename, pg_size_pretty(pg_total_relation_size(schemaname||'.'||tablename)) 
FROM pg_tables 
ORDER BY pg_total_relation_size(schemaname||'.'||tablename) DESC;

Memory Usage

# Check Docker memory usage
docker stats

# Adjust in docker-compose.yml if needed
# deploy:
#   resources:
#     limits:
#       memory: 8G  # Increase from 4G

Environment Variable Reference

Key variables in .env:

# Server
PORT=9621
WEBUI_PORT=3000

# Database
POSTGRES_USER=lightrag
POSTGRES_PASSWORD=secure_password
POSTGRES_DATABASE=lightrag_multitenant

# LLM
LLM_BINDING=openai
LLM_MODEL=gpt-4o
LLM_BINDING_API_KEY=sk-...

# Embedding
EMBEDDING_BINDING=ollama
EMBEDDING_MODEL=bge-m3:latest

Change any variable and restart services:

nano .env
make restart

Troubleshooting Matrix

Issue	Check	Solution
Services won't start	`make status`	`make logs` for error details
API not responding	`make api-health`	Wait longer, check `make logs-api`
Database errors	`make db-shell`	Run `make init-db` or `make db-reset`
Port already in use	`lsof -i :9621`	Change `PORT` in `.env`, restart
Memory issues	`docker stats`	Increase resource limits
Data loss	Have backup	`make db-restore` from backup

Color Legend

Color	Meaning	Example
🔵 Blue	Headers, info	Section titles
🟢 Green	Success	"✓ Services started"
🟡 Yellow	Warning	"⚠️ This deletes data"
🔴 Red	Error/Critical	"✗ Connection refused"

Exit Codes

# Command succeeded
make status
echo $?  # 0

# Command failed
make up
echo $?  # non-zero (1, 2, etc.)

Tips & Tricks

View logs with grep

# Show only errors
make logs | grep -i error

# Show only tenant-related logs
make logs | grep -i tenant

# Show last 50 lines
make logs | tail -50

Fast database exports

# Export to CSV
make db-shell
\COPY (SELECT * FROM documents WHERE tenant_id='acme-corp') TO 'export.csv' CSV HEADER;
\q

# Import from CSV
make db-shell
\COPY documents FROM 'export.csv' CSV HEADER;
\q

Monitor in real-time

# Split terminal or use tmux
make logs-api &
make logs-db &
make logs-webui &

Advanced Usage

Custom SQL Scripts

# Run SQL file against database
docker compose -p lightrag-multitenant exec -T postgres psql -U lightrag -d lightrag_multitenant -f script.sql

# Interactive SQL
make db-shell < script.sql

Backup Automation

# Schedule daily backups with cron
0 2 * * * cd /path/to/starter && make db-backup

# Or with Docker
docker compose exec -T postgres pg_dump -U lightrag lightrag_multitenant > backup.sql

Environment Switching

# Development environment
cp .env.development .env
make restart

# Production environment
cp .env.production .env
make restart

Summary

This Makefile is designed to be:

Intuitive - Clear command names matching their purpose
Safe - Confirmations for destructive operations
Observable - Color output and progress messages
Complete - Covers all common operations
Documented - Built-in help and examples

For additional help:

make help

Last Updated: November 20, 2025
Makefile Version: 1.0
Status: Production Ready

13 KiB Raw Blame History

LightRAG Multi-Tenant Makefile - Complete Usage Guide

Overview

Design Philosophy

Command Structure

Color Coding

Symbol Usage

Command Categories

1. Initial Setup (Run Once)

2. Service Control Commands

Start All Services

Stop All Services

Restart Services

3. Logging Commands

View All Logs

View Service-Specific Logs

4. Database Management Commands

Initialize Database

Connect to Database Shell

Backup Database

Restore Database

Reset Database

5. Health & Status Commands

Check All Service Status

Check API Health

6. Testing Commands

Run All Multi-Tenant Tests

Run Tenant Isolation Tests

7. Cleanup & Maintenance Commands

Clean Up Docker Resources

Full System Reset

System Prune

8. Utility Commands

Display Docker Compose Configuration

List Running Services

Workflow Examples

Fresh Start Workflow

Development Workflow

Troubleshooting Workflow

Backup & Restore Workflow

Performance Tuning

Database Optimization

Memory Usage

Environment Variable Reference

Troubleshooting Matrix

Color Legend

Exit Codes

Tips & Tricks

View logs with grep

Fast database exports

Monitor in real-time

Advanced Usage

Custom SQL Scripts

Backup Automation

Environment Switching

Summary

13 KiB

Raw Blame History