Clément THOMAS 62b2a71dda feat(api): add multi-workspace server support for multi-tenant deployments

Enable a single LightRAG server instance to serve multiple isolated workspaces
via HTTP header-based routing. This allows multi-tenant SaaS deployments where
each tenant's data is completely isolated.

Key features:
- Header-based workspace routing (LIGHTRAG-WORKSPACE, X-Workspace-ID fallback)
- Process-local pool of LightRAG instances with LRU eviction
- FastAPI dependency (get_rag) for workspace resolution per request
- Full backward compatibility - existing deployments work unchanged
- Strict multi-tenant mode option (LIGHTRAG_ALLOW_DEFAULT_WORKSPACE=false)
- Configurable pool size (LIGHTRAG_MAX_WORKSPACES_IN_POOL)
- Graceful shutdown with workspace finalization

Configuration:
- LIGHTRAG_DEFAULT_WORKSPACE: Default workspace (falls back to WORKSPACE)
- LIGHTRAG_ALLOW_DEFAULT_WORKSPACE: Require explicit header when false
- LIGHTRAG_MAX_WORKSPACES_IN_POOL: Max concurrent workspace instances (default: 50)

Files:
- New: lightrag/api/workspace_manager.py (core multi-workspace module)
- New: tests/test_multi_workspace_server.py (17 unit tests)
- New: render.yaml (Render deployment blueprint)
- Modified: All route files to use get_rag dependency
- Updated: README.md, env.example with documentation

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>

2025-12-01 12:07:22 +01:00

5.8 KiB

Raw Blame History

Data Model: Multi-Workspace Server Support

Date: 2025-12-01 Feature: 001-multi-workspace-server

Overview

This feature introduces server-level workspace management without adding new persistent data models. The data model focuses on runtime entities that manage workspace instances.

Entities

WorkspaceInstance

Represents a running LightRAG instance serving requests for a specific workspace.

Attribute	Type	Description
`workspace_id`	`str`	Unique identifier for the workspace (validated, 1-64 chars)
`rag_instance`	`LightRAG`	The initialized LightRAG object
`created_at`	`datetime`	When the instance was first created
`last_accessed_at`	`datetime`	When the instance was last used (for LRU)
`status`	`enum`	`initializing`, `ready`, `finalizing`, `error`

Validation Rules:

workspace_id must match: ^[a-zA-Z0-9][a-zA-Z0-9_-]{0,63}$
workspace_id must not be empty string (use explicit default workspace)

State Transitions:

┌─────────────┐     ┌───────┐     ┌────────────┐
│ initializing│ ──► │ ready │ ──► │ finalizing │
└─────────────┘     └───────┘     └────────────┘
      │                 │
      ▼                 ▼
  ┌───────┐         ┌───────┐
  │ error │         │ error │
  └───────┘         └───────┘

WorkspacePool

Collection managing active WorkspaceInstance objects.

Attribute	Type	Description
`max_size`	`int`	Maximum concurrent instances (from config)
`instances`	`dict[str, WorkspaceInstance]`	Active instances by workspace_id
`lru_order`	`list[str]`	Workspace IDs ordered by last access
`lock`	`asyncio.Lock`	Protects concurrent access

Invariants:

len(instances) <= max_size
set(lru_order) == set(instances.keys())
Only one instance per workspace_id

Operations:

Operation	Description	Complexity
`get(workspace_id)`	Get or create instance, updates LRU	O(1) amortized
`evict_lru()`	Remove least recently used instance	O(1)
`finalize_all()`	Clean shutdown of all instances	O(n)

WorkspaceConfig

Configuration for multi-workspace behavior (runtime, not persisted).

Attribute	Type	Default	Description
`default_workspace`	`str`	`""`	Workspace when no header present
`allow_default_workspace`	`bool`	`true`	Allow requests without header
`max_workspaces_in_pool`	`int`	`50`	Pool size limit

Sources (in priority order):

Environment variables (LIGHTRAG_DEFAULT_WORKSPACE, etc.)
Existing WORKSPACE env var (backward compatibility)
Hardcoded defaults

Relationships

┌─────────────────┐
│ WorkspaceConfig │
└────────┬────────┘
         │ configures
         ▼
┌─────────────────┐       contains        ┌───────────────────┐
│  WorkspacePool  │◄─────────────────────►│ WorkspaceInstance │
└─────────────────┘                       └───────────────────┘
         │                                         │
         │ validates workspace_id                  │ wraps
         ▼                                         ▼
┌─────────────────┐                       ┌───────────────────┐
│ HTTP Request    │                       │ LightRAG (core)   │
│ (workspace hdr) │                       │                   │
└─────────────────┘                       └───────────────────┘

Data Flow

Request Processing

1. HTTP Request arrives
   │
2. Extract workspace from headers
   │  ├─ LIGHTRAG-WORKSPACE header (primary)
   │  └─ X-Workspace-ID header (fallback)
   │
3. If no header:
   │  ├─ allow_default_workspace=true → use default_workspace
   │  └─ allow_default_workspace=false → return 400
   │
4. Validate workspace_id format
   │  └─ Invalid → return 400
   │
5. WorkspacePool.get(workspace_id)
   │  ├─ Instance exists → update LRU, return instance
   │  └─ Instance missing:
   │       ├─ Pool full → evict LRU instance
   │       └─ Create new instance, initialize, add to pool
   │
6. Route handler receives LightRAG instance
   │
7. Process request using instance
   │
8. Return response

Instance Lifecycle

1. First request for workspace arrives
   │
2. WorkspacePool creates WorkspaceInstance
   │  status: initializing
   │
3. LightRAG object created with workspace parameter
   │
4. await rag.initialize_storages()
   │
5. Instance status → ready
   │  Added to pool and LRU list
   │
6. Instance serves requests...
   │  last_accessed_at updated on each access
   │
7. Pool reaches max_size, this instance is LRU
   │
8. Instance status → finalizing
   │
9. await rag.finalize_storages()
   │
10. Instance removed from pool

No Persistent Schema Changes

This feature does not modify:

Storage schemas (KV, vector, graph)
Database tables
File formats

Workspace isolation at the data layer is already handled by the LightRAG core using namespace prefixing.

5.8 KiB Raw Blame History