History

Taddeus 748ded40fb MLO-446: Adds API key authentication support to LightRAG client (#12 ) * Adds LightRAG API key support to deployment and secrets Introduces a new environment variable for the LightRAG API key sourced from secrets to enable authenticated access. Updates Helm values and templates to include LightRAG API key management alongside the existing OpenAI key, improving configuration consistency and security. Relates to MLO-339 * Adds optional API key authentication support to LightRAG client Enables passing custom headers, including an API key from environment variables, to all LightRAG HTTP requests for authentication. Improves security by allowing authenticated access without breaking existing unauthenticated usage. Relates to MLO-446 * Adds basic user authentication support to Helm deployment Introduces configurable user accounts and token secret in values and templates to enable authentication. Generates an encoded authentication string from account data stored in secrets and exposes relevant environment variables in the deployment only when authentication is enabled and configured. This enhancement allows secure management of multiple user credentials and token secrets, improving the deployment's security and flexibility. Relates to MLO-446 * Adds support for external secret references in PostgreSQL auth Introduces parameters to allow PostgreSQL credentials to be sourced from existing Kubernetes secrets instead of inline passwords. Improves security and flexibility by enabling integration with external secret management without changing deployment structure. Relates to MLO-446 * Streamline deployment docs and remove preset environment configs Consolidates deployment instructions by removing separate dev and prod values files and related workflows, encouraging users to customize a single values file instead. Simplifies the README to focus on flexible chart deployment without environment-specific templates or variable substitution, improving maintainability and clarity. * Adds Helm packaging and publishing Makefile for LightRAG Introduces a Makefile to automate Helm chart packaging, versioning, and publishing to a container registry. Uses git tags or user-defined versions for chart versioning with sanitization. Ensures streamlined CI/CD by handling dependencies, packaging, registry login, and cleanup, simplifying release workflows. Relates to MLO-446		2025-10-29 14:31:56 +02:00
..
databases	feat: change sh script mode	2025-05-19 23:07:24 +08:00
lightrag	Adds support for pod labels, tolerations, affinity, and node selectors (#7 )	2025-07-09 09:58:30 +03:00
lightrag-minimal	MLO-446: Adds API key authentication support to LightRAG client (#12 )	2025-10-29 14:31:56 +02:00
install_lightrag.sh	feat: update script	2025-05-20 14:14:51 +08:00
install_lightrag_dev.sh	feat: update script	2025-05-20 14:14:51 +08:00
README-zh.md	feat: update README	2025-05-20 22:58:33 +08:00
README.md	feat: update README	2025-05-20 22:58:33 +08:00
uninstall_lightrag.sh	feat: add helm chart	2025-05-19 23:06:11 +08:00
uninstall_lightrag_dev.sh	feat: change sh script mode	2025-05-19 23:07:24 +08:00

README.md

LightRAG Helm Chart

This is the Helm chart for LightRAG, used to deploy LightRAG services on a Kubernetes cluster.

There are two recommended deployment methods for LightRAG:

Lightweight Deployment: Using built-in lightweight storage, suitable for testing and small-scale usage
Production Deployment: Using external databases (such as PostgreSQL and Neo4J), suitable for production environments and large-scale usage

If you'd like a video walkthrough of the deployment process, feel free to check out this optional video tutorial on YouTube. It might help clarify some steps for those who prefer visual guidance.

Prerequisites

Make sure the following tools are installed and configured:

Kubernetes cluster
- A running Kubernetes cluster is required.
- For local development or demos you can use Minikube (needs ≥ 2 CPUs, ≥ 4 GB RAM, and Docker/VM-driver support).
- Any standard cloud or on-premises Kubernetes cluster (EKS, GKE, AKS, etc.) also works.
kubectl
- The Kubernetes command-line tool for managing your cluster.
- Follow the official guide: Install and Set Up kubectl.
Helm (v3.x+)
- Kubernetes package manager used to install LightRAG.
- Install it via the official instructions: Installing Helm.

Lightweight Deployment (No External Databases Required)

This deployment option uses built-in lightweight storage components that are perfect for testing, demos, or small-scale usage scenarios. No external database configuration is required.

You can deploy LightRAG using either the provided convenience script or direct Helm commands. Both methods configure the same environment variables defined in the lightrag/values.yaml file.

Using the convenience script (recommended):

export OPENAI_API_BASE=<YOUR_OPENAI_API_BASE>
export OPENAI_API_KEY=<YOUR_OPENAI_API_KEY>
bash ./install_lightrag_dev.sh

Or using Helm directly:

# You can override any env param you want
helm upgrade --install lightrag ./lightrag \
  --namespace rag \
  --set-string env.LIGHTRAG_KV_STORAGE=JsonKVStorage \
  --set-string env.LIGHTRAG_VECTOR_STORAGE=NanoVectorDBStorage \
  --set-string env.LIGHTRAG_GRAPH_STORAGE=NetworkXStorage \
  --set-string env.LIGHTRAG_DOC_STATUS_STORAGE=JsonDocStatusStorage \
  --set-string env.LLM_BINDING=openai \
  --set-string env.LLM_MODEL=gpt-4o-mini \
  --set-string env.LLM_BINDING_HOST=$OPENAI_API_BASE \
  --set-string env.LLM_BINDING_API_KEY=$OPENAI_API_KEY \
  --set-string env.EMBEDDING_BINDING=openai \
  --set-string env.EMBEDDING_MODEL=text-embedding-ada-002 \
  --set-string env.EMBEDDING_DIM=1536 \
  --set-string env.EMBEDDING_BINDING_API_KEY=$OPENAI_API_KEY

Accessing the application:

# 1. Run this port-forward command in your terminal:
kubectl --namespace rag port-forward svc/lightrag-dev 9621:9621

# 2. While the command is running, open your browser and navigate to:
# http://localhost:9621

Production Deployment (Using External Databases)

1. Install Databases

You can skip this step if you've already prepared databases. Detailed information can be found in: README.md.

We recommend KubeBlocks for database deployment. KubeBlocks is a cloud-native database operator that makes it easy to run any database on Kubernetes at production scale.

First, install KubeBlocks and KubeBlocks-Addons (skip if already installed):

bash ./databases/01-prepare.sh

Then install the required databases. By default, this will install PostgreSQL and Neo4J, but you can modify 00-config.sh to select different databases based on your needs:

bash ./databases/02-install-database.sh

Verify that the clusters are up and running:

kubectl get clusters -n rag
# Expected output:
# NAME            CLUSTER-DEFINITION   TERMINATION-POLICY   STATUS     AGE
# neo4j-cluster                        Delete               Running    39s
# pg-cluster      postgresql           Delete               Running    42s

kubectl get po -n rag
# Expected output:
# NAME                      READY   STATUS    RESTARTS   AGE
# neo4j-cluster-neo4j-0     1/1     Running   0          58s
# pg-cluster-postgresql-0   4/4     Running   0          59s
# pg-cluster-postgresql-1   4/4     Running   0          59s

2. Install LightRAG

LightRAG and its databases are deployed within the same Kubernetes cluster, making configuration straightforward. The installation script automatically retrieves all database connection information from KubeBlocks, eliminating the need to manually set database credentials:

export OPENAI_API_BASE=<YOUR_OPENAI_API_BASE>
export OPENAI_API_KEY=<YOUR_OPENAI_API_KEY>
bash ./install_lightrag.sh

Accessing the application:

# 1. Run this port-forward command in your terminal:
kubectl --namespace rag port-forward svc/lightrag 9621:9621

# 2. While the command is running, open your browser and navigate to:
# http://localhost:9621

Configuration

Modifying Resource Configuration

You can configure LightRAG's resource usage by modifying the values.yaml file:

replicaCount: 1  # Number of replicas, can be increased as needed

resources:
  limits:
    cpu: 1000m    # CPU limit, can be adjusted as needed
    memory: 2Gi   # Memory limit, can be adjusted as needed
  requests:
    cpu: 500m     # CPU request, can be adjusted as needed
    memory: 1Gi   # Memory request, can be adjusted as needed

Modifying Persistent Storage

persistence:
  enabled: true
  ragStorage:
    size: 10Gi    # RAG storage size, can be adjusted as needed
  inputs:
    size: 5Gi     # Input data storage size, can be adjusted as needed

Configuring Environment Variables

The env section in the values.yaml file contains all environment configurations for LightRAG, similar to a .env file. When using helm upgrade or helm install commands, you can override these with the --set flag.

env:
  HOST: 0.0.0.0
  PORT: 9621
  WEBUI_TITLE: Graph RAG Engine
  WEBUI_DESCRIPTION: Simple and Fast Graph Based RAG System

  # LLM Configuration
  LLM_BINDING: openai            # LLM service provider
  LLM_MODEL: gpt-4o-mini         # LLM model
  LLM_BINDING_HOST:              # API base URL (optional)
  LLM_BINDING_API_KEY:           # API key

  # Embedding Configuration
  EMBEDDING_BINDING: openai                 # Embedding service provider
  EMBEDDING_MODEL: text-embedding-ada-002   # Embedding model
  EMBEDDING_DIM: 1536                       # Embedding dimension
  EMBEDDING_BINDING_API_KEY:                # API key

  # Storage Configuration
  LIGHTRAG_KV_STORAGE: PGKVStorage              # Key-value storage type
  LIGHTRAG_VECTOR_STORAGE: PGVectorStorage      # Vector storage type
  LIGHTRAG_GRAPH_STORAGE: Neo4JStorage          # Graph storage type
  LIGHTRAG_DOC_STATUS_STORAGE: PGDocStatusStorage  # Document status storage type

Notes

Ensure all necessary environment variables (API keys and database passwords) are set before deployment
For security reasons, it's recommended to pass sensitive information using environment variables rather than writing them directly in scripts or values files
Lightweight deployment is suitable for testing and small-scale usage, but data persistence and performance may be limited
Production deployment (PostgreSQL + Neo4J) is recommended for production environments and large-scale usage
For more customized configurations, please refer to the official LightRAG documentation