LightRAG

Author	SHA1	Message	Date
Raphaël MANSUY	833a27fc2e	cherry-pick `90f341d6`	2025-12-04 19:19:22 +08:00
Raphaël MANSUY	300039cbc8	cherry-pick `ecea9399`	2025-12-04 19:19:21 +08:00
Raphaël MANSUY	5f51abb88f	cherry-pick `1d2f534f`	2025-12-04 19:19:05 +08:00
Raphaël MANSUY	df761099d5	cherry-pick `d803df94`	2025-12-04 19:19:01 +08:00
Raphaël MANSUY	0fec29d64f	cherry-pick `451257ae`	2025-12-04 19:19:01 +08:00
Raphaël MANSUY	7fa4f883a6	cherry-pick `8af8bd80`	2025-12-04 19:18:40 +08:00
Raphaël MANSUY	af3b2cf118	cherry-pick `0b3d3150`	2025-12-04 19:14:29 +08:00
Raphaël MANSUY	9fdc964e40	cherry-pick `97a9dfca`	2025-12-04 19:14:28 +08:00
Raphaël MANSUY	d266d00f3e	cherry-pick `1d07ff7f`	2025-12-04 19:14:28 +08:00
yangdx	87561f8b28	Remove manual initialize_pipeline_status() calls across codebase - Auto-init pipeline status in storages - Remove redundant import statements - Simplify initialization pattern - Update docs and examples (cherry picked from commit `cdd53ee875`)	2025-12-04 19:11:17 +08:00
yangdx	26602f3e20	Update postgreSQL docker image link (cherry picked from commit `1e415cff95`)	2025-12-04 19:09:06 +08:00
yangdx	db508954d1	Add uv package manager support to installation docs (cherry picked from commit `7bc6ccea19`)	2025-12-04 19:09:04 +08:00
Anush008	e86aa091f4	refactor: Qdrant Multi-tenancy (Include staged) Signed-off-by: Anush008 <anushshetty90@gmail.com> (cherry picked from commit `8584980e3a`)	2025-12-04 19:09:01 +08:00
yangdx	b0bdbb5839	Add offline deployment support with cache management and layered deps • Add tiktoken cache downloader CLI • Add layered offline dependencies • Add offline requirements files • Add offline deployment guide (cherry picked from commit `a5c05f1b92`)	2025-12-04 19:07:09 +08:00
yangdx	c2c6ac3a45	Add AGENTS.md documentation section for AI coding agent guidance (cherry picked from commit `1bf802eebf`)	2025-12-04 19:05:56 +08:00
Raphael MANSUY	fe9b8ec02a	tests: stabilize integration tests + skip external services; fix multi-tenant API behavior and idempotency (#4 ) * feat: Implement multi-tenant architecture with tenant and knowledge base models - Added data models for tenants, knowledge bases, and related configurations. - Introduced role and permission management for users in the multi-tenant system. - Created a service layer for managing tenants and knowledge bases, including CRUD operations. - Developed a tenant-aware instance manager for LightRAG with caching and isolation features. - Added a migration script to transition existing workspace-based deployments to the new multi-tenant architecture. * chore: ignore lightrag/api/webui/assets/ directory * chore: stop tracking lightrag/api/webui/assets (ignore in .gitignore) * feat: Initialize LightRAG Multi-Tenant Stack with PostgreSQL - Added README.md for project overview, setup instructions, and architecture details. - Created docker-compose.yml to define services: PostgreSQL, Redis, LightRAG API, and Web UI. - Introduced env.example for environment variable configuration. - Implemented init-postgres.sql for PostgreSQL schema initialization with multi-tenant support. - Added reproduce_issue.py for testing default tenant access via API. * feat: Enhance TenantSelector and update related components for improved multi-tenant support * feat: Enhance testing capabilities and update documentation - Updated Makefile to include new test commands for various modes (compatibility, isolation, multi-tenant, security, coverage, and dry-run). - Modified API health check endpoint in Makefile to reflect new port configuration. - Updated QUICK_START.md and README.md to reflect changes in service URLs and ports. - Added environment variables for testing modes in env.example. - Introduced run_all_tests.sh script to automate testing across different modes. - Created conftest.py for pytest configuration, including database fixtures and mock services. - Implemented database helper functions for streamlined database operations in tests. - Added test collection hooks to skip tests based on the current MULTITENANT_MODE. * feat: Implement multi-tenant support with demo mode enabled by default - Added multi-tenant configuration to the environment and Docker setup. - Created pre-configured demo tenants (acme-corp and techstart) for testing. - Updated API endpoints to support tenant-specific data access. - Enhanced Makefile commands for better service management and database operations. - Introduced user-tenant membership system with role-based access control. - Added comprehensive documentation for multi-tenant setup and usage. - Fixed issues with document visibility in multi-tenant environments. - Implemented necessary database migrations for user memberships and legacy support. * feat(audit): Add final audit report for multi-tenant implementation - Documented overall assessment, architecture overview, test results, security findings, and recommendations. - Included detailed findings on critical security issues and architectural concerns. fix(security): Implement security fixes based on audit findings - Removed global RAG fallback and enforced strict tenant context. - Configured super-admin access and required user authentication for tenant access. - Cleared localStorage on logout and improved error handling in WebUI. chore(logs): Create task logs for audit and security fixes implementation - Documented actions, decisions, and next steps for both audit and security fixes. - Summarized test results and remaining recommendations. chore(scripts): Enhance development stack management scripts - Added scripts for cleaning, starting, and stopping the development stack. - Improved output messages and ensured graceful shutdown of services. feat(starter): Initialize PostgreSQL with AGE extension support - Created initialization scripts for PostgreSQL extensions including uuid-ossp, vector, and AGE. - Ensured successful installation and verification of extensions. * feat: Implement auto-select for first tenant and KB on initial load in WebUI - Removed WEBUI_INITIAL_STATE_FIX.md as the issue is resolved. - Added useTenantInitialization hook to automatically select the first available tenant and KB on app load. - Integrated the new hook into the Root component of the WebUI. - Updated RetrievalTesting component to ensure a KB is selected before allowing user interaction. - Created end-to-end tests for multi-tenant isolation and real service interactions. - Added scripts for starting, stopping, and cleaning the development stack. - Enhanced API and tenant routes to support tenant-specific pipeline status initialization. - Updated constants for backend URL to reflect the correct port. - Improved error handling and logging in various components. * feat: Add multi-tenant support with enhanced E2E testing scripts and client functionality * update client * Add integration and unit tests for multi-tenant API, models, security, and storage - Implement integration tests for tenant and knowledge base management endpoints in `test_tenant_api_routes.py`. - Create unit tests for tenant isolation, model validation, and role permissions in `test_tenant_models.py`. - Add security tests to enforce role-based permissions and context validation in `test_tenant_security.py`. - Develop tests for tenant-aware storage operations and context isolation in `test_tenant_storage_phase3.py`. * feat(e2e): Implement OpenAI model support and database reset functionality * Add comprehensive test suite for gpt-5-nano compatibility - Introduced tests for parameter normalization, embeddings, and entity extraction. - Implemented direct API testing for gpt-5-nano. - Validated .env configuration loading and OpenAI API connectivity. - Analyzed reasoning token overhead with various token limits. - Documented test procedures and expected outcomes in README files. - Ensured all tests pass for production readiness. * kg(postgres_impl): ensure AGE extension is loaded in session and configure graph initialization * dev: add hybrid dev helper scripts, Makefile, docker-compose.dev-db and local development docs * feat(dev): add dev helper scripts and local development documentation for hybrid setup * feat(multi-tenant): add detailed specifications and logs for multi-tenant improvements, including UX, backend handling, and ingestion pipeline * feat(migration): add generated tenant/kb columns, indexes, triggers; drop unused tables; update schema and docs * test(backward-compat): adapt tests to new StorageNameSpace/TenantService APIs (use concrete dummy storages) * chore: multi-tenant and UX updates — docs, webui, storage, tenant service adjustments * tests: stabilize integration tests + skip external services; fix multi-tenant API behavior and idempotency - gpt5_nano_compatibility: add pytest-asyncio markers, skip when OPENAI key missing, prevent module-level asyncio.run collection, add conftest - Ollama tests: add server availability check and skip markers; avoid pytest collection warnings by renaming helper classes - Graph storage tests: rename interactive test functions to avoid pytest collection - Document & Tenant routes: support external_ids for idempotency; ensure HTTPExceptions are re-raised - LightRAG core: support external_ids in apipeline_enqueue_documents and idempotent logic - Tests updated to match API changes (tenant routes & document routes) - Add logs and scripts for inspection and audit	2025-12-04 16:04:21 +08:00
yangdx	2ce6a022ac	Fix documentation for user_prompt parameter in QueryParam	2025-09-27 23:41:17 +08:00
yangdx	699ca3ba00	Remove deprecated `history_turns` and `ids` parameters from query API endpoint • Update QueryParam documentation • Mark history_turns as deprecated • Clean up splash screen display • Clarify conversation_history usage	2025-09-25 04:58:57 +08:00
yangdx	7b371309dd	Update README	2025-09-15 12:31:39 +08:00
yangdx	4e751e0653	refac: Enhance extraction with improved prompts and parser - Prompts: Restructured prompts with clearer steps and quality guidelines. Simplified the relationship tuple by removing `relationship_strength` - Model: Updated default entity types to be more comprehensive and consistently capitalized (e.g., `Location`, `Product`)	2025-08-31 22:24:11 +08:00
yangdx	de2daf6565	refac: Rename summary_max_tokens to summary_context_size, comprehensive parameter validation for summary configuration - Update algorithm logic in operate.py for better token management - Fix health endpoint to use correct parameter names	2025-08-26 01:35:50 +08:00
yangdx	49ea9a79a7	Update rerank doc in README	2025-08-23 23:06:10 +08:00
yangdx	16a1ef1178	Update summary_max_tokens default from 10k to 30k tokens	2025-08-21 23:16:07 +08:00
yangdx	8c6b5f4a3a	Update README	2025-08-21 18:14:27 +08:00
yangdx	62cdc7d7eb	Update documentation with LLM selection guidelines and API improvements	2025-08-21 13:59:14 +08:00
yangdx	0e67ead8fa	Rename MAX_TOKENS to SUMMARY_MAX_TOKENS for clarity	2025-08-21 10:15:20 +08:00
yangdx	d5e8f1e860	Update default query parameters for better performance - Increase chunk_top_k from 10 to 20 - Reduce max_entity_tokens to 6000 - Reduce max_relation_tokens to 8000 - Update web UI default values - Fix max_total_tokens to 30000	2025-08-18 19:32:11 +08:00
yangdx	dc7a6e1c5b	Update README	2025-08-16 06:15:27 +08:00
yangdx	0b5c708660	Update storage implementation documentation - Add detailed storage type descriptions - Remove Chroma from vector storage options - Include recommended PostgreSQL version - Add Memgraph to graph storage options - Update performance comparison notes	2025-08-05 18:03:51 +08:00
yangdx	32af45ff46	refactor: improve JSON parsing reliability with json-repair library Replace regex-based JSON extraction with json-repair for better handling of malformed LLM responses. Remove deprecated JSON parsing utilities and clean up keyword_extraction parameter across LLM providers. - Remove locate_json_string_body_from_string() and convert_response_to_json() - Use json-repair.loads() in extract_keywords_only() for robust parsing - Clean up LLM interfaces and remove unused parameters - Add json-repair dependency	2025-08-01 19:36:20 +08:00
yangdx	3c530b21b6	Update README	2025-07-31 13:00:09 +08:00
yangdx	c6bd9f0329	Disable conversation history by default - Set default history_turns to 0 - Mark history_turns as deprecated - Remove history_turns from example - Update documentation comments	2025-07-31 12:28:42 +08:00
yangdx	aba46213a7	Update README	2025-07-30 13:13:59 +08:00
yangdx	9923821d75	refactor: Remove deprecated `max_token_size` from embedding configuration This parameter is no longer used. Its removal simplifies the API and clarifies that token length management is handled by upstream text chunking logic rather than the embedding wrapper.	2025-07-29 10:49:35 +08:00
yangdx	598eecd06d	Refactor: Rename llm_model_max_token_size to summary_max_tokens This commit renames the parameter 'llm_model_max_token_size' to 'summary_max_tokens' for better clarity, as it specifically controls the token limit for entity relation summaries.	2025-07-28 00:49:08 +08:00
Ákos Lukács	f115661e16	Fix "A Simple Program" example in README.md The example should use ainsert and aquery. Fixes #1723	2025-07-22 14:37:15 +02:00
yangdx	80f7e37168	Fix default workspace name for PostgreSQL AGE graph storage	2025-07-16 19:16:22 +08:00
yangdx	1c53c5c764	Update README.md	2025-07-16 11:10:56 +08:00
yangdx	47341d3a71	Merge branch 'main' into rerank	2025-07-15 16:12:33 +08:00
yangdx	e8e1f6ab56	feat: centralize environment variable defaults in constants.py	2025-07-15 16:11:50 +08:00
yangdx	ccc2a20071	feat: remove deprecated MAX_TOKEN_SUMMARY parameter to prevent LLM output truncation - Remove MAX_TOKEN_SUMMARY parameter and related configurations - Eliminate forced token-based truncation in entity/relationship descriptions - Switch to fragment-count based summarization logic using FORCE_LLM_SUMMARY_ON_MERGE - Update FORCE_LLM_SUMMARY_ON_MERGE default from 6 to 4 for better summarization - Clean up documentation, environment examples, and API display code - Preserve backward compatibility by graceful parameter removal This change resolves issues where LLMs were forcibly truncating entity relationship descriptions mid-sentence, leading to incomplete and potentially inaccurate knowledge graph content. The new approach allows LLMs to generate complete descriptions while still providing summarization when multiple fragments need to be merged. Breaking Change: None - parameter removal is backward compatible Fixes: Entity relationship description truncation issues	2025-07-15 12:26:33 +08:00
zrguo	7c882313bb	remove chunk_rerank_top_k	2025-07-15 11:52:34 +08:00
zrguo	4e425b1b59	Revert "update from main" This reverts commit `1d0376d6a9`.	2025-07-14 16:29:00 +08:00
zrguo	1d0376d6a9	update from main	2025-07-14 16:27:49 +08:00
zrguo	c9cbd2d3e0	Merge branch 'main' into rerank	2025-07-14 16:24:29 +08:00
zrguo	ef2115d437	Update token limit	2025-07-14 15:53:48 +08:00
yangdx	b03bb48e24	feat: Refine summary logic and add dedicated Ollama num_ctx config - Refactor the trigger condition for LLM-based summarization of entities and relations. Instead of relying on character length, the summary is now triggered when the number of merged description fragments exceeds a configured threshold. This provides a more robust and logical condition for consolidation. - Introduce the `OLLAMA_NUM_CTX` environment variable to explicitly configure the context window size (`num_ctx`) for Ollama models. This decouples the model's context length from the `MAX_TOKENS` parameter, which is now specifically used to limit input for summary generation, making the configuration clearer and more flexible. - Updated `README` files, `env.example`, and default values to reflect these changes.	2025-07-14 01:55:04 +08:00
yangdx	9aa2ed0837	Merge branch 'main' into rerank	2025-07-09 15:33:39 +08:00
yangdx	e457374224	Fix linting	2025-07-09 15:33:05 +08:00
yangdx	bfa0844ecb	Update README	2025-07-09 15:17:05 +08:00

1 2 3 4 5 ...

343 commits