LightRAG

Author	SHA1	Message	Date
Raphaël MANSUY	ed73def994	fix: sync core modules with upstream for compatibility	2025-12-04 19:10:46 +08:00
yangdx	7ce3680ca5	Add retry decorators to Neo4j read operations for resilience (cherry picked from commit `7aaa51cda9`)	2025-12-04 19:09:08 +08:00
yangdx	00d51f9dba	Fix dimension type comparison in Milvus vector field validation • Convert dimensions to int for comparison • Handle string vs int type mismatches (cherry picked from commit `0fa9a2eee3`)	2025-12-04 19:09:08 +08:00
yangdx	0594a5a049	Update pymilvus dependency from 2.5.2 to >=2.6.2 (cherry picked from commit `baab992431`)	2025-12-04 19:09:07 +08:00
yangdx	de011c99a4	Add CASCADE to AGE extension creation in PostgreSQL implementation - Add CASCADE option to CREATE EXTENSION - Ensure dependencies are installed - Fix potential AGE setup issues (cherry picked from commit `d6019c82af`)	2025-12-04 19:09:07 +08:00
yangdx	bd93f13012	Refactor: Extract retry decorator to reduce code duplication in Neo4J storage • Define READ_RETRY_EXCEPTIONS constant • Create reusable READ_RETRY decorator • Replace 11 duplicate retry decorators • Improve code maintainability • Add missing retry to edge_degrees_batch (cherry picked from commit `8c4d7a00ad`)	2025-12-04 19:09:07 +08:00
copilot-swe-agent[bot]	b28a701532	Improve edge case handling for max_tokens=1 Co-authored-by: netbrah <162479981+netbrah@users.noreply.github.com> (cherry picked from commit `8835fc244a`)	2025-12-04 19:09:07 +08:00
wmsnp	ae5cd9262b	fix: add logger to configure_vchordrq() and format code (cherry picked from commit `f4bf5d279c`)	2025-12-04 19:09:06 +08:00
wmsnp	3954bb6579	feat(postgres_impl): add vchordrq vector index support and unify vector index creation logic (cherry picked from commit `d07023c962`)	2025-12-04 19:09:06 +08:00
yangdx	1cbe0ba885	Reduce log level and improve workspace mismatch message clarity • Change warning to info level • Simplify workspace mismatch wording (cherry picked from commit `6cef8df159`)	2025-12-04 19:09:06 +08:00
yangdx	0ac858d3e2	fix(postgres): allow vchordrq.epsilon config when probes is empty Previously, configure_vchordrq would fail silently when probes was empty (the default), preventing epsilon from being configured. Now each parameter is handled independently with conditional execution, and configuration errors fail-fast instead of being swallowed. This fixes the documented epsilon setting being impossible to use in the default configuration. (cherry picked from commit `3096f844fb`)	2025-12-04 19:09:06 +08:00
yangdx	5bd1320a1d	Refactor storage classes to use namespace instead of final_namespace (cherry picked from commit `fd486bc922`)	2025-12-04 19:09:06 +08:00
yangdx	ed46d375fb	Auto-initialize pipeline status in LightRAG.initialize_storages() • Remove manual initialize_pipeline_status calls • Auto-init in initialize_storages method • Update error messages for clarity • Warn on workspace conflicts (cherry picked from commit `e22ac52ebc`)	2025-12-04 19:09:05 +08:00
yangdx	961c87a6e5	Standardize empty workspace handling from "_" to "" across storage * Unify empty workspace behavior by changing workspace from "_" to "" * Fixed incorrect empty workspace detection in get_all_update_flags_status() (cherry picked from commit `d54d0d55d9`)	2025-12-04 19:09:05 +08:00
yangdx	6b0c0ef815	Refactor namespace lock to support reusable async context manager • Add NamespaceLock class wrapper • Fix lock re-entrance issues • Enable concurrent lock usage • Fresh context per async with block • Update get_namespace_lock API (cherry picked from commit `7deb9a64b9`)	2025-12-04 19:09:05 +08:00
yangdx	708f80f43d	Add _default_workspace to shared storage finalization - Add _default_workspace to global vars - Set _default_workspace to None on cleanup - Ensure complete resource cleanup - Fix missing workspace finalization (cherry picked from commit `6d6716e9f8`)	2025-12-04 19:09:05 +08:00
yangdx	67007ed9a6	Improve LightRAG initialization checker tool with better usage docs • Add workspace param to get_namespace_data • Update docstring with proper usage example • Simplify demo to show correct workflow • Remove confusing before/after comparison • Clarify tool should run after init (cherry picked from commit `393f880311`)	2025-12-04 19:09:05 +08:00
yangdx	dcf88a8273	Refactor exception handling in MemgraphStorage label methods (cherry picked from commit `4401f86f07`)	2025-12-04 19:09:04 +08:00
yangdx	ed79218550	Optimize JSON write with fast/slow path to reduce memory usage - Fast path for clean data (no sanitization) - Slow path sanitizes during encoding - Reload shared memory after sanitization - Custom encoder avoids deep copies - Comprehensive test coverage (cherry picked from commit `777c987371`)	2025-12-04 19:09:04 +08:00
yangdx	7632805cd0	Add concurrency warning for JsonKVStorage in cleanup tool (cherry picked from commit `913fa1e415`)	2025-12-04 19:09:04 +08:00
yangdx	db508954d1	Add uv package manager support to installation docs (cherry picked from commit `7bc6ccea19`)	2025-12-04 19:09:04 +08:00
yangdx	1daf35a77d	Refactor storage selection UI with dynamic numbering and inline prompts • Remove standalone get_user_choice method • Add dynamic sequential numbering • Inline choice validation logic • Remove redundant storage type prints • Improve excluded storage handling (cherry picked from commit `e95b02fb55`)	2025-12-04 19:09:03 +08:00
yangdx	fa5510e6f6	Fix deadlock in JSON cache migration and prevent same storage selection - Snapshot JSON data before yielding batches - Release lock during batch processing - Exclude source type from target selection - Add detailed docstring for lock behavior - Filter available storage types properly (cherry picked from commit `5be04263b2`)	2025-12-04 19:09:03 +08:00
yangdx	5a5e583b9c	Improve storage config validation and add config.ini fallback support • Add MongoDB env requirements • Support config.ini fallback • Warn on missing env vars • Check available storage count • Show config source info (cherry picked from commit `1a91bcdb5f`)	2025-12-04 19:09:03 +08:00
yangdx	7896c42fba	Restructure semaphore control to manage entire evaluation pipeline • Move rag_semaphore to wrap full function • Increase RAG concurrency to 2x eval limit • Prevent memory buildup from slow evals • Keep eval_semaphore for RAGAS control (cherry picked from commit `e5abe9dd3d`)	2025-12-04 19:09:02 +08:00
yangdx	c459caed26	Implement two-stage pipeline for RAG evaluation with separate semaphores • Split RAG gen and eval stages • Add rag_semaphore for stage 1 • Add eval_semaphore for stage 2 • Improve concurrency control • Update connection pool limits (cherry picked from commit `83715a3ac1`)	2025-12-04 19:09:02 +08:00
ben moussa anouar	dd425e5513	Update lightrag/evaluation/eval_rag_quality.py for launguage Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> (cherry picked from commit `98f0464a31`)	2025-12-04 19:09:02 +08:00
yangdx	407a2c2ecd	Remove redundant shutdown message from gunicorn (cherry picked from commit `6d4a55100e`)	2025-12-04 19:09:02 +08:00
yangdx	df2c24264f	Improve entity merge logging by removing redundant message and fixing typo (cherry picked from commit `9a8742da59`)	2025-12-04 19:09:02 +08:00
yangdx	8c7b0017df	Remove enable_logging parameter from get_data_init_lock call in MilvusVectorDBStorage (cherry picked from commit `0692175c7b`)	2025-12-04 19:09:01 +08:00
Anush008	e86aa091f4	refactor: Qdrant Multi-tenancy (Include staged) Signed-off-by: Anush008 <anushshetty90@gmail.com> (cherry picked from commit `8584980e3a`)	2025-12-04 19:09:01 +08:00
yangdx	a42222d7f9	Resolve lock leakage issue during user cancellation handling • Change default log level to INFO • Force enable error logging output • Add lock cleanup rollback protection • Handle LLM cache persistence errors • Fix async task exception handling (cherry picked from commit `a9ec15e669`)	2025-12-04 19:09:01 +08:00
yangdx	8b6fdef965	Optimize PostgreSQL graph queries to avoid Cypher overhead and complexity • Replace Cypher with native SQL queries • Fix O(N²) to O(E) performance issue • Add error handling for parse failures • Use direct table access pattern • Eliminate Cartesian product joins (cherry picked from commit `a97e5dad4c`)	2025-12-04 19:09:01 +08:00
yangdx	e4be3549c3	Improve entity identifier truncation warning message format (cherry picked from commit `00aa5e53a7`)	2025-12-04 19:09:00 +08:00
Yasiru Rangana	8a72135a32	Optimize PostgreSQL initialization performance - Batch index existence checks into single query (16+ queries -> 1 query) - Batch timestamp column checks into single query (8 queries -> 1 query) - Batch field length checks into single query (5 queries -> 1 query) Performance improvement: ~70-80% faster initialization (35s -> 5-10s) Key optimizations: 1. check_tables(): Use ANY($1) to check all indexes at once 2. _migrate_timestamp_columns(): Batch all column type checks 3. _migrate_field_lengths(): Batch all field definition checks All changes are backward compatible with no schema or API changes. Reduces database round-trips by batching information_schema queries. (cherry picked from commit `2f22336ace`)	2025-12-04 19:09:00 +08:00
Lucky Verma	12ebc9f2a9	Refactor SQL queries and improve input handling in PGKVStorage and PGDocStatusStorage (cherry picked from commit `917e41aa78`)	2025-12-04 19:09:00 +08:00
yangdx	e19a4be0af	Preserve ordering in get_by_ids methods across all storage implementations - Fix result ordering in vector stores - Update KV storage get_by_ids methods - Maintain order in doc status storage - Return None for missing IDs (cherry picked from commit `9be22dd666`)	2025-12-04 19:08:58 +08:00
yangdx	17106225dd	Add PostgreSQL connection retry mechanism with comprehensive error handling • Implement connection retry with backoff • Add transient error detection • Pool management with timeout guards (cherry picked from commit `e758204ab2`)	2025-12-04 19:08:58 +08:00
yangdx	60a695539a	Refactor PostgreSQL retry config to use centralized configuration • Move retry config to ClientManager • Remove env var parsing from PostgreSQLDB • Add config params to test setup (cherry picked from commit `b3ed264707`)	2025-12-04 19:08:57 +08:00
yangdx	c6433edb23	Make PostgreSQL statement_cache_size configuration optional • Remove forced int conversion • Allow None values for cache size • Add conditional parameter setting (cherry picked from commit `f2c0b41e78`)	2025-12-04 19:08:57 +08:00
kevinnkansah	c8c73ab114	fix: renamed PostGreSQL options env variable and allowed LRU cache to be an optional env variable (cherry picked from commit `22a7b482c5`)	2025-12-04 19:08:56 +08:00
kevinnkansah	7ce46bacb6	feat: add options for PostGres connection (cherry picked from commit `108cdbe133`)	2025-12-04 19:08:56 +08:00
yangdx	6de4bb9113	Fix logging message formatting (cherry picked from commit `e0fd31a60d`)	2025-12-04 19:08:46 +08:00
Lucky Verma	80dcbc696a	Refactor SQL queries and improve input handling in PGKVStorage and PGDocStatusStorage (cherry picked from commit `917e41aa78`)	2025-12-04 19:08:41 +08:00
yangdx	b0bdbb5839	Add offline deployment support with cache management and layered deps • Add tiktoken cache downloader CLI • Add layered offline dependencies • Add offline requirements files • Add offline deployment guide (cherry picked from commit `a5c05f1b92`)	2025-12-04 19:07:09 +08:00
yangdx	770fd64c70	Preserve ordering in get_by_ids methods across all storage implementations - Fix result ordering in vector stores - Update KV storage get_by_ids methods - Maintain order in doc status storage - Return None for missing IDs (cherry picked from commit `9be22dd666`)	2025-12-04 19:06:54 +08:00
yangdx	de2713ca93	Add PostgreSQL connection retry mechanism with comprehensive error handling • Implement connection retry with backoff • Add transient error detection • Pool management with timeout guards (cherry picked from commit `e758204ab2`)	2025-12-04 19:06:30 +08:00
yangdx	39ad057384	Refactor PostgreSQL retry config to use centralized configuration • Move retry config to ClientManager • Remove env var parsing from PostgreSQLDB • Add config params to test setup (cherry picked from commit `b3ed264707`)	2025-12-04 19:06:06 +08:00
yangdx	e0e228673c	Make PostgreSQL statement_cache_size configuration optional • Remove forced int conversion • Allow None values for cache size • Add conditional parameter setting (cherry picked from commit `f2c0b41e78`)	2025-12-04 19:05:45 +08:00
Aleks Vujić	742e6958fe	Fixed typo in log message when creating new graph file (cherry picked from commit `dd8f44e621`)	2025-12-04 19:05:35 +08:00

1 2 3 4 5 ...

3393 commits