LightRAG

Author	SHA1	Message	Date
chengjie	27de78113d	style: apply code formatting to pass pre-commit checks - Split long function calls across multiple lines - Split long function definitions across multiple lines - Add blank line after docstring in test function These changes are purely formatting to comply with the project's linting standards (black/ruff). No functional changes.	2025-11-11 00:10:54 +08:00
chengjie	5d31412bd7	feat: add workspace isolation support to unified lock functions Why this change is needed: The current locking system uses global locks shared across all users and workspaces, causing blocking issues in multi-tenant scenarios. When one tenant performs document indexing, all other tenants are blocked waiting for the same global lock. This severely limits the system's ability to serve multiple users concurrently. How it solves it: - Add optional `workspace` parameter to 5 lock functions - Implement lazy creation of workspace-specific locks with proper synchronization - Store workspace locks in new `_sync_locks` dictionary - Support both multi-process (RLock) and single-process (asyncio.Lock) modes - Empty workspace parameter uses global lock for backward compatibility - Extract common logic into `_get_workspace_lock()` to eliminate duplication Impact: - Enables concurrent operations across different workspaces - Foundation for PR2 (pipeline status isolation) - Zero impact on existing code (all parameters optional with defaults) - Each workspace now has independent lock instances - Thread-safe lazy creation using _registry_guard in multiprocess mode - Automatic creation of async_locks for workspace locks in multiprocess mode Code Quality Improvements (Linus review feedback): - Fixed race condition: lazy creation protected by _registry_guard - Eliminated code duplication: common logic extracted to _get_workspace_lock() - Added async_lock support: workspace locks now have companion async_locks - Handles None workspace parameter gracefully - Clear separation of concerns: one function handles all workspace logic Testing: - 17 new test cases covering: - Basic functionality and naming - Workspace isolation and independence - Backward compatibility with empty workspace - Concurrent operations (3 workspaces in parallel) - Performance (1000 workspace lock creation <2s) - Edge cases (special characters, unicode, long names) - All existing tests pass (21/21 excluding env issues) - Verified lock serialization within workspace - Verified lock independence across workspaces Files modified: - lightrag/kg/shared_storage.py: refactored lock functions + synchronization - tests/test_workspace_locks.py: comprehensive test suite	2025-11-10 22:51:49 +08:00
yangdx	e8f5f57ec7	Update qdrant-client minimum version from 1.7.0 to 1.11.0 • Bump qdrant-client to >=1.11.0 • Update pyproject.toml dependency • Update requirements files • Sync uv.lock with new version • Maintain <2.0.0 upper bound	2025-11-10 11:54:48 +08:00
yangdx	913fa1e415	Add concurrency warning for JsonKVStorage in cleanup tool	2025-11-09 23:04:04 +08:00
yangdx	1f9d0735c3	Bump API version to 0253	2025-11-09 14:42:22 +08:00
Daniel.y	3110ca518b	Merge pull request #2335 from danielaskdd/llm-cache-cleanup Feat: Add LLM Query Cache Cleanup Tool	2025-11-09 14:27:58 +08:00
yangdx	37b7118901	Fix table alignment and add validation for empty cleanup selections	2025-11-09 14:17:56 +08:00
yangdx	1485cb82e9	Add LLM query cache cleanup tool for KV storage backends - Interactive cleanup workflow - Supports all KV storage types - Batch deletion with progress - Comprehensive error reporting - Preserves workspace isolation	2025-11-09 13:37:33 +08:00
Daniel.y	8859eaade7	Merge pull request #2334 from danielaskdd/hotfix-opena-streaming HotFix: Restore OpenAI Streaming Response & Refactor keyword_extraction Parameter	2025-11-09 12:25:20 +08:00
yangdx	2f16065256	Refactor keyword_extraction from kwargs to explicit parameter • Add keyword_extraction param to functions • Remove kwargs.pop() calls • Update function signatures • Improve parameter documentation • Make parameter handling consistent	2025-11-09 12:02:17 +08:00
yangdx	88ab73f6ae	HotFix: Restore streaming response in OpenAI LLM The stream and timeout parameters were moved from **kwargs to explicit parameters in a previous commit, but were not being passed to the OpenAI API, causing streaming responses to fail and fall back to non-streaming mode.Fixes the issue where stream=True was being silently ignored, resulting in unexpected non-streaming behavior.	2025-11-09 11:52:26 +08:00
yangdx	c12bc372dc	Update README	2025-11-09 04:35:41 +08:00
yangdx	7bc6ccea19	Add uv package manager support to installation docs	2025-11-09 04:31:07 +08:00
yangdx	80f2e691fc	Remove redundant i18n import triggered the Vite “dynamic + static import” warning	2025-11-09 02:48:11 +08:00
yangdx	1334b3d896	Update uv.lock	2025-11-09 02:32:30 +08:00
yangdx	754d2ad297	Add documentation for LLM cache migration between storage types	2025-11-09 00:41:07 +08:00
Daniel.y	8adf3180d6	Merge pull request #2330 from danielaskdd/llm-cache-migrate Feat: Add LLM Cache Migration Tool	2025-11-09 00:12:32 +08:00
yangdx	a75efb06dc	Fix: prevent source data corruption by target upsert function • Prevent mutations bugs by using copy() when storing cache values • Protect filtered cache data and ensure batch data isolation	2025-11-09 00:02:19 +08:00
yangdx	987bc09cab	Update LLM cache migration docs and improve UX prompts	2025-11-08 23:48:19 +08:00
yangdx	1a91bcdb5f	Improve storage config validation and add config.ini fallback support • Add MongoDB env requirements • Support config.ini fallback • Warn on missing env vars • Check available storage count • Show config source info	2025-11-08 22:48:49 +08:00
yangdx	57ee7d5ac8	Merge branch 'main' into llm-cache-migrate	2025-11-08 22:15:46 +08:00
Daniel.y	85bb98b307	Merge pull request #2331 from danielaskdd/gemini-retry Fix Gemini driver retry mechanism	2025-11-08 22:14:56 +08:00
yangdx	3d9de5ed03	feat: improve Gemini client error handling and retry logic • Add google-api-core dependency • Add specific exception handling • Create InvalidResponseError class • Update retry decorators • Fix empty response handling	2025-11-08 22:10:09 +08:00
yangdx	1864b28242	Add colored output formatting to migration confirmation display	2025-11-08 21:16:41 +08:00
yangdx	e95b02fb55	Refactor storage selection UI with dynamic numbering and inline prompts • Remove standalone get_user_choice method • Add dynamic sequential numbering • Inline choice validation logic • Remove redundant storage type prints • Improve excluded storage handling	2025-11-08 20:42:27 +08:00
yangdx	b72632e4d4	Add async generator lock management rule to cline extension	2025-11-08 20:03:59 +08:00
yangdx	5be04263b2	Fix deadlock in JSON cache migration and prevent same storage selection - Snapshot JSON data before yielding batches - Release lock during batch processing - Exclude source type from target selection - Add detailed docstring for lock behavior - Filter available storage types properly	2025-11-08 19:58:36 +08:00
yangdx	6b9f13c792	Enhance LLM cache migration tool with streaming and improved UX - Add streaming migration for memory efficiency - Implement graceful exit with Enter/0 - Add progress indicators for counting - Optimize batch processing by storage type - Update docs with new progress displays	2025-11-08 19:38:00 +08:00
yangdx	d0d31e9262	Improve LLM cache migration tool configuration and messaging	2025-11-08 18:52:33 +08:00
yangdx	6fc54d3625	Move LLM cache migration tool to lightrag.tools module - Relocated tool to proper package structure - Updated import paths and documentation - Added shared storage initialization - Fixed module path resolution - Updated usage instructions	2025-11-08 18:33:13 +08:00
yangdx	0f2c0de8df	Fix linting	2025-11-08 18:16:03 +08:00
yangdx	55274dde59	Add LLM cache migration tool for KV storage backends - Supports JSON/Redis/PostgreSQL/MongoDB - Batch migration with error tracking - Workspace-aware data transfer - Memory-efficient pagination - Comprehensive migration reporting	2025-11-08 17:57:22 +08:00
yangdx	cf732dbfc6	Bump core version to 1.4.9.9 and API to 0252	2025-11-08 11:27:26 +08:00
Daniel.y	29a349f25b	Merge pull request #2329 from danielaskdd/gemini-embedding Feat: Add Gemini Embedding Support to LightRAG	2025-11-08 04:10:52 +08:00
yangdx	a624a9508a	Add Gemini to APIs requiring embedding dimension parameter	2025-11-08 03:54:50 +08:00
yangdx	de4ed73652	Add Gemini embedding support - Implement gemini_embed function - Add gemini to embedding binding choices - Add L2 normalization for dims < 3072	2025-11-08 03:34:30 +08:00
Daniel.y	f4492d48dc	Merge pull request #2328 from HKUDS/apply-dim-to-embedding-call Feat: Add Optional Embedding Dimension Parameter Control with Jina API Compliance	2025-11-08 02:10:08 +08:00
yangdx	f83ea3394e	Add section header comment for Gemini binding options	2025-11-08 02:07:31 +08:00
yangdx	0b2a15c452	Centralize embedding_send_dim config through args instead of env var	2025-11-08 01:52:23 +08:00
yangdx	03cc6262c4	Prohibit direct access to internal functions of EmbeddingFunc. • Fix similarity search error in query stage • Remove redundant null checks • Improve log readability	2025-11-08 01:43:36 +08:00
yangdx	ffeeae4208	refactor: simplify jina embedding dimension handling	2025-11-07 22:09:57 +08:00
yangdx	9cee5a63df	Merge branch 'main' into apply-dim-to-embedding-call	2025-11-07 22:06:04 +08:00
yangdx	01b07b2be5	Refactor Jina embedding dimension by changing param to optional with default	2025-11-07 22:04:34 +08:00
Daniel.y	d536257308	Merge pull request #2327 from huangbhan/patch-1 Fix spelling errors in the "使用PostgreSQL存储" section of README-zh.md	2025-11-07 21:32:05 +08:00
yangdx	d95efcb9ad	Fix linting	2025-11-07 21:27:54 +08:00
yangdx	ce28f30ca6	Add embedding_dim parameter support to embedding functions • Pass embedding_dim to jina_embed call • Pass embedding_dim to openai_embed call	2025-11-07 21:23:59 +08:00
yangdx	c14f25b7f8	Add mandatory dimension parameter handling for Jina API compliance	2025-11-07 21:08:34 +08:00
yangdx	d8a6355e41	Merge branch 'main' into apply-dim-to-embedding-call	2025-11-07 20:48:22 +08:00
yangdx	33a1482f7f	Add optional embedding dimension parameter control via env var * Add EMBEDDING_SEND_DIM environment variable * Update Jina/OpenAI embed functions * Add send_dimensions to EmbeddingFunc * Auto-inject embedding_dim when enabled * Add parameter validation warnings	2025-11-07 20:46:40 +08:00
domices	5c0ced6e4a	Fix spelling errors in the "使用PostgreSQL存储" section of README-zh.md	2025-11-07 17:48:41 +08:00

1 2 3 4 5 ...

5608 commits