LightRAG

Author	SHA1	Message	Date
anouarbm	c9e1c6c1c2	fix(api): change content field to list in query responses BREAKING CHANGE: content field is now List[str] instead of str - Add ReferenceItem Pydantic model for type safety - Update /query and /query/stream to return content as list - Update OpenAPI schema and examples - Add migration guide to API README - Fix RAGAS evaluation to handle list format Addresses PR #2297 feedback. Tested with RAGAS: 97.37% score.	2025-11-03 04:57:08 +01:00
anouarbm	9d69e8d776	fix(api): Change content field from string to list in query responses BREAKING CHANGE: The `content` field in query response references is now an array of strings instead of a concatenated string. This preserves individual chunk boundaries when a single file has multiple chunks. Changes: - Update QueryResponse Pydantic model to accept List[str] for content - Modify query_text endpoint to return content as list (query_routes.py:425) - Modify query_text_stream endpoint to support chunk content enrichment - Update OpenAPI schema and examples to reflect array structure - Update API README with breaking change notice and migration guide - Fix RAGAS evaluation to flatten chunk content lists	2025-11-03 04:37:09 +01:00
anouarbm	0b5e3f9dc4	Use logger in RAG evaluation and optimize reference content joins	2025-11-02 18:43:53 +01:00
anouarbm	963ad4c637	docs: Add documentation and examples for include_chunk_content parameter Added comprehensive documentation for the new include_chunk_content parameter that enables retrieval of actual chunk text content in API responses. Documentation Updates: - Added "Include Chunk Content in References" section to API README - Explained use cases: RAG evaluation, debugging, citations, transparency - Provided JSON request/response examples - Clarified parameter interaction with include_references OpenAPI/Swagger Examples: - Added "Response with chunk content" example to /query endpoint - Shows complete reference structure with content field - Demonstrates realistic chunk text content This makes the feature discoverable through: 1. API documentation (README.md) 2. Interactive Swagger UI (http://localhost:9621/docs) 3. Code examples for developers	2025-11-02 17:53:27 +01:00
anouarbm	0bbef9814e	Optimize RAGAS evaluation with parallel execution and chunk content enrichment Added efficient RAG evaluation system with optimized API calls and comprehensive benchmarking. Key Features: - Single API call per evaluation (2x faster than before) - Parallel evaluation based on MAX_ASYNC environment variable - Chunk content enrichment in /query endpoint responses - Comprehensive benchmark statistics (moyennes) - NaN-safe metric calculations API Changes: - Added include_chunk_content parameter to QueryRequest (backward compatible) - /query endpoint enriches references with actual chunk content when requested - No breaking changes - default behavior unchanged Evaluation Improvements: - Parallel execution using asyncio.Semaphore (respects MAX_ASYNC) - Shared HTTP client with connection pooling - Proper timeout handling (3min connect, 5min read) - Debug output for context retrieval verification - Benchmark statistics with averages, min/max scores Results: - Moyenne RAGAS Score: 0.9772 - Perfect Faithfulness: 1.0000 - Perfect Context Recall: 1.0000 - Perfect Context Precision: 1.0000 - Excellent Answer Relevance: 0.9087	2025-11-02 17:39:43 +01:00
yangdx	61b57cbb5d	Add PDF decryption support for password-protected files • Add PDF_DECRYPT_PASSWORD env variable • Check encryption status before reading • Handle decrypt errors gracefully • Log detailed error messages • Support both encrypted/plain PDFs	2025-11-01 15:01:17 +08:00
yangdx	c46c1b26a9	Add pycryptodome dependency for PDF encryption support	2025-10-31 01:49:42 +08:00
yangdx	5155edd8d2	feat: Improve entity merge and edit UX - API: The `graph/entity/edit` endpoint now returns a detailed `operation_summary` for better client-side handling of update, rename, and merge outcomes. - Web UI: Added an "auto-merge on rename" option. The UI now gracefully handles merge success, partial failures (update OK, merge fail), and other errors with specific user feedback.	2025-10-27 23:42:08 +08:00
yangdx	97034f06e3	Add allow_merge parameter to entity update API endpoint	2025-10-27 14:30:27 +08:00
yangdx	6015e8bc68	Refactor graph utils to use unified persistence callback - Add _persist_graph_updates function - Remove duplicate callback functions	2025-10-26 20:20:16 +08:00
yangdx	bf1897a67e	Normalize entity order for undirected graph consistency • Normalize entity pairs for storage • Update API docs for undirected edges	2025-10-26 15:53:31 +08:00
Daniel.y	c82485d94d	Merge pull request #2253 from Mobious/main Allow users to provide keywords with QueryRequest	2025-10-25 11:26:54 +08:00
yangdx	78ad8873b8	Add cancellation check in delete loop	2025-10-24 14:47:20 +08:00
yangdx	743aefc655	Add pipeline cancellation feature for graceful processing termination • Add cancel_pipeline API endpoint • Implement PipelineCancelledException • Add cancellation checks in main loop • Handle task cancellation gracefully • Mark cancelled docs as FAILED	2025-10-24 14:08:12 +08:00
Mobious	f24a261613	Allow users to provide keywords with QueryRequest	2025-10-23 12:53:19 -10:00
yangdx	8dc23eeff2	Fix RayAnything compatible problem • Use "preprocessed" to indicate multimodal processing is required • Update DocProcessingStatus to process status convertion automatically • Remove multimodal_processed from DocStatus enum value • Update UI filter logic	2025-10-22 20:15:29 +08:00
yangdx	162370b6e6	Add optional LLM cache deletion when deleting documents • Add delete_llm_cache parameter to API • Collect cache IDs from text chunks • Delete cache after graph operations • Update UI with new checkbox option • Add i18n translations for cache option	2025-10-22 12:19:23 +08:00
yangdx	dc62c78f98	Add entity/relation chunk tracking with configurable source ID limits - Add entity_chunks & relation_chunks storage - Implement KEEP/FIFO limit strategies - Update env.example with new settings - Add migration for chunk tracking data - Support all KV storage	2025-10-20 15:24:15 +08:00
yangdx	130b4959dc	Add PREPROCESSED (multimodal_processed) status for multimodal document processing • Add DocStatus.PREPROCESSED enum value • Update API routes and response models • Add preprocessed filter in web UI • Update localization files • Handle preprocessed status in deletion	2025-10-14 14:02:05 +08:00
yangdx	12facac506	Enhance graph API endpoints with detailed docs and field validation - Remove redundant README section - Add Pydantic field validation - Expand endpoint docstrings - Include request/response examples - Document merge operation benefits	2025-10-10 12:49:00 +08:00
yangdx	85d1a563b3	Merge branch 'adminunblinded/main'	2025-10-10 12:31:47 +08:00
NeelM0906	b7c77396a0	Fix entity/relation creation endpoints to properly update vector stores - Changed create_entity to use rag.acreate_entity() instead of direct graph manipulation - Changed create_relation to use rag.acreate_relation() instead of direct graph manipulation - This ensures vector embeddings are created and entities/relations are searchable - Adds proper concurrency locks and metadata population	2025-10-09 17:02:17 -04:00
NeelM0906	f6d1fb98ac	Fix Linting errors	2025-10-09 16:52:22 -04:00
NeelM0906	9f44e89de7	Add knowledge graph manipulation endpoints Added three new REST API endpoints for direct knowledge graph manipulation: - POST /graph/entity/create: Create new entities in the knowledge graph - POST /graph/relation/create: Create relationships between entities - POST /graph/entities/merge: Merge duplicate/misspelled entities while preserving relationships The merge endpoint is particularly useful for consolidating entities discovered after document processing, fixing spelling errors, and cleaning up the knowledge graph. All relationships from source entities are transferred to the target entity, with intelligent handling of duplicate relationships. Updated API documentation in lightrag/api/README.md with usage examples for all three endpoints.	2025-10-08 15:59:47 -04:00
Jon	cf2a024e37	feat: Add endpoint and UI to retry failed documents Add a new `/documents/reprocess_failed` API endpoint and corresponding UI button to retry processing of failed and pending documents. This addresses a common recovery scenario when document processing fails due to server crashes, network errors, or LLM service outages. Backend changes: - Add ReprocessResponse model with status, message, and track_id fields - Add POST /documents/reprocess_failed endpoint that triggers background reprocessing of FAILED, PENDING, and interrupted PROCESSING documents - Reuses existing apipeline_process_enqueue_documents for consistency - Includes comprehensive docstring and logging for observability Frontend changes: - Add TypeScript types and API function for the new endpoint - Add retry handler with intelligent polling (fast refresh → normal) - Add "Retry Failed" button in Documents page toolbar - Button disabled when pipeline is busy to prevent duplicate operations - Complete i18n support (English and Chinese translations) This feature provides a convenient way to recover from processing failures without requiring a full filesystem rescan.	2025-10-04 16:46:29 -04:00
yangdx	83d99e1424	fix(OllamaAPI): Add validation to ensure last message is from user role • Validate last message role is "user" • Raise 400 error for invalid role • Improve API request validation • Prevent invalid message sequences	2025-10-01 20:48:37 +08:00
yangdx	df43afc89b	Relax conversation history role validation requirements • Remove strict role value checking • Allow any non-empty string roles	2025-09-29 13:10:15 +08:00
yangdx	7cba458f22	Limit deprecated documents endpoint to 1000 records with fair distribution	2025-09-28 11:18:10 +08:00
yangdx	91be53ffd2	Fix linting	2025-09-27 22:36:38 +08:00
yangdx	e0ac05db90	Simplify query route documentation and clarify conversation history	2025-09-27 22:36:16 +08:00
yangdx	f66a0aad8b	Update query streaming endpoint docs to clarify behavior	2025-09-27 22:27:49 +08:00
yangdx	e7948df541	Fix linting	2025-09-27 15:13:07 +08:00
yangdx	1766cddd6c	Fix mode parameter serialization error in Ollama chat API • Use mode.value for API requests • Add debug logging in aquery_llm	2025-09-27 15:11:51 +08:00
yangdx	81caee3498	Enhance query API with streaming control and comprehensive documentation - Add stream parameter to QueryRequest - Support non-streaming in /query/stream - Add detailed OpenAPI response schemas - Expand endpoint documentation - Include usage examples and error handling	2025-09-27 11:53:31 +08:00
yangdx	a528213210	Fix logging filter logic • Fix boolean operator precedence in filter • Consolidate GET/POST condition logic	2025-09-26 19:42:33 +08:00
yangdx	3ba06478a8	fix http log message order for streaming respond - Move aquery_llm call outside generator - Execute query before stream starts	2025-09-26 19:27:44 +08:00
yangdx	8cd4139cbf	refactor: fix double query problem by add aquery_llm function for consistent response handling - Add new aquery_llm/query_llm methods providing structured responses - Consolidate /query and /query/stream endpoints to use unified aquery_llm - Optimize cache handling by moving cache checks before LLM calls	2025-09-26 19:05:03 +08:00
yangdx	b848ca49e6	Fix linting	2025-09-25 16:22:00 +08:00
yangdx	b08b8a6a6a	Add reference list support to query API endpoints with unified result handling • Add include_references param to QueryRequest • Extend QueryResponse with references field • Create unified QueryResult data structures • Refactor kg_query and naive_query functions • Update streaming to send references first	2025-09-25 16:21:42 +08:00
yangdx	699ca3ba00	Remove deprecated `history_turns` and `ids` parameters from query API endpoint • Update QueryParam documentation • Mark history_turns as deprecated • Clean up splash screen display • Clarify conversation_history usage	2025-09-25 04:58:57 +08:00
yangdx	5eb4a4b799	feat: simplify citations, add reference merging, and restructure API response format	2025-09-24 14:30:10 +08:00
yangdx	2adb8efdc7	Add duplicate document detection and skip processed files in scanning - Add get_doc_by_file_path to all storages - Skip processed files in scan operation - Check duplicates in upload endpoints - Check duplicates in text insert APIs - Return status info in duplicate responses	2025-09-23 17:30:54 +08:00
yangdx	26c9ba4cb5	Make graph label methods required in BaseGraphStorage interface • Remove fallback compatibility code • Add get_popular_labels to ABC • Add search_labels to ABC • Enforce consistent implementation • Clean up error handling paths	2025-09-20 12:40:36 +08:00
yangdx	9db8f2fce5	feat: Add popular labels and search APIs with history management - Add popular/search label endpoints - Implement SearchHistoryManager utility - Replace client-side with server search - Add graph data version tracking - Update UI for better label discovery	2025-09-20 02:03:47 +08:00
yangdx	8f6287e27e	Add path traversal security validation for file deletion operations • Add validate_file_path_security function • Prevent path traversal attacks • Validate file paths before deletion • Check both input and enqueued dirs • Log security violations	2025-09-17 01:12:44 +08:00
yangdx	c0d5abba6b	Fix linting	2025-09-15 02:59:21 +08:00
yangdx	b1c8206346	Add aquery_data endpoint for structured retrieval without LLM generation - Add QueryDataResponse model - Implement /query/data endpoint - Add aquery_data method to LightRAG - Return entities, relationships, chunks	2025-09-15 02:15:14 +08:00
yangdx	17d665c9f3	Limit history messages to latest 1000 entries with truncation indicator • Limit history to 1000 latest messages • Add truncation message when needed • Show count of truncated messages • Update API documentation • Prevent memory issues with large logs	2025-09-05 12:31:36 +08:00
yangdx	25b5d176cd	Fix label selection with leading/trailing whitespace • Fix AsyncSelect value trimming issue • Preserve whitespace in label display • Use safe keys for command items • Add GraphControl dependency fix • Add debug logging for graph labels	2025-08-31 02:54:39 +08:00
yangdx	3d5e6226a9	Refactored `rerank_example` file to utilize the updated rerank function.	2025-08-23 22:51:41 +08:00

1 2 3 4 5

217 commits