LightRAG

History

Saswat 6872f085d1 feat: Enhance document processing with page tracking and reference validation - Added optional page tracking fields (start_page, end_page, pages) to TextChunkSchema. - Updated LightRAG class to handle page metadata during document processing. - Implemented validation for LLM responses to ensure only valid reference IDs are used. - Enhanced chunking functions to include page data for better context management. - Improved reference generation to include page ranges for citations. - Added PDF extraction methods to capture page-level data using PyPDF2 and Docling.		2025-10-09 17:38:43 +05:30
..
deprecated	style: ruff-format	2025-08-29 21:09:14 -07:00
__init__.py	Remove deprecated storage	2025-08-06 00:02:50 +08:00
faiss_impl.py	perf: add optional query_embedding parameter to avoid redundant embedding calls	2025-08-29 18:15:45 +08:00
json_doc_status_impl.py	Add duplicate document detection and skip processed files in scanning	2025-09-23 17:30:54 +08:00
json_kv_impl.py	Merge upstream/main and resolve conflicts	2025-08-21 16:56:11 +00:00
memgraph_impl.py	Add label search and popularity methods to MemgraphStorage	2025-09-20 12:38:04 +08:00
milvus_impl.py	perf: add optional query_embedding parameter to avoid redundant embedding calls	2025-08-29 18:15:45 +08:00
mongo_impl.py	Add duplicate document detection and skip processed files in scanning	2025-09-23 17:30:54 +08:00
nano_vector_db_impl.py	perf: add optional query_embedding parameter to avoid redundant embedding calls	2025-08-29 18:15:45 +08:00
neo4j_impl.py	Fix Neo4J index creation to check state instead of analyzer	2025-09-20 23:51:50 +08:00
networkx_impl.py	Fixed typo in log message when creating new graph file	2025-10-07 14:30:05 +02:00
postgres_impl.py	feat: Enhance document processing with page tracking and reference validation	2025-10-09 17:38:43 +05:30
qdrant_impl.py	Fix linting	2025-09-12 17:00:53 +08:00
redis_impl.py	Add duplicate document detection and skip processed files in scanning	2025-09-23 17:30:54 +08:00
shared_storage.py	Rename allow_create to first_initialization for clarity	2025-08-23 02:34:39 +08:00