- Added optional page tracking fields (start_page, end_page, pages) to TextChunkSchema. - Updated LightRAG class to handle page metadata during document processing. - Implemented validation for LLM responses to ensure only valid reference IDs are used. - Enhanced chunking functions to include page data for better context management. - Improved reference generation to include page ranges for citations. - Added PDF extraction methods to capture page-level data using PyPDF2 and Docling. |
||
|---|---|---|
| .. | ||
| deprecated | ||
| __init__.py | ||
| faiss_impl.py | ||
| json_doc_status_impl.py | ||
| json_kv_impl.py | ||
| memgraph_impl.py | ||
| milvus_impl.py | ||
| mongo_impl.py | ||
| nano_vector_db_impl.py | ||
| neo4j_impl.py | ||
| networkx_impl.py | ||
| postgres_impl.py | ||
| qdrant_impl.py | ||
| redis_impl.py | ||
| shared_storage.py | ||