- Added optional page tracking fields (start_page, end_page, pages) to TextChunkSchema. - Updated LightRAG class to handle page metadata during document processing. - Implemented validation for LLM responses to ensure only valid reference IDs are used. - Enhanced chunking functions to include page data for better context management. - Improved reference generation to include page ranges for citations. - Added PDF extraction methods to capture page-level data using PyPDF2 and Docling. |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| document_routes.py | ||
| graph_routes.py | ||
| ollama_api.py | ||
| query_routes.py | ||