- Added optional page tracking fields (start_page, end_page, pages) to TextChunkSchema. - Updated LightRAG class to handle page metadata during document processing. - Implemented validation for LLM responses to ensure only valid reference IDs are used. - Enhanced chunking functions to include page data for better context management. - Improved reference generation to include page ranges for citations. - Added PDF extraction methods to capture page-level data using PyPDF2 and Docling. |
||
|---|---|---|
| .. | ||
| api | ||
| kg | ||
| llm | ||
| tools | ||
| __init__.py | ||
| base.py | ||
| constants.py | ||
| exceptions.py | ||
| lightrag.py | ||
| namespace.py | ||
| operate.py | ||
| prompt.py | ||
| rerank.py | ||
| types.py | ||
| utils.py | ||
| utils_graph.py | ||