LightRAG/lightrag/kg
Magic_yuan 650b8e38b7 feat(lightrag): Add document status tracking and checkpoint support
功能(lightrag): 添加文档状态跟踪和断点续传支持

- Add DocStatus enum and DocProcessingStatus class for document processing state management
- 添加 DocStatus 枚举和 DocProcessingStatus 类用于文档处理状态管理

- Implement JsonDocStatusStorage for persistent status storage
- 实现 JsonDocStatusStorage 用于持久化状态存储

- Add document-level deduplication in batch processing
- 在批处理中添加文档级别的去重功能

- Add checkpoint support in ainsert method for resumable document processing
- 在 ainsert 方法中添加断点续传支持,实现可恢复的文档处理

- Add status query methods for monitoring processing progress
- 添加状态查询方法用于监控处理进度

- Update LightRAG initialization to support document status tracking
- 更新 LightRAG 初始化以支持文档状态跟踪
2024-12-28 00:11:25 +08:00
..
__init__.py Linting 2024-11-11 10:45:22 +08:00
age_impl.py feat(lightrag): Add document status tracking and checkpoint support 2024-12-28 00:11:25 +08:00
chroma_impl.py feat: Add ChromaDB integration for vector storage 2024-12-10 16:23:05 +01:00
gremlin_impl.py GremlinStorage: fixes and patch to support other Gremlin compatible BD. Tested on ArcadeDB with Gremlin plugin. The main change is using "entity_name" vertex property instead of the label as a node_id since different implementations have different restrictions on label names. 2024-12-23 16:16:17 +01:00
milvus_impl.py Update __version__ 2024-12-13 20:15:49 +08:00
mongo_impl.py Add MongoDB as KV storage 2024-12-05 13:57:43 +08:00
neo4j_impl.py Added embedding_func argument in Neo4JStorage class's __init__ method in neo4j implementation 2024-12-02 02:44:47 +05:30
oracle_impl.py Update oracle_impl.py 2024-12-06 11:06:20 +08:00
tidb_impl.py support TiDBGraphStorage 2024-12-18 10:57:33 +08:00