Commit graph

56 commits

Author SHA1 Message Date
Magic_yuan
650b8e38b7 feat(lightrag): Add document status tracking and checkpoint support
功能(lightrag): 添加文档状态跟踪和断点续传支持

- Add DocStatus enum and DocProcessingStatus class for document processing state management
- 添加 DocStatus 枚举和 DocProcessingStatus 类用于文档处理状态管理

- Implement JsonDocStatusStorage for persistent status storage
- 实现 JsonDocStatusStorage 用于持久化状态存储

- Add document-level deduplication in batch processing
- 在批处理中添加文档级别的去重功能

- Add checkpoint support in ainsert method for resumable document processing
- 在 ainsert 方法中添加断点续传支持,实现可恢复的文档处理

- Add status query methods for monitoring processing progress
- 添加状态查询方法用于监控处理进度

- Update LightRAG initialization to support document status tracking
- 更新 LightRAG 初始化以支持文档状态跟踪
2024-12-28 00:11:25 +08:00
Samuel Chan
42b3690e71 Fix the async issue while running on Windows 2024-12-27 12:01:50 +08:00
Alex Potapenko
848b3f6e33 GremlinStorage: fixes and patch to support other Gremlin compatible BD. Tested on ArcadeDB with Gremlin plugin. The main change is using "entity_name" vertex property instead of the label as a node_id since different implementations have different restrictions on label names. 2024-12-23 16:16:17 +01:00
Alex Potapenko
016d9f572d GremlinStorage: fix linting error, use asyncio.gather in get_node_edges() 2024-12-20 09:57:35 +01:00
Alex Potapenko
6f71293c83 Add Gremlin graph storage 2024-12-19 17:47:42 +01:00
Weaxs
344d8f277b support TiDBGraphStorage 2024-12-18 10:57:33 +08:00
Alex Potapenko
a44aceab1e AGEStorage: fix has_node() and has_edge() 2024-12-14 13:37:34 +01:00
Alex Potapenko
25d6939ca3 AGEStorage: all queries as multiline strings for consistency 2024-12-14 12:49:39 +01:00
Alex Potapenko
05c73bcfd1 AGEStorage: fix debug logging, and some cosmetics 2024-12-14 12:47:29 +01:00
Alex Potapenko
eb2e6d0d87 AGEStorage: fix has_edge(): AGE always returns field names in lower case 2024-12-14 12:24:45 +01:00
Alex Potapenko
7564841450 Add Apache AGE graph storage 2024-12-13 20:41:38 +01:00
LarFii
b7a2d336e6 Update __version__ 2024-12-13 20:15:49 +08:00
billvsme
a788c78197 keep tqdm_async work 2024-12-13 16:48:22 +08:00
billvsme
cf0278ca43 The asyncio.as_completed() function does not guarantee that the results are ordered 2024-12-13 15:42:41 +08:00
Weaxs
288985eab4 pre-commit fix tidb 2024-12-12 10:22:31 +08:00
Weaxs
8ef5a6b8cd support TiDB: add TiDBKVStorage, TiDBVectorDBStorage 2024-12-11 16:23:50 +08:00
Pankaj Kaushal
ca788463cc feat: Add ChromaDB integration for vector storage
- Implemented `ChromaVectorDBStorage` class in `lightrag/kg/chroma_impl.py` to support ChromaDB as a vector storage backend.
- Updated `lightrag.py` to include `ChromaVectorDBStorage` in the storage class mapping.
- Added a test script `test_chromadb.py` to demonstrate the usage of ChromaDB with LightRAG, including configuration for embedding functions and ChromaDB connection settings.
- fix lazy import function to support package context for dynamic class loading.
  288d4b8355
2024-12-10 16:23:05 +01:00
Suroy
979f30d20b Update oracle_impl.py
Fixed typing error in python3.9
2024-12-06 11:06:20 +08:00
partoneplay
d8ba7c57f3 Add MongoDB as KV storage 2024-12-05 13:57:43 +08:00
partoneplay
052322b213 Add Milvus as vector storage 2024-12-05 08:48:41 +08:00
AdityaKalraShorthillsAI
7734b81911 Added embedding_func argument in Neo4JStorage class's __init__ method in neo4j implementation 2024-12-02 02:44:47 +05:30
jin
adfaa75c00 Update oracle_impl.py 2024-11-25 14:21:01 +08:00
jin
776ba2f2ce Update oracle_impl.py 2024-11-25 14:15:10 +08:00
jin
26ae240c65 Logic Optimization 2024-11-25 13:40:38 +08:00
jin
1dbe803521 Merge branch 'main' of https://github.com/jin38324/LightRAG 2024-11-25 13:32:33 +08:00
jin
89c2de54a2 Optimization logic 2024-11-25 13:29:55 +08:00
lzd
9bf140d7cf 删除Neo4JStorage.has_edge中定义的没有用到的同步close函数 2024-11-22 13:59:14 +08:00
LarFii
ce7f524174 Update 2024-11-19 16:52:26 +08:00
zrguo
eec08661a2 Merge pull request #286 from Sucran/main
fix neo4j bug
2024-11-19 15:25:49 +08:00
tmuife
e5f2aa3a30 use pre-commit reformat 2024-11-18 13:52:49 +08:00
tmuife
1123ccfbd0 change the type of binding parameters in Oracle23AI 2024-11-18 10:00:06 +08:00
Richard
6bdf693b85 fix neo4j bug 2024-11-15 13:11:43 +08:00
jin
662303f605 use oracle bind variables to avoid error 2024-11-15 12:57:01 +08:00
jin
41599897fb fix pre commit 2024-11-12 13:32:40 +08:00
jin
cc45d802a6 Update oracle_impl.py 2024-11-12 13:23:03 +08:00
jin
f4ee85034f Update oracle_impl.py 2024-11-12 13:03:03 +08:00
jin
77123be2a1 Update oracle_impl.py 2024-11-12 12:02:24 +08:00
jin
8bc5d4efff add Oracle support 2024-11-12 09:59:12 +08:00
jin
8a7fb9761f Update oracle_impl.py 2024-11-11 15:32:30 +08:00
jin
10e4c595c5 Update oracle_impl.py 2024-11-11 15:24:31 +08:00
jin
0b6b0064d6 Merge branch 'main' of https://github.com/jin38324/LightRAG 2024-11-11 15:21:37 +08:00
LarFii
d0c1844264 Linting 2024-11-11 10:45:22 +08:00
jin
594470ab56 Oracle Database support
Add oracle 23ai database as the KV/vector/graph storage
2024-11-08 14:58:41 +08:00
Ken Wiltshire
3d5d083f42 fix event loop conflict 2024-11-06 11:18:14 -05:00
Ken Wiltshire
7af90f0129 linting errors fixed 2024-11-03 07:17:02 -05:00
Ken Wiltshire
8bd5d9b5b2 using neo4j async 2024-11-02 18:35:07 -04:00
Ken Wiltshire
465c9a13d9 cleaning code for pull 2024-11-01 16:29:36 -04:00
Ken Wiltshire
8547dd4941 cleaning code for pull 2024-11-01 16:13:39 -04:00
Ken Wiltshire
f375620992 cleaning code for pull 2024-11-01 16:11:19 -04:00
Ken Wiltshire
13940c1726 env var naming conflict, qualified by prefix 2024-11-01 14:24:17 -04:00