Magic_yuan
650b8e38b7
feat(lightrag): Add document status tracking and checkpoint support
...
功能(lightrag): 添加文档状态跟踪和断点续传支持
- Add DocStatus enum and DocProcessingStatus class for document processing state management
- 添加 DocStatus 枚举和 DocProcessingStatus 类用于文档处理状态管理
- Implement JsonDocStatusStorage for persistent status storage
- 实现 JsonDocStatusStorage 用于持久化状态存储
- Add document-level deduplication in batch processing
- 在批处理中添加文档级别的去重功能
- Add checkpoint support in ainsert method for resumable document processing
- 在 ainsert 方法中添加断点续传支持,实现可恢复的文档处理
- Add status query methods for monitoring processing progress
- 添加状态查询方法用于监控处理进度
- Update LightRAG initialization to support document status tracking
- 更新 LightRAG 初始化以支持文档状态跟踪
2024-12-28 00:11:25 +08:00
Samuel Chan
42b3690e71
Fix the async issue while running on Windows
2024-12-27 12:01:50 +08:00
Alex Potapenko
848b3f6e33
GremlinStorage: fixes and patch to support other Gremlin compatible BD. Tested on ArcadeDB with Gremlin plugin. The main change is using "entity_name" vertex property instead of the label as a node_id since different implementations have different restrictions on label names.
2024-12-23 16:16:17 +01:00
Alex Potapenko
016d9f572d
GremlinStorage: fix linting error, use asyncio.gather in get_node_edges()
2024-12-20 09:57:35 +01:00
Alex Potapenko
6f71293c83
Add Gremlin graph storage
2024-12-19 17:47:42 +01:00
Weaxs
344d8f277b
support TiDBGraphStorage
2024-12-18 10:57:33 +08:00
Alex Potapenko
a44aceab1e
AGEStorage: fix has_node() and has_edge()
2024-12-14 13:37:34 +01:00
Alex Potapenko
25d6939ca3
AGEStorage: all queries as multiline strings for consistency
2024-12-14 12:49:39 +01:00
Alex Potapenko
05c73bcfd1
AGEStorage: fix debug logging, and some cosmetics
2024-12-14 12:47:29 +01:00
Alex Potapenko
eb2e6d0d87
AGEStorage: fix has_edge(): AGE always returns field names in lower case
2024-12-14 12:24:45 +01:00
Alex Potapenko
7564841450
Add Apache AGE graph storage
2024-12-13 20:41:38 +01:00
LarFii
b7a2d336e6
Update __version__
2024-12-13 20:15:49 +08:00
billvsme
a788c78197
keep tqdm_async work
2024-12-13 16:48:22 +08:00
billvsme
cf0278ca43
The asyncio.as_completed() function does not guarantee that the results are ordered
2024-12-13 15:42:41 +08:00
Weaxs
288985eab4
pre-commit fix tidb
2024-12-12 10:22:31 +08:00
Weaxs
8ef5a6b8cd
support TiDB: add TiDBKVStorage, TiDBVectorDBStorage
2024-12-11 16:23:50 +08:00
Pankaj Kaushal
ca788463cc
feat: Add ChromaDB integration for vector storage
...
- Implemented `ChromaVectorDBStorage` class in `lightrag/kg/chroma_impl.py` to support ChromaDB as a vector storage backend.
- Updated `lightrag.py` to include `ChromaVectorDBStorage` in the storage class mapping.
- Added a test script `test_chromadb.py` to demonstrate the usage of ChromaDB with LightRAG, including configuration for embedding functions and ChromaDB connection settings.
- fix lazy import function to support package context for dynamic class loading.
288d4b8355
2024-12-10 16:23:05 +01:00
Suroy
979f30d20b
Update oracle_impl.py
...
Fixed typing error in python3.9
2024-12-06 11:06:20 +08:00
partoneplay
d8ba7c57f3
Add MongoDB as KV storage
2024-12-05 13:57:43 +08:00
partoneplay
052322b213
Add Milvus as vector storage
2024-12-05 08:48:41 +08:00
AdityaKalraShorthillsAI
7734b81911
Added embedding_func argument in Neo4JStorage class's __init__ method in neo4j implementation
2024-12-02 02:44:47 +05:30
jin
adfaa75c00
Update oracle_impl.py
2024-11-25 14:21:01 +08:00
jin
776ba2f2ce
Update oracle_impl.py
2024-11-25 14:15:10 +08:00
jin
26ae240c65
Logic Optimization
2024-11-25 13:40:38 +08:00
jin
1dbe803521
Merge branch 'main' of https://github.com/jin38324/LightRAG
2024-11-25 13:32:33 +08:00
jin
89c2de54a2
Optimization logic
2024-11-25 13:29:55 +08:00
lzd
9bf140d7cf
删除Neo4JStorage.has_edge中定义的没有用到的同步close函数
2024-11-22 13:59:14 +08:00
LarFii
ce7f524174
Update
2024-11-19 16:52:26 +08:00
zrguo
eec08661a2
Merge pull request #286 from Sucran/main
...
fix neo4j bug
2024-11-19 15:25:49 +08:00
tmuife
e5f2aa3a30
use pre-commit reformat
2024-11-18 13:52:49 +08:00
tmuife
1123ccfbd0
change the type of binding parameters in Oracle23AI
2024-11-18 10:00:06 +08:00
Richard
6bdf693b85
fix neo4j bug
2024-11-15 13:11:43 +08:00
jin
662303f605
use oracle bind variables to avoid error
2024-11-15 12:57:01 +08:00
jin
41599897fb
fix pre commit
2024-11-12 13:32:40 +08:00
jin
cc45d802a6
Update oracle_impl.py
2024-11-12 13:23:03 +08:00
jin
f4ee85034f
Update oracle_impl.py
2024-11-12 13:03:03 +08:00
jin
77123be2a1
Update oracle_impl.py
2024-11-12 12:02:24 +08:00
jin
8bc5d4efff
add Oracle support
2024-11-12 09:59:12 +08:00
jin
8a7fb9761f
Update oracle_impl.py
2024-11-11 15:32:30 +08:00
jin
10e4c595c5
Update oracle_impl.py
2024-11-11 15:24:31 +08:00
jin
0b6b0064d6
Merge branch 'main' of https://github.com/jin38324/LightRAG
2024-11-11 15:21:37 +08:00
LarFii
d0c1844264
Linting
2024-11-11 10:45:22 +08:00
jin
594470ab56
Oracle Database support
...
Add oracle 23ai database as the KV/vector/graph storage
2024-11-08 14:58:41 +08:00
Ken Wiltshire
3d5d083f42
fix event loop conflict
2024-11-06 11:18:14 -05:00
Ken Wiltshire
7af90f0129
linting errors fixed
2024-11-03 07:17:02 -05:00
Ken Wiltshire
8bd5d9b5b2
using neo4j async
2024-11-02 18:35:07 -04:00
Ken Wiltshire
465c9a13d9
cleaning code for pull
2024-11-01 16:29:36 -04:00
Ken Wiltshire
8547dd4941
cleaning code for pull
2024-11-01 16:13:39 -04:00
Ken Wiltshire
f375620992
cleaning code for pull
2024-11-01 16:11:19 -04:00
Ken Wiltshire
13940c1726
env var naming conflict, qualified by prefix
2024-11-01 14:24:17 -04:00