Commit graph

154 commits

Author SHA1 Message Date
yangdx
7a89916bab Add method to retrieve in-progress documents in DocStatusStorage
• Add get_processing_docs() abstract method
• Override get_processing_docs() in PG storage
• Method retrieves docs with PROCESSING status
• Keep consistent with existing status methods
2025-02-13 01:27:27 +08:00
yangdx
7c7cac1cfd fix: remove unnecessary param binding, use direct workspace string interpolation 2025-02-13 00:39:40 +08:00
yangdx
3372af7c3d refactor: remove injected db field from PGDocStatusStorage, it must be injected after object is created 2025-02-12 22:54:22 +08:00
yangdx
7b79427097 refactor: improve database initialization by centralizing db instance injection
- Move db configs to separate methods
- Remove db field defaults in storage classes
- Add _initialize_database_if_needed method
- Inject db instances during initialization
- Clean up storage implementation code
2025-02-12 22:25:34 +08:00
yangdx
cf61bed62c Reorganize env config sections, add data store config to env file.
• Add namespace prefix config option
• Make AGE graph name optional
• Update env variable requirements
• Add comments for deprecated options
2025-02-12 21:48:48 +08:00
yangdx
fc0f522ed5 Merge branch 'main' into select-datastore-in-api-server 2025-02-12 09:49:18 +08:00
zrguo
18acb4a2b1 fix linting error 2025-02-11 22:16:35 +08:00
zrguo
a272c7b3f7
Merge pull request #741 from Brenon28/fix-update-doc-status-postgresql
fix(postgres): update document status with partial update instead of …
2025-02-11 22:14:52 +08:00
yangdx
8a56a5ea6c fix: Add content column to doc status and fix SQL parameter indexing
• Add content column to doc status table
• Fix SQL param index in get_by_status query
• Update insert SQL to include content field
2025-02-11 16:11:15 +08:00
zrguo
5ffbb548ad Fix linting error 2025-02-11 13:32:24 +08:00
zrguo
2d2ed19095 Fix cache bugs 2025-02-11 13:28:18 +08:00
yangdx
c5c606f491 Inject TiDB同LightRAG storage when needed 2025-02-11 04:27:45 +08:00
yangdx
a4cf7e66d3 Inject oracle db to LightRag storage class when needed 2025-02-11 03:54:54 +08:00
yangdx
8cfca5a141 Fix linting 2025-02-11 03:29:40 +08:00
yangdx
56c1792767 feat optimize storage configuration and environment variables
* add storage type compatibility validation table
* add enviroment variables check for storage
* modify storage init to get setting from confing.ini and env
2025-02-11 00:55:52 +08:00
Brenon
4723e9b535 fix(postgres): update document status with partial update instead of full upsert 2025-02-10 15:05:44 +03:00
zrguo
ad8349aff5
Merge pull request #738 from ArnoChenFx/qdrant-backend
add qdrant backend, enable MongoGraphStorage based on config
2025-02-10 13:55:15 +08:00
Yannick Stephan
9f6621454e fixed ids 2025-02-09 21:24:13 +01:00
Yannick Stephan
068924b59f updated type 2025-02-09 21:12:39 +01:00
Yannick Stephan
62115b836f cleaned set 2025-02-09 20:03:15 +01:00
Yannick Stephan
6480ddee5d cleaned code 2025-02-09 19:51:05 +01:00
Yannick Stephan
7d63898015 fixed bugs 2025-02-09 19:21:49 +01:00
ArnoChen
c858e4a4e6 add qdrant backend 2025-02-10 00:57:28 +08:00
Yannick Stephan
93717e6705 cleaned code 2025-02-09 15:36:01 +01:00
Yannick Stephan
7116bd1872 cleaned code 2025-02-09 15:24:30 +01:00
Yannick Stephan
82481ecf28 cleaned code 2025-02-09 14:55:52 +01:00
Yannick Stephan
4cce14e65e cleaned import 2025-02-09 11:24:08 +01:00
Yannick Stephan
31fe96d74a cleaned optional not used 2025-02-09 10:33:15 +01:00
Yannick Stephan
50c7f26262 cleanup code 2025-02-08 23:58:15 +01:00
Yannick Stephan
5a082a0052 cleaned code 2025-02-08 23:20:37 +01:00
Yannick Stephan
cff415d91f implemented method and cleaned the mess 2025-02-08 23:18:12 +01:00
Yannick Stephan
fe3050adce updated json kv 2025-02-08 23:02:40 +01:00
ArnoChen
f5bf6a4af8 use namespace as neo4j database name
format

fix
2025-02-08 20:06:18 +08:00
ArnoChen
3f845e9e53 better handling of namespace 2025-02-08 16:05:59 +08:00
ArnoChen
f974bf39bb format
format
2025-02-08 13:53:00 +08:00
ArnoChen
88d691deb9 add namespace prefix to storage namespaces 2025-02-08 13:53:00 +08:00
zrguo
3211816da8 Reapply "fix mongo impl" 2025-02-05 02:48:12 +08:00
zrguo
1c2545b38a Revert "Merge pull request #708 from ArnoChenFx/fix-mongo-impl"
This reverts commit ad2a82e5e4, reversing
changes made to 7b238dbc3e.
2025-02-05 02:36:38 +08:00
ArnoChen
c1ff66db98 fix mongo impl 2025-02-05 01:52:33 +08:00
chenjingyang
6e79bef321 Fix get_by_id DB query ressult is empty array 2025-02-04 17:09:34 +08:00
zrguo
c07b5522fe
Merge pull request #695 from ShanGor/main
Fix the bug from main stream that using doc['status'] and improve Apache AGE performance
2025-02-02 18:27:11 +08:00
Samuel Chan
02ac96ff8e - Fix the bug from main stream that using doc['status']
- Improve the performance of Apache AGE.
- Revise the README.md for Apache AGE indexing.
2025-02-02 18:20:32 +08:00
yangdx
a0710e9d65 Merge branch 'main' into fix-extract-entity-concurrent-problem 2025-02-01 14:45:42 +08:00
yangdx
635d4fd9e4 Add lock to protect file write operations in NanoVectorDBStorage
- Introduce asyncio.Lock for save operations
- Ensure thread-safe file writes
2025-02-01 10:36:25 +08:00
yangdx
6a326e2783 Revert "Refactor embedding functions and add async query limit"
This reverts commit 21481dba8f.
2025-02-01 10:36:25 +08:00
Gurjot Singh
2894e8faf2 Fix linting errors 2025-01-31 19:05:47 +05:30
Gurjot Singh
8a624e198a Add faiss integration for storage 2025-01-31 19:00:36 +05:30
yangdx
389f4ee872 Shorten log message for cosine similarity threshold. 2025-01-31 15:33:41 +08:00
yangdx
21481dba8f Refactor embedding functions and add async query limit
- Separate insert/query embedding funcs
- Add query-specific async limit
- Update storage classes to use new funcs
- Protect vector DB save with lock
- Improve config handling for thresholds
2025-01-31 15:00:56 +08:00
yangdx
06647438b2 Refactor threshold handling to use environment variables and global config settings for oracle, postgres and tidb 2025-01-29 23:47:57 +08:00