Commit graph

2716 commits

Author SHA1 Message Date
xuewei
77a42a37ca Fix Milvus DataNotMatchException lint-and-format 2025-07-16 22:07:39 +08:00
xuewei
e0bb6093b0 Fix Milvus DataNotMatchException 2025-07-16 21:59:17 +08:00
yangdx
bd340fece6 Fix timestamp column migration comment typos
- Correct timezone-related comments
- Fix typo in debug log message
- Update migration success message
- Maintain same migration logic
2025-07-16 14:27:52 +08:00
yangdx
2bf0d397ed Update webui assets 2025-07-16 10:18:51 +08:00
yangdx
e4f62de727 Bump aip version to 0184 2025-07-16 04:57:46 +08:00
yangdx
500e940f75 Remove max token summary display from splash screen 2025-07-16 04:55:32 +08:00
yangdx
8103b200db Set DEFAULT_HISTORY_TURNS to 0 2025-07-16 02:20:27 +08:00
yangdx
0adb5f2595 Update webui assets 2025-07-16 01:39:48 +08:00
yangdx
45d38fa083 Fix JSON error logging in Redis storage implementations 2025-07-16 01:35:07 +08:00
Daniel.y
b44c8d46a5
Merge pull request #1782 from HKUDS/rerank
Refactor the token control system
2025-07-16 00:23:25 +08:00
yangdx
6e084bfae1 Increase default related chunk number from 5 to 10 2025-07-16 00:22:34 +08:00
yangdx
b1276a079f Fix linting 2025-07-15 23:57:24 +08:00
yangdx
5f7cb437e8 Centralize query parameters into LightRAG class
This commit refactors query parameter management by consolidating settings like `top_k`, token limits, and thresholds into the `LightRAG` class, and consistently sourcing parameters from a single location.
2025-07-15 23:56:49 +08:00
zrguo
3ead0489b8 Remove "rank", "weight", "keywords" 2025-07-15 21:47:33 +08:00
zrguo
1541034816 Add DEFAULT_RELATED_CHUNK_NUMBER 2025-07-15 21:35:12 +08:00
yangdx
089346f8df Bump api version to 0183 2025-07-15 19:52:50 +08:00
zrguo
42f1fd60f4 Update operate.py 2025-07-15 18:59:52 +08:00
zrguo
29e82723e6 Update operate.py 2025-07-15 18:57:57 +08:00
yangdx
93b25a65d5 Update webui assets 2025-07-15 18:10:00 +08:00
yangdx
bdcb280731 Merge branch 'main' into rerank 2025-07-15 18:09:01 +08:00
yangdx
6d66cde4ac Reorder query settings in web UI 2025-07-15 18:06:00 +08:00
zrguo
e254c3dd81 Update openai.py 2025-07-15 17:30:30 +08:00
yangdx
661a41f9eb Update webui assets 2025-07-15 17:25:39 +08:00
yangdx
1927cb2685 Fix linting 2025-07-15 17:24:57 +08:00
yangdx
47341d3a71 Merge branch 'main' into rerank 2025-07-15 16:12:33 +08:00
yangdx
e8e1f6ab56 feat: centralize environment variable defaults in constants.py 2025-07-15 16:11:50 +08:00
Daniel.y
6d1260aafa
Merge pull request #1766 from HKUDS/fix-memgraph-max-nodes-issue
Fix Memgraph get_knowledge_graph issues
2025-07-15 16:07:04 +08:00
zrguo
91d0f65476 Update QueryParam 2025-07-15 14:21:58 +08:00
yangdx
3da9f8aab4 Fix logging output condition in shared_storage.py. Early return if logging disabled 2025-07-15 13:38:05 +08:00
yangdx
ccc2a20071 feat: remove deprecated MAX_TOKEN_SUMMARY parameter to prevent LLM output truncation
- Remove MAX_TOKEN_SUMMARY parameter and related configurations
- Eliminate forced token-based truncation in entity/relationship descriptions
- Switch to fragment-count based summarization logic using FORCE_LLM_SUMMARY_ON_MERGE
- Update FORCE_LLM_SUMMARY_ON_MERGE default from 6 to 4 for better summarization
- Clean up documentation, environment examples, and API display code
- Preserve backward compatibility by graceful parameter removal

This change resolves issues where LLMs were forcibly truncating entity relationship
descriptions mid-sentence, leading to incomplete and potentially inaccurate knowledge
graph content. The new approach allows LLMs to generate complete descriptions while
still providing summarization when multiple fragments need to be merged.

Breaking Change: None - parameter removal is backward compatible
Fixes: Entity relationship description truncation issues
2025-07-15 12:26:33 +08:00
zrguo
9a9f0f2463 Update rerank_example & readme 2025-07-15 12:17:27 +08:00
zrguo
7c882313bb remove chunk_rerank_top_k 2025-07-15 11:52:34 +08:00
zrguo
86a0a4872e Update operate.py 2025-07-15 10:56:48 +08:00
DavIvek
2914b21b34 remove unused query parameter 2025-07-14 16:25:58 +02:00
DavIvek
9beb2456ec update subgraph query comment 2025-07-14 16:25:17 +02:00
DavIvek
45815f1eae remove redundant UNWIND 2025-07-14 15:39:39 +02:00
DavIvek
593ce552af run pre-commit 2025-07-14 14:26:39 +02:00
DavIvek
f961f1aa7d remove fallback query 2025-07-14 14:26:23 +02:00
DavIvek
81c93f6950 dont use mage procedure 2025-07-14 14:16:20 +02:00
zrguo
7edf087baa Update operate.py 2025-07-14 18:43:22 +08:00
yangdx
9afe578fe7 Update webui assets 2025-07-14 17:56:51 +08:00
zrguo
c2da2fbe12 build 2025-07-14 17:19:28 +08:00
zrguo
bbd91d3a18 Update operate.py 2025-07-14 16:37:25 +08:00
zrguo
4e425b1b59 Revert "update from main"
This reverts commit 1d0376d6a9.
2025-07-14 16:29:00 +08:00
zrguo
1d0376d6a9 update from main 2025-07-14 16:27:49 +08:00
zrguo
c9cbd2d3e0 Merge branch 'main' into rerank 2025-07-14 16:24:29 +08:00
zrguo
ef2115d437 Update token limit 2025-07-14 15:53:48 +08:00
yangdx
7e988158a9 Fix: Resolve timezone handling problem in PostgreSQL storage
- Changed timestamp columns to naive UTC
- Added datetime formatting utilities
- Updated SQL templates for timestamp extraction
- Simplified timestamp migration logic
2025-07-14 04:12:52 +08:00
yangdx
b03bb48e24 feat: Refine summary logic and add dedicated Ollama num_ctx config
- Refactor the trigger condition for LLM-based summarization of entities and relations. Instead of relying on character length, the summary is now triggered when the number of merged description fragments exceeds a configured threshold. This provides a more robust and logical condition for consolidation.
- Introduce the `OLLAMA_NUM_CTX` environment variable to explicitly configure the context window size (`num_ctx`) for Ollama models. This decouples the model's context length from the `MAX_TOKENS` parameter, which is now specifically used to limit input for summary generation, making the configuration clearer and more flexible.
- Updated `README` files, `env.example`, and default values to reflect these changes.
2025-07-14 01:55:04 +08:00
yangdx
e8b3dfcf90 Bump api verion to 0182 2025-07-14 00:29:48 +08:00