Commit graph

3294 commits

Author SHA1 Message Date
yangdx
e9503ee6ae Merge branch 'patch-1' into citation-optimization 2025-09-24 18:23:29 +08:00
yangdx
f99c4a3738 Fix graph truncation logic for depth-limited traversals
• Only set truncated flag for node limit
• Keep depth limit info logging
• Improve log message clarity
• Fix false truncation detection
2025-09-24 18:03:11 +08:00
yangdx
ac26f3a2f2 Refactor citation format from file paths to numbered document titles
• Change citation format to [n] style
• Reduce max citations from 6 to 5
• Add reference tracking instructions
• Simplify citation merge logic
• Remove inline citation requirements
2025-09-24 14:30:53 +08:00
yangdx
5eb4a4b799 feat: simplify citations, add reference merging, and restructure API response format 2025-09-24 14:30:10 +08:00
SASon
b3cc0127d9
Fix typo in output language instruction 2025-09-24 13:22:35 +09:00
SASon
746d4c576d
Fix typo in output language instruction
from Oputput to Output
2025-09-24 13:17:37 +09:00
yangdx
18968c6b6b Update webui assets 2025-09-23 23:46:17 +08:00
yangdx
8a49f968b2 Update webui assets 2025-09-23 23:45:12 +08:00
yangdx
3591cb69b5 Update webui assets 2025-09-23 23:16:26 +08:00
yangdx
6a599680ec Update webui assets 2025-09-23 22:41:43 +08:00
yangdx
c9c691384b Update webui assets 2025-09-23 22:33:47 +08:00
yangdx
faa65d39c3 Update webui assets 2025-09-23 22:28:54 +08:00
yangdx
9dd1790b5c Add "Creature" entity type and reorganize type mappings
- Add Creature to default entity types
- Map animals/beings to creature type
2025-09-23 21:58:33 +08:00
yangdx
2adb8efdc7 Add duplicate document detection and skip processed files in scanning
- Add get_doc_by_file_path to all storages
- Skip processed files in scan operation
- Check duplicates in upload endpoints
- Check duplicates in text insert APIs
- Return status info in duplicate responses
2025-09-23 17:30:54 +08:00
yangdx
6b953fa53d Remove auto-scan-at-startup feature and related documentation
• Remove --auto-scan-at-startup arg
• Delete auto scan docs sections
• Remove startup scanning logic
2025-09-23 16:24:53 +08:00
yangdx
fc15e9f142 Update webui assets 2025-09-23 13:25:11 +08:00
yangdx
8093bfb5a3 Update webui assets 2025-09-23 13:12:00 +08:00
yangdx
367f3df038 Fix log message 2025-09-23 11:25:55 +08:00
yangdx
a4442a8613 Optimize log message 2025-09-23 11:22:14 +08:00
yangdx
86186c0c85 Update log message 2025-09-23 11:08:33 +08:00
yangdx
55774bf394 Mark get_all_labels method as deprecated in BaseGraphStorage 2025-09-22 18:24:10 +08:00
yangdx
6b3a341977 Increase default PostgreSQL max connections from 20 to 50 2025-09-22 18:11:28 +08:00
yangdx
20802b1ee6 Update webui assets 2025-09-22 15:16:23 +08:00
yangdx
0268768ecd Update webassets and bump API version to 0227 2025-09-22 13:03:27 +08:00
yangdx
5fa92cbf99 Improve citation quality and reduce reference limits in prompts
- Reduce max citations from 8 to 6
- Require direct fact referencing
- Clarify relevance prioritization
2025-09-22 10:53:03 +08:00
yangdx
b13209e241 Update webui assets 2025-09-22 03:04:06 +08:00
yangdx
0fcb5af333 Update webui assets and bump API version to 0227 2025-09-22 02:02:42 +08:00
yangdx
8826d2f892 Optimize prompt instruction for citation format 2025-09-22 01:04:57 +08:00
yangdx
cff6029508 Ensure COT tags are properly closed in all stream termination scenarios
- Add COT closure after stream completion
- Handle COT in exception scenarios
- Add final safety check in finally block
- Prevent unclosed thinking tags
- Log COT closure failures
2025-09-22 00:09:27 +08:00
yangdx
6e2eab5c23 Add ID fields to entities, relations, and chunks in raw data query results 2025-09-21 23:31:35 +08:00
yangdx
2f06f851c3 Enhance citation format with merged references and clearer guidelines
- Increase max references from 5 to 8
- Merge citations by file_path
- Remove inline citations from body
- Add reference section examples
- Update citation prefixes (KG→EN, RE)
2025-09-21 22:48:48 +08:00
yangdx
18e886d7e9 Improve context item identification with meaningful IDs
- Add EN prefix to entitie IDs
- Add RE prefix to relation IDs
-Add DC prefix chunk IDs
- Enhance traceability across contexts
2025-09-21 20:19:14 +08:00
yangdx
f88c2fbdff Refactor citation format instructions for clarity and consistency 2025-09-21 15:51:31 +08:00
yangdx
8f0fb3c9eb Include user query in prompt returns 2025-09-21 15:24:20 +08:00
yangdx
6eb37e270a Refactor query handling and improve RAG response prompts
- Move user_prompt to query concatenation
- Remove DEFAULT_USER_PROMPT constant
- Enhance prompt clarity and structure
- Standardize citation formatting
- Improve step-by-step instructions
2025-09-21 15:16:24 +08:00
yangdx
523028f8d0 Remove deprecated truncated fields from token truncation return
• Drop truncated_entities field
• Drop truncated_relations field
2025-09-21 11:00:48 +08:00
yangdx
938eced0de Udate webui assets 2025-09-21 01:19:28 +08:00
yangdx
7c463f0fb5 Change entity type formatting from title case to lowercase without spaces 2025-09-21 00:56:56 +08:00
yangdx
c2985429ba Update webui assets 2025-09-21 00:32:06 +08:00
yangdx
040b0c8620 Fix Neo4J index creation to check state instead of analyzer
• Check index state not analyzer
• Skip if index is ONLINE
• Recreate if state not ONLINE
• Simplify recreation logic
2025-09-20 23:51:50 +08:00
yangdx
9e0dabc34d Update webui assets 2025-09-20 23:31:00 +08:00
yangdx
5e0ef7166e Update webui assets and bump api version to 0226 2025-09-20 22:47:45 +08:00
yangdx
5da1df3b19 Fix linting 2025-09-20 15:30:27 +08:00
yangdx
8e2a1fa59e Enhance Neo4j fulltext search with Chinese language support
• Add CJK analyzer for Chinese text
• Auto-detect Chinese characters
• Recreate index if needed
• Separate Chinese/Latin search logic
• Improve fallback for Chinese queries
2025-09-20 15:19:22 +08:00
yangdx
3b502af858 Update webui assets 2025-09-20 14:36:34 +08:00
yangdx
9330ccb14e Fix graph truncation logging to correctly identify truncation cause 2025-09-20 13:33:19 +08:00
yangdx
1dd164a122 Fix graph truncation detection for depth-limited BFS
- Track unexplored neighbors at max depth
- Improve truncation flag accuracy
2025-09-20 13:12:25 +08:00
yangdx
b897eedaef Update webui assets and bump API version to 0225 2025-09-20 12:41:52 +08:00
yangdx
26c9ba4cb5 Make graph label methods required in BaseGraphStorage interface
• Remove fallback compatibility code
• Add get_popular_labels to ABC
• Add search_labels to ABC
• Enforce consistent implementation
• Clean up error handling paths
2025-09-20 12:40:36 +08:00
yangdx
3296bcb553 Add high-performance label search methods to PostgreSQL graph storage
- Add get_popular_labels() method
- Add search_labels() with fuzzy matching
- Use native SQL for better performance
- Include proper scoring and ranking
2025-09-20 12:39:53 +08:00