Commit graph

3283 commits

Author SHA1 Message Date
yangdx
faa65d39c3 Update webui assets 2025-09-23 22:28:54 +08:00
yangdx
9dd1790b5c Add "Creature" entity type and reorganize type mappings
- Add Creature to default entity types
- Map animals/beings to creature type
2025-09-23 21:58:33 +08:00
yangdx
2adb8efdc7 Add duplicate document detection and skip processed files in scanning
- Add get_doc_by_file_path to all storages
- Skip processed files in scan operation
- Check duplicates in upload endpoints
- Check duplicates in text insert APIs
- Return status info in duplicate responses
2025-09-23 17:30:54 +08:00
yangdx
6b953fa53d Remove auto-scan-at-startup feature and related documentation
• Remove --auto-scan-at-startup arg
• Delete auto scan docs sections
• Remove startup scanning logic
2025-09-23 16:24:53 +08:00
yangdx
fc15e9f142 Update webui assets 2025-09-23 13:25:11 +08:00
yangdx
8093bfb5a3 Update webui assets 2025-09-23 13:12:00 +08:00
yangdx
367f3df038 Fix log message 2025-09-23 11:25:55 +08:00
yangdx
a4442a8613 Optimize log message 2025-09-23 11:22:14 +08:00
yangdx
86186c0c85 Update log message 2025-09-23 11:08:33 +08:00
yangdx
55774bf394 Mark get_all_labels method as deprecated in BaseGraphStorage 2025-09-22 18:24:10 +08:00
yangdx
6b3a341977 Increase default PostgreSQL max connections from 20 to 50 2025-09-22 18:11:28 +08:00
yangdx
20802b1ee6 Update webui assets 2025-09-22 15:16:23 +08:00
yangdx
0268768ecd Update webassets and bump API version to 0227 2025-09-22 13:03:27 +08:00
yangdx
5fa92cbf99 Improve citation quality and reduce reference limits in prompts
- Reduce max citations from 8 to 6
- Require direct fact referencing
- Clarify relevance prioritization
2025-09-22 10:53:03 +08:00
yangdx
b13209e241 Update webui assets 2025-09-22 03:04:06 +08:00
yangdx
0fcb5af333 Update webui assets and bump API version to 0227 2025-09-22 02:02:42 +08:00
yangdx
8826d2f892 Optimize prompt instruction for citation format 2025-09-22 01:04:57 +08:00
yangdx
cff6029508 Ensure COT tags are properly closed in all stream termination scenarios
- Add COT closure after stream completion
- Handle COT in exception scenarios
- Add final safety check in finally block
- Prevent unclosed thinking tags
- Log COT closure failures
2025-09-22 00:09:27 +08:00
yangdx
6e2eab5c23 Add ID fields to entities, relations, and chunks in raw data query results 2025-09-21 23:31:35 +08:00
yangdx
2f06f851c3 Enhance citation format with merged references and clearer guidelines
- Increase max references from 5 to 8
- Merge citations by file_path
- Remove inline citations from body
- Add reference section examples
- Update citation prefixes (KG→EN, RE)
2025-09-21 22:48:48 +08:00
yangdx
18e886d7e9 Improve context item identification with meaningful IDs
- Add EN prefix to entitie IDs
- Add RE prefix to relation IDs
-Add DC prefix chunk IDs
- Enhance traceability across contexts
2025-09-21 20:19:14 +08:00
yangdx
f88c2fbdff Refactor citation format instructions for clarity and consistency 2025-09-21 15:51:31 +08:00
yangdx
8f0fb3c9eb Include user query in prompt returns 2025-09-21 15:24:20 +08:00
yangdx
6eb37e270a Refactor query handling and improve RAG response prompts
- Move user_prompt to query concatenation
- Remove DEFAULT_USER_PROMPT constant
- Enhance prompt clarity and structure
- Standardize citation formatting
- Improve step-by-step instructions
2025-09-21 15:16:24 +08:00
yangdx
523028f8d0 Remove deprecated truncated fields from token truncation return
• Drop truncated_entities field
• Drop truncated_relations field
2025-09-21 11:00:48 +08:00
yangdx
938eced0de Udate webui assets 2025-09-21 01:19:28 +08:00
yangdx
7c463f0fb5 Change entity type formatting from title case to lowercase without spaces 2025-09-21 00:56:56 +08:00
yangdx
c2985429ba Update webui assets 2025-09-21 00:32:06 +08:00
yangdx
040b0c8620 Fix Neo4J index creation to check state instead of analyzer
• Check index state not analyzer
• Skip if index is ONLINE
• Recreate if state not ONLINE
• Simplify recreation logic
2025-09-20 23:51:50 +08:00
yangdx
9e0dabc34d Update webui assets 2025-09-20 23:31:00 +08:00
yangdx
5e0ef7166e Update webui assets and bump api version to 0226 2025-09-20 22:47:45 +08:00
yangdx
5da1df3b19 Fix linting 2025-09-20 15:30:27 +08:00
yangdx
8e2a1fa59e Enhance Neo4j fulltext search with Chinese language support
• Add CJK analyzer for Chinese text
• Auto-detect Chinese characters
• Recreate index if needed
• Separate Chinese/Latin search logic
• Improve fallback for Chinese queries
2025-09-20 15:19:22 +08:00
yangdx
3b502af858 Update webui assets 2025-09-20 14:36:34 +08:00
yangdx
9330ccb14e Fix graph truncation logging to correctly identify truncation cause 2025-09-20 13:33:19 +08:00
yangdx
1dd164a122 Fix graph truncation detection for depth-limited BFS
- Track unexplored neighbors at max depth
- Improve truncation flag accuracy
2025-09-20 13:12:25 +08:00
yangdx
b897eedaef Update webui assets and bump API version to 0225 2025-09-20 12:41:52 +08:00
yangdx
26c9ba4cb5 Make graph label methods required in BaseGraphStorage interface
• Remove fallback compatibility code
• Add get_popular_labels to ABC
• Add search_labels to ABC
• Enforce consistent implementation
• Clean up error handling paths
2025-09-20 12:40:36 +08:00
yangdx
3296bcb553 Add high-performance label search methods to PostgreSQL graph storage
- Add get_popular_labels() method
- Add search_labels() with fuzzy matching
- Use native SQL for better performance
- Include proper scoring and ranking
2025-09-20 12:39:53 +08:00
yangdx
6f85bd6b19 Add workspace-aware MongoDB indexing and Atlas Search support
• Add workspace attribute to storage classes
• Use workspace-specific index names
• Implement Atlas Search with fallbacks
• Add entity search and popular labels
• Improve index migration strategy
2025-09-20 12:38:41 +08:00
yangdx
223397a247 Add label search and popularity methods to MemgraphStorage
• Get popular labels by node degree
• Search labels with fuzzy matching
• Sort by relevance and connection count
2025-09-20 12:38:04 +08:00
yangdx
e14cee69a3 Fix Neo4j typo and add fulltext search with performance optimizations
- Fix NEO4J_DATABASE typo in env.example
- Add fulltext index for entity searches
- Implement get_popular_labels method
- Add search_labels with fuzzy matching
- Simplify B-Tree index creation logic
2025-09-20 12:37:13 +08:00
yangdx
9db8f2fce5 feat: Add popular labels and search APIs with history management
- Add popular/search label endpoints
- Implement SearchHistoryManager utility
- Replace client-side with server search
- Add graph data version tracking
- Update UI for better label discovery
2025-09-20 02:03:47 +08:00
yangdx
361ea5b069 Update webui assets 2025-09-19 15:17:27 +08:00
yangdx
89a4471ae1 Bump core version to v1.4.9 2025-09-17 02:57:28 +08:00
yangdx
77569ddea2 Add chunk key to entity extraction logging output 2025-09-17 02:21:11 +08:00
yangdx
fdf8b176ad Update webui assets 2025-09-17 02:05:26 +08:00
yangdx
dac156ac8e Update webui assets 2025-09-17 01:53:26 +08:00
yangdx
983fe31af5 Bump API version and improve tooltip text wrapping in DocumentManager
- Update API version to 0224
- Add word-break: break-all to tooltip
- Improve pre tag text wrapping
- Enhance tooltip readability
2025-09-17 01:47:40 +08:00
yangdx
8f6287e27e Add path traversal security validation for file deletion operations
• Add validate_file_path_security function
• Prevent path traversal attacks
• Validate file paths before deletion
• Check both input and enqueued dirs
• Log security violations
2025-09-17 01:12:44 +08:00