Commit graph

3154 commits

Author SHA1 Message Date
yangdx
077d9be5d7 Add Deepseek Style Chain of Thought (CoT) Support for OpenAI Compatible LLM providers
- Add enable_cot parameter to all LLM APIs
- Implement CoT for OpenAI with <think> tags
- Log warnings for unsupported providers
- Enable CoT in query operations
- Handle streaming and non-streaming CoT
2025-09-09 22:34:36 +08:00
yangdx
09abb656b8 Improve log message formatting for better readability 2025-09-09 17:41:09 +08:00
Daniel.y
f064b950fc
Merge pull request #2027 from Matt23-star/main
Refactor: PostgreSQL
2025-09-09 15:12:35 +08:00
Daniel.y
92058187f7
Merge pull request #2082 from danielaskdd/prompt-optimization
Prompt Optimization: remove angle brackets from entity and relationship output formats
2025-09-09 12:11:39 +08:00
yangdx
564850aa9d Update webui assets and bump api version to 0218 2025-09-09 11:41:02 +08:00
yangdx
06db511f3b Remove angle brackets from entity and relationship output formats 2025-09-09 09:21:23 +08:00
yangdx
6157318408 Update webui assets and bump api to 0217 2025-09-08 23:37:34 +08:00
yangdx
3059089e7d Fix logging order in pipeline history trimming 2025-09-08 23:00:44 +08:00
yangdx
8d53ef7ff0 Increase default Gunicorn worker timeout from 210 to 300 seconds 2025-09-08 20:03:21 +08:00
yangdx
56df4ac4b1 Update webui assets and bump api version to 0216 2025-09-08 19:36:49 +08:00
yangdx
de4fe8bc7d Improve uvicorn workers warning message clarity 2025-09-08 16:05:51 +08:00
yangdx
9437df83cc Add memory management for pipeline history messages
- Trim history at 10k messages
- Keep latest 5k messages
- Prevent memory growth
- Add logging for trim events
2025-09-08 15:56:35 +08:00
yangdx
d218f15a62 Refactor entity extraction with system prompts and output limits
- Add system/user prompt separation
- Set max tokens for endless output fix
- Improve extraction error logging
- Update cache type from extract to summary
2025-09-08 15:20:45 +08:00
yangdx
451f488f72 Add debug logging for client configs in OpenAI LLM function 2025-09-07 02:29:37 +08:00
yangdx
c87eb2cfcf Increase timeout buffers for async function calls
• Extend execution timeout buffer to 150s
• Extend task duration buffer to 180s
• Account for low-level retry delays
• Improve health check phase handling
• Reduce timeout-related failures
2025-09-06 23:56:24 +08:00
yangdx
725db3b240 Fix linting in the prompt 2025-09-06 11:16:49 +08:00
yangdx
219a08b7c9 Restore completion_delimiter 2025-09-06 11:13:37 +08:00
yangdx
528d04a0e4 Update prompt template delimiters 2025-09-06 10:35:06 +08:00
yangdx
5446815008 Refactor entity extraction prompts and remove completion delimiter.
- Remove `completion_delimiter` from prompts
- Update input/output format markers
2025-09-06 09:13:51 +08:00
yangdx
0a62f02e84 Improve edge logging format and exception prefixes 2025-09-06 08:35:52 +08:00
yangdx
387d817fc2 Remove trailing colons from queue names in function wrappers 2025-09-06 00:53:05 +08:00
yangdx
e4dd83ee3d Change tooltips font to a smaller one 2025-09-05 22:14:28 +08:00
yangdx
c77e045fb3 Update webui assets and bump api version to 0215 2025-09-05 22:05:52 +08:00
yangdx
6be462511f Add error prefixing for better debugging context in async operations
* Add create_prefixed_exception utility
* Prefix entity processing errors
* Prefix relationship processing errors
* Prefix chunk extraction progress info
* Maintain original exception chains
2025-09-05 21:28:00 +08:00
yangdx
385668dec5 Fix malformed tuple delimiters in extraction result processing 2025-09-05 17:14:42 +08:00
yangdx
a1df76a4ea Optimize LLM/embedding config caching to reduce repeated parsing overhead
• Add LLMConfigCache class for smart caching
• Pre-process OpenAI/Ollama configurations
• Create optimized function factories
• Reduce redundant option parsing calls
2025-09-05 16:36:08 +08:00
yangdx
17d665c9f3 Limit history messages to latest 1000 entries with truncation indicator
• Limit history to 1000 latest messages
• Add truncation message when needed
• Show count of truncated messages
• Update API documentation
• Prevent memory issues with large logs
2025-09-05 12:31:36 +08:00
yangdx
be3f0ebbe5 Simplify entity extraction prompt instructions and remove delimiter 2025-09-04 23:42:11 +08:00
yangdx
3f56c6820c Reorder language and completion delimiter instructions in prompt 2025-09-04 23:05:16 +08:00
yangdx
2c551cb5db Add support for Chinese book title marks in normalize_extracted_info 2025-09-04 18:51:57 +08:00
yangdx
f19cce16be Fix incorrect variable name in NetworkXStorage file path
- Fix working_dir -> workspace_dir typo
- Correct GraphML file path generation
2025-09-04 18:31:53 +08:00
yangdx
50adf64fab Fix linting in prompt 2025-09-04 15:22:36 +08:00
yangdx
94114df995 Improve prompt clarity and structure 2025-09-04 14:53:27 +08:00
yangdx
83b54975a2 fix: resolve "Task exception was never retrieved" warnings in async task handling
- Handle multiple simultaneous exceptions correctly
- Maintain fast-fail behavior while ensuring proper exception cleanup to
  prevent asyncio warnings
2025-09-04 12:40:41 +08:00
yangdx
c903b14849 Bump AIP version to 0214 and update env.example 2025-09-04 12:04:50 +08:00
yangdx
de972f6222 Rename method for clarity and improve code readability
- Rename _process_entity_relation_graph to _process_extract_entities
2025-09-04 11:48:31 +08:00
yangdx
9b516a8a53 Hot Fix: Preserve whitespace chars in text sanitization
• Keep \t, \n, \r in control char removal
2025-09-04 10:58:29 +08:00
yangdx
7b35657e32 Refactor entity extraction prompt formatting and clarity
- Remove quotes from tuple format strings
- Simplify relationship extraction text
- Add relationships to quality guidelines
2025-09-04 10:47:57 +08:00
yangdx
a25ce7f078 Fix linting 2025-09-03 21:58:30 +08:00
yangdx
7ef2f0dff6 Add VDB error handling with retries for data consistency
- Add safe_vdb_operation_with_exception util
- Wrap VDB ops in entity/relationship code
- Ensure exceptions propagate on failure
- Add retry logic with configurable delays
2025-09-03 21:15:09 +08:00
yangdx
0b07c022d6 Update webui assets and bump api version to 0213 2025-09-03 12:51:08 +08:00
yangdx
78abb397bf Reorder entity types and add Document type to extraction 2025-09-03 12:44:40 +08:00
yangdx
95c08cc7dc Improve entity extraction prompt clarity by replacing pronouns with specific nouns 2025-09-03 12:35:52 +08:00
yangdx
c86f863fa4 feat: optimize entity extraction for smaller LLMs
Simplify entity relationship extraction process to improve compatibility
and performance with smaller, less capable language models.

Changes:
- Remove iterative gleaning loop with LLM-based continuation decisions
- Simplify to single gleaning pass when entity_extract_max_gleaning > 0
- Streamline entity extraction prompts with clearer instructions
- Add explicit completion delimiter signals in all examples
2025-09-03 10:33:01 +08:00
yangdx
9d81cd724a Fix typo: change "Equiment" to "Equipment" in entity types 2025-09-02 03:19:31 +08:00
yangdx
476b64c9d4 Update webui assets 2025-09-02 03:03:19 +08:00
yangdx
4e37ff5f2f Bump API verstion to 0212 2025-09-02 03:02:39 +08:00
yangdx
5b2deccbef Improve text normalization and add entity type capitalization
- Capitalize entity types with .title()
- Add non-breaking space handling
- Add narrow non-breaking space regex
2025-09-02 02:51:41 +08:00
yangdx
29f0ecc88c Refactor entity extraction prompts and remove completion delimiter
• Update prompt structure and wording
• Remove deprecated completion delimiter
• Add quality guidelines section
• Improve instruction clarity
• Enhance continue extraction prompt
2025-09-02 02:14:14 +08:00
yangdx
3f8a9abe7e Refactor extraction result processing to reduce code duplication
• Extract shared processing logic
• Add delimiter pattern fixes
• Improve bracket standardization
2025-09-02 01:22:29 +08:00