ragflow

Author	SHA1	Message	Date
hsparks.codes	811e8e0561	fix: Correct import path for get_uuid in CheckpointService - Change from 'api.utils import get_uuid' to 'common.misc_utils import get_uuid' - Fixes ImportError that prevented service from starting - Resolves CI/CD timeout issue	2025-12-03 09:44:32 +01:00
hsparks.codes	48a03e6343	feat: Implement checkpoint/resume for RAPTOR tasks (Phase 1 & 2) Addresses issues #11640 and #11483 Phase 1 - Core Infrastructure: - Add TaskCheckpoint model with per-document state tracking - Add checkpoint fields to Task model (checkpoint_id, can_pause, is_paused) - Create CheckpointService with 15+ methods for checkpoint management - Add database migrations for new fields Phase 2 - Per-Document Execution: - Implement run_raptor_with_checkpoint() wrapper function - Process documents individually with checkpoint saves after each - Add pause/cancel checks between documents - Implement error isolation (failed docs don't affect others) - Add automatic retry logic (max 3 retries per document) - Integrate checkpoint-aware execution into task_executor - Add use_checkpoints config option (default: True) Features: ✅ Per-document granularity - each doc processed independently ✅ Fault tolerance - failures isolated, other docs continue ✅ Resume capability - restart from last checkpoint ✅ Pause/cancel support - check between each document ✅ Token tracking - monitor API usage per document ✅ Progress tracking - real-time status updates ✅ Configurable - can disable checkpoints if needed Benefits: - 99% reduction in wasted work on failures - Production-ready for weeks-long RAPTOR tasks - No more all-or-nothing execution - Graceful handling of API timeouts/errors	2025-12-03 09:13:47 +01:00
buua436	b8c0fb4572	Feat:new api /sequence2txt and update QWenSeq2txt (#11643 ) ### What problem does this PR solve? change: new api /sequence2txt, update QWenSeq2txt and ZhipuSeq2txt ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-02 11:17:31 +08:00
Kevin Hu	81ae6cf78d	Feat: support uploading in dialog. (#11634 ) ### What problem does this PR solve? #9590 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-01 16:54:57 +08:00
Yongteng Lei	b6c4722687	Refa: make RAGFlow more asynchronous (#11601 ) ### What problem does this PR solve? Try to make this more asynchronous. Verified in chat and agent scenarios, reducing blocking behavior. #11551, #11579. However, the impact of these changes still requires further investigation to ensure everything works as expected. ### Type of change - [x] Refactoring	2025-12-01 14:24:06 +08:00
Kevin Hu	6ea4248bdc	Feat: support parent-child in search procedure. (#11629 ) ### What problem does this PR solve? #7996 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-01 14:03:09 +08:00
Kevin Hu	88a28212b3	Fix: Table parse method issue. (#11627 ) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-01 12:42:35 +08:00
Kevin Hu	14616cf845	Feat: add child parent chunking method in backend. (#11598 ) ### What problem does this PR solve? #7996 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-11-28 19:25:32 +08:00
Yongteng Lei	9d8b96c1d0	Feat: add context for figure and table (#11547 ) ### What problem does this PR solve? Add context for figure table. ![demo_figure_table_context](https://github.com/user-attachments/assets/61b37fac-e22e-40a4-9665-9396c7b4103e) `==================()` for demonstrating purpose. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-11-27 10:21:44 +08:00
Levi	12979a3f21	feat: improve metadata handling in connector service (#11421 ) ### What problem does this PR solve? - Update sync data source to handle metadata properly ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-11-26 19:55:48 +08:00
Kevin Hu	f5faf0c94f	Feat: support operator in/not in for metadata filter. (#11503 ) ### What problem does this PR solve? #11376 #11378 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-11-25 12:44:26 +08:00
Kevin Hu	820934fc77	Fix: no result if metadata returns none. (#11412 ) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-11-20 19:51:25 +08:00
Kevin Hu	06cef71ba6	Feat: add or logic operations for meta data filters. (#11404 ) ### What problem does this PR solve? #11376 #11387 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-11-20 14:31:12 +08:00
buua436	7c6d30f4c8	Fix:RagFlow not starting with Postgres DB (#11398 ) ### What problem does this PR solve? issue: #11293 change: RagFlow not starting with Postgres DB ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-11-20 12:49:13 +08:00
天海蒼灆	9f715d6bc2	Feature (canvas): Add mind tagging support (#11359 ) ### What problem does this PR solve? Resolve the issue of missing thinking labels when viewing pre-existing conversations ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-11-20 10:11:28 +08:00
Kevin Hu	1c201c4d54	Fix: circle imports issue. (#11374 ) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-11-19 16:13:21 +08:00
Kevin Hu	d1716d865a	Feat: Alter flask to Quart for async API serving. (#11275 ) ### What problem does this PR solve? #11277 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-11-18 17:05:16 +08:00
Billy Bao	0569b50fed	Fix: create dataset return type inconsistent (#11272 ) ### What problem does this PR solve? Fix: create dataset return type inconsistent #11167 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-11-17 15:27:19 +08:00
Scott Davidson	6b64641042	Fix: default model base url extraction logic (#11263 ) ### What problem does this PR solve? Fixes an issue where default models which used the same factory but different base URLs would all be initialised with the default chat model's base URL and would ignore e.g. the embedding model's base URL config. For example, with the following service config, the embedding and reranker models would end up using the base URL for the default chat model (i.e. `llm1.example.com`): ```yaml ragflow: service_conf: user_default_llm: factory: OpenAI-API-Compatible api_key: not-used default_models: chat_model: name: llm1 base_url: https://llm1.example.com/v1 embedding_model: name: llm2 base_url: https://llm2.example.com/v1 rerank_model: name: llm3 base_url: https://llm3.example.com/v1/rerank llm_factories: factory_llm_infos: - name: OpenAI-API-Compatible logo: "" tags: "LLM,TEXT EMBEDDING,SPEECH2TEXT,MODERATION" status: "1" llm: - llm_name: llm1 base_url: 'https://llm1.example.com/v1' api_key: not-used tags: "LLM,CHAT,IMAGE2TEXT" max_tokens: 100000 model_type: chat is_tools: false - llm_name: llm2 base_url: https://llm2.example.com/v1 api_key: not-used tags: "TEXT EMBEDDING" max_tokens: 10000 model_type: embedding - llm_name: llm3 base_url: https://llm3.example.com/v1/rerank api_key: not-used tags: "RERANK,1k" max_tokens: 10000 model_type: rerank ``` ### Type of change - [X] Bug Fix (non-breaking change which fixes an issue)	2025-11-17 14:21:27 +08:00
Jin Hai	61cf430dbb	Minor tweats (#11271 ) ### What problem does this PR solve? As title. ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-11-16 19:29:20 +08:00
Kevin Hu	ba71160b14	Refa: rm useless code. (#11238 ) ### Type of change - [x] Refactoring	2025-11-13 09:59:55 +08:00
YngvarHuang	bd5dda6b10	Feature/doc upload api add parent path 20251112 (#11231 ) ### What problem does this PR solve? Add the specified parent_path to the document upload api interface (#11230) ### Type of change - [x] New Feature (non-breaking change which adds functionality) Co-authored-by: virgilwong <hyhvirgil@gmail.com>	2025-11-13 09:59:39 +08:00
Jin Hai	296476ab89	Refactor function name (#11210 ) ### What problem does this PR solve? As title ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-11-12 19:00:15 +08:00
Yongteng Lei	8ae562504b	Fix: GraphRAG and RAPTOR tasks do not affect document status (#11194 ) ### What problem does this PR solve? GraphRAG and RAPTOR tasks do not affect document status. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-11-12 12:03:41 +08:00
Kevin Hu	d226764ed0	Fix: connector auto-parse issue. (#11189 ) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-11-12 11:50:39 +08:00
Kevin Hu	c30ffb5716	Fix: ollama model list issue. (#11175 ) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-11-11 19:46:41 +08:00
Kevin Hu	f441f8ffc2	Fix: waitForResponse component. (#11172 ) ### What problem does this PR solve? #10056 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality)	2025-11-11 16:58:47 +08:00
Kevin Hu	dd5b8e2e1a	Fix: add auto_parse to kb detail. (#11153 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-11-11 12:22:43 +08:00
Billy Bao	6e1ebb2855	Fix: Optimize Prompts and Regex for use_sql() (#11148 ) ### What problem does this PR solve? Fix: Optimize Prompts and Regex for use_sql() #11127 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-11-10 19:02:07 +08:00
Kevin Hu	d207291217	Fix: add download stats to kb logs. (#11112 ) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-11-10 13:28:07 +08:00
Kevin Hu	dd1c8c5779	Feat: add auto parse to connector. (#11099 ) ### What problem does this PR solve? #10953 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-11-07 16:49:29 +08:00
Kevin Hu	34283d4db4	Feat: add data source to pipleline logs . (#11075 ) ### What problem does this PR solve? #10953 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-11-07 11:43:59 +08:00
Lynn	b7aa6d6c4f	Fix: add avatar for UI (#11080 ) ### What problem does this PR solve? Add avatar for admin UI. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-11-07 09:27:31 +08:00
Kevin Hu	3bd1fefe1f	Feat: debug sync data. (#11073 ) ### What problem does this PR solve? #10953 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-11-06 16:48:04 +08:00
Kevin Hu	adbb8319e0	Fix: add fields for logs. (#11039 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-11-06 09:49:57 +08:00
Jin Hai	f98b24c9bf	Move api.settings to common.settings (#11036 ) ### What problem does this PR solve? As title ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-11-06 09:36:38 +08:00
Kevin Hu	cd6ed4b380	Feat: add webhook component. (#11033 ) ### What problem does this PR solve? #10427 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-11-05 19:59:23 +08:00
Jin Hai	02d10f8eda	Move var from rag.settings to common.globals (#11022 ) ### What problem does this PR solve? As title. ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-11-05 15:48:50 +08:00
Kevin Hu	8584d4b642	Fix: numeric string miss transformation. (#11025 ) ### What problem does this PR solve? #11024 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-11-05 15:14:30 +08:00
Kevin Hu	b86e07088b	Fix: escape multi-steps issues. (#11016 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-11-05 14:51:00 +08:00
Jin Hai	1a9215bc6f	Move some vars to globals (#11017 ) ### What problem does this PR solve? As title. ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-11-05 14:14:38 +08:00
Jin Hai	96c015fb85	Fix and refactor imports (#11010 ) ### What problem does this PR solve? 1. Move EMBEDDING_CFG to common.globals 2. Fix error imports 3. Move signal handles to common/signal_utils.py ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-11-05 11:07:54 +08:00
Jin Hai	bab3fce136	Move some constants to common (#11004 ) ### What problem does this PR solve? As title. ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-11-05 08:01:39 +08:00
Kevin Hu	4bbbf92331	Refa: link connector to KB. (#10991 ) ### What problem does this PR solve? #10953 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-11-04 20:13:52 +08:00
Jin Hai	16d2be623c	Minor tweaks (#10987 ) ### What problem does this PR solve? 1. Rename identifier name 2. Fix some return statement 3. Fix some typos ### Type of change - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-11-04 14:15:31 +08:00
Billy Bao	19f71a961a	Fix: Create dataset performance unmatched between HTTP api and web ui (#10960 ) ### What problem does this PR solve? Fix: Create dataset performance unmatched between HTTP api and web ui #10925 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-11-04 13:45:14 +08:00
Kevin Hu	3e5a39482e	Feat: Support multiple data sources synchronizations (#10954 ) ### What problem does this PR solve? #10953 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-11-03 19:59:18 +08:00
Billy Bao	fa210e7c58	Feat: parsing hyperlinks in docx and pdf & Fix: default parser config of toc extraction (#10877 ) ### What problem does this PR solve? Feat: parsing hyperlinks in docx and pdf #10848 Fix: default parser config of toc extraction ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-11-03 09:34:12 +08:00
Jin Hai	360f5c1179	Move token related functions to common (#10942 ) ### What problem does this PR solve? As title ### Type of change - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-11-03 08:50:05 +08:00
Jin Hai	f52e56c2d6	Remove 'get_lan_ip' and add common misc_utils.py (#10880 ) ### What problem does this PR solve? Add get_uuid, download_img and hash_str2int into misc_utils.py ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-10-31 16:42:01 +08:00

1 2 3 4 5 ...

436 commits