ragflow/rag/flow
aidan 420c97199a
Feat: Add TCADP parser for PPTX and spreadsheet document types. (#11041)
### What problem does this PR solve?

- Added TCADP Parser configuration fields to PDF, PPT, and spreadsheet
parsing forms
- Implemented support for setting table result type (Markdown/HTML) and
Markdown image response type (URL/Text)
- Updated TCADP Parser to handle return format settings from
configuration or parameters
- Enhanced frontend to dynamically show TCADP options based on selected
parsing method
- Modified backend to pass format parameters when calling TCADP API
- Optimized form default value logic for TCADP configuration items
- Updated multilingual resource files for new configuration options

### Type of change

- [x] New Feature (non-breaking change which adds functionality)
2025-11-20 10:08:42 +08:00
..
extractor Feat: Use data pipeline to visualize the parsing configuration of the knowledge base (#10423) 2025-10-09 12:36:19 +08:00
hierarchical_merger Fix: missing file formats in hierarchical_manager (#11129) 2025-11-10 13:27:22 +08:00
parser Feat: Add TCADP parser for PPTX and spreadsheet document types. (#11041) 2025-11-20 10:08:42 +08:00
splitter Move api.settings to common.settings (#11036) 2025-11-06 09:36:38 +08:00
tests Move api.settings to common.settings (#11036) 2025-11-06 09:36:38 +08:00
tokenizer Move api.settings to common.settings (#11036) 2025-11-06 09:36:38 +08:00
__init__.py Feat: refine dataflow and initialize dataflow app (#9952) 2025-09-05 18:50:46 +08:00
base.py Move 'timeout' to common folder (#10983) 2025-11-04 11:51:12 +08:00
file.py Feat: Use data pipeline to visualize the parsing configuration of the knowledge base (#10423) 2025-10-09 12:36:19 +08:00
pipeline.py Feat: GraphRAG handle cancel gracefully (#11061) 2025-11-06 16:12:20 +08:00