ragflow/rag/app
Billy Bao fa210e7c58
Feat: parsing hyperlinks in docx and pdf & Fix: default parser config of toc extraction (#10877)
### What problem does this PR solve?

Feat: parsing hyperlinks in docx and pdf #10848
Fix: default parser config of toc extraction

### Type of change

- [x] New Feature (non-breaking change which adds functionality)
2025-11-03 09:34:12 +08:00
..
__init__.py Update comments (#4569) 2025-01-21 20:52:28 +08:00
audio.py Refa: OpenAI whisper-1 (#9552) 2025-08-19 16:41:18 +08:00
book.py Feat:Vision Model Image Enhancement in Manual/Paper/Book/One chunker (#10640) 2025-10-21 09:36:27 +08:00
email.py Feat: Use data pipeline to visualize the parsing configuration of the knowledge base (#10423) 2025-10-09 12:36:19 +08:00
laws.py Fix: law parser (#10897) 2025-10-30 19:00:11 +08:00
manual.py Move token related functions to common (#10942) 2025-11-03 08:50:05 +08:00
naive.py Feat: parsing hyperlinks in docx and pdf & Fix: default parser config of toc extraction (#10877) 2025-11-03 09:34:12 +08:00
one.py Feat:Vision Model Image Enhancement in Manual/Paper/Book/One chunker (#10640) 2025-10-21 09:36:27 +08:00
paper.py Feat:Vision Model Image Enhancement in Manual/Paper/Book/One chunker (#10640) 2025-10-21 09:36:27 +08:00
picture.py Refactor: rename rmSpace to remove_redundant_spaces (#10796) 2025-10-28 09:46:32 +08:00
presentation.py Fix: PlainParser using fix in presentation (#9239) 2025-08-05 17:48:18 +08:00
qa.py Refactor: rename rmSpace to remove_redundant_spaces (#10796) 2025-10-28 09:46:32 +08:00
resume.py Refactor: rename rmSpace to remove_redundant_spaces (#10796) 2025-10-28 09:46:32 +08:00
table.py Fix: parsing excel with chartsheet & Clamp begin to a minimum of 0 to prevent negative indexing (#10819) 2025-10-28 09:40:37 +08:00
tag.py Fix typos: retrievaler -> retriever (#10372) 2025-10-10 09:17:36 +08:00