ragflow/deepdoc/parser
Stephen Hu 79482ff672
Refa: Improve ppt_parser better handle list (#6162)
### What problem does this PR solve?
This pull request (PR) incorporates codes for parsing PPTX files, aiming
to more precisely depict text in list formats (hint list by .).

### Type of change

- [ ] Bug Fix (non-breaking change which fixes an issue)
- [x] New Feature (non-breaking change which adds functionality)
- [ ] Documentation Update
- [x] Refactoring
- [ ] Performance Improvement
- [ ] Other (please describe):
2025-03-17 17:02:39 +08:00
..
resume Fix:when start with source code not in docker env report 'UnicodeDec… (#5802) 2025-03-10 11:22:06 +08:00
__init__.py Update comments (#4569) 2025-01-21 20:52:28 +08:00
docx_parser.py Update comments (#4569) 2025-01-21 20:52:28 +08:00
excel_parser.py Feat: add CSV file parsing support (#5989) 2025-03-12 19:20:50 +08:00
html_parser.py Update comments (#4569) 2025-01-21 20:52:28 +08:00
json_parser.py Update comments (#4569) 2025-01-21 20:52:28 +08:00
markdown_parser.py Feat:Optimize the table extraction logic in the Markdown parser: (#5663) 2025-03-07 17:02:35 +08:00
pdf_parser.py Refa: PARALLEL_DEVICES is a static parameter. (#6168) 2025-03-17 16:49:54 +08:00
ppt_parser.py Refa: Improve ppt_parser better handle list (#6162) 2025-03-17 17:02:39 +08:00
txt_parser.py Fix: delimiter issue. (#5720) 2025-03-06 17:51:22 +08:00
utils.py Update comments (#4569) 2025-01-21 20:52:28 +08:00