ragflow/deepdoc/parser
Jason Lee ebdd71ce68
fix: When parsing the bold content in PDF, the result is duplicated. (#1729)
### What problem does this PR solve?

_fix: When parsing the bold content in PDF, the result is duplicated._

the detail: [When using OCR to recognize Chinese titles, the structure
appears to be
duplicated](https://github.com/infiniflow/ragflow/issues/1718)

### Type of change

- [x] Bug Fix (non-breaking change which fixes an issue)
2024-07-29 09:43:05 +08:00
..
resume Update readme and add license (#1018) 2024-06-01 16:24:10 +08:00
__init__.py Support table for markdown file in general parser (#1278) 2024-06-27 14:38:35 +08:00
docx_parser.py fix bug of ragflowdocxpparser (#1642) 2024-07-23 09:25:32 +08:00
excel_parser.py Update readme and add license (#1018) 2024-06-01 16:24:10 +08:00
html_parser.py fix create dialog bug (#982) 2024-05-30 09:25:05 +08:00
json_parser.py feat: support json file (#1217) 2024-06-21 10:42:29 +08:00
markdown_parser.py Support table for markdown file in general parser (#1278) 2024-06-27 14:38:35 +08:00
pdf_parser.py fix: When parsing the bold content in PDF, the result is duplicated. (#1729) 2024-07-29 09:43:05 +08:00
ppt_parser.py fix generate error (#1590) 2024-07-18 14:33:30 +08:00