ragflow/deepdoc/parser
ly0303521 101b8ff813
fix chunk method "Table" losing content when the Excel file has multi… (#4123)
…ple sheets

### What problem does this PR solve?
discussed in https://github.com/infiniflow/ragflow/pull/4102
- In excel_parser.py, `total` means the total number of rows in Excel,
but it return in the first iterate, that lead to the wrong `to_page`
- In table.py, it when Excel file has multiple sheets, it will be
divided into multiple parts, every part size is 3000, `data` may be
empty, because it has recorded in the last iterate.
### Type of change

- [x] Bug Fix (non-breaking change which fixes an issue)
2024-12-19 17:30:26 +08:00
..
resume Fix errors detected by Ruff (#3918) 2024-12-08 14:21:12 +08:00
__init__.py Fix errors detected by Ruff (#3918) 2024-12-08 14:21:12 +08:00
docx_parser.py Edit chunk shall update instead of insert it (#3709) 2024-11-28 13:00:38 +08:00
excel_parser.py fix chunk method "Table" losing content when the Excel file has multi… (#4123) 2024-12-19 17:30:26 +08:00
html_parser.py Fix errors detected by Ruff (#3918) 2024-12-08 14:21:12 +08:00
json_parser.py Fix json file parse (#4004) 2024-12-12 20:34:46 +08:00
markdown_parser.py Support table for markdown file in general parser (#1278) 2024-06-27 14:38:35 +08:00
pdf_parser.py Added static check at PR CI (#3921) 2024-12-08 21:23:51 +08:00
ppt_parser.py Format file format from Windows/dos to Unix (#1949) 2024-08-15 09:17:36 +08:00
txt_parser.py Fix errors detected by Ruff (#3918) 2024-12-08 14:21:12 +08:00
utils.py rename get_txt to get_text (#2649) 2024-09-29 12:47:09 +08:00