ragflow/rag
ly0303521 101b8ff813
fix chunk method "Table" losing content when the Excel file has multi… (#4123)
…ple sheets

### What problem does this PR solve?
discussed in https://github.com/infiniflow/ragflow/pull/4102
- In excel_parser.py, `total` means the total number of rows in Excel,
but it return in the first iterate, that lead to the wrong `to_page`
- In table.py, it when Excel file has multiple sheets, it will be
divided into multiple parts, every part size is 3000, `data` may be
empty, because it has recorded in the last iterate.
### Type of change

- [x] Bug Fix (non-breaking change which fixes an issue)
2024-12-19 17:30:26 +08:00
..
app fix chunk method "Table" losing content when the Excel file has multi… (#4123) 2024-12-19 17:30:26 +08:00
llm Make fast embed and default embed mutually exclusive. (#4121) 2024-12-19 17:27:09 +08:00
nlp Refactor trie load and construct (#4083) 2024-12-18 12:52:56 +08:00
res Format file format from Windows/dos to Unix (#1949) 2024-08-15 09:17:36 +08:00
svr Cache the result from llm for graphrag and raptor (#4051) 2024-12-17 09:48:03 +08:00
utils Fixed infinity exception SCORE() / SCORE_FACTORS() requires Fusion or MATCH TEXT or MATCH TENSOR (#4110) 2024-12-19 13:49:36 +08:00
__init__.py Add back beartype (#3967) 2024-12-10 18:43:43 +08:00
benchmark.py Refactor embedding batch_size (#3825) 2024-12-03 16:22:39 +08:00
raptor.py Cache the result from llm for graphrag and raptor (#4051) 2024-12-17 09:48:03 +08:00
settings.py Update progress info and start welcome info (#3768) 2024-11-30 18:48:06 +08:00