ragflow/rag
Tuan Le 6b1221d2f6
Fix parser_config access for layout_recognize in presentation.py (#8492)
### What problem does this PR solve?
This PR addresses an issue in the presentation parser where the
`layout_recognize` configuration was incorrectly retrieved from
`kwargs.get("layout_recognize", "DeepDOC")`. Instead, it should be
sourced from the `parser_config` parameter, specifically
`parser_config.get("layout_recognize", "DeepDOC")`.

This mismatch could cause the parser to default to the "DeepDOC" layout
recognizer, ignoring any alternative recognition method specified in the
parser configuration. As a result, PDF document parsing might use an
incorrect recognition engine.

The fix ensures the presentation parser consistently uses the
`layout_recognize` setting from `parser_config`, aligning with the
configuration access patterns used elsewhere in the codebase.

### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2025-06-26 11:54:43 +08:00
..
app Fix parser_config access for layout_recognize in presentation.py (#8492) 2025-06-26 11:54:43 +08:00
llm fix the error 'Unknown field for GenerationConfig: max_tokens' when u… (#8473) 2025-06-25 16:23:35 +08:00
nlp Refa: improve GraphRAG similarity sensitivity to numeric differences (#8479) 2025-06-25 16:20:59 +08:00
res Update synonym dictionary file (#7997) 2025-06-03 09:41:53 +08:00
svr Fix: some cases Task return but not set progress (#8469) 2025-06-25 09:58:55 +08:00
utils Fix: Database Query Vulnerable to Injection Attacks in rag/utils/opendal_conn.py (#8408) 2025-06-23 14:54:25 +08:00
__init__.py Update comments (#4569) 2025-01-21 20:52:28 +08:00
benchmark.py Refactor embedding batch_size (#3825) 2024-12-03 16:22:39 +08:00
prompts.py Fix: rank feature score should be greater than 0. (#8416) 2025-06-23 14:10:13 +08:00
raptor.py Fix task_limiter in raptor.py (#8124) 2025-06-09 10:18:03 +08:00
settings.py Feat: make document parsing and embedding batch sizes configurable via environment variables (#8266) 2025-06-16 13:40:47 +08:00