ragflow/rag
Yongteng Lei 2677617f93
Feat: supports MinerU http-client/server method (#10961)
### What problem does this PR solve?

Add support for MinerU http-client/server method.

To use MinerU with vLLM server:

1. Set up a vLLM server running MinerU:
   ```bash
   mineru-vllm-server --port 30000
   ```

2. Configure the following environment variables:
- `MINERU_EXECUTABLE=/ragflow/uv_tools/.venv/bin/mineru` (or the path to
your MinerU executable)
   - `MINERU_BACKEND="vlm-http-client"`
   - `MINERU_SERVER_URL="http://your-vllm-server-ip:30000"`

3. Follow the standard MinerU setup steps as described above.

With this configuration, RAGFlow will connect to your vLLM server to
perform document parsing, which can significantly improve parsing
performance for complex documents while reducing the resource
requirements on your RAGFlow server.



![1](https://github.com/user-attachments/assets/46624a0c-0f3b-423e-ace8-81801e97a27d)

![2](https://github.com/user-attachments/assets/66ccc004-a598-47d4-93cb-fe176834f83b)


### Type of change

- [x] New Feature (non-breaking change which adds functionality)
- [x] Documentation Update

---------

Co-authored-by: writinwaters <cai.keith@gmail.com>
2025-11-04 16:03:30 +08:00
..
app Feat: supports MinerU http-client/server method (#10961) 2025-11-04 16:03:30 +08:00
flow Feat: supports MinerU http-client/server method (#10961) 2025-11-04 16:03:30 +08:00
llm Refactor log utils (#10973) 2025-11-03 20:25:02 +08:00
nlp Move token related functions to common (#10942) 2025-11-03 08:50:05 +08:00
prompts Fix:output_structure in agent (#10907) 2025-11-03 09:39:53 +08:00
res Fix: prio synonym match than wordnet for english (#10762) 2025-10-27 09:32:55 +08:00
svr Minor tweaks (#10987) 2025-11-04 14:15:31 +08:00
utils Fix: elasticsearch connection hardcoded (#10975) 2025-11-04 10:59:35 +08:00
__init__.py Update comments (#4569) 2025-01-21 20:52:28 +08:00
benchmark.py Remove 'get_lan_ip' and add common misc_utils.py (#10880) 2025-10-31 16:42:01 +08:00
raptor.py Move 'timeout' to common folder (#10983) 2025-11-04 11:51:12 +08:00
settings.py Introduce common/config_utils.py (#10968) 2025-11-03 17:25:06 +08:00