ragflow/deepdoc/vision
Debug Doctor 3e19044dee
Feat: add OCR's muti-gpus and parallel processing support (#5972)
### What problem does this PR solve?

Add OCR's muti-gpus and parallel processing support

### Type of change

- [x] New Feature (non-breaking change which adds functionality)

@yuzhichang I've tried to resolve the comments in #5697. OCR jobs can
now be done on both CPU and GPU. ( By the way, I've encountered a
“Generate embedding error” issue #5954 that might be due to my outdated
GPUs? idk. ) Please review it and give me suggestions.

GPU:

![gpu_ocr](https://github.com/user-attachments/assets/0ee2ecfb-a665-4e50-8bc7-15941b9cd80e)

![smi](https://github.com/user-attachments/assets/a2312f8c-cf24-443d-bf89-bec50503546d)

CPU:

![cpu_ocr](https://github.com/user-attachments/assets/1ba6bb0b-94df-41ea-be79-790096da4bf1)
2025-03-17 11:58:40 +08:00
..
__init__.py Made task_executor async to speedup parsing (#5530) 2025-03-03 18:59:49 +08:00
layout_recognizer.py Fix: optimize OCR garbage identification to reduce unnecessary filtering (#6027) 2025-03-13 18:48:32 +08:00
ocr.py Feat: add OCR's muti-gpus and parallel processing support (#5972) 2025-03-17 11:58:40 +08:00
operators.py refactor: no need to inherit in python3 clean the code (#5659) 2025-03-05 18:03:53 +08:00
postprocess.py refactor: no need to inherit in python3 clean the code (#5659) 2025-03-05 18:03:53 +08:00
recognizer.py refactor: no need to inherit in python3 clean the code (#5659) 2025-03-05 18:03:53 +08:00
seeit.py Update comments (#4569) 2025-01-21 20:52:28 +08:00
t_ocr.py Feat: add OCR's muti-gpus and parallel processing support (#5972) 2025-03-17 11:58:40 +08:00
t_recognizer.py Update comments (#4569) 2025-01-21 20:52:28 +08:00
table_structure_recognizer.py Optimized Recognizer.sort_X_firstly and Recognizer.sort_Y_firstly (#5182) 2025-02-20 15:41:12 +08:00