No description
Find a file
2024-10-29 15:44:41 +08:00
.github/workflows Manually reformatted files 2024-10-25 13:32:25 +05:30
examples Fix lint issue 2024-10-28 17:05:38 +02:00
lightrag Fix lint issue 2024-10-28 17:05:38 +02:00
reproduce fix Step_3_openai_compatible.py 2024-10-25 19:25:26 +08:00
.gitignore Manually reformatted files 2024-10-25 13:32:25 +05:30
.pre-commit-config.yaml chore: added pre-commit-hooks and ruff formatting for commit-hooks 2024-10-19 09:43:17 +05:30
LICENSE update 2024-10-08 10:38:50 +08:00
README.md Update README.md 2024-10-29 15:44:41 +08:00
requirements.txt support lmdeploy backend 2024-10-26 16:11:15 +08:00
setup.py Refactor code formatting in lightrag_api_openai_compatible_demo.py 2024-10-26 16:12:10 +08:00

Quick start

  • install textract
pip install textract

*example

import textract
# 指定要提取文本的文件路径
file_path = 'path/to/your/file.pdf'
# 从文件中提取文本
text_content = textract.process(file_path)
# 打印提取的文本
print(text_content.decode('utf-8'))