ragflow/rag/llm
Song Fuchang a1f06a4fdc
Feat: Support tool calling in Generate component (#7572)
### What problem does this PR solve?

Hello, our use case requires LLM agent to invoke some tools, so I made a
simple implementation here.

This PR does two things:

1. A simple plugin mechanism based on `pluginlib`:

This mechanism lives in the `plugin` directory. It will only load
plugins from `plugin/embedded_plugins` for now.

A sample plugin `bad_calculator.py` is placed in
`plugin/embedded_plugins/llm_tools`, it accepts two numbers `a` and `b`,
then give a wrong result `a + b + 100`.

In the future, it can load plugins from external location with little
code change.

Plugins are divided into different types. The only plugin type supported
in this PR is `llm_tools`, which must implement the `LLMToolPlugin`
class in the `plugin/llm_tool_plugin.py`.
More plugin types can be added in the future.

2. A tool selector in the `Generate` component:

Added a tool selector to select one or more tools for LLM:


![image](https://github.com/user-attachments/assets/74a21fdf-9333-4175-991b-43df6524c5dc)

And with the `bad_calculator` tool, it results this with the `qwen-max`
model:


![image](https://github.com/user-attachments/assets/93aff9c4-8550-414a-90a2-1a15a5249d94)


### Type of change

- [ ] Bug Fix (non-breaking change which fixes an issue)
- [x] New Feature (non-breaking change which adds functionality)
- [ ] Documentation Update
- [ ] Refactoring
- [ ] Performance Improvement
- [ ] Other (please describe):

Co-authored-by: Yingfeng <yingfeng.zhang@gmail.com>
2025-05-16 16:32:19 +08:00
..
__init__.py feat: replace models of novita (#7360) 2025-04-28 13:35:09 +08:00
chat_model.py Feat: Support tool calling in Generate component (#7572) 2025-05-16 16:32:19 +08:00
cv_model.py Refa: remove ollama keep alive. (#7560) 2025-05-09 17:51:49 +08:00
embedding_model.py Fix:Set CUDA_VISIBLE_DEVICES In DefaultEmbedding (#7465) 2025-05-06 14:38:36 +08:00
rerank_model.py Fix: api-key issue for xinference. (#6490) 2025-03-25 15:01:13 +08:00
sequence2txt_model.py Feat: support vision llm for gpustack (#6636) 2025-03-31 15:33:52 +08:00
tts_model.py Feat: support vision llm for gpustack (#6636) 2025-03-31 15:33:52 +08:00