- Add new aquery_llm/query_llm methods providing structured responses - Consolidate /query and /query/stream endpoints to use unified aquery_llm - Optimize cache handling by moving cache checks before LLM calls |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| document_routes.py | ||
| graph_routes.py | ||
| ollama_api.py | ||
| query_routes.py | ||