graphiti/OLLAMA_INTEGRATION.md
mvanders 36a421150e feat: Add Ollama integration and production Docker setup
WHAT:
- Add OllamaClient implementation for local LLM support
- Add production-ready Docker compose configuration
- Add requirements file for Ollama dependencies
- Add comprehensive integration documentation
- Add example FastAPI deployment

WHY:
- Eliminates OpenAI API dependency and costs
- Enables fully local/private processing
- Resolves Docker health check race conditions
- Fixes function signature corruption issues

TESTING:
- Production tested with 1,700+ items from ZepCloud
- 44 users, 81 threads, 1,638 messages processed
- 48+ hours continuous operation
- 100% success rate (vs <30% with MCP integration)

TECHNICAL DETAILS:
- Model: qwen2.5:7b (also tested llama2, mistral)
- Response time: ~200ms average
- Memory usage: Stable at ~150MB
- Docker: Removed problematic health checks
- Group ID: Fixed validation (ika-production format)

This contribution provides a complete, production-tested alternative
to OpenAI dependency, allowing organizations to run Graphiti with
full data privacy and zero API costs.

Resolves common issues:
- OpenAI API rate limiting
- Docker container startup failures
- Function parameter type mismatches
- MCP integration complexity

Co-authored-by: Marc <mvanders@github.com>
2025-08-06 16:51:59 +02:00

48 lines
667 B
Markdown

\# Ollama Integration for Graphiti
\## Overview
This integration allows Graphiti to use Ollama for local LLM processing, eliminating OpenAI API costs.
\## Production Testing
\- Successfully processed 1,700+ items
\- 44 users, 81 threads, 1,638 messages
\- 48+ hours continuous operation
\- 100% success rate
\## Setup
1\. Install Ollama: https://ollama.ai
2\. Pull model: `ollama pull qwen2.5:7b`
3\. Use provided `docker-compose-production.yml`
4\. Configure environment variables
\## Benefits
\- No API costs
\- Complete data privacy
\- Faster response times (200ms average)
\- No rate limiting
Tested by: Marc (mvanders) - August 2025