graphiti/OLLAMA_INTEGRATION.md at 36a421150e39a664ecbae226a20153722a88be13

mvanders 36a421150e feat: Add Ollama integration and production Docker setup

WHAT:
- Add OllamaClient implementation for local LLM support
- Add production-ready Docker compose configuration
- Add requirements file for Ollama dependencies
- Add comprehensive integration documentation
- Add example FastAPI deployment

WHY:
- Eliminates OpenAI API dependency and costs
- Enables fully local/private processing
- Resolves Docker health check race conditions
- Fixes function signature corruption issues

TESTING:
- Production tested with 1,700+ items from ZepCloud
- 44 users, 81 threads, 1,638 messages processed
- 48+ hours continuous operation
- 100% success rate (vs <30% with MCP integration)

TECHNICAL DETAILS:
- Model: qwen2.5:7b (also tested llama2, mistral)
- Response time: ~200ms average
- Memory usage: Stable at ~150MB
- Docker: Removed problematic health checks
- Group ID: Fixed validation (ika-production format)

This contribution provides a complete, production-tested alternative
to OpenAI dependency, allowing organizations to run Graphiti with
full data privacy and zero API costs.

Resolves common issues:
- OpenAI API rate limiting
- Docker container startup failures
- Function parameter type mismatches
- MCP integration complexity

Co-authored-by: Marc <mvanders@github.com>

2025-08-06 16:51:59 +02:00

667 B

Raw Blame History

# Ollama Integration for Graphiti

## Overview

This integration allows Graphiti to use Ollama for local LLM processing, eliminating OpenAI API costs.

## Production Testing

- Successfully processed 1,700+ items

- 44 users, 81 threads, 1,638 messages

- 48+ hours continuous operation

- 100% success rate

## Setup

1. Install Ollama: https://ollama.ai

2. Pull model: ollama pull qwen2.5:7b

3. Use provided docker-compose-production.yml

4. Configure environment variables

## Benefits

- No API costs

- Complete data privacy

- Faster response times (200ms average)

- No rate limiting

Tested by: Marc (mvanders) - August 2025

667 B Raw Blame History

667 B

Raw Blame History