graphiti

History

Daniel Chalef 1a6db24600 Final MMR optimization focused on 1024D vectors with smart dimensionality dispatch This commit delivers a production-ready MMR optimization specifically tailored for Graphiti's primary use case while handling high-dimensional vectors appropriately. ## Performance Improvements for 1024D Vectors - Average 1.16x speedup (13.6% reduction in search latency) - Best performance: 1.31x speedup for 25 candidates (23.5% faster) - Sub-millisecond latency: 0.266ms for 10 candidates, 0.662ms for 25 candidates - Scalable performance: Maintains improvements up to 100 candidates ## Smart Algorithm Dispatch - 1024D vectors: Uses optimized precomputed similarity matrix approach - High-dimensional vectors (≥2048D): Falls back to original algorithm to avoid overhead - Adaptive thresholds: Considers both dataset size and dimensionality for optimal performance ## Key Optimizations for Primary Use Case 1. Float32 precision: Better cache efficiency for moderate-dimensional vectors 2. Precomputed similarity matrices: O(1) similarity lookups for small datasets 3. Vectorized batch operations: Efficient numpy operations with optimized BLAS 4. Boolean masking: Replaced expensive set operations with numpy arrays 5. Smart memory management: Optimal layouts for CPU cache utilization ## Technical Implementation - Memory efficient: All test cases fit in CPU cache (max 0.43MB for 100×1024D) - Cache-conscious: Contiguous float32 arrays improve memory bandwidth - BLAS optimized: Matrix multiplication leverages hardware acceleration - Correctness maintained: All existing tests pass with identical results ## Production Impact - Real-time search: Sub-millisecond performance for typical scenarios - Scalable: Performance improvements across all tested dataset sizes - Robust: Handles edge cases and high-dimensional vectors gracefully - Backward compatible: Drop-in replacement with identical API This optimization transforms MMR from a potential bottleneck into a highly efficient operation for Graphiti's search pipeline, providing significant performance gains for the most common use case (1024D vectors) while maintaining robustness for all scenarios. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>		2025-07-18 12:28:50 -07:00
..
cross_encoder	Bulk ingestion (#698 )	2025-07-10 12:14:49 -04:00
driver	Final MMR optimization focused on 1024D vectors with smart dimensionality dispatch	2025-07-18 12:28:50 -07:00
embedder	save edge update (#721 )	2025-07-14 11:15:38 -04:00
llm_client	save edge update (#721 )	2025-07-14 11:15:38 -04:00
models	save edge update (#721 )	2025-07-14 11:15:38 -04:00
prompts	Bulk updates (#732 )	2025-07-16 02:26:33 -04:00
search	Final MMR optimization focused on 1024D vectors with smart dimensionality dispatch	2025-07-18 12:28:50 -07:00
telemetry	feat: add telemetry with PostHog and update Docker configurations (#633 )	2025-06-27 12:23:30 -07:00
utils	make egg_operations more robust (#737 )	2025-07-16 17:12:20 -04:00
__init__.py	chore: Fix packaging (#38 )	2024-08-25 10:07:50 -07:00
edges.py	Return embeddings option in get_by_uuids (#736 )	2025-07-16 11:09:10 -04:00
errors.py	Add group ID validation and error handling (#618 )	2025-06-24 09:33:54 -07:00
graph_queries.py	update driver (#583 )	2025-06-13 14:12:09 -04:00
graphiti.py	[Bug Fix] Fix the Group ID usage with FalkorDB (#733 )	2025-07-17 12:35:08 -04:00
graphiti_types.py	Add support for falkordb (#575 )	2025-06-13 12:06:57 -04:00
helpers.py	Final MMR optimization focused on 1024D vectors with smart dimensionality dispatch	2025-07-18 12:28:50 -07:00
nodes.py	Return embeddings option in get_by_uuids (#736 )	2025-07-16 11:09:10 -04:00
py.typed	Add py.typed file (#105 )	2024-09-11 08:44:06 -04:00