graphiti

Author	SHA1	Message	Date
Daniel Chalef	590282524a	fix: Improve edge extraction entity ID validation (#968 ) * fix: Improve edge extraction entity ID validation Fixes invalid entity ID references in edge extraction that caused warnings like: "WARNING: source or target node not filled WILL_FIND. source_node_uuid: 23 and target_node_uuid: 3" Changes: - Format ENTITIES list as proper JSON in prompt for better LLM parsing - Clarify field descriptions to reference entity id from ENTITIES list - Add explicit entity ID validation as #1 extraction rule with examples - Improve error logging (removed PII, added entity count and valid range) These changes follow patterns from extract_nodes.py and dedupe_nodes.py where entity referencing works reliably. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * wip * fix: Align fact field naming and add description - Change extraction rule to reference 'fact' instead of 'fact_text' - Add descriptive text for fact field in Edge model * fix: Remove ensure_ascii parameter from to_prompt_json call Align with other to_prompt_json calls that don't use ensure_ascii * fix: Use validated target_node_idx variable consistently Line 190 was using raw edge_data.target_entity_id instead of the validated target_node_idx variable, creating inconsistency with line 189 * fix: Improve edge extraction validation checks - Add explicit check for empty nodes list - Use more explicit 0 <= idx comparison instead of -1 < idx - Prevents nonsensical error message when no entities provided * chore: Restore uv.lock from main branch Previously deleted in commit `7e4464b`, now restored to match main branch state * Update uv.lock --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-10-02 22:45:11 -07:00
Preston Rasmussen	bec3f02036	filter out falsey values before creating embeddings (#966 ) * filter out falsey values * update * early return	2025-10-02 15:26:51 -04:00
Daniel Chalef	7bd8f8a2f2	chore: Update edge extraction prompt to paraphrase instead of quote (#957 ) * chore: Update edge extraction prompt to paraphrase instead of quote - Changed instruction 5 to request paraphrasing rather than verbatim quoting - Updated string quotes to use double quotes for consistency 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * chore: Format edge_operations.py and update lock file - Minor formatting fix in edge_operations.py list comprehension - Update uv.lock with version bump to 0.21.0rc8 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-10-01 09:05:04 -07:00
Daniel Chalef	420676faf2	fix: Prevent duplicate edge facts within same episode (#955 ) * fix: Prevent duplicate edge facts within same episode This fixes three related bugs that allowed verbatim duplicate edge facts: 1. Fixed LLM deduplication: Changed related_edges_context to use integer indices instead of UUIDs, matching the EdgeDuplicate model expectations. 2. Fixed batch deduplication: Removed episode skip in dedupe_edges_bulk that prevented comparing edges from the same episode. Added self-comparison guard to prevent edge from comparing against itself. 3. Added fast-path deduplication: Added exact string matching before parallel processing in resolve_extracted_edges to catch within-episode duplicates early, preventing race conditions where concurrent edges can't see each other. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * test: Add tests for edge deduplication fixes Added three tests to verify the edge deduplication fixes: 1. test_dedupe_edges_bulk_deduplicates_within_episode: Verifies that dedupe_edges_bulk now compares edges from the same episode after removing the `if i == j: continue` check. 2. test_resolve_extracted_edge_uses_integer_indices_for_duplicates: Validates that the LLM receives integer indices for duplicate detection and correctly processes returned duplicate_facts. 3. test_resolve_extracted_edges_fast_path_deduplication: Confirms that the fast-path exact string matching deduplicates identical edges before parallel processing, preventing race conditions. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * fix: Remove unused variables flagged by ruff - Remove unused loop variable 'j' in bulk_utils.py - Remove unused return value 'edges_by_episode' in test - Replace unused 'edge_uuid' with '_' in test loop 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-10-01 07:30:30 -07:00
Daniel Chalef	3fcd587276	fix: Add edge type validation based on node labels (#948 ) * fix: Add edge type validation based on node labels - Add DEFAULT_EDGE_NAME constant for 'RELATES_TO' - Implement pre-resolution validation to reset invalid edge names - Add post-resolution validation for LLM-returned fact types - Rename parameter from edge_types to edge_type_candidates for clarity - Add comprehensive tests for validation scenarios This ensures edges conform to edge_type_map constraints and prevents misclassification when edge types don't match node label pairs. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * chore: Bump version to 0.30.0pre4 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-09-29 16:35:00 -07:00
Preston Rasmussen	d6d4bbdeb7	don't save duplicate edges (#927 ) * don't save duplicate edges * remove build duplicate edges	2025-09-24 17:24:57 -04:00
Preston Rasmussen	c794f8881b	pre5 (#926 )	2025-09-24 16:38:20 -04:00
Preston Rasmussen	36056ad141	Graph quality updates (#922 ) duplicate_of updates	2025-09-23 17:53:39 -04:00
Preston Rasmussen	da71d118db	Embedding fix (#917 ) * embedding fix * pre3 * fixedmake format	2025-09-20 09:00:04 -04:00
Preston Rasmussen	3efe085a92	OpenSearch updates (#906 ) * updates * add uuid filter functionality * update * updates * bump-version * update * fix typo * use async function * update unit tests * update delete * update deletion * async update * update * update * update * update	2025-09-14 01:43:37 -04:00
Preston Rasmussen	0884cc00e5	OpenSearch Integration for Neo4j (#896 ) * move aoss to driver * add indexes * don't save vectors to neo4j with aoss * load embeddings from aoss * add group_id routing * add search filters and similarity search * neptune regression update * update neptune for regression purposes * update index creation with aliasing * regression tested * update version * edits * claude suggestions * cleanup * updates * add embedding dim env var * use cosine sim * updates * updates * remove unused imports * update	2025-09-09 10:51:46 -04:00
Preston Rasmussen	ce1ae30569	Add return to add_triplet (#898 ) * update * add triplet results * Update graphiti_core/graphiti.py Co-authored-by: claude[bot] <209825114+claude[bot]@users.noreply.github.com> --------- Co-authored-by: claude[bot] <209825114+claude[bot]@users.noreply.github.com>	2025-09-08 15:39:05 -04:00
Preston Rasmussen	7e6d93fa32	add episode bulk search results (#897 ) * add episode bulk search results * update * docstring * update	2025-09-08 14:34:32 -04:00
Preston Rasmussen	1f5a1b890c	cleanup (#894 ) * cleanup * update * remove unused imports	2025-09-05 11:30:46 -04:00
Preston Rasmussen	81d110f944	bump version (#889 ) * bump version * remove unused imports	2025-09-03 14:08:35 -04:00
Preston Rasmussen	1460172568	don't return index labels (#887 ) * don't return index labels * update tests	2025-09-02 12:02:33 -04:00
Preston Rasmussen	da6f3336bb	update-tests (#872 ) * update-tests * unit test update * update tests * update tests * update kuzu query * update * update query * update args * fix bulk episode add * make handling better	2025-08-31 13:19:29 -04:00
Siddhartha Sahu	8802b7db13	Add support for Kuzu as the graph driver (#799 ) * Fix FalkoDB tests * Add support for graph memory using Kuzu * Fix lints * Fix queries * Add tests * Add comments * Add more test coverage * Add mocked tests * Format * Add mocked tests II * Refactor community queries * Add more mocked tests * Refactor tests to always cleanup * Add more mocked tests * Update kuzu * Refactor how filters are built * Add more mocked tests * Refactor and cleanup * Fix tests * Fix lints * Refactor tests * Disable neptune * Fix * Update kuzu version * Update kuzu to latest release * Fix filter * Fix query * Fix Neptune query * Fix bulk queries * Fix lints * Fix deletes * Comments and format * Add Kuzu to the README * Fix bulk queries * Test all fields of nodes and edges * Fix lints * Update search_utils.py --------- Co-authored-by: Preston Rasmussen <109292228+prasmussen15@users.noreply.github.com>	2025-08-27 11:45:21 -04:00
Preston Rasmussen	309159bccb	update migration (#870 ) * update migration * bump version * close driver	2025-08-27 11:13:10 -04:00
Preston Rasmussen	fa9c1696b8	dont create extra search embeddings (#861 ) * dont create extra search embeddings * updates * add missing conditionals * fix * float 0 * null check * more nullchecks * bump version	2025-08-26 11:16:46 -04:00
Preston Rasmussen	0ac7ded4d1	use hnsw indexes (#859 ) * use hnsw indexes * add migration * updates * add group_id validation * updates * add type annotation * updates * update * swap to prerelease	2025-08-25 12:31:35 -04:00
Preston Rasmussen	1edcbaa9e9	Gpt 5 default (#849 ) * gpt-5-mini and gpt-5-nano default * bump version * remove unused imports * linter * update * disable neptune errors while we get a fixture in place * update pyright * revert non-structured completions * fix typo	2025-08-21 12:10:57 -04:00
bechbd	ef56dc779a	Amazon Neptune Support (#793 ) * Rebased Neptune changes based on significant rework done * Updated the README documentation * Fixed linting and formatting * Update README.md Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * Update graphiti_core/driver/neptune_driver.py Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * Update README.md Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * Addressed feedback from code review * Updated the README documentation for clarity * Updated the README and neptune_driver based on PR feedback * Update node_db_queries.py --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> Co-authored-by: Preston Rasmussen <109292228+prasmussen15@users.noreply.github.com>	2025-08-20 10:56:03 -04:00
Preston Rasmussen	1c27a3563b	update prompts and support thinking models (#846 ) * update prompts and support thinking models * update * type ignore	2025-08-19 12:31:50 -04:00
Preston Rasmussen	1278f877d8	add bulk delete (#837 ) * add bulk delete * Update graphiti_core/edges.py Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * Update graphiti_core/edges.py Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * Update graphiti_core/edges.py Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>	2025-08-15 12:15:07 -04:00
Preston Rasmussen	bfe51a0fcd	Null search datetimes (#818 ) * support null operations * update * only provide variables for non-null params * test update	2025-08-12 12:24:37 -04:00
Preston Rasmussen	0c01417e1f	add batch delete capabilities (#813 ) * add batch delete capabilities * use session for delete queries	2025-08-07 15:51:30 -04:00
Preston Rasmussen	f0cc7709bd	test updates (#806 ) * test updates * update	2025-08-05 10:49:44 -04:00
Preston Rasmussen	ab8106cb4f	move summary out of attribute extraction (#792 ) * move summary out of attribute extraction * linter * linter * fix db query	2025-07-31 12:15:21 -04:00
prestonrasmussen	6f3a4f19eb	dedupe prompt update	2025-07-29 18:43:42 -04:00
Preston Rasmussen	17747ff58d	Return reranker scores (#758 ) * add search reranker scores to search output * bump version * updates	2025-07-23 16:05:48 -04:00
prestonrasmussen	c0cae61d52	fulltext query update	2025-07-22 14:07:41 -04:00
Preston Rasmussen	38dd3e8dc3	Edge search updates (#753 ) * update edge fulltext search * update * update	2025-07-22 10:05:58 -04:00
prestonrasmussen	f4dc7e2fba	update max query length	2025-07-21 19:39:31 -04:00
Preston Rasmussen	748464dfa5	Return embeddings option in get_by_uuids (#736 ) * add with_embeddings option * update	2025-07-16 11:09:10 -04:00
Preston Rasmussen	5d45d71259	Bulk updates (#732 ) * updates * update * update * typo * linter	2025-07-16 02:26:33 -04:00
Preston Rasmussen	62df6624d4	bulk utils update (#727 ) * bulk utils update * remove unused imports * edge model type guard	2025-07-15 11:42:08 -04:00
Preston Rasmussen	e56ba1a71c	save edge update (#721 )	2025-07-14 11:15:38 -04:00
Preston Rasmussen	deda803dc5	update search filters (#706 ) * update search filters * toml	2025-07-11 10:53:15 -04:00
Daniel Chalef	aa6e38856a	[REFACTOR][FIX] Move away from DEFAULT_DATABASE environment variable in favour of driver-config support (dc) (#699 ) * fix: remove global DEFAULT_DATABASE usage in favor of driver-specific config Fixes bugs introduced in PR #607. This removes reliance on the global DEFAULT_DATABASE environment variable. It specifies the database within each driver. PR #607 introduced a Neo4j compatability, as the database names are different when attempting to support FalkorDB. This refactor improves compatability across database types and ensures future reliance by isolating the configuraiton to the driver level. * fix: make falkordb support optional This ensures that the the optional dependency and subsequent import is compliant with the graphiti-core project dependencies. * chore: fmt code * chore: undo changes to uv.lock * fix: undo potentially breaking changes to drive interface * fix: ensure a default database of "None" is provided - falling back to internal default * chore: ensure default value exists for session and delete_all_indexes * chore: fix typos and grammar * chore: update package versions and dependencies in uv.lock and bulk_utils.py * docs: update database configuration instructions for Neo4j and FalkorDB Clarified default database names and how to override them in driver constructors. Updated testing requirements to include specific commands for running integration and unit tests. * fix: ensure params defaults to an empty dictionary in Neo4jDriver Updated the execute_query method to initialize params as an empty dictionary if not provided, ensuring compatibility with the database configuration. --------- Co-authored-by: Urmzd <urmzd@dal.ca>	2025-07-10 17:25:39 -04:00
prestonrasmussen	e5a61de931	version bump	2025-07-10 12:15:23 -04:00
Preston Rasmussen	0675ac2b7d	Bulk ingestion (#698 ) * partial * update * update * update * update * updates * updates * update * update	2025-07-10 12:14:49 -04:00
Daniel Chalef	513cfbf7b2	Refactor imports (#675 ) * Refactor imports * Fix: Remove duplicate sentence-transformers dependency from dev requirements * Refactor: Update optional import patterns across various modules for better type checking and error handling * Update CONTRIBUTING.md Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>	2025-07-05 08:57:07 -07:00
James.	7ce07942b1	Fix: Add missing name_embedding field to community search queries (#664 ) Enhanced queries in search_utils.py to include 'name_embedding' field in community full-text and similarity search functions.	2025-07-02 11:45:25 -04:00
Daniel Chalef	8213d10d44	migrate to pyright (#646 ) * migrate to pyright * Refactor type checking to use Pyright, update dependencies, and clean up code. - Replaced MyPy with Pyright in configuration files and CI workflows. - Updated `pyproject.toml` and `uv.lock` to reflect new dependencies and versions. - Adjusted type hints and fixed minor code issues across various modules for better compatibility with Pyright. - Added new packages `backoff` and `posthog` to the project dependencies. * Update CI workflows to install all extra dependencies for type checking and unit tests * Update dependencies in uv.lock to replace MyPy with Pyright and add nodeenv package. Adjust type hinting in config.py for compatibility with Pyright.	2025-06-30 12:04:21 -07:00
Daniel Chalef	a7ca777af5	migrate to uv (#634 )	2025-06-27 12:12:49 -07:00

46 commits