graphiti

Author	SHA1	Message	Date
Daniel Chalef	160a8a1310	fix: Prevent duplicate edge facts within same episode This fixes three related bugs that allowed verbatim duplicate edge facts: 1. Fixed LLM deduplication: Changed related_edges_context to use integer indices instead of UUIDs, matching the EdgeDuplicate model expectations. 2. Fixed batch deduplication: Removed episode skip in dedupe_edges_bulk that prevented comparing edges from the same episode. Added self-comparison guard to prevent edge from comparing against itself. 3. Added fast-path deduplication: Added exact string matching before parallel processing in resolve_extracted_edges to catch within-episode duplicates early, preventing race conditions where concurrent edges can't see each other. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-09-30 20:34:17 -07:00
Daniel Chalef	f2c4c97362	Allow Edge extraction to keep discovered edge labels (#950 ) * chore: Update dependencies and enhance edge resolution logic - Add new dependencies: boto3, opensearch-py, and langchain-aws to pyproject.toml. - Modify Graphiti class to handle additional parameters in edge resolution. - Improve edge type handling in deduplication logic by introducing custom edge type names. - Enhance tests for edge resolution to cover new scenarios and ensure correct behavior. This update improves the flexibility and functionality of edge operations while ensuring compatibility with new libraries. * refactor: Clean up test_edge_operations.py and format response returns - Remove unnecessary stubs for opensearchpy module. - Format return values in llm_client.generate_response for consistency. - Enhance readability by ensuring proper indentation and structure in test cases. This refactor improves the clarity and maintainability of the test suite for edge operations. * bump version to 0.30.0pre5 and enhance docstring for resolve_extracted_edge function - Update version in pyproject.toml to 0.30.0pre5. - Add detailed docstring to resolve_extracted_edge function in edge_operations.py, clarifying parameters and return values. This update improves documentation clarity for the edge resolution process.	2025-09-29 21:32:47 -07:00
Daniel Chalef	3fcd587276	fix: Add edge type validation based on node labels (#948 ) * fix: Add edge type validation based on node labels - Add DEFAULT_EDGE_NAME constant for 'RELATES_TO' - Implement pre-resolution validation to reset invalid edge names - Add post-resolution validation for LLM-returned fact types - Rename parameter from edge_types to edge_type_candidates for clarity - Add comprehensive tests for validation scenarios This ensures edges conform to edge_type_map constraints and prevents misclassification when edge types don't match node label pairs. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * chore: Bump version to 0.30.0pre4 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-09-29 16:35:00 -07:00
Daniel Chalef	7c469e8e2b	Improve node deduplication w/ deterministic matching, LLM fallbacks (#929 ) * add repository guidelines and project structure documentation * update neo4j image version and modify test command to disable specific databases * implement deduplication helpers and integrate with node operations * refactor string formatting to use single quotes in node operations * enhance deduplication helpers with UUID indexing and update resolution logic * implement exact fact matching (#931)	2025-09-25 07:13:19 -07:00
Preston Rasmussen	d6d4bbdeb7	don't save duplicate edges (#927 ) * don't save duplicate edges * remove build duplicate edges	2025-09-24 17:24:57 -04:00
Preston Rasmussen	3efe085a92	OpenSearch updates (#906 ) * updates * add uuid filter functionality * update * updates * bump-version * update * fix typo * use async function * update unit tests * update delete * update deletion * async update * update * update * update * update	2025-09-14 01:43:37 -04:00
Siddhartha Sahu	8802b7db13	Add support for Kuzu as the graph driver (#799 ) * Fix FalkoDB tests * Add support for graph memory using Kuzu * Fix lints * Fix queries * Add tests * Add comments * Add more test coverage * Add mocked tests * Format * Add mocked tests II * Refactor community queries * Add more mocked tests * Refactor tests to always cleanup * Add more mocked tests * Update kuzu * Refactor how filters are built * Add more mocked tests * Refactor and cleanup * Fix tests * Fix lints * Refactor tests * Disable neptune * Fix * Update kuzu version * Update kuzu to latest release * Fix filter * Fix query * Fix Neptune query * Fix bulk queries * Fix lints * Fix deletes * Comments and format * Add Kuzu to the README * Fix bulk queries * Test all fields of nodes and edges * Fix lints * Update search_utils.py --------- Co-authored-by: Preston Rasmussen <109292228+prasmussen15@users.noreply.github.com>	2025-08-27 11:45:21 -04:00
bechbd	ef56dc779a	Amazon Neptune Support (#793 ) * Rebased Neptune changes based on significant rework done * Updated the README documentation * Fixed linting and formatting * Update README.md Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * Update graphiti_core/driver/neptune_driver.py Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * Update README.md Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * Addressed feedback from code review * Updated the README documentation for clarity * Updated the README and neptune_driver based on PR feedback * Update node_db_queries.py --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> Co-authored-by: Preston Rasmussen <109292228+prasmussen15@users.noreply.github.com>	2025-08-20 10:56:03 -04:00
HUGO SON	ce9ef3ca79	Add support for non-ASCII characters in LLM prompts (#805 ) * Add support for non-ASCII characters in LLM prompts - Add ensure_ascii parameter to Graphiti class (default: True) - Create to_prompt_json helper function for consistent JSON serialization - Update all prompt files to use new helper function - Preserve Korean/Japanese/Chinese characters when ensure_ascii=False - Maintain backward compatibility with existing behavior Fixes issue where non-ASCII characters were escaped as unicode sequences in prompts, making them unreadable in LLM logs and potentially affecting model understanding. * Remove unused json imports after replacing with to_prompt_json helper - Fix ruff lint errors (F401) for unused json imports - All prompt files now use to_prompt_json helper instead of json.dumps - Maintains clean code style and passes lint checks * Fix ensure_ascii propagation to all LLM calls - Add ensure_ascii parameter to maintenance operation functions that were missing it - Update function signatures in node_operations, community_operations, temporal_operations, and edge_operations - Ensure all llm_client.generate_response calls receive proper ensure_ascii context - Fix hardcoded ensure_ascii: True values that prevented non-ASCII character preservation - Maintain backward compatibility with default ensure_ascii=True - Complete the fix for issue #804 ensuring Korean/Japanese/Chinese characters are properly handled in LLM prompts	2025-08-08 11:07:32 -04:00
Preston Rasmussen	ab8106cb4f	move summary out of attribute extraction (#792 ) * move summary out of attribute extraction * linter * linter * fix db query	2025-07-31 12:15:21 -04:00
Preston Rasmussen	19bddb5528	validate pydantic objects (#783 ) * validate pydantic objects * unused imports * linter	2025-07-29 17:54:09 -04:00
Preston Rasmussen	0ac2541b35	make egg_operations more robust (#737 ) update	2025-07-16 17:12:20 -04:00
Preston Rasmussen	62df6624d4	bulk utils update (#727 ) * bulk utils update * remove unused imports * edge model type guard	2025-07-15 11:42:08 -04:00
Daniel Chalef	aa6e38856a	[REFACTOR][FIX] Move away from DEFAULT_DATABASE environment variable in favour of driver-config support (dc) (#699 ) * fix: remove global DEFAULT_DATABASE usage in favor of driver-specific config Fixes bugs introduced in PR #607. This removes reliance on the global DEFAULT_DATABASE environment variable. It specifies the database within each driver. PR #607 introduced a Neo4j compatability, as the database names are different when attempting to support FalkorDB. This refactor improves compatability across database types and ensures future reliance by isolating the configuraiton to the driver level. * fix: make falkordb support optional This ensures that the the optional dependency and subsequent import is compliant with the graphiti-core project dependencies. * chore: fmt code * chore: undo changes to uv.lock * fix: undo potentially breaking changes to drive interface * fix: ensure a default database of "None" is provided - falling back to internal default * chore: ensure default value exists for session and delete_all_indexes * chore: fix typos and grammar * chore: update package versions and dependencies in uv.lock and bulk_utils.py * docs: update database configuration instructions for Neo4j and FalkorDB Clarified default database names and how to override them in driver constructors. Updated testing requirements to include specific commands for running integration and unit tests. * fix: ensure params defaults to an empty dictionary in Neo4jDriver Updated the execute_query method to initialize params as an empty dictionary if not provided, ensuring compatibility with the database configuration. --------- Co-authored-by: Urmzd <urmzd@dal.ca>	2025-07-10 17:25:39 -04:00
Preston Rasmussen	0675ac2b7d	Bulk ingestion (#698 ) * partial * update * update * update * update * updates * updates * update * update	2025-07-10 12:14:49 -04:00
Daniel Chalef	8213d10d44	migrate to pyright (#646 ) * migrate to pyright * Refactor type checking to use Pyright, update dependencies, and clean up code. - Replaced MyPy with Pyright in configuration files and CI workflows. - Updated `pyproject.toml` and `uv.lock` to reflect new dependencies and versions. - Adjusted type hints and fixed minor code issues across various modules for better compatibility with Pyright. - Added new packages `backoff` and `posthog` to the project dependencies. * Update CI workflows to install all extra dependencies for type checking and unit tests * Update dependencies in uv.lock to replace MyPy with Pyright and add nodeenv package. Adjust type hinting in config.py for compatibility with Pyright.	2025-06-30 12:04:21 -07:00
Preston Rasmussen	2b0bc21b21	be more explicit about edge type signatures (#600 ) * be more explicit about edge type signatures * bump version * update	2025-06-18 16:01:00 -04:00
Preston Rasmussen	e8bf81fc6b	add IS_DUPLICATE_OF edges (#599 ) * add IS_DUPLICATE_OF edges * cypher query update * robust handling	2025-06-17 11:56:55 -04:00
Preston Rasmussen	14146dc46f	Add support for falkordb (#575 ) * [wip] add support for falkordb * updates * fix-async * progress * fix-issues * rm-date-handler * red-code * rm-uns-try * fix-exm * rm-un-lines * fix-comments * fix-se-utils * fix-falkor-readme * fix-falkor-cosine-score * update-falkor-ver * fix-vec-sim * min-updates * make format * update graph driver abstraction * poetry lock * updates * linter * Update graphiti_core/search/search_utils.py Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> --------- Co-authored-by: Dudi Zimberknopf <zimber.dudi@gmail.com> Co-authored-by: Gal Shubeli <galshubeli93@gmail.com> Co-authored-by: Gal Shubeli <124919062+galshubeli@users.noreply.github.com> Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>	2025-06-13 12:06:57 -04:00
Preston Rasmussen	ebee09b335	Edge extraction and Node Deduplication updates (#564 ) * update tests * updated fact extraction * optimize node deduplication * linting * Update graphiti_core/utils/maintenance/edge_operations.py Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>	2025-06-06 12:28:52 -04:00
Preston Rasmussen	a9a6ee6bf0	edge operations update (#539 ) * edge operations update * bump version * edge name * update	2025-05-28 16:33:20 -04:00
Preston Rasmussen	5fe2f588a6	Edge type search (#537 ) * add filters * search filter * Update graphiti_core/search/search_utils.py Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>	2025-05-27 13:16:28 -04:00
Pavlo Paliychuk	b295f57e78	fix: update key name in edge attributes context (#531 )	2025-05-27 09:58:51 -04:00
Preston Rasmussen	db7595fe63	Edge types (#501 ) * update entity edge attributes * Adding prompts * extract fact attributes * edge types * edge types no regressions * mypy * mypy update * Update graphiti_core/prompts/dedupe_edges.py Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * Update graphiti_core/prompts/dedupe_edges.py Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * mypy --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>	2025-05-19 13:30:56 -04:00
Preston Rasmussen	9422b6f5fb	Node dedupe efficiency (#490 ) * update resolve extracted edge * updated edge resolution * dedupe nodes update * single pass node resolution * updates * mypy updates * Update graphiti_core/prompts/dedupe_nodes.py Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * remove unused imports * mypy --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>	2025-05-15 13:56:33 -04:00
Preston Rasmussen	baebe79731	updates (#463 ) * updates * bump version	2025-05-09 15:00:08 -04:00
Preston Rasmussen	e75feff45e	pre4 (#462 ) * pre4 * update * update	2025-05-08 18:25:22 -04:00
Preston Rasmussen	a5f1f03372	Add episode fix (#460 ) * fix add episode * bump version	2025-05-08 14:04:40 -04:00
prestonrasmussen	8ce9b1e157	fix bugs	2025-05-07 22:46:35 -04:00
Preston Rasmussen	1f2f1eeab5	Size optimizations (#456 ) * memory optimizations for vectors * debugged * unused import * Update graphiti_core/edges.py Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>	2025-05-07 20:08:30 -04:00
Preston Rasmussen	2ffc58b3da	small model fix (#432 ) * updated dedupe nodes operations * updates * Update examples/podcast/podcast_transcript.txt Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * mypy --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>	2025-05-02 10:08:25 -04:00
Preston Rasmussen	1193b25fa3	`add_episode()` refactor (#421 ) * temporal updates * update resolve nodes * dedupe edge updates * edge dedupe * extract attributes * update dynamic pydantic model * first pass of extract node attributes * no errors * bug fixes * bug fixes * prompt updates * prompt updates * updates * updates * remove unused imports * update tests based on changes * remove unused import	2025-04-30 12:08:52 -04:00
Preston Rasmussen	0b94e0e603	Bulk embed (#403 ) * add batch embeddings * bulk edge and node embeddings * update embeddings during add_episode * Update graphiti_core/embedder/client.py Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * mypy --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>	2025-04-26 22:09:12 -04:00
Preston Rasmussen	a26b25dc06	Add episode refactor (#399 ) * partial refactor * get relevant nodes refactor * load edges updates * refactor triplets * not there yet * node search update * working refactor * updates * mypy * mypy	2025-04-26 00:24:23 -04:00
Daniel Chalef	0f6ac57dab	chore: update version to 0.9.3 and restructure dependencies (#338 ) * Bump version from 0.9.0 to 0.9.1 in pyproject.toml and update google-genai dependency to >=0.1.0 * Bump version from 0.9.1 to 0.9.2 in pyproject.toml * Update google-genai dependency version to >=0.8.0 in pyproject.toml * loc file * Update pyproject.toml to version 0.9.3, restructure dependencies, and modify author format. Remove outdated Google API key note from README.md. * upgrade poetry and ruff	2025-04-08 20:47:38 -07:00
Preston Rasmussen	0f50b74735	Set max tokens by prompt (#255 ) * set max tokens * update generic openai client * mypy updates * fix: dockerfile --------- Co-authored-by: paulpaliychuk <pavlo.paliychuk.ca@gmail.com>	2025-01-24 10:14:49 -05:00
Preston Rasmussen	00fe87679e	Bounded semaphore - limiting concurrency (#244 ) * WIP * add semaphore * remove unused imports * remove unused imports * lower concurrency limit	2024-12-17 13:08:18 -05:00
Daniel Chalef	445dccc021	refactor: use `utc_now()` for consistent UTC datetime handling (#234 ) * ensure utc timezones * fix: dep cycle --------- Co-authored-by: paulpaliychuk <pavlo.paliychuk.ca@gmail.com>	2024-12-09 10:36:04 -08:00
Daniel Chalef	567a8ab74a	Implement OpenAI Structured Output (#225 ) * implement so * bug fixes and typing * inject schema for non-openai clients * correct datetime format * remove List keyword * Refactor node_operations.py to use updated prompt_library functions * update example	2024-12-05 07:03:18 -08:00
Preston Rasmussen	0fbe5c0704	Pagination for get by group_id (#218 ) * add pagination to subgraphs * update pagination * update LiteralString import * cleanup * cleanup * update embedding dims	2024-12-02 11:17:37 -05:00
Preston Rasmussen	a8a73ec38b	Add episode latency improvements (#214 ) * reformat prompts * update prompts * update * update * update * update * update * mypy	2024-11-13 20:13:06 -05:00
Preston Rasmussen	eba9f40ca2	add reflexion (#212 ) * add reflexion * clean up boolean logic * update conditional * cap reflexion iterations * don't do an extra reflection step	2024-11-13 11:58:56 -05:00
Preston Rasmussen	3199e893ed	add_fact endpoint (#207 ) * add_fact endpoint * bump version * add edge invalidation * update	2024-11-06 09:12:21 -05:00
Preston Rasmussen	6c3b32e620	make broader use of debug logs (#187 )	2024-10-11 16:38:56 -04:00
Preston Rasmussen	e15c872900	Fix edge invalidation (#174 ) * update edge operations * add new tests	2024-10-07 11:45:31 -04:00
Preston Rasmussen	794b705664	Group id fix (#152 ) * node distance and group_ids fixed * get all with no group_id passed * push * push * remove comments * mypy * mypy ids * please mypy * trust * last one	2024-09-24 15:55:30 -04:00
Preston Rasmussen	e398f95612	Mentions reranker (#124 ) * documentation update * update communities * mentions reranker * fix episode edge mentions * get episode mentions * add communities to mentions endpoint * rebase * defaults episodes to empty list * update	2024-09-18 15:44:28 -04:00
Preston Rasmussen	c0a740ff60	Community nodes (#103 ) * add gds * community work * save progress * community updates * e2e communities * troubleshooting * updates * communities * remove unused import	2024-09-11 12:06:35 -04:00
Preston Rasmussen	42fb590606	Add group ids (#89 ) * set and retrieve group ids * update add episode with group id support * add episode and search functional * update bulk * mypy updates * remove unused imports * update unit tests * unit tests * add optional uuid field * format * mypy * ellipsis	2024-09-06 12:33:42 -04:00
Preston Rasmussen	299021173b	Add episode refactor (#85 ) * temp commit while moving * fix name embedding bug * invalidation * format * tests on runner examples * format * ellipsis * ruff * fix * format * minor prompt change	2024-09-05 12:05:44 -04:00

1 2

54 commits