graphiti

Author	SHA1	Message	Date
Daniel Chalef	196eb2f077	Remove JSON indentation from prompts to reduce token usage (#985 ) Changes to `to_prompt_json()` helper to default to minified JSON (no indentation) instead of 2-space indentation. This reduces token consumption in LLM prompts while maintaining all necessary information. - Changed default `indent` parameter from `2` to `None` in `prompt_helpers.py` - Updated all prompt modules to remove explicit `indent=2` arguments - Minor code formatting fixes in LLM clients 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude <noreply@anthropic.com>	2025-10-06 16:08:43 -07:00
Daniel Chalef	8770012745	Refactor prompt structure: move MESSAGES after instructions (#980 ) * Refactor prompt structure: move MESSAGES after instructions Reordered prompt structure in extract_nodes.py to place MESSAGES section after instructions/guidelines in both extract_attributes and extract_summary functions for improved prompt clarity. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * Add sentence-aware text truncator for entity summaries - Created truncate_at_sentence() utility function that truncates text at sentence boundaries while respecting max character limits - Added MAX_SUMMARY_CHARS constant (250 chars) for entity summaries - Applied truncator to entity summaries in prompts (extract_nodes.py) - Applied truncator to LLM-generated summaries (node_operations.py) - Added comprehensive test suite for truncation logic 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * Clean up formatting in extract_attributes prompt - Remove extra blank lines - Fix indentation of MESSAGES tag 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * Bump version to 0.22.0pre3 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-10-04 19:06:32 -07:00
Daniel Chalef	896cb4e990	Refactor summary prompts to use character limit and prevent meta-commentary (#979 ) * Refactor summary prompts to use character limit and prevent meta-commentary - Changed summary length constraint from "8 sentences" to "250 characters" for more predictable output - Created reusable summary_instructions snippet in snippets.py with clear BAD/GOOD examples - Added explicit instruction to output only factual content without meta-commentary - Applied consistent formatting across extract_nodes.py and summarize_nodes.py - Bumped version to 0.22.0pre2 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * Add copyright header to snippets.py 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-10-04 15:44:00 -07:00
Daniel Chalef	8a78633e2f	Enforce shorter summaries with 8 sentence limit (#978 ) * Enforce shorter summaries with 8 sentence limit Replace 250-word limit with 8 sentence limit for node summaries to improve conciseness. Also update prompt system message for summarize_context to better reflect its dual purpose of generating summaries and attributes. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * Update graphiti_core/prompts/summarize_nodes.py Co-authored-by: claude[bot] <209825114+claude[bot]@users.noreply.github.com> * Bump version to 0.22.0pre1 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * Update graphiti_core/prompts/summarize_nodes.py 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: claude[bot] <209825114+claude[bot]@users.noreply.github.com>	2025-10-04 14:37:16 -07:00
Daniel Chalef	590282524a	fix: Improve edge extraction entity ID validation (#968 ) * fix: Improve edge extraction entity ID validation Fixes invalid entity ID references in edge extraction that caused warnings like: "WARNING: source or target node not filled WILL_FIND. source_node_uuid: 23 and target_node_uuid: 3" Changes: - Format ENTITIES list as proper JSON in prompt for better LLM parsing - Clarify field descriptions to reference entity id from ENTITIES list - Add explicit entity ID validation as #1 extraction rule with examples - Improve error logging (removed PII, added entity count and valid range) These changes follow patterns from extract_nodes.py and dedupe_nodes.py where entity referencing works reliably. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * wip * fix: Align fact field naming and add description - Change extraction rule to reference 'fact' instead of 'fact_text' - Add descriptive text for fact field in Edge model * fix: Remove ensure_ascii parameter from to_prompt_json call Align with other to_prompt_json calls that don't use ensure_ascii * fix: Use validated target_node_idx variable consistently Line 190 was using raw edge_data.target_entity_id instead of the validated target_node_idx variable, creating inconsistency with line 189 * fix: Improve edge extraction validation checks - Add explicit check for empty nodes list - Use more explicit 0 <= idx comparison instead of -1 < idx - Prevents nonsensical error message when no entities provided * chore: Restore uv.lock from main branch Previously deleted in commit `7e4464b`, now restored to match main branch state * Update uv.lock --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-10-02 22:45:11 -07:00
Daniel Chalef	4a307dbf10	Optimize edge deduplication prompt for caching and clarity (#970 ) * Optimize edge deduplication prompt for caching and clarity - Restructure prompt to place invariant instructions at top and dynamic context at bottom for better LLM caching - Change 'id' to 'idx' in edge context lists to avoid confusion with other identifiers - Remove 'fact_type_id' from edge types context as LLM only needs fact_type_name - Remove dynamic range values from prompt instructions (e.g., "range 0-N") - Add debug logging before LLM call to track input sizes - Add validation logging after LLM response to catch invalid idx values - Clarify that duplicate_facts uses EXISTING FACTS idx and contradicted_facts uses INVALIDATION CANDIDATES idx 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * Address terminology consistency and edge case logging - Update Pydantic field descriptions to use 'idx' instead of 'ids' for consistency - Fix debug logging to handle empty list edge case (avoid 'idx 0--1' display) Note on review feedback: - Validation is intentionally non-redundant: warnings provide visibility, list comprehensions ensure robustness - WARNING level is appropriate for LLM output issues (not system errors) - Existing test coverage is sufficient for this defensive logging addition 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-10-02 17:07:43 -07:00
Daniel Chalef	b28bd92c16	Remove ensure_ascii configuration parameter (#969 ) * Remove ensure_ascii configuration parameter - Changed to_prompt_json default from ensure_ascii=True to False - Removed ensure_ascii parameter from Graphiti.__init__ and GraphitiClients - Removed ensure_ascii from all function signatures and context dictionaries - Removed ensure_ascii from all test files - All JSON serialization now preserves Unicode characters by default 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * format --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-10-02 15:10:57 -07:00
Daniel Chalef	5ca8b9565c	fix: Improve deduplication ID validation and logging (#965 ) * fix: Improve deduplication ID validation and logging - Add comprehensive logging to verify IDs sent to LLM (sent vs received) - Enhance prompt with explicit ID bounds (0 through N-1) - Add validation warnings for missing and extra IDs from LLM responses - Improve error message clarity for invalid dedupe IDs - Log actual IDs sent to LLM to confirm no index leakage This helps diagnose cases where the LLM returns IDs outside the valid range (e.g., ID 19 when only 0-18 were sent). 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * fix: Remove redundant logging parameter Address reviewer comment about redundant third parameter in debug log statement. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * fix: Address reviewer comments on list slicing and prompt clarity - Fix list slicing bug: change <= to < to avoid gap when exactly 20 elements (previously would skip element 10 when showing 21 elements) - Consolidate redundant prompt phrasing while maintaining clarity (reduced from 3 sentences to 2, keeping essential constraints) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * fix: Remove redundant prompt text to reduce token usage Consolidate 'using these exact IDs (0 through N-1)' with following sentence to eliminate repetition. Changes: - 'using these exact IDs (0 through {N-1}). Do not skip IDs or use IDs outside this range' - 'with IDs 0 through {N-1}. Do not skip or add IDs' Saves ~15 tokens per deduplication call. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-10-02 12:22:07 -07:00
Jack Ryan	59fcc9545f	fix: Fix typo in JSON entity extraction prompt (#953 ) * fix: Fix typo in JSON entity extraction prompt Change "an entities" to "any entities" in guideline 1 of the extract_json prompt. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * Update graphiti_core/prompts/extract_nodes.py Co-authored-by: Daniel Chalef <131175+danielchalef@users.noreply.github.com> --------- Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Daniel Chalef <131175+danielchalef@users.noreply.github.com>	2025-10-01 11:23:39 -05:00
Daniel Chalef	7bd8f8a2f2	chore: Update edge extraction prompt to paraphrase instead of quote (#957 ) * chore: Update edge extraction prompt to paraphrase instead of quote - Changed instruction 5 to request paraphrasing rather than verbatim quoting - Updated string quotes to use double quotes for consistency 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * chore: Format edge_operations.py and update lock file - Minor formatting fix in edge_operations.py list comprehension - Update uv.lock with version bump to 0.21.0rc8 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-10-01 09:05:04 -07:00
Jack Ryan	f632a8ae9e	Improve JSON entity extraction prompt (#949 ) Add guideline to extract entities from all JSON properties, not just primary fields like name/user. This ensures comprehensive entity extraction while maintaining the existing exclusion of date properties. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-authored-by: Claude <noreply@anthropic.com>	2025-09-30 11:00:14 -04:00
Daniel Chalef	02efeea8e9	Improve node dedup prompts (#942 ) * remove poetry.lock * Improve dedup prompts * normalize string formatting in dedupe_nodes.py to use single quotes	2025-09-28 12:18:06 -07:00
Preston Rasmussen	1c27a3563b	update prompts and support thinking models (#846 ) * update prompts and support thinking models * update * type ignore	2025-08-19 12:31:50 -04:00
Preston Rasmussen	c28bde6b07	fix typo and model selector (#843 ) * fix typo and model selector * bump version	2025-08-18 11:15:45 -04:00
Preston Rasmussen	baa6825708	ensure ascii default to false (#817 )	2025-08-08 11:20:02 -04:00
HUGO SON	ce9ef3ca79	Add support for non-ASCII characters in LLM prompts (#805 ) * Add support for non-ASCII characters in LLM prompts - Add ensure_ascii parameter to Graphiti class (default: True) - Create to_prompt_json helper function for consistent JSON serialization - Update all prompt files to use new helper function - Preserve Korean/Japanese/Chinese characters when ensure_ascii=False - Maintain backward compatibility with existing behavior Fixes issue where non-ASCII characters were escaped as unicode sequences in prompts, making them unreadable in LLM logs and potentially affecting model understanding. * Remove unused json imports after replacing with to_prompt_json helper - Fix ruff lint errors (F401) for unused json imports - All prompt files now use to_prompt_json helper instead of json.dumps - Maintains clean code style and passes lint checks * Fix ensure_ascii propagation to all LLM calls - Add ensure_ascii parameter to maintenance operation functions that were missing it - Update function signatures in node_operations, community_operations, temporal_operations, and edge_operations - Ensure all llm_client.generate_response calls receive proper ensure_ascii context - Fix hardcoded ensure_ascii: True values that prevented non-ASCII character preservation - Maintain backward compatibility with default ensure_ascii=True - Complete the fix for issue #804 ensuring Korean/Japanese/Chinese characters are properly handled in LLM prompts	2025-08-08 11:07:32 -04:00
Preston Rasmussen	ab8106cb4f	move summary out of attribute extraction (#792 ) * move summary out of attribute extraction * linter * linter * fix db query	2025-07-31 12:15:21 -04:00
prestonrasmussen	6f3a4f19eb	dedupe prompt update	2025-07-29 18:43:42 -04:00
Preston Rasmussen	19bddb5528	validate pydantic objects (#783 ) * validate pydantic objects * unused imports * linter	2025-07-29 17:54:09 -04:00
Preston Rasmussen	5d45d71259	Bulk updates (#732 ) * updates * update * update * typo * linter	2025-07-16 02:26:33 -04:00
Preston Rasmussen	0675ac2b7d	Bulk ingestion (#698 ) * partial * update * update * update * update * updates * updates * update * update	2025-07-10 12:14:49 -04:00
prestonrasmussen	c6f7c98598	updates	2025-06-26 15:38:05 -04:00
Preston Rasmussen	2b27353097	Node name bug fix (#622 ) * fixes * fix bugs * change version	2025-06-24 17:13:27 -04:00
Preston Rasmussen	760ca7e90c	Node name bug (#605 ) * prompt update * prompt update * revert quickstart changes	2025-06-18 18:20:28 -04:00
Preston Rasmussen	2b0bc21b21	be more explicit about edge type signatures (#600 ) * be more explicit about edge type signatures * bump version * update	2025-06-18 16:01:00 -04:00
Preston Rasmussen	e8bf81fc6b	add IS_DUPLICATE_OF edges (#599 ) * add IS_DUPLICATE_OF edges * cypher query update * robust handling	2025-06-17 11:56:55 -04:00
Preston Rasmussen	ebee09b335	Edge extraction and Node Deduplication updates (#564 ) * update tests * updated fact extraction * optimize node deduplication * linting * Update graphiti_core/utils/maintenance/edge_operations.py Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>	2025-06-06 12:28:52 -04:00
Preston Rasmussen	db7595fe63	Edge types (#501 ) * update entity edge attributes * Adding prompts * extract fact attributes * edge types * edge types no regressions * mypy * mypy update * Update graphiti_core/prompts/dedupe_edges.py Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * Update graphiti_core/prompts/dedupe_edges.py Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * mypy --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>	2025-05-19 13:30:56 -04:00
Preston Rasmussen	9422b6f5fb	Node dedupe efficiency (#490 ) * update resolve extracted edge * updated edge resolution * dedupe nodes update * single pass node resolution * updates * mypy updates * Update graphiti_core/prompts/dedupe_nodes.py Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * remove unused imports * mypy --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>	2025-05-15 13:56:33 -04:00
Preston Rasmussen	fd9969b5a1	Update dedupe prompt (#457 ) * improve dedupe logic * cut summary length * update unit tests	2025-05-07 23:23:31 -04:00
Preston Rasmussen	6b85e92105	Fix empty node name issues (#433 ) * fixes * fix * remove unused imports * format * bump version	2025-05-02 12:16:26 -04:00
Preston Rasmussen	2ffc58b3da	small model fix (#432 ) * updated dedupe nodes operations * updates * Update examples/podcast/podcast_transcript.txt Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * mypy --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>	2025-05-02 10:08:25 -04:00
Preston Rasmussen	1193b25fa3	`add_episode()` refactor (#421 ) * temporal updates * update resolve nodes * dedupe edge updates * edge dedupe * extract attributes * update dynamic pydantic model * first pass of extract node attributes * no errors * bug fixes * bug fixes * prompt updates * prompt updates * updates * updates * remove unused imports * update tests based on changes * remove unused import	2025-04-30 12:08:52 -04:00
Preston Rasmussen	f578ee2177	prompt update (#378 )	2025-04-18 00:09:12 -04:00
Preston Rasmussen	5274970be3	reduce entity type attribute hallucinations (#365 ) * reduce entity type attribute hallucinations * reduce entity type attribute hallucinations * reduce entity type attribute hallucinations * mypy fix * mypy fix * mypy fix	2025-04-16 19:09:25 -04:00
Preston Rasmussen	5dce26722e	e2e graph builder eval (#343 ) * add partial eval platform * dedupe updates * add e2e eval * Update graphiti_core/prompts/eval.py Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * clear all outputs * clear all outputs * squash eval commits * Update tests/evals/data/utils.py Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * add longmemeval disclaimer * remove gitignore * add copyright headers * add cli * Update tests/evals/data/longmemeval_data/README.md Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * Update tests/evals/eval_e2e_graph_building.py Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * updates --------- Co-authored-by: jackaldenryan <jackaldenryan@gmail.com> Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>	2025-04-12 10:35:22 -04:00
Daniel Chalef	7f7a17c926	chore: update dependencies and refactor type hinting (#339 ) * Bump version from 0.9.0 to 0.9.1 in pyproject.toml and update google-genai dependency to >=0.1.0 * Bump version from 0.9.1 to 0.9.2 in pyproject.toml * Update google-genai dependency version to >=0.8.0 in pyproject.toml * loc file * Update pyproject.toml to version 0.9.3, restructure dependencies, and modify author format. Remove outdated Google API key note from README.md. * upgrade poetry and ruff * Update README.md to include installation instructions for Graphiti with Google Gemini support * fix to deps since peotry doesn't fully implement PEP 735 * Refactor string formatting in various files to use single quotes for consistency and improve readability. This includes updates in agent.ipynb, quickstart.py, multiple prompt files, and ingest.py and retrieve.py modules. * Remove optional dependencies from pyproject.toml to streamline project requirements.	2025-04-09 08:05:26 -07:00
Daniel Chalef	0f6ac57dab	chore: update version to 0.9.3 and restructure dependencies (#338 ) * Bump version from 0.9.0 to 0.9.1 in pyproject.toml and update google-genai dependency to >=0.1.0 * Bump version from 0.9.1 to 0.9.2 in pyproject.toml * Update google-genai dependency version to >=0.8.0 in pyproject.toml * loc file * Update pyproject.toml to version 0.9.3, restructure dependencies, and modify author format. Remove outdated Google API key note from README.md. * upgrade poetry and ruff	2025-04-08 20:47:38 -07:00
Daniel Chalef	9e78890f2e	Gemini support (#324 ) * first cut * Update dependencies and enhance README for optional LLM providers - Bump aiohttp version from 3.11.14 to 3.11.16 - Update yarl version from 1.18.3 to 1.19.0 - Modify pyproject.toml to include optional extras for Anthropic, Groq, and Google Gemini - Revise README.md to reflect new optional LLM provider installation instructions and clarify API key requirements * Remove deprecated packages from poetry.lock and update content hash - Removed cachetools, google-auth, google-genai, pyasn1, pyasn1-modules, rsa, and websockets from the lock file. - Added new extras for anthropic, google-genai, and groq. - Updated content hash to reflect changes. * Refactor import paths for GeminiClient in README and __init__.py - Updated import statement in README.md to reflect the new module structure for GeminiClient. - Removed GeminiClient from the __all__ list in __init__.py as it is no longer directly imported. * Refactor import paths for GeminiEmbedder in README and __init__.py - Updated import statement in README.md to reflect the new module structure for GeminiEmbedder. - Removed GeminiEmbedder and GeminiEmbedderConfig from the __all__ list in __init__.py as they are no longer directly imported.	2025-04-06 09:27:04 -07:00
Preston Rasmussen	04203506d9	fix bug with updating node type attributes (#305 ) fix bug with saving new properties	2025-03-26 12:37:48 -04:00
Preston Rasmussen	f73867e0fa	Entity classification updates (#285 ) * updates * tested * remove unused imports * llm outputs will be dicts rather than pydantic models * removed unused imports	2025-03-05 12:08:11 -05:00
Preston Rasmussen	7f20b21572	Entity attributes in prompts (#284 ) * add node attributes to prompts * tested * attribute update	2025-03-04 16:34:19 -05:00
Preston Rasmussen	6f874730f3	Entity classification updates (#281 ) * node classification updates * update * remove unused code * update	2025-02-27 15:12:50 -05:00
Preston Rasmussen	29a071b2b8	Custom ontology (#262 ) * ontology * extract and save node labels * extract entity type properties * neo4j upgrade needed * add entity types * update typing * update types * updates * Update graphiti_core/utils/maintenance/node_operations.py Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com> * fix warning * mypy updates * update properties * mypy ignore * mypy types * bump version --------- Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>	2025-02-13 12:17:52 -05:00
Preston Rasmussen	0e45d15462	Add triple update (#263 ) * update add-triplet * test fixes	2025-02-12 12:04:43 -05:00
Preston Rasmussen	7ca6121cde	update summary length (#227 )	2024-12-05 15:51:31 -05:00
Daniel Chalef	567a8ab74a	Implement OpenAI Structured Output (#225 ) * implement so * bug fixes and typing * inject schema for non-openai clients * correct datetime format * remove List keyword * Refactor node_operations.py to use updated prompt_library functions * update example	2024-12-05 07:03:18 -08:00
Preston Rasmussen	427c73d2a8	add unicode escape (#224 ) * add unicode escape * bump version	2024-12-03 11:52:49 -05:00
Preston Rasmussen	a8a73ec38b	Add episode latency improvements (#214 ) * reformat prompts * update prompts * update * update * update * update * update * mypy	2024-11-13 20:13:06 -05:00
Preston Rasmussen	eba9f40ca2	add reflexion (#212 ) * add reflexion * clean up boolean logic * update conditional * cap reflexion iterations * don't do an extra reflection step	2024-11-13 11:58:56 -05:00

1 2

61 commits