graphiti

History

Nate Shumway 0f10fb6bdd Improve semantic equivalence detection in edge deduplication Enhanced the edge deduplication prompts to better recognize semantically equivalent facts that use different phrasings: - Self-referential relationships ("X is a sub-agency of X" = "X is its own sub-agency") - Active vs passive voice ("A awarded contract to B" = "B received contract from A") - Numeric format equivalence ($1M = $1,000,000) - Entity aliases (DoD = Department of Defense) Added integration tests that verify the LLM correctly identifies semantic duplicates with the improved prompts.		2025-12-09 19:01:05 -06:00
..
__init__.py
dedupe_edges.py	Improve semantic equivalence detection in edge deduplication	2025-12-09 19:01:05 -06:00
dedupe_nodes.py	Remove JSON indentation from prompts to reduce token usage (#985 )	2025-10-06 16:08:43 -07:00
eval.py	Remove ensure_ascii configuration parameter (#969 )	2025-10-02 15:10:57 -07:00
extract_edge_dates.py	prompt update (#378 )	2025-04-18 00:09:12 -04:00
extract_edges.py	Remove JSON indentation from prompts to reduce token usage (#985 )	2025-10-06 16:08:43 -07:00
extract_nodes.py	Remove JSON indentation from prompts to reduce token usage (#985 )	2025-10-06 16:08:43 -07:00
invalidate_edges.py	Node dedupe efficiency (#490 )	2025-05-15 13:56:33 -04:00
lib.py
models.py	chore: update version to 0.9.3 and restructure dependencies (#338 )	2025-04-08 20:47:38 -07:00
prompt_helpers.py	Remove JSON indentation from prompts to reduce token usage (#985 )	2025-10-06 16:08:43 -07:00
snippets.py	Refactor summary prompts to use character limit and prevent meta-commentary (#979 )	2025-10-04 15:44:00 -07:00
summarize_nodes.py	Remove JSON indentation from prompts to reduce token usage (#985 )	2025-10-06 16:08:43 -07:00