Enhanced the edge deduplication prompts to better recognize semantically
equivalent facts that use different phrasings:
- Self-referential relationships ("X is a sub-agency of X" = "X is its own sub-agency")
- Active vs passive voice ("A awarded contract to B" = "B received contract from A")
- Numeric format equivalence ($1M = $1,000,000)
- Entity aliases (DoD = Department of Defense)
Added integration tests that verify the LLM correctly identifies semantic
duplicates with the improved prompts.
|
||
|---|---|---|
| .. | ||
| cross_encoder | ||
| driver | ||
| embedder | ||
| evals | ||
| llm_client | ||
| utils | ||
| helpers_test.py | ||
| test_edge_int.py | ||
| test_entity_exclusion_int.py | ||
| test_graphiti_int.py | ||
| test_graphiti_mock.py | ||
| test_node_int.py | ||
| test_text_utils.py | ||