Enhanced the edge deduplication prompts to better recognize semantically
equivalent facts that use different phrasings:
- Self-referential relationships ("X is a sub-agency of X" = "X is its own sub-agency")
- Active vs passive voice ("A awarded contract to B" = "B received contract from A")
- Numeric format equivalence ($1M = $1,000,000)
- Entity aliases (DoD = Department of Defense)
Added integration tests that verify the LLM correctly identifies semantic
duplicates with the improved prompts.
|
||
|---|---|---|
| .. | ||
| test_bulk_utils.py | ||
| test_edge_operations.py | ||
| test_edge_operations_int.py | ||
| test_node_operations.py | ||
| test_temporal_operations_int.py | ||