graphiti/tests
Daniel Chalef 9aee3174bd
Refactor batch deduplication logic to enhance node resolution and track duplicate pairs (#929) (#936)
* Refactor deduplication logic to enhance node resolution and track duplicate pairs (#929)

* Simplify deduplication process in bulk_utils by reusing canonical nodes.
* Update dedup_helpers to store duplicate pairs during resolution.
* Modify node_operations to append duplicate pairs when resolving nodes.
* Add tests to verify deduplication behavior and ensure correct state updates.

* reveret to concurrent dedup with fanout and then reconcilation

* add performance note for deduplication loop in bulk_utils

* enhance deduplication logic in bulk_utils to handle missing canonical nodes gracefully

* Update graphiti_core/utils/bulk_utils.py

Co-authored-by: claude[bot] <209825114+claude[bot]@users.noreply.github.com>

* refactor deduplication logic in bulk_utils to use directed union-find for canonical UUID resolution

* implement _build_directed_uuid_map for efficient UUID resolution in bulk_utils

* document directed union-find lookup in bulk_utils for clarity

---------

Co-authored-by: claude[bot] <209825114+claude[bot]@users.noreply.github.com>
2025-09-26 08:40:18 -07:00
..
cross_encoder Gemini client improvements; Gemini reranker (#645) 2025-06-30 12:55:17 -07:00
driver chore/prepare kuzu integration (#762) 2025-07-29 09:07:34 -04:00
embedder save edge update (#721) 2025-07-14 11:15:38 -04:00
evals add_episode() refactor (#421) 2025-04-30 12:08:52 -04:00
llm_client save edge update (#721) 2025-07-14 11:15:38 -04:00
utils Refactor batch deduplication logic to enhance node resolution and track duplicate pairs (#929) (#936) 2025-09-26 08:40:18 -07:00
helpers_test.py Add support for Kuzu as the graph driver (#799) 2025-08-27 11:45:21 -04:00
test_edge_int.py Improve node deduplication w/ deterministic matching, LLM fallbacks (#929) 2025-09-25 07:13:19 -07:00
test_entity_exclusion_int.py Add support for Kuzu as the graph driver (#799) 2025-08-27 11:45:21 -04:00
test_graphiti_int.py Add support for Kuzu as the graph driver (#799) 2025-08-27 11:45:21 -04:00
test_graphiti_mock.py don't return index labels (#887) 2025-09-02 12:02:33 -04:00
test_node_int.py Improve node deduplication w/ deterministic matching, LLM fallbacks (#929) 2025-09-25 07:13:19 -07:00