cognee/cognee
Igor Ilic 127d9860df
feat: Add dataset database handler info (#1887)
<!-- .github/pull_request_template.md -->

## Description
Add info on dataset database handler used for dataset database

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [ ] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [ ] **I have tested my changes thoroughly before submitting this PR**
- [ ] **This PR contains minimal changes necessary to address the
issue/feature**
- [ ] My code follows the project's coding standards and style
guidelines
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have added necessary documentation (if applicable)
- [ ] All new and existing tests pass
- [ ] I have searched existing PRs to ensure this change hasn't been
submitted already
- [ ] I have linked any relevant issues in the description
- [ ] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit

* **New Features**
* Datasets now record their assigned vector and graph database handlers,
allowing per-dataset backend selection.

* **Chores**
  * Database schema expanded to store handler identifiers per dataset.
* Deletion/cleanup processes now use dataset-level handler info for
accurate removal across backends.

* **Tests**
* Tests updated to include and validate the new handler fields in
dataset creation outputs.

<sub>✏️ Tip: You can customize this high-level summary in your review
settings.</sub>
<!-- end of auto-generated comment: release notes by coderabbit.ai -->
2025-12-12 13:22:03 +01:00
..
api feat: make pipeline processing cache optional (#1876) 2025-12-12 13:11:31 +01:00
cli feat: csv ingestion loader & chunk 2025-10-22 16:56:46 +08:00
eval_framework Merge branch 'main' into merge-main-vol7 2025-12-11 19:11:24 +01:00
exceptions Update deprecated Exception status codes 2025-10-22 17:38:41 +01:00
infrastructure feat: Add dataset database handler info (#1887) 2025-12-12 13:22:03 +01:00
memify_pipelines feature: adds triplet embedding via memify (#1832) 2025-12-02 18:27:08 +01:00
modules feat: Add dataset database handler info (#1887) 2025-12-12 13:22:03 +01:00
shared feat: add dataset database handler logic and neo4j/lancedb/kuzu handlers (#1776) 2025-12-11 14:15:20 +01:00
tasks Merge branch 'main' into merge-main-vol7 2025-12-11 19:11:24 +01:00
tests feat: Add dataset database handler info (#1887) 2025-12-12 13:22:03 +01:00
__init__.py feat: Add initial custom pipeline (#1716) 2025-11-04 17:58:34 +01:00
__main__.py
base_config.py Merge branch 'dev' into fix_issue_with_base_config 2025-10-29 11:42:25 +01:00
context_global_variables.py fix: Resolve DB caching issues when deleting databases 2025-12-03 18:05:47 +01:00
get_token.py
low_level.py
pipelines.py
root_dir.py fix: Add S3 URL handling in ensure_absolute_path function (#1438) 2025-09-18 11:47:34 +02:00
version.py