cognee/cognee/modules/pipelines
Igor Ilic ede884e0b0
feat: make pipeline processing cache optional (#1876)
<!-- .github/pull_request_template.md -->

## Description
Make the pipeline cache mechanism optional, have it turned off by
default but use it for add and cognify like it has been used until now

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [x] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [ ] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [x] **I have tested my changes thoroughly before submitting this PR**
- [x] **This PR contains minimal changes necessary to address the
issue/feature**
- [x] My code follows the project's coding standards and style
guidelines
- [x] I have added tests that prove my fix is effective or that my
feature works
- [x] I have added necessary documentation (if applicable)
- [x] All new and existing tests pass
- [ x I have searched existing PRs to ensure this change hasn't been
submitted already
- [x] I have linked any relevant issues in the description
- [x] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit

* **New Features**
* Introduced pipeline caching across ingestion, processing, and custom
pipeline flows with per-run controls to enable or disable caching.
  * Added an option for incremental loading in custom pipeline runs.

* **Behavior Changes**
* One pipeline path now explicitly bypasses caching by default to always
re-run when invoked.
* Disabling cache forces re-processing instead of early exit; cache
reset still enables re-execution.

* **Tests**
* Added tests validating caching, non-caching, and cache-reset
re-execution behavior.

* **Chores**
  * Added CI job to run pipeline caching tests.

<sub>✏️ Tip: You can customize this high-level summary in your review
settings.</sub>
<!-- end of auto-generated comment: release notes by coderabbit.ai -->
2025-12-12 13:11:31 +01:00
..
exceptions Update deprecated Exception status codes 2025-10-22 17:38:41 +01:00
layers refactor: Unify dataset resolution (#1488) 2025-10-07 19:14:46 +02:00
methods feat: migrate pipeline status reset to add pipeline (#1289) 2025-08-26 19:06:08 +02:00
models fix: Resolve issue with Data object serialization (#1407) 2025-09-17 11:14:05 +02:00
operations feat: make pipeline processing cache optional (#1876) 2025-12-12 13:11:31 +01:00
queues feat: websockets for pipeline update streaming (#851) 2025-06-11 20:29:26 +02:00
tasks feature: tighten run_tasks_base (#730) 2025-04-16 09:19:03 +02:00
utils feat: Add dataset_id to pipeline run info and status (#1009) 2025-06-30 11:53:17 +02:00
__init__.py refactor: make run_pipeline a high-level api for running pipelines (#1294) 2025-08-27 09:49:20 +02:00