Commit graph

3900 commits

Author SHA1 Message Date
hajdul88
47f0b577df
Merge branch 'dev' into feature/cog-3160-redis-session-conversation 2025-10-21 09:12:14 +02:00
Vasilije
915aa5184e
fix: Resolve issue with plain text files not having magic file info (#1564)
<!-- .github/pull_request_template.md -->

## Description
Some plain text files dont have to have magic binary info on file type,
in case file type guess was not able to dedcude file type consider it
plain text

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [ ] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [ ] **I have tested my changes thoroughly before submitting this PR**
- [ ] **This PR contains minimal changes necessary to address the
issue/feature**
- [ ] My code follows the project's coding standards and style
guidelines
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have added necessary documentation (if applicable)
- [ ] All new and existing tests pass
- [ ] I have searched existing PRs to ensure this change hasn't been
submitted already
- [ ] I have linked any relevant issues in the description
- [ ] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-10-20 18:47:38 +02:00
hajdul88
c42f8392e1
Merge branch 'dev' into feature/cog-3160-redis-session-conversation 2025-10-20 17:28:12 +02:00
hajdul88
df038365c8
fix: fixes id in get_filtered_graph_data (#1569)
<!-- .github/pull_request_template.md -->

## Description
Fixes get_filtered_graph_data method in neo4jAdapter.


## Type of Change
<!-- Please check the relevant option -->
- [x] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [ ] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [x] **I have tested my changes thoroughly before submitting this PR**
- [x] **This PR contains minimal changes necessary to address the
issue/feature**
- [x] My code follows the project's coding standards and style
guidelines
- [x] I have added tests that prove my fix is effective or that my
feature works
- [x] I have added necessary documentation (if applicable)
- [x] All new and existing tests pass
- [x] I have searched existing PRs to ensure this change hasn't been
submitted already
- [x] I have linked any relevant issues in the description
- [x] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-10-20 17:27:49 +02:00
hajdul88
dd8afe42f8
Merge branch 'dev' into feature/cog-3160-redis-session-conversation 2025-10-20 15:21:56 +02:00
Igor Ilic
0be56ee762
Merge branch 'dev' into fix-plain-txt-file-type 2025-10-20 15:08:59 +02:00
Igor Ilic
3e54b67b4d
fix: Resolve missing argument for distributed (#1563)
<!-- .github/pull_request_template.md -->

## Description
Resolve missing argument for distributed

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [ ] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [ ] **I have tested my changes thoroughly before submitting this PR**
- [ ] **This PR contains minimal changes necessary to address the
issue/feature**
- [ ] My code follows the project's coding standards and style
guidelines
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have added necessary documentation (if applicable)
- [ ] All new and existing tests pass
- [ ] I have searched existing PRs to ensure this change hasn't been
submitted already
- [ ] I have linked any relevant issues in the description
- [ ] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

Co-authored-by: hajdul88 <52442977+hajdul88@users.noreply.github.com>
2025-10-20 15:03:35 +02:00
Igor Ilic
09c10286bd
Merge branch 'dev' into fix-plain-txt-file-type 2025-10-20 14:44:46 +02:00
hajdul88
d2d2cfb477
Merge branch 'dev' into feature/cog-3160-redis-session-conversation 2025-10-20 13:31:33 +02:00
Vasilije
407352d586
Revert "fix: search without prior cognify" (#1567)
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin

Reverts topoteretes/cognee#1548
2025-10-20 13:19:02 +02:00
hajdul88
07caedde08
Merge branch 'dev' into feature/cog-3160-redis-session-conversation 2025-10-20 13:03:38 +02:00
Vasilije
ef80eeb147
chore: update colab notebook on README (#1565)
<!-- .github/pull_request_template.md -->

## Description
<!--
Please provide a clear, human-generated description of the changes in
this PR.
DO NOT use AI-generated descriptions. We want to understand your thought
process and reasoning.
-->

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [x] Documentation update
- [ ] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [ ] **I have tested my changes thoroughly before submitting this PR**
- [ ] **This PR contains minimal changes necessary to address the
issue/feature**
- [ ] My code follows the project's coding standards and style
guidelines
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have added necessary documentation (if applicable)
- [ ] All new and existing tests pass
- [ ] I have searched existing PRs to ensure this change hasn't been
submitted already
- [ ] I have linked any relevant issues in the description
- [ ] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-10-20 12:56:55 +02:00
Daulet Amirkhanov
279d6e80f0
Revert "fix: search without prior cognify" 2025-10-20 11:56:15 +01:00
Hande
8c627d9e10 chore: update colab notebook on README 2025-10-20 12:03:40 +02:00
hajdul88
49e9d7dc27 chore: renames conversation history save method 2025-10-20 10:28:03 +02:00
hajdul88
4d3ba3f313
Merge branch 'dev' into feature/cog-3160-redis-session-conversation 2025-10-20 10:05:57 +02:00
Vasilije
055042af0f
fix: Resolve issue with multi-user mode search (#1562)
<!-- .github/pull_request_template.md -->

## Description
Resolve issue stopping multi-user search capability

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [ ] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [ ] **I have tested my changes thoroughly before submitting this PR**
- [ ] **This PR contains minimal changes necessary to address the
issue/feature**
- [ ] My code follows the project's coding standards and style
guidelines
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have added necessary documentation (if applicable)
- [ ] All new and existing tests pass
- [ ] I have searched existing PRs to ensure this change hasn't been
submitted already
- [ ] I have linked any relevant issues in the description
- [ ] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-10-20 07:00:07 +02:00
Igor Ilic
f88277c467 fix: Resolve issue with plain text files not having magic file info 2025-10-19 23:10:53 +02:00
Igor Ilic
400095d76d fix: Resolve issue with multi-user mode search 2025-10-19 21:30:13 +02:00
Vasilije
6bc9b1d482
feat: Extend tests (#1561)
<!-- .github/pull_request_template.md -->

## Description

Disabled windows 3.11 due to random bug, added python 3.13


## Type of Change
<!-- Please check the relevant option -->
- [x] Bug fix (non-breaking change that fixes an issue)
- [x] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [ ] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [x] **I have tested my changes thoroughly before submitting this PR**
- [x] **This PR contains minimal changes necessary to address the
issue/feature**
- [ ] My code follows the project's coding standards and style
guidelines
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have added necessary documentation (if applicable)
- [ ] All new and existing tests pass
- [ ] I have searched existing PRs to ensure this change hasn't been
submitted already
- [ ] I have linked any relevant issues in the description
- [ ] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-10-19 15:26:26 +02:00
vasilije
cbfa360b8f added lock file 2025-10-19 15:26:06 +02:00
vasilije
86ec2e9685 added 2025-10-19 15:06:38 +02:00
vasilije
aa577d4384 added 2025-10-19 15:02:53 +02:00
vasilije
8900b31dec added 2025-10-19 14:57:40 +02:00
vasilije
0c62916e75 added 2025-10-19 14:54:00 +02:00
vasilije
a1927548ad added 2025-10-19 14:52:02 +02:00
vasilije
66876daf85 removed docs 2025-10-19 14:38:34 +02:00
vasilije
3f7efd8b88 added fixes for tests 2025-10-19 13:33:02 +02:00
Vasilije
68e94b53f6
Feature/cog 2985 add ci tests that run more examples (#1551)
<!-- .github/pull_request_template.md -->

## Description
<!--
Please provide a clear, human-generated description of the changes in
this PR.
DO NOT use AI-generated descriptions. We want to understand your thought
process and reasoning.
-->
Added some examples to run in CI, because we had issues in the past we
didn't see on time. Thought it would be more of a change, but we do run
most of them already, just a few were missing that could be useful to
have in CI.

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [ ] Code refactoring
- [ ] Performance improvement
- [x] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [ ] **I have tested my changes thoroughly before submitting this PR**
- [ ] **This PR contains minimal changes necessary to address the
issue/feature**
- [ ] My code follows the project's coding standards and style
guidelines
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have added necessary documentation (if applicable)
- [ ] All new and existing tests pass
- [ ] I have searched existing PRs to ensure this change hasn't been
submitted already
- [ ] I have linked any relevant issues in the description
- [ ] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-10-19 13:08:18 +02:00
Boris
5da8b03e0b
Merge branch 'dev' into feature/cog-2985-add-ci-tests-that-run-more-examples 2025-10-18 17:14:08 +02:00
Vasilije
1f2fd5a8a6
fix: preferred_loaders is always None in data_item_to_text_file.load_file() (#1554)
<!-- .github/pull_request_template.md -->

## Description
<!--
Please provide a clear, human-generated description of the changes in
this PR.
DO NOT use AI-generated descriptions. We want to understand your thought
process and reasoning.
-->

*LoaderEngine.py*
```python
async def load_file(
        self,
        file_path: str,
        file_stream: Optional[Any],
        preferred_loaders: Optional[List[str]] = None,
        **kwargs,
    ):
        ...
        print(file_path, file_stream, preferred_loaders, kwargs)
        ...
```

shows
`https://cognee.ai ['web_url_loader'] None {}`

i.e. preferred_loaders is always None when `self.get_loader(file_path,
preferred_loaders)`

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [ ] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [ ] **I have tested my changes thoroughly before submitting this PR**
- [ ] **This PR contains minimal changes necessary to address the
issue/feature**
- [ ] My code follows the project's coding standards and style
guidelines
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have added necessary documentation (if applicable)
- [ ] All new and existing tests pass
- [ ] I have searched existing PRs to ensure this change hasn't been
submitted already
- [ ] I have linked any relevant issues in the description
- [ ] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-10-18 16:54:26 +02:00
Vasilije
2ac15d4fff
chore: remove memgraph from cognee repo (#1550)
<!-- .github/pull_request_template.md -->

## Description
<!--
Please provide a clear, human-generated description of the changes in
this PR.
DO NOT use AI-generated descriptions. We want to understand your thought
process and reasoning.
-->

Memgraph adapter has been moved to and being maintained in
[`cognee-community`](https://github.com/topoteretes/cognee-community/tree/main)
repo.

This PR removes Memgraph, and updates any mentions of it in this repo.

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [ ] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [ ] **I have tested my changes thoroughly before submitting this PR**
- [ ] **This PR contains minimal changes necessary to address the
issue/feature**
- [ ] My code follows the project's coding standards and style
guidelines
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have added necessary documentation (if applicable)
- [ ] All new and existing tests pass
- [ ] I have searched existing PRs to ensure this change hasn't been
submitted already
- [ ] I have linked any relevant issues in the description
- [ ] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-10-18 16:53:47 +02:00
Vasilije
559d5009f7
feat: Batch document handling (#1469)
<!-- .github/pull_request_template.md -->

## Description
Add a batch system for document processing to limit number of parallel
documents being processed in Cognee

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [x] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [x] **I have tested my changes thoroughly before submitting this PR**
- [x] **This PR contains minimal changes necessary to address the
issue/feature**
- [x] My code follows the project's coding standards and style
guidelines
- [x] I have added tests that prove my fix is effective or that my
feature works
- [x] I have added necessary documentation (if applicable)
- [x] All new and existing tests pass
- [x] I have searched existing PRs to ensure this change hasn't been
submitted already
- [x] I have linked any relevant issues in the description
- [x] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-10-18 09:48:52 +02:00
hajdul88
2d1f0e5eaa
Merge branch 'dev' into feature/cog-3160-redis-session-conversation 2025-10-17 21:45:31 +02:00
Vasilije
9d0261f375
fix: search without prior cognify (#1548)
<!-- .github/pull_request_template.md -->

## Description
<!--
Please provide a clear, human-generated description of the changes in
this PR.
DO NOT use AI-generated descriptions. We want to understand your thought
process and reasoning.
-->

Running search when `cognee.add()` is called, but `cognee.cognify()`
wasn't called yet goes through the whole search operation to throw a
cryptic error:
```
Error during graph projection: EntityNotFoundError: Empty graph projected from the database. (Status code: 404)
```

## How to reproduce
modify `dynamic_steps_example.py` to not run cognify

## This PR

Checks graph before searching, and throws an informative exception to
ensure cognify was run

| Logs Before | Logs After |
|--------------|------------|
| `Error during graph projection: EntityNotFoundError: Empty graph
projected from the database. (Status code: 404)` |
`2025-10-17T11:05:58.465315 [warning ] Search attempt on an empty
knowledge graph [cognee.shared.logging_utils]` |

## Type of Change
<!-- Please check the relevant option -->
- [x] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [ ] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [ ] **I have tested my changes thoroughly before submitting this PR**
- [ ] **This PR contains minimal changes necessary to address the
issue/feature**
- [ ] My code follows the project's coding standards and style
guidelines
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have added necessary documentation (if applicable)
- [ ] All new and existing tests pass
- [ ] I have searched existing PRs to ensure this change hasn't been
submitted already
- [ ] I have linked any relevant issues in the description
- [ ] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-10-17 19:07:56 +02:00
Daulet Amirkhanov
d85037896a
Merge branch 'dev' into chore/remove-memgraph 2025-10-17 17:36:53 +01:00
Daulet Amirkhanov
c9ccd50bd8
Merge branch 'dev' into fix/loader-engine-load-file-uses-wrong-arguments 2025-10-17 17:33:43 +01:00
Daulet Amirkhanov
50aa8aac11 refactor: remove filestream arg from LoaderEngine.load_file(...) 2025-10-17 17:33:25 +01:00
Daulet Amirkhanov
4e2a777860 tests: update tests after last refactoring 2025-10-17 14:18:47 +01:00
hajdul88
9089698896 chore: turns of shared kuzu lock for session tests 2025-10-17 14:28:33 +02:00
hajdul88
f7aa1519e4 feat: adds conversation history test to workflows 2025-10-17 14:24:58 +02:00
hajdul88
7357c81079 chore: linting fix 2025-10-17 14:24:25 +02:00
hajdul88
2182b7be5f linting fix 2025-10-17 14:18:13 +02:00
hajdul88
e9f4e2000f feat: adds e2e conversation history test 2025-10-17 14:15:18 +02:00
hajdul88
a3bbeb1c10 chore: generalizes return value in session history 2025-10-17 13:47:02 +02:00
Igor Ilic
6baf2d6806
Merge branch 'dev' into batch-document-handling 2025-10-17 13:45:32 +02:00
Daulet Amirkhanov
41fd854c7e
Merge branch 'dev' into fix/search-without-prior-cognify 2025-10-17 12:09:47 +01:00
Daulet Amirkhanov
c313fcd029 log warning on attempts to search on an empty knowledge graph 2025-10-17 12:06:35 +01:00
Daulet Amirkhanov
3ee50c192f refactor emptiness check to be boolean, and optimize query 2025-10-17 12:01:06 +01:00
hajdul88
ebb5b94265 chore: unit test fix for cache mocking 2025-10-17 11:06:34 +02:00