Commit graph

654 commits

Author SHA1 Message Date
hajdul88
cc058b4482
Merge branch 'dev' into feature/ontology-param-rest 2025-11-06 09:40:04 +01:00
Vasilije
8cc55ac0b2
refactor: Enable multi user mode by default if graph and vector db pr… (#1695)
…oviders support it

<!-- .github/pull_request_template.md -->

## Description
Enable multi user mode by default for supported graph and vector DBs

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [x] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [ ] **I have tested my changes thoroughly before submitting this PR**
- [ ] **This PR contains minimal changes necessary to address the
issue/feature**
- [ ] My code follows the project's coding standards and style
guidelines
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have added necessary documentation (if applicable)
- [ ] All new and existing tests pass
- [ ] I have searched existing PRs to ensure this change hasn't been
submitted already
- [ ] I have linked any relevant issues in the description
- [ ] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-11-06 08:40:06 +01:00
Igor Ilic
bee2fe3ba7
feat: Add initial custom pipeline (#1716)
<!-- .github/pull_request_template.md -->

## Description
Add run_custom_pipeline to have a way to execute a custom collection of tasks in Cognee

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [ ] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [ ] **I have tested my changes thoroughly before submitting this PR**
- [ ] **This PR contains minimal changes necessary to address the
issue/feature**
- [ ] My code follows the project's coding standards and style
guidelines
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have added necessary documentation (if applicable)
- [ ] All new and existing tests pass
- [ ] I have searched existing PRs to ensure this change hasn't been
submitted already
- [ ] I have linked any relevant issues in the description
- [ ] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-11-04 17:58:34 +01:00
Igor Ilic
46c509778f refactor: Rename access control functions 2025-11-04 12:06:16 +01:00
Igor Ilic
baac00923c
Merge branch 'dev' into enable-multi-user-mode-default 2025-11-03 15:57:06 +01:00
Fahad Shoaib
a4a9e76246 feat: add ontology endpoint in REST API
- Add POST /api/v1/ontologies endpoint for file upload
  - Add GET /api/v1/ontologies endpoint for listing ontologies
  - Implement OntologyService for file management and metadata
  - Integrate ontology_key parameter in cognify endpoint
  - Update RDFLibOntologyResolver to support file-like objects
  - Add essential test suite for ontology endpoints
2025-11-02 17:05:03 +05:00
Igor Ilic
d58f85e431
Update dev from main (#1707)
<!-- .github/pull_request_template.md -->

## Description
Merge main into dev branch

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [ ] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [ ] **I have tested my changes thoroughly before submitting this PR**
- [ ] **This PR contains minimal changes necessary to address the
issue/feature**
- [ ] My code follows the project's coding standards and style
guidelines
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have added necessary documentation (if applicable)
- [ ] All new and existing tests pass
- [ ] I have searched existing PRs to ensure this change hasn't been
submitted already
- [ ] I have linked any relevant issues in the description
- [ ] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-10-30 17:35:07 +01:00
Igor Ilic
e2457ef277 fix: Resolve issue with text classification 2025-10-30 14:27:01 +01:00
Igor Ilic
8d4eed6101
Merge branch 'dev' into enable-multi-user-mode-default 2025-10-30 14:20:30 +01:00
lxobr
6223ecf05b
feat: optimize repeated entity extraction (#1682)
<!-- .github/pull_request_template.md -->

## Description
<!--
Please provide a clear, human-generated description of the changes in
this PR.
DO NOT use AI-generated descriptions. We want to understand your thought
process and reasoning.
-->

- Added an `edge_text` field to edges that auto-fills from
`relationship_type` if not provided.
- Containts edges now store descriptions for better embedding
- Updated and refactored indexing so that edge_text gets embedded and
exposed
- Updated retrieval to use the new embeddings 
- Added a test to verify edge_text exists in the graph with the correct
format.

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [x] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [x] Code refactoring
- [x] Performance improvement
- [ ] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [x] **I have tested my changes thoroughly before submitting this PR**
- [x] **This PR contains minimal changes necessary to address the
issue/feature**
- [x] My code follows the project's coding standards and style
guidelines
- [x] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have added necessary documentation (if applicable)
- [ ] All new and existing tests pass
- [x] I have searched existing PRs to ensure this change hasn't been
submitted already
- [ ] I have linked any relevant issues in the description
- [ ] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-10-30 13:56:06 +01:00
Igor Ilic
6572cf5cb9 refactor: use boolean instead of string 2025-10-29 16:35:44 +01:00
Igor Ilic
fb7e74eaa8 refactor: Enable multi user mode by default if graph and vector db providers support it 2025-10-29 16:28:09 +01:00
vasilije
55d55946b0 add fix 2025-10-29 10:28:10 +01:00
vasilije
8efd152f4c fix network v 2025-10-28 16:03:43 +01:00
hajdul88
221a0dba01
feature: adds the concept of now to the qa for temporal queries (#1685)
<!-- .github/pull_request_template.md -->

## Description
 Adds the concept of now to the qa for temporal queries

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [x] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [ ] **I have tested my changes thoroughly before submitting this PR**
- [ ] **This PR contains minimal changes necessary to address the
issue/feature**
- [ ] My code follows the project's coding standards and style
guidelines
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have added necessary documentation (if applicable)
- [ ] All new and existing tests pass
- [ ] I have searched existing PRs to ensure this change hasn't been
submitted already
- [ ] I have linked any relevant issues in the description
- [ ] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-10-28 15:27:29 +01:00
hajdul88
65003a0a0b
Merge branch 'dev' into feature/cog-3254-add-multifile-ontology-handling 2025-10-28 09:58:17 +01:00
hajdul88
2e6ee5d61e
feature: adds new tenant id (if available) to telemetry packages for basic cognee operations (#1676)
<!-- .github/pull_request_template.md -->

## Description
Adds new tenant id (if available) to telemetry packages for basic cognee
operations

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [x] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [x] **I have tested my changes thoroughly before submitting this PR**
- [x] **This PR contains minimal changes necessary to address the
issue/feature**
- [x] My code follows the project's coding standards and style
guidelines
- [x] I have added tests that prove my fix is effective or that my
feature works
- [x] I have added necessary documentation (if applicable)
- [x] All new and existing tests pass
- [x] I have searched existing PRs to ensure this change hasn't been
submitted already
- [x] I have linked any relevant issues in the description
- [x] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-10-27 16:38:23 +01:00
hajdul88
6d55da00af ruff fix 2025-10-27 13:47:04 +01:00
hajdul88
6c576883b9 Update RDFLibOntologyResolver.py 2025-10-27 13:46:42 +01:00
hajdul88
61e47c0b2f feat: adds multifile ontology handling to OntologyResolver 2025-10-27 13:39:02 +01:00
hajdul88
7c921c22dc ruff fix 2025-10-24 15:37:31 +02:00
hajdul88
cd33740d3d feat: adds cognee version to logging 2025-10-24 15:32:36 +02:00
hajdul88
5ce79fb361 feat: adds cognee version to task and pipeline based telemetry info 2025-10-24 15:12:35 +02:00
hajdul88
828a1096de feat: adds config as additional properties to Pipeline_Run_Finished event 2025-10-24 12:52:21 +02:00
lxobr
ecae650a28 refactor: unify structured and str completion 2025-10-23 12:30:55 +02:00
lxobr
66a8242cec chore: restore the feedback enrichment cot retriever functionality 2025-10-23 12:07:31 +02:00
Vasilije
407550dada
Chore: update deprecated exceptions (#1586)
<!-- .github/pull_request_template.md -->

## Description
<!--
Please provide a clear, human-generated description of the changes in
this PR.
DO NOT use AI-generated descriptions. We want to understand your thought
process and reasoning.
-->

Logs are getting polluted with these starlette deprecation warnings.

This PR replaces the depracated status code

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [ ] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [ ] **I have tested my changes thoroughly before submitting this PR**
- [ ] **This PR contains minimal changes necessary to address the
issue/feature**
- [ ] My code follows the project's coding standards and style
guidelines
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have added necessary documentation (if applicable)
- [ ] All new and existing tests pass
- [ ] I have searched existing PRs to ensure this change hasn't been
submitted already
- [ ] I have linked any relevant issues in the description
- [ ] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-10-22 21:04:29 +02:00
Daulet Amirkhanov
3e2dbd1846 Update deprecated Exception status codes 2025-10-22 17:38:41 +01:00
Daulet Amirkhanov
e4cbbcbf51 Add hint log for when data is added but not cognified 2025-10-22 13:21:51 +01:00
Daulet Amirkhanov
ee7db762e6 log warning and early exit when graph is empty and is queried 2025-10-22 13:21:51 +01:00
Daulet Amirkhanov
03b4547b7f validate e2e - urls are saved as htmls, and loaders are selected correctly 2025-10-21 22:47:52 +01:00
Daulet Amirkhanov
95e735d397 remove fetchers_config, use default configs for Tavily and BeautifulSoup 2025-10-21 22:46:50 +01:00
Daulet Amirkhanov
085e81c082 Clean up - remove UnsupportedPathSchemeError 2025-10-21 22:46:49 +01:00
Daulet Amirkhanov
d7417d9b06 refactor: move url data fetching logic into save_data_item_to_storage 2025-10-21 22:46:49 +01:00
Daulet Amirkhanov
1a0978fb37 incremental loading - fallback to regular, update test cases 2025-10-21 22:46:49 +01:00
hajdul88
dd8afe42f8
Merge branch 'dev' into feature/cog-3160-redis-session-conversation 2025-10-20 15:21:56 +02:00
Igor Ilic
3e54b67b4d
fix: Resolve missing argument for distributed (#1563)
<!-- .github/pull_request_template.md -->

## Description
Resolve missing argument for distributed

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [ ] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [ ] **I have tested my changes thoroughly before submitting this PR**
- [ ] **This PR contains minimal changes necessary to address the
issue/feature**
- [ ] My code follows the project's coding standards and style
guidelines
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have added necessary documentation (if applicable)
- [ ] All new and existing tests pass
- [ ] I have searched existing PRs to ensure this change hasn't been
submitted already
- [ ] I have linked any relevant issues in the description
- [ ] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

Co-authored-by: hajdul88 <52442977+hajdul88@users.noreply.github.com>
2025-10-20 15:03:35 +02:00
hajdul88
49e9d7dc27 chore: renames conversation history save method 2025-10-20 10:28:03 +02:00
hajdul88
4d3ba3f313
Merge branch 'dev' into feature/cog-3160-redis-session-conversation 2025-10-20 10:05:57 +02:00
hajdul88
2595a419b3 ruff format 2025-10-16 18:25:13 +02:00
hajdul88
cdbdbfd755 feat: adds conversation history to most of the retrievers 2025-10-16 18:24:57 +02:00
hajdul88
9e9489c858 feat: adds conversation history to context if caching is enabled 2025-10-16 17:48:50 +02:00
hajdul88
91a22e8bc4 feat: adds session id to get_completion methods 2025-10-16 16:26:58 +02:00
hajdul88
7149f8c45b ruff format 2025-10-16 16:13:55 +02:00
hajdul88
fe1b02d937 feat: adds error handling and logging to redis cache 2025-10-16 16:12:50 +02:00
hajdul88
96f2a2f22b ruff format 2025-10-16 15:54:48 +02:00
hajdul88
abe4dfa69a feat: adds session save to retrievers where actual completion is used 2025-10-16 15:07:15 +02:00
hajdul88
0e4c4907e9 feat: centralizes session caching in util function 2025-10-16 10:46:19 +02:00
hajdul88
8454389a7d
Merge branch 'feature/cog-3142-agentic-use-case-kuzu-lock-fix-redis-integration' into feature/cog-3160-redis-session-conversation 2025-10-16 09:43:37 +02:00
Igor Ilic
2e1bfe78b1 refactor: rename variable to be more understandable 2025-10-15 20:26:59 +02:00