Commit graph

2341 commits

Author SHA1 Message Date
Jan Müller
66da3d2934
Document collection of usage data and how to opt out (#767)
## Description
Document that Cognee collects usage data in the readme and add
instructions how users can opt out.

---------

Co-authored-by: i555611 <jan.mueller01@sap.com>
Co-authored-by: Vasilije <8619304+Vasilije1990@users.noreply.github.com>
2025-04-22 23:52:34 +02:00
Boris Arzentar
f1364fb348 version: v0.1.38 2025-04-21 15:24:08 +02:00
Boris Arzentar
7846f1c647 fix: remove filesystem extra 2025-04-21 15:22:41 +02:00
Boris Arzentar
b705479af0 chore: update cognee version 2025-04-19 21:49:04 +02:00
Boris Arzentar
a63ba8f148 Merge remote-tracking branch 'origin/dev' 2025-04-19 21:37:27 +02:00
Boris Arzentar
1cdcc0a69e version: v0.1.37 2025-04-19 21:01:24 +02:00
Vasilije
bb7eaa017b
feat: Group DataPoints into NodeSets (#680)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: lxobr <122801072+lxobr@users.noreply.github.com>
Co-authored-by: Boris <boris@topoteretes.com>
Co-authored-by: Boris Arzentar <borisarzentar@gmail.com>
2025-04-19 20:21:04 +02:00
Vasilije
8374e402a8
fix: Clean up pokemons (#746)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-04-19 10:51:51 +02:00
Vasilije
8eda1eda74
fix: Bump fast api users (#759)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: Boris Arzentar <borisarzentar@gmail.com>
Co-authored-by: Igor Ilic <30923996+dexters1@users.noreply.github.com>
Co-authored-by: lxobr <122801072+lxobr@users.noreply.github.com>
2025-04-19 10:51:23 +02:00
Boris
675b66175f
test: make search unit tests deterministic (#726)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: Daniel Molnar <soobrosa@gmail.com>
2025-04-18 21:55:24 +02:00
Boris
751eca7aaf
fix: cognee ui with new visualization (#733)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: Igor Ilic <30923996+dexters1@users.noreply.github.com>
2025-04-18 15:23:51 +02:00
rex993
54fb400d91
Set LiteLLM logging level (#758)
<!-- .github/pull_request_template.md -->

## Description
LiteLLM adds the debugging lines: LiteLLM:INFO to the output when
running and can flood the output making actual cognee debugging
difficult. This .env config will suppress logging to ERROR level only.

Note there is an open bug with LiteLLM here
[https://github.com/BerriAI/litellm/issues/9815](https://github.com/BerriAI/litellm/issues/9815)
as 'cost calculation' lines will still be shown despite ERROR level
logging

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

Co-authored-by: rex <rex@epicx.me>
2025-04-18 09:28:39 +02:00
Igor Ilic
ba2de9bb22
fix: HuggingFace tokenizer (#752)
<!-- .github/pull_request_template.md -->

## Description
Resolve issue noticed by [RyabykinIlya](https://github.com/RyabykinIlya)
where too many HuggingFace requests have been sent due to the embedding
engine not working as a singleton per config

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Signed-off-by: Ryabykin Ilya <ryabykinia@sibur.ru>
Co-authored-by: greshish <ryabykinia@yandex.ru>
Co-authored-by: Ryabykin Ilya <ryabykinia@sibur.ru>
2025-04-17 17:07:36 +02:00
Daniel Molnar
9ba12b25ef
feat: add delete by document (#668)
<!-- .github/pull_request_template.md -->

## Description
Delete by document.

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin
2025-04-17 15:42:10 +02:00
Igor Ilic
af276b8999
feat: Add initial cognee pipeline simplification [COG-1705] (#670)
<!-- .github/pull_request_template.md -->

## Description
Simplify Cognee pipeline usage for users

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin
2025-04-17 14:02:12 +02:00
Igor Ilic
da332e85fe
Add top k [COG-1862] (#743)
<!-- .github/pull_request_template.md -->

## Description
Add ability to define top-k for Cognee search types Insights, RAG and
GRAPH Completion

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-04-17 14:01:35 +02:00
hajdul88
f13607cf18
fix: Index graph edges embedding error (#750)
<!-- .github/pull_request_template.md -->

## Description
Fixes the embedding error for index graph edges

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-04-17 12:48:27 +02:00
hajdul88
0121a2b5fc
feature: Adds S3 functionality (#731)
<!-- .github/pull_request_template.md -->

## Description
Adds S3 support


## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-04-17 08:56:40 +02:00
Igor Ilic
a036787ad1
Embedding string fix [COG-1900] (#742)
<!-- .github/pull_request_template.md -->

## Description
Allow embedding of big strings to support full row embedding in SQL
databases

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-04-16 22:39:06 +02:00
Igor Ilic
acd7abbd29
Fix gemini gh action (#748)
<!-- .github/pull_request_template.md -->

## Description
Fix Gemini github action

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-04-16 15:22:16 +02:00
Vasilije
4e9ca94e78
feat: Adding rate limiting (#709)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: Igor Ilic <30923996+dexters1@users.noreply.github.com>
2025-04-16 12:03:46 +02:00
lxobr
d1eab97102
feature: tighten run_tasks_base (#730)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->
- Extracted run_tasks_base function into a new file run_tasks_base.py.
- Extracted four executors that execute core logic based on the task
type.
- Extracted a task handler/wrapper that safely executes the core logic
with logging and telemetry.
- Fixed the inconsistency with the batches of size 1.

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-04-16 09:19:03 +02:00
Vasilije
ee34aac696
Update CONTRIBUTING.md 2025-04-14 22:00:04 +02:00
Vasilije
0ebd750ff7
Update README.md 2025-04-13 09:17:43 +02:00
Vasilije
3bb611ed01
Update README.md 2025-04-12 22:26:40 +02:00
Vasilije
af0dc95c45
Update README.md 2025-04-12 22:25:37 +02:00
Vasilije
2c83e71df4
fix: Update README.md (#725)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-04-12 22:24:45 +02:00
Igor Ilic
22b363b297
tests: Add gh action to test relational db migration [COG-1591] (#718)
<!-- .github/pull_request_template.md -->

## Description
Add relational db migration action 

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-04-11 14:02:44 +02:00
Vasilije
228fba8096
fix: Refactor notebooks (#720)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

Co-authored-by: Boris Arzentar <borisarzentar@gmail.com>
2025-04-11 10:23:22 +02:00
Igor Ilic
87a3642ea4
Cog 1576 (#721)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: vasilije <vas.markovic@gmail.com>
Co-authored-by: lxobr <122801072+lxobr@users.noreply.github.com>
2025-04-11 09:31:48 +02:00
Boris
9536395468
Revert "feat: pipeline tasks needs mapping" (#717)
Reverts topoteretes/cognee#690

I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-04-10 12:10:12 +02:00
Vasilije
c3d33e728e
feat: New tests (#678)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin

---------

Co-authored-by: Daniel Molnar <soobrosa@gmail.com>
Co-authored-by: Igor Ilic <30923996+dexters1@users.noreply.github.com>
Co-authored-by: Igor Ilic <igorilic03@gmail.com>
2025-04-10 10:52:48 +02:00
Igor Ilic
6f4c117005
fix: Resolve issue with missing name if no attribute (#707)
<!-- .github/pull_request_template.md -->

## Description
Resolve issue with missing name if no attribute

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-04-08 11:46:18 +02:00
Hande
0ca8ef2448
test: relational db migration (#695)
<!-- .github/pull_request_template.md -->

## Description
test for database migration

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: Igor Ilic <30923996+dexters1@users.noreply.github.com>
2025-04-07 18:37:44 +02:00
Boris
534e7fb22c
fix: ollama test embedding endpoint (#708)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-04-07 18:13:39 +02:00
Igor Ilic
f5cd39c09d
fix: Resolve failing test for RAG_COMPLETION, add RAG_COMPLETION to MCP (#706)
<!-- .github/pull_request_template.md -->

## Description
Resolve failing test for RAG_COMPLETION, add RAG_COMPLETION to MCP

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

Co-authored-by: Boris <boris@topoteretes.com>
2025-04-07 18:13:15 +02:00
Igor Ilic
c4a6c94675
fix: Resolve duplicate chunk issue for PGVector [COG-895] (#705)
<!-- .github/pull_request_template.md -->

## Description
Resolve issues with duplicate chunks for PGVector

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-04-07 18:03:36 +02:00
Vasilije
cd0d321eda
feat: Rename COMPLETION to RAG_COMPLETION (#701)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-04-07 11:46:48 +02:00
lxobr
e12242b9d0
fix: get default tasks (#700)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->
- Fixed get_no_summary_tasks and get_just_chunks_tasks to work with the
new tasks and pipelines
- Chore: fixed the pokemon example to work with the new tasks and
pipelines

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-04-07 08:46:02 +02:00
Igor Ilic
f4856b4413
Mcp add search (#702)
<!-- .github/pull_request_template.md -->

## Description
- Fix Ollama endpoint issue
- Fix COMPLETION and GRAPH COMPLETION MCP use

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-04-04 19:11:07 +02:00
lxobr
8207dc8643
feat: make graph creation prompt configurable (#686)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->
- Added new graph creation prompts
- Exposed graph creation prompts in .cognify via get_default tasks
- Exposed graph creation prompts in eval framework
## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: hajdul88 <52442977+hajdul88@users.noreply.github.com>
2025-04-03 11:14:33 +02:00
Igor Ilic
b618e97f98
chore: Remove outdated nodejs example, add specific versioning for mcp (#698)
<!-- .github/pull_request_template.md -->

## Description
Remove outdated nodejs example, add specific versioning for mcp

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-04-03 10:53:44 +02:00
Boris
0ce6fad24a
feat: pipeline tasks needs mapping (#690)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-04-03 10:52:59 +02:00
Vasilije
567b45efa6
feat: Add Benchmark results (#697)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: lxobr <122801072+lxobr@users.noreply.github.com>
2025-04-02 18:48:56 +02:00
Vasilije
67e980f22e
feat: Add default user from config (#682)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: Boris Arzentar <borisarzentar@gmail.com>
2025-04-02 14:48:10 +02:00
Boris Arzentar
de50b36322 Merge remote-tracking branch 'origin/dev' 2025-04-02 12:50:53 +02:00
Boris Arzentar
0f8bec1f07 version: v0.1.36 2025-04-02 12:49:51 +02:00
James
edea54c5c3
fix: convert file path to str (#693)
## Description

fix int unable find method .split, not sure why its a int

## DCO Affirmation

I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-04-02 12:35:19 +02:00
Daniel Molnar
ef2bd5c1e3
feat: follow extras in Docker (#694)
<!-- .github/pull_request_template.md -->

## Description
Explicit is better than implicit.
https://peps.python.org/pep-0020/#the-zen-of-python

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-04-02 09:53:33 +02:00
Igor Ilic
6898e8f766
Fix codify mcp (#696)
<!-- .github/pull_request_template.md -->

## Description
- Redirect all Cognee output to stderr for MCP ( as stdout is used to
communicate between MCP Client and server )
- Add test for CODE search type
- Resolve missing optional GUI dependency

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-04-02 06:38:17 +02:00