Commit graph

2764 commits

Author SHA1 Message Date
Vasilije
eb999c24f2 fix: Delete by id draft (#1073)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: Igor Ilic <igorilic03@gmail.com>
Co-authored-by: Boris <boris@topoteretes.com>
Co-authored-by: Boris Arzentar <borisarzentar@gmail.com>
2025-07-24 13:29:31 +02:00
Boris
09975ef555 fix: s3 file storage (#1095)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-24 13:29:31 +02:00
Vasilije
799c8abbaa fix: Remove milvus from core (#1096)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-24 13:29:30 +02:00
Matea Pesic
d69e552d7a Memgraph integration fixes (#991)
<!-- .github/pull_request_template.md -->

## Description
Created a fix for Memgraph integration where the query silently fails in
relationship creation if nodes don't exist.

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: github-actions[bot] <github-actions@users.noreply.github.com>
Co-authored-by: hajdul88 <52442977+hajdul88@users.noreply.github.com>
Co-authored-by: Boris <boris@topoteretes.com>
2025-07-24 13:29:29 +02:00
Igor Ilic
e49bb51a14 refactor: Add Kuzu buffer change (#1094)
<!-- .github/pull_request_template.md -->

## Description
Change Kuzu default buffer size

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-24 13:29:27 +02:00
Boris
45350ca590 release: 0.2.1.dev1 (#1089)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-24 13:29:27 +02:00
Boris
2360dd2224 feat: s3 storage (#988)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: vasilije <vas.markovic@gmail.com>
Co-authored-by: Vasilije <8619304+Vasilije1990@users.noreply.github.com>
2025-07-24 13:29:25 +02:00
Vasilije
efaf6b7268 feat: Weighted edges (#1068)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: Igor Ilic <30923996+dexters1@users.noreply.github.com>
Co-authored-by: Igor Ilic <igorilic03@gmail.com>
2025-07-24 13:29:07 +02:00
Igor Ilic
9f44ac7f60 feat: Data size info tracking (#1088)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-24 13:28:58 +02:00
Igor Ilic
fa126fa1e3 Swagger endpoint docstrings (#1087)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: vasilije <vas.markovic@gmail.com>
2025-07-24 13:28:58 +02:00
Vasilije
b64e4b414a fix: Add docstrings to routers (#1081)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: Igor Ilic <igorilic03@gmail.com>
2025-07-24 13:28:58 +02:00
vasilije
4ec7e9b215 formatting 2025-07-24 13:28:58 +02:00
vasilije
61729aec73 add info 2025-07-24 13:28:58 +02:00
Hande
7d4613383b chore: Update README.md (#1137)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-24 13:28:58 +02:00
Daulet Amirkhanov
3f76fc4e1b chore: fix typo in .env.template (#1122)
## Description
Fixed a minor typo in `.env.template`. Not the biggest change, but
that's a start ¯\_(ツ)_/¯

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-24 13:28:55 +02:00
Vasilije
1885ab9e88
chore: Cog 2354 add logging (#1115)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: hajdul88 <52442977+hajdul88@users.noreply.github.com>
2025-07-24 13:27:27 +02:00
Vasilije
ed284c820d chore: Update CONTRIBUTING.md (#1119)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-24 13:23:56 +02:00
Boris
d6727a1b4a
fix: UnstructuredDocument read method (#1141)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-24 13:23:27 +02:00
Igor Ilic
8e943beb15
Kuzu migration (#1135)
<!-- .github/pull_request_template.md -->

## Description
Added script that can migrate Kuzu versions automatically

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-24 13:13:38 +02:00
Igor Ilic
ed09750598 feat : add migration for kuzu 0.8.2 as well 2025-07-24 13:07:07 +02:00
Igor Ilic
2fff637cda feat: Add better info when migrating kuzu database 2025-07-24 12:38:31 +02:00
Igor Ilic
021437095c refactor: Change old db backup name 2025-07-24 12:18:23 +02:00
Igor Ilic
08ed43b1e9 feat: Add usage of temp directory for kuzu migration, epand on old graph db naming 2025-07-24 12:09:45 +02:00
Igor Ilic
8cd3bed1ad Merge branch 'kuzu-migration' of github.com:topoteretes/cognee into kuzu-migration 2025-07-23 20:59:47 +02:00
Igor Ilic
6abf0713b9 refactor: move kuzu-migrate to infrastructure 2025-07-23 20:59:25 +02:00
Igor Ilic
87d6fef643
Merge branch 'dev' into kuzu-migration 2025-07-23 19:56:56 +02:00
Igor Ilic
8a7c6bfef0 feat: Add automatic Kuzu migration as exception handling attempt 2025-07-23 19:55:55 +02:00
Vasilije
daa4e9acc4
fix: Remove weaviate (#1139)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-23 19:34:35 +02:00
Igor Ilic
ce312fb397 feat: Add automatic mapping of Kuzu version based on file 2025-07-23 19:14:03 +02:00
Hande
f4a11a236d
chore: Update README.md (#1137)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-23 18:17:57 +02:00
Hande
4740f87947
chore: Update README.md (#1138)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-23 18:12:33 +02:00
Igor Ilic
7e1492b13c
Merge branch 'dev' into kuzu-migration 2025-07-23 17:18:34 +02:00
Igor Ilic
819e244616
Merge main changes (#1136)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-23 17:18:24 +02:00
Igor Ilic
7eff8b825f Merge branch 'main' into merge-main-changes 2025-07-23 17:16:24 +02:00
Igor Ilic
1453b3dd1a refactor: Move kuzu migration to tools 2025-07-23 17:12:22 +02:00
Igor Ilic
98f2cd3305 refactor: add comments 2025-07-23 16:59:08 +02:00
Igor Ilic
d780b54965 feat: Added delete old flag for kuzu 2025-07-23 16:48:36 +02:00
Igor Ilic
5c9835c610 feat: add overwrite old db option 2025-07-23 15:58:05 +02:00
hajdul88
1135a5e44d
chore: Sets sqlalchemy pool_size and max overflow to a hard limit instead of default values (#1133)
<!-- .github/pull_request_template.md -->

## Description
Sets sqlalchemy pool_size and max overflow to a hard limit instead of
default values

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-23 15:36:29 +02:00
Boris
f77183d001
fix: UI (#1134)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-23 15:35:21 +02:00
Igor Ilic
c98c3a3d53 feat: Add kuzu migration script 2025-07-23 14:43:44 +02:00
hajdul88
2b1c17404c
Feature: optimizes query embedding and edge collection search (#1126)
<!-- .github/pull_request_template.md -->

## Description
Optimizes query embedding by reducing the number of query embedding
calls and avoids multiple edge collection searches when they are
available.

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-23 11:47:22 +02:00
Igor Ilic
59594e01ac
fix: add missing await for getting default user (#1131)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-23 06:47:38 +02:00
Boris
245e88588f
version: v0.2.1.dev7 (#1129)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-22 19:06:26 +02:00
Boris
7ba35f2f14
fix: add api tracking events (#1128)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-22 18:20:15 +02:00
Igor Ilic
022c96de55
refactor: simplify endpoint default values (#1123)
<!-- .github/pull_request_template.md -->

## Description
Simplify Cognee endpoints so default dataset ID will be None

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-22 09:09:44 -04:00
Pedro Thompson
115585ee9c
enhancement: Optimizing embedding calls in brute_force_search (#1101)
@Vasilije1990

- Use query_vector instead of query_text in brute_force_search

<!-- .github/pull_request_template.md -->

## Description

[Here](ef1aecd835/cognee/modules/retrieval/utils/brute_force_triplet_search.py (L163))
brute_force_search uses the vector engine to perform the same search —
with the same query text — across multiple collections, making the
embedding calls unnecessarily proportional to the number of collections
being searched.

Since the
[search](ef1aecd835/cognee/infrastructure/databases/vector/vector_db_interface.py (L85))
interface is already designed to accept precomputed query vectors, I’m
submitting an optimization to brute_force_search to take advantage of
this.

If this is considered good practice, it might be worth implementing a
direct query_vector argument in
[map_vector_distances_to_graph_edges](ef1aecd835/cognee/modules/graph/cognee_graph/CogneeGraph.py (L135))
, and using it both
[here](ef1aecd835/cognee/modules/retrieval/utils/brute_force_triplet_search.py (L179))
and in any future uses of map_vector_distances_to_graph_edges.

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: Pedro Henrique Thompson Furtado <pedrothompson@petrobras.com.br>
Co-authored-by: Vasilije <8619304+Vasilije1990@users.noreply.github.com>
Co-authored-by: Daulet Amirkhanov <damirkhanov01@gmail.com>
Co-authored-by: hajdul88 <52442977+hajdul88@users.noreply.github.com>
2025-07-22 13:50:25 +02:00
hajdul88
dad7da2e7b
fix:Fixes missing entity to entity edges (#1118)
<!-- .github/pull_request_template.md -->

## Description
Fixes missing entity to entity edges

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-22 11:48:56 +02:00
hajdul88
4474efade3
Fix: Fixes psycopg2 error when pg_config is not installed (#1120)
<!-- .github/pull_request_template.md -->

## Description
Fix: Fixes psycopg2 error when pg_config is not installed

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: Igor Ilic <igorilic03@gmail.com>
2025-07-22 03:31:01 -04:00
Vasilije
7af7e3834f
feat: Cog 2340 remove graphistry (#1080)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: Igor Ilic <igorilic03@gmail.com>
2025-07-21 15:06:23 -04:00