Commit graph

2676 commits

Author SHA1 Message Date
Igor Ilic
dbdf04c089
Data model migration (#1143)
<!-- .github/pull_request_template.md -->

## Description
Data model migration for new release

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-24 15:03:16 +02:00
Vasilije
1885ab9e88
chore: Cog 2354 add logging (#1115)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: hajdul88 <52442977+hajdul88@users.noreply.github.com>
2025-07-24 13:27:27 +02:00
Boris
d6727a1b4a
fix: UnstructuredDocument read method (#1141)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-24 13:23:27 +02:00
Igor Ilic
8e943beb15
Kuzu migration (#1135)
<!-- .github/pull_request_template.md -->

## Description
Added script that can migrate Kuzu versions automatically

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-24 13:13:38 +02:00
Igor Ilic
ed09750598 feat : add migration for kuzu 0.8.2 as well 2025-07-24 13:07:07 +02:00
Igor Ilic
2fff637cda feat: Add better info when migrating kuzu database 2025-07-24 12:38:31 +02:00
Igor Ilic
021437095c refactor: Change old db backup name 2025-07-24 12:18:23 +02:00
Igor Ilic
08ed43b1e9 feat: Add usage of temp directory for kuzu migration, epand on old graph db naming 2025-07-24 12:09:45 +02:00
Igor Ilic
8cd3bed1ad Merge branch 'kuzu-migration' of github.com:topoteretes/cognee into kuzu-migration 2025-07-23 20:59:47 +02:00
Igor Ilic
6abf0713b9 refactor: move kuzu-migrate to infrastructure 2025-07-23 20:59:25 +02:00
Igor Ilic
87d6fef643
Merge branch 'dev' into kuzu-migration 2025-07-23 19:56:56 +02:00
Igor Ilic
8a7c6bfef0 feat: Add automatic Kuzu migration as exception handling attempt 2025-07-23 19:55:55 +02:00
Vasilije
daa4e9acc4
fix: Remove weaviate (#1139)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-23 19:34:35 +02:00
Igor Ilic
ce312fb397 feat: Add automatic mapping of Kuzu version based on file 2025-07-23 19:14:03 +02:00
Hande
4740f87947
chore: Update README.md (#1138)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-23 18:12:33 +02:00
Igor Ilic
7e1492b13c
Merge branch 'dev' into kuzu-migration 2025-07-23 17:18:34 +02:00
Igor Ilic
819e244616
Merge main changes (#1136)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-23 17:18:24 +02:00
Igor Ilic
7eff8b825f Merge branch 'main' into merge-main-changes 2025-07-23 17:16:24 +02:00
Igor Ilic
1453b3dd1a refactor: Move kuzu migration to tools 2025-07-23 17:12:22 +02:00
Igor Ilic
98f2cd3305 refactor: add comments 2025-07-23 16:59:08 +02:00
Igor Ilic
d780b54965 feat: Added delete old flag for kuzu 2025-07-23 16:48:36 +02:00
Igor Ilic
5c9835c610 feat: add overwrite old db option 2025-07-23 15:58:05 +02:00
hajdul88
1135a5e44d
chore: Sets sqlalchemy pool_size and max overflow to a hard limit instead of default values (#1133)
<!-- .github/pull_request_template.md -->

## Description
Sets sqlalchemy pool_size and max overflow to a hard limit instead of
default values

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-23 15:36:29 +02:00
Boris
f77183d001
fix: UI (#1134)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-23 15:35:21 +02:00
Igor Ilic
c98c3a3d53 feat: Add kuzu migration script 2025-07-23 14:43:44 +02:00
hajdul88
2b1c17404c
Feature: optimizes query embedding and edge collection search (#1126)
<!-- .github/pull_request_template.md -->

## Description
Optimizes query embedding by reducing the number of query embedding
calls and avoids multiple edge collection searches when they are
available.

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-23 11:47:22 +02:00
Igor Ilic
59594e01ac
fix: add missing await for getting default user (#1131)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-23 06:47:38 +02:00
Boris
245e88588f
version: v0.2.1.dev7 (#1129)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-22 19:06:26 +02:00
Boris
7ba35f2f14
fix: add api tracking events (#1128)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-22 18:20:15 +02:00
Igor Ilic
022c96de55
refactor: simplify endpoint default values (#1123)
<!-- .github/pull_request_template.md -->

## Description
Simplify Cognee endpoints so default dataset ID will be None

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-22 09:09:44 -04:00
Pedro Thompson
115585ee9c
enhancement: Optimizing embedding calls in brute_force_search (#1101)
@Vasilije1990

- Use query_vector instead of query_text in brute_force_search

<!-- .github/pull_request_template.md -->

## Description

[Here](ef1aecd835/cognee/modules/retrieval/utils/brute_force_triplet_search.py (L163))
brute_force_search uses the vector engine to perform the same search —
with the same query text — across multiple collections, making the
embedding calls unnecessarily proportional to the number of collections
being searched.

Since the
[search](ef1aecd835/cognee/infrastructure/databases/vector/vector_db_interface.py (L85))
interface is already designed to accept precomputed query vectors, I’m
submitting an optimization to brute_force_search to take advantage of
this.

If this is considered good practice, it might be worth implementing a
direct query_vector argument in
[map_vector_distances_to_graph_edges](ef1aecd835/cognee/modules/graph/cognee_graph/CogneeGraph.py (L135))
, and using it both
[here](ef1aecd835/cognee/modules/retrieval/utils/brute_force_triplet_search.py (L179))
and in any future uses of map_vector_distances_to_graph_edges.

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: Pedro Henrique Thompson Furtado <pedrothompson@petrobras.com.br>
Co-authored-by: Vasilije <8619304+Vasilije1990@users.noreply.github.com>
Co-authored-by: Daulet Amirkhanov <damirkhanov01@gmail.com>
Co-authored-by: hajdul88 <52442977+hajdul88@users.noreply.github.com>
2025-07-22 13:50:25 +02:00
hajdul88
dad7da2e7b
fix:Fixes missing entity to entity edges (#1118)
<!-- .github/pull_request_template.md -->

## Description
Fixes missing entity to entity edges

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-22 11:48:56 +02:00
hajdul88
4474efade3
Fix: Fixes psycopg2 error when pg_config is not installed (#1120)
<!-- .github/pull_request_template.md -->

## Description
Fix: Fixes psycopg2 error when pg_config is not installed

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: Igor Ilic <igorilic03@gmail.com>
2025-07-22 03:31:01 -04:00
Vasilije
7af7e3834f
feat: Cog 2340 remove graphistry (#1080)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: Igor Ilic <igorilic03@gmail.com>
2025-07-21 15:06:23 -04:00
Daulet Amirkhanov
844f4830d6
chore: fix typo in .env.template (#1122)
## Description
Fixed a minor typo in `.env.template`. Not the biggest change, but
that's a start ¯\_(ツ)_/¯

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-21 21:00:53 +02:00
Vasilije
95279d0805
chore: Update CONTRIBUTING.md (#1119)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-21 15:22:59 +02:00
Raj Mandhare
3c854b384d
Added Flexible CORS Configuration via Environment Variableadded flexibility to cors (#1109)
<!-- .github/pull_request_template.md -->

## Description
Fixes #1086 
This PR improves the security and flexibility of the Cognee API’s CORS
(Cross-Origin Resource Sharing) configuration.
It introduces a new environment variable, CORS_ALLOWED_ORIGINS, which
allows explicit control over which origins are permitted to access the
API.
The API now reads allowed origins from the CORS_ALLOWED_ORIGINS
environment variable (comma-separated).
If the variable is not set, all cross-origin requests are blocked by
default (secure by default).
For local development, you can use:
CORS_ALLOWED_ORIGINS="http://localhost:3000"
Added a commented example to .env.example to guide users in configuring
CORS origins.
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

Signed-off-by: Raj2604 <rajmandhare26@gmail.com>
Co-authored-by: Vasilije <8619304+Vasilije1990@users.noreply.github.com>
2025-07-19 20:48:52 +02:00
Vasilije
02df4a79be
fix: add numpy higher version (#1093)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: Igor Ilic <igorilic03@gmail.com>
2025-07-19 18:31:39 +02:00
Boris
eed2befde8
version: v0.2.1.dev6 (#1113)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-19 15:58:31 +02:00
Boris
468186789c
fix: s3 file system env vars (#1112)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-19 15:56:15 +02:00
Raj Mandhare
b72a75776d
Frontend hardcodes API base URL #1084 (#1104)
<!-- .github/pull_request_template.md -->

## Description
This pull request refactors the frontend application to remove hardcoded
backend API and WebSocket URLs. The Environment variables are stored at
.env.template file which is created in PR. By this you can temporary
solve the problem of binding to the same port 8000. You have to bind
Cognee Api Server to different Port and then instead of changing the
hardcoded urls you only need to change the Port Number in base url which
is present in .env . Please let me know if you want to change Port
Number of Cognee Api Server from 8000 to different Port Number which
will resolve this issue permanently.
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Signed-off-by: Raj2604 <rajmandhare26@gmail.com>
2025-07-18 18:12:57 +02:00
Boris
34c2976bb5
version: v0.2.1.dev5 (#1107)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-18 18:04:39 +02:00
Boris
43a21342a3
fix: anonymous id as env var (#1106)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-18 17:56:08 +02:00
Igor Ilic
01bab3f0c7
Fix cognify endpoint (#1105)
<!-- .github/pull_request_template.md -->

## Description
Have cognify run in background

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-18 16:37:04 +02:00
Igor Ilic
875da53c3a
fix: Resolve issue with dataset status endpoint (#1103)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-18 08:00:20 -04:00
Boris
6549b93075
version: v0.2.1.dev4 (#1098)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-16 21:46:48 +02:00
Vasilije
d4739bd40c
fix: Delete by id draft (#1073)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: Igor Ilic <igorilic03@gmail.com>
Co-authored-by: Boris <boris@topoteretes.com>
Co-authored-by: Boris Arzentar <borisarzentar@gmail.com>
2025-07-16 21:11:35 +02:00
Boris
c5bd6bed40
fix: s3 file storage (#1095)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-16 20:36:18 +02:00
Vasilije
67c006bd2f
fix: Remove milvus from core (#1096)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-07-16 15:56:34 +02:00
Matea Pesic
a06b3fc7e4
Memgraph integration fixes (#991)
<!-- .github/pull_request_template.md -->

## Description
Created a fix for Memgraph integration where the query silently fails in
relationship creation if nodes don't exist.

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: github-actions[bot] <github-actions@users.noreply.github.com>
Co-authored-by: hajdul88 <52442977+hajdul88@users.noreply.github.com>
Co-authored-by: Boris <boris@topoteretes.com>
2025-07-16 15:01:41 +02:00