Commit graph

3014 commits

Author SHA1 Message Date
hajdul88
ebd4403c2f adds new error classes to keywordextration error and defines new error 2025-08-13 12:14:42 +02:00
hajdul88
6870bba5a9 feat: adds new error to delete 2025-08-13 12:03:18 +02:00
hajdul88
7bd2660d08 feat: setting base classes of data exceptions to the new ones 2025-08-13 11:58:32 +02:00
hajdul88
623148ec6a feat: adds new errors to litellm client 2025-08-13 11:53:29 +02:00
hajdul88
b40dbf3c68 feat: adds new error classes to litellm instructor 2025-08-13 11:50:24 +02:00
hajdul88
1b3898dd8b feat: updates rel_db exceptions with the new error classes 2025-08-13 11:38:38 +02:00
hajdul88
0555dd9dda Update __init__.py 2025-08-13 11:29:16 +02:00
hajdul88
ed555a731d feat: adds new Exception classes 2025-08-13 11:27:45 +02:00
hajdul88
fd9aaf57b1
feature: Adds doctype handling to delete (audio, image, unstructured) (#1239)
<!-- .github/pull_request_template.md -->

## Description
feature: Adds doctype handling to delete (audio, image, unstructured)

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-08-12 19:01:06 +02:00
Vasilije
a15cad05d5
Merge branch 'dev' into add_cli 2025-08-12 16:00:51 +02:00
Vasilije
0c42d19505
Update README.md 2025-08-12 15:38:30 +02:00
vasilije
e3bd4749ca git merge 2025-08-12 15:18:19 +02:00
vasilije
b2265957d9 added fix 2025-08-12 15:17:14 +02:00
vasilije
1de81c8dbf added fixes 2025-08-12 15:16:31 +02:00
vasilije
ff1614d55f format 2025-08-12 15:16:31 +02:00
Igor Ilic
a75a79f012
Lancedb async lock (#1222)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: Vasilije <8619304+Vasilije1990@users.noreply.github.com>
2025-08-12 08:46:15 -04:00
Igor Ilic
0a03ff1501
Merge branch 'dev' into configurable-path-exclusion-code-graph 2025-08-11 16:38:07 +02:00
Igor Ilic
c33536685d
fix: Resolve Kuzu S3 issue (#1235)
<!-- .github/pull_request_template.md -->

## Description
Resolve issue with pushing Kuzu DBs to S3

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-08-11 16:29:36 +02:00
Hande
6e7e4506cc
chore: update readme (#1234)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-08-11 11:16:16 +02:00
Fardeen Malik
a31986e591
Merge branch 'main' into multi-lang-codegraph 2025-08-11 13:56:18 +05:30
P-FardeenMalik
0a330683de Extend CodeGraph pipeline for multi-language support (closes #1160) 2025-08-11 13:48:37 +05:30
Vasilije
ed2bea4ff2
Potential fix for code scanning alert no. 147: Clear-text logging of sensitive information (#1229)
Potential fix for
[https://github.com/topoteretes/cognee/security/code-scanning/147](https://github.com/topoteretes/cognee/security/code-scanning/147)

To fix the problem, we should avoid logging any part of the API key.
Instead, we can log a generic message indicating that an API key is
configured, without revealing any part of the key. This maintains the
functionality of confirming that the key is present, while eliminating
the risk of exposing sensitive information. The change should be made in
`cognee-mcp/src/test_client.py`, specifically on line 60, replacing the
log statement with a safer alternative. No new imports or methods are
required.

---


_Suggested fixes powered by Copilot Autofix. Review carefully before
merging._

Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>
2025-08-10 12:29:51 +02:00
Emmanuel Ferdman
089fe34ec9
fix: raise error on version mapping failure (#1228)
## Description
Fixed `ValueError` that was not being properly raised when
`version_code` could not be mapped to proper Kuzu version.

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

Signed-off-by: Emmanuel Ferdman <emmanuelferdman@gmail.com>
2025-08-10 10:17:17 +02:00
Pavel Zorin
df0c0b3f7f
Chore: fix Commutity Greetings workflow permissions (#1227)
<!-- .github/pull_request_template.md -->

## Description
Fix “Resource not accessible by integration” error in greetings workflow

This PR updates the community | Greetings GitHub Actions workflow to
ensure it can successfully post greeting comments on newly opened issues
and pull requests — including PRs from forks — without hitting the
“Resource not accessible by integration” error.

## Changes
- Switched PR trigger from `pull_request` to `pull_request_target` to
run in the context of the base repository and grant write-scoped
`GITHUB_TOKEN` for commenting on forked PRs.
- Added explicit `permissions` block with:
  - `issues: write`  
  - `pull-requests: write`  
- Limited triggers to `types: [opened]` for both issues and PRs to avoid
unnecessary runs.
- Preserved existing greeting messages for issues and pull requests.

## Reason for change
The workflow was failing because the default `GITHUB_TOKEN` in
`pull_request` events is read-only for forks, preventing the bot from
posting comments.
`pull_request_target` with explicit permissions solves this while
maintaining security.

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-08-10 07:27:02 +02:00
EricXiao
815d639132
fix: graph visualization access for users with read permissions (#1220)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->
Description
This PR fix graph visualization access for users with read permissions
(https://github.com/topoteretes/cognee/issues/1182)

- Add permission checks for graph visualization endpoints to ensure
users can only access datasets they have permission to view
- Create get_dataset_with_permissions method to validate user access
before returning a dataset
- Remove redundant dataset existence validation in datasets router and
delegate permission checking to graph data retrieval
- Add comprehensive test suite for graph visualization permissions
covering owner access and permission granting scenarios
- Update get_formatted_graph_data() to use dataset owner's ID for
context
## Testing
Tests can be run with:
```bash
pytest -s cognee/tests/test_graph_visualization_permissions.py
```

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Signed-off-by: EricXiao <taoiaox@gmail.com>
Co-authored-by: Vasilije <8619304+Vasilije1990@users.noreply.github.com>
2025-08-08 20:42:57 +02:00
Vasilije
e3b41e0ed4
fix: Interface fixes (#1206)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: Daulet Amirkhanov <damirkhanov01@gmail.com>
2025-08-08 20:41:33 +02:00
Vasilije
f65605b575
fix: Feature/cog 2648 evals update (#1221)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: lxobr <122801072+lxobr@users.noreply.github.com>
Co-authored-by: Hande <159312713+hande-k@users.noreply.github.com>
2025-08-08 20:23:09 +02:00
Igor Ilic
5c44d01265
Merge branch 'dev' into configurable-path-exclusion-code-graph 2025-08-07 16:12:15 +02:00
lxobr
6dbd8e85a1
feat: dynamic multiple edges in datapoints (#1212)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->
- Improved list handling, removed `.index` logic from
`get_graph_from_model`, transitioned to fully datapoint-oriented
processing
- Streamlined datapoint iteration by introducing `_datapoints_generator`
with nested loops
- Generalized field processing to handle mixed lists: `[DataPoint,
(Edge, DataPoint), (Edge, [DataPoint])]`, allowing dynamic multiple
edges generation
- Small improvements and refactorings
- Added tests to `test_get_graph_from_model_flexible_edges()` covering
weighted edges and dynamic multiple edges
- Created `dynamic_multiple_edges_example.py` demonstrating dynamic
multiple edges

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-08-07 14:50:45 +02:00
vasilije
c8202c51a7 format fix 2025-08-06 19:11:39 +02:00
vasilije
0ea5894823 added distributed fixes 2025-08-06 19:11:20 +02:00
hajdul88
4e816ad80b
fix: changing deletion logic to use document id instead of content hash (#1210)
<!-- .github/pull_request_template.md -->

## Description
Changing deletion logic to use document id instead of content hash

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-08-06 16:54:35 +02:00
sssaha1989
a9e74dac42
Update LiteLLMEmbeddingEngine.py (#1205)
This will allow to deal with the issue when the user is using custom
embedding and LLM and passes the hosted_vllm option as part of the
LiteLLM documentation

<!-- .github/pull_request_template.md -->

## Description
<!-- This allows the user to use hosted_vllm with respect to LiteLLM
usage - and only gets applicable for custom embedding models -
specifically Hugging Face models -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-08-06 16:53:33 +02:00
Igor Ilic
ede6c881c7
fix: Resolve env template variable (#1208)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-08-06 11:04:34 +02:00
Igor Ilic
b54e843951
Add neo4j multi db support (#1207)
<!-- .github/pull_request_template.md -->

## Description
Add multi db support for Neo4j Enterprise users

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Signed-off-by: Raj2604 <rajmandhare26@gmail.com>
Co-authored-by: vasilije <vas.markovic@gmail.com>
Co-authored-by: Vasilije <8619304+Vasilije1990@users.noreply.github.com>
Co-authored-by: Daulet Amirkhanov <damirkhanov01@gmail.com>
Co-authored-by: Hande <159312713+hande-k@users.noreply.github.com>
Co-authored-by: Boris <boris@topoteretes.com>
Co-authored-by: Matea Pesic <80577904+matea16@users.noreply.github.com>
Co-authored-by: github-actions[bot] <github-actions@users.noreply.github.com>
Co-authored-by: hajdul88 <52442977+hajdul88@users.noreply.github.com>
Co-authored-by: Boris Arzentar <borisarzentar@gmail.com>
Co-authored-by: Raj Mandhare <96978537+Raj2604@users.noreply.github.com>
Co-authored-by: Pedro Thompson <thompsonp17@hotmail.com>
Co-authored-by: Pedro Henrique Thompson Furtado <pedrothompson@petrobras.com.br>
2025-08-06 10:44:45 +02:00
Vasilije
dabd0912f8
feat: Cog 2082 add BAML to cognee (#1054)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Signed-off-by: Raj2604 <rajmandhare26@gmail.com>
Co-authored-by: Daulet Amirkhanov <damirkhanov01@gmail.com>
Co-authored-by: Hande <159312713+hande-k@users.noreply.github.com>
Co-authored-by: Igor Ilic <igorilic03@gmail.com>
Co-authored-by: Igor Ilic <30923996+dexters1@users.noreply.github.com>
Co-authored-by: Boris <boris@topoteretes.com>
Co-authored-by: Matea Pesic <80577904+matea16@users.noreply.github.com>
Co-authored-by: github-actions[bot] <github-actions@users.noreply.github.com>
Co-authored-by: hajdul88 <52442977+hajdul88@users.noreply.github.com>
Co-authored-by: Boris Arzentar <borisarzentar@gmail.com>
Co-authored-by: Raj Mandhare <96978537+Raj2604@users.noreply.github.com>
Co-authored-by: Pedro Thompson <thompsonp17@hotmail.com>
Co-authored-by: Pedro Henrique Thompson Furtado <pedrothompson@petrobras.com.br>
2025-08-06 10:41:47 +02:00
Vasilije
4cdc42ea7a
Merge branch 'main' into feat/configurable-path-exclusion 2025-08-05 19:00:48 +02:00
Igor Ilic
8d4ed35cbe
Fix low level pipeline (#1203)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-08-05 17:01:48 +02:00
Raj Mandhare
e26c21670a
Merge branch 'dev' into feature/top_k-input 2025-08-05 16:29:46 +05:30
Igor Ilic
ba624660b9
Merge main vol 4 (#1201)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-08-05 12:55:45 +02:00
Igor Ilic
d237b806f9
Merge branch 'dev' into merge-main-vol-4 2025-08-05 12:51:15 +02:00
Raj2604
8590194806 feat: add top_k input to SearchView and send to backend 2025-08-05 16:19:33 +05:30
Igor Ilic
343d990fcc
Merge main vol 4 (#1200)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Signed-off-by: Andrew Carbonetto <andrew.carbonetto@improving.com>
Signed-off-by: Andy Kwok <andy.kwok@improving.com>
Co-authored-by: Vasilije <8619304+Vasilije1990@users.noreply.github.com>
Co-authored-by: vasilije <vas.markovic@gmail.com>
Co-authored-by: Andrew Carbonetto <andrew.carbonetto@improving.com>
Co-authored-by: Andy Kwok <andy.kwok@improving.com>
2025-08-05 12:48:24 +02:00
Igor Ilic
ab425e4e18 Merge branch 'main' into merge-main-vol-4 2025-08-05 11:49:55 +02:00
vasilije
11422f362f fixes to formatting 2025-08-05 10:12:05 +02:00
Andrew Carbonetto
7d2bf78c81
Add Neptune Analytics hybrid storage (#1156)
<!-- .github/pull_request_template.md -->

## Description
Adds a Neptune Analytics 'hybrid' integration layer to the Cognee.ai
memory storage layer. The following configuration will use Amazon
Neptune Analytics to store all nodes, edges, and vector embeddings for
the Cognee.ai memory conversation.

```
    cognee.config.set_graph_db_config(
        {
            "graph_database_provider": "neptune_analytics",  # Specify Neptune Analytics as provider
            "graph_database_url": graph_endpoint_url,  # Neptune Analytics endpoint with the format neptune-graph://<GRAPH_ID>
        }
    )
    cognee.config.set_vector_db_config(
        {
            "vector_db_provider": "neptune_analytics",  # Specify Neptune Analytics as provider
            "vector_db_url": graph_endpoint_url,  # Neptune Analytics endpoint with the format neptune-graph://<GRAPH_ID>
        }
    )
```

For example, see
[neptune_analytics_example.py](08a3a1d2a8/examples/database_examples/neptune_analytics_example.py)

Related: https://github.com/topoteretes/cognee-starter/pull/11

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Signed-off-by: Andrew Carbonetto <andrew.carbonetto@improving.com>
Signed-off-by: Andy Kwok <andy.kwok@improving.com>
Co-authored-by: Andy Kwok <andy.kwok@improving.com>
Co-authored-by: Vasilije <8619304+Vasilije1990@users.noreply.github.com>
2025-08-05 10:05:31 +02:00
Vasilije
1a15669779
Merge branch 'dev' into add_cli 2025-08-04 19:54:11 +02:00
vasilije
84ba0b2b59 fix cognify 2025-08-04 19:51:54 +02:00
Igor Ilic
b46833b476
Cognee mcp fixes main (#1196)
<!-- .github/pull_request_template.md -->

## Description
Fix Cognee mcp issues

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-08-04 15:48:04 +02:00
Igor Ilic
f4a37edc3f
Cognee mcp docker fix (#1195)
<!-- .github/pull_request_template.md -->

## Description
Resolve issues with Cognee MCP docker use

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-08-04 12:19:12 +02:00