Commit graph

3132 commits

Author SHA1 Message Date
hajdul88
66d2c75128 chore: changing docstring 2025-08-13 15:20:24 +02:00
hajdul88
7af4775c56 chore: changed pypdf error base class 2025-08-13 15:17:01 +02:00
hajdul88
da40365932 ruff formatting 2025-08-13 15:15:39 +02:00
hajdul88
68327d3ab9 chore: Changing base classes for exceptions that were already defined 2025-08-13 15:14:43 +02:00
hajdul88
59c9204a74 Update exceptions.py 2025-08-13 15:03:44 +02:00
hajdul88
fabbd638a4 chore: renaming error 2025-08-13 15:03:12 +02:00
hajdul88
d1bfeaa0f2 fix: fixes search unit test error expectation 2025-08-13 15:00:25 +02:00
hajdul88
885f7c3f99 chore: fixing graph elements tests 2025-08-13 14:58:56 +02:00
hajdul88
544e08930b feat: removing invalidValueErrors 2025-08-13 14:42:57 +02:00
hajdul88
38329da0e8
Merge branch 'dev' into feature/cog-2717-add-better-error-management-to-cognee 2025-08-13 14:11:56 +02:00
hajdul88
bed523a36b feat: api error handling restruct 2025-08-13 13:59:12 +02:00
Igor Ilic
beea2f5e0a
Incremental loading migration (#1238)
<!-- .github/pull_request_template.md -->

## Description
Add relational db migration for incremental loading, change incremental
loading to work document per document instead of async together

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-08-13 07:58:09 -04:00
hajdul88
32996aa0d0 feat: adds new error classes to llm and databases + introduces loglevel and logging from child error 2025-08-13 13:40:50 +02:00
hajdul88
6dcd59c73c feat: Adds changes to cognee graph part 2025-08-13 13:07:04 +02:00
hajdul88
91b9c11cd0 changes base class for vectordb exceptions 2025-08-13 12:39:34 +02:00
hajdul88
9fb9f68c42 adds new base errors to retrieval exceptions 2025-08-13 12:36:31 +02:00
hajdul88
5bc00f1143 feat: adds new search classes to search.py 2025-08-13 12:29:35 +02:00
hajdul88
657c775cbe feat: adds configattributeerror 2025-08-13 12:22:03 +02:00
hajdul88
ebd4403c2f adds new error classes to keywordextration error and defines new error 2025-08-13 12:14:42 +02:00
hajdul88
6870bba5a9 feat: adds new error to delete 2025-08-13 12:03:18 +02:00
hajdul88
7bd2660d08 feat: setting base classes of data exceptions to the new ones 2025-08-13 11:58:32 +02:00
hajdul88
623148ec6a feat: adds new errors to litellm client 2025-08-13 11:53:29 +02:00
hajdul88
b40dbf3c68 feat: adds new error classes to litellm instructor 2025-08-13 11:50:24 +02:00
hajdul88
1b3898dd8b feat: updates rel_db exceptions with the new error classes 2025-08-13 11:38:38 +02:00
hajdul88
0555dd9dda Update __init__.py 2025-08-13 11:29:16 +02:00
hajdul88
ed555a731d feat: adds new Exception classes 2025-08-13 11:27:45 +02:00
hajdul88
fd9aaf57b1
feature: Adds doctype handling to delete (audio, image, unstructured) (#1239)
<!-- .github/pull_request_template.md -->

## Description
feature: Adds doctype handling to delete (audio, image, unstructured)

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-08-12 19:01:06 +02:00
Vasilije
a15cad05d5
Merge branch 'dev' into add_cli 2025-08-12 16:00:51 +02:00
Vasilije
0c42d19505
Update README.md 2025-08-12 15:38:30 +02:00
vasilije
e3bd4749ca git merge 2025-08-12 15:18:19 +02:00
vasilije
b2265957d9 added fix 2025-08-12 15:17:14 +02:00
vasilije
1de81c8dbf added fixes 2025-08-12 15:16:31 +02:00
vasilije
ff1614d55f format 2025-08-12 15:16:31 +02:00
Igor Ilic
a75a79f012
Lancedb async lock (#1222)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: Vasilije <8619304+Vasilije1990@users.noreply.github.com>
2025-08-12 08:46:15 -04:00
Igor Ilic
0a03ff1501
Merge branch 'dev' into configurable-path-exclusion-code-graph 2025-08-11 16:38:07 +02:00
Igor Ilic
c33536685d
fix: Resolve Kuzu S3 issue (#1235)
<!-- .github/pull_request_template.md -->

## Description
Resolve issue with pushing Kuzu DBs to S3

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-08-11 16:29:36 +02:00
Hande
6e7e4506cc
chore: update readme (#1234)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-08-11 11:16:16 +02:00
Fardeen Malik
a31986e591
Merge branch 'main' into multi-lang-codegraph 2025-08-11 13:56:18 +05:30
P-FardeenMalik
0a330683de Extend CodeGraph pipeline for multi-language support (closes #1160) 2025-08-11 13:48:37 +05:30
Vasilije
ed2bea4ff2
Potential fix for code scanning alert no. 147: Clear-text logging of sensitive information (#1229)
Potential fix for
[https://github.com/topoteretes/cognee/security/code-scanning/147](https://github.com/topoteretes/cognee/security/code-scanning/147)

To fix the problem, we should avoid logging any part of the API key.
Instead, we can log a generic message indicating that an API key is
configured, without revealing any part of the key. This maintains the
functionality of confirming that the key is present, while eliminating
the risk of exposing sensitive information. The change should be made in
`cognee-mcp/src/test_client.py`, specifically on line 60, replacing the
log statement with a safer alternative. No new imports or methods are
required.

---


_Suggested fixes powered by Copilot Autofix. Review carefully before
merging._

Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>
2025-08-10 12:29:51 +02:00
Emmanuel Ferdman
089fe34ec9
fix: raise error on version mapping failure (#1228)
## Description
Fixed `ValueError` that was not being properly raised when
`version_code` could not be mapped to proper Kuzu version.

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

Signed-off-by: Emmanuel Ferdman <emmanuelferdman@gmail.com>
2025-08-10 10:17:17 +02:00
Pavel Zorin
df0c0b3f7f
Chore: fix Commutity Greetings workflow permissions (#1227)
<!-- .github/pull_request_template.md -->

## Description
Fix “Resource not accessible by integration” error in greetings workflow

This PR updates the community | Greetings GitHub Actions workflow to
ensure it can successfully post greeting comments on newly opened issues
and pull requests — including PRs from forks — without hitting the
“Resource not accessible by integration” error.

## Changes
- Switched PR trigger from `pull_request` to `pull_request_target` to
run in the context of the base repository and grant write-scoped
`GITHUB_TOKEN` for commenting on forked PRs.
- Added explicit `permissions` block with:
  - `issues: write`  
  - `pull-requests: write`  
- Limited triggers to `types: [opened]` for both issues and PRs to avoid
unnecessary runs.
- Preserved existing greeting messages for issues and pull requests.

## Reason for change
The workflow was failing because the default `GITHUB_TOKEN` in
`pull_request` events is read-only for forks, preventing the bot from
posting comments.
`pull_request_target` with explicit permissions solves this while
maintaining security.

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-08-10 07:27:02 +02:00
EricXiao
815d639132
fix: graph visualization access for users with read permissions (#1220)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->
Description
This PR fix graph visualization access for users with read permissions
(https://github.com/topoteretes/cognee/issues/1182)

- Add permission checks for graph visualization endpoints to ensure
users can only access datasets they have permission to view
- Create get_dataset_with_permissions method to validate user access
before returning a dataset
- Remove redundant dataset existence validation in datasets router and
delegate permission checking to graph data retrieval
- Add comprehensive test suite for graph visualization permissions
covering owner access and permission granting scenarios
- Update get_formatted_graph_data() to use dataset owner's ID for
context
## Testing
Tests can be run with:
```bash
pytest -s cognee/tests/test_graph_visualization_permissions.py
```

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Signed-off-by: EricXiao <taoiaox@gmail.com>
Co-authored-by: Vasilije <8619304+Vasilije1990@users.noreply.github.com>
2025-08-08 20:42:57 +02:00
Vasilije
e3b41e0ed4
fix: Interface fixes (#1206)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: Daulet Amirkhanov <damirkhanov01@gmail.com>
2025-08-08 20:41:33 +02:00
Vasilije
f65605b575
fix: Feature/cog 2648 evals update (#1221)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: lxobr <122801072+lxobr@users.noreply.github.com>
Co-authored-by: Hande <159312713+hande-k@users.noreply.github.com>
2025-08-08 20:23:09 +02:00
Igor Ilic
5c44d01265
Merge branch 'dev' into configurable-path-exclusion-code-graph 2025-08-07 16:12:15 +02:00
lxobr
6dbd8e85a1
feat: dynamic multiple edges in datapoints (#1212)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->
- Improved list handling, removed `.index` logic from
`get_graph_from_model`, transitioned to fully datapoint-oriented
processing
- Streamlined datapoint iteration by introducing `_datapoints_generator`
with nested loops
- Generalized field processing to handle mixed lists: `[DataPoint,
(Edge, DataPoint), (Edge, [DataPoint])]`, allowing dynamic multiple
edges generation
- Small improvements and refactorings
- Added tests to `test_get_graph_from_model_flexible_edges()` covering
weighted edges and dynamic multiple edges
- Created `dynamic_multiple_edges_example.py` demonstrating dynamic
multiple edges

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-08-07 14:50:45 +02:00
vasilije
c8202c51a7 format fix 2025-08-06 19:11:39 +02:00
vasilije
0ea5894823 added distributed fixes 2025-08-06 19:11:20 +02:00
hajdul88
4e816ad80b
fix: changing deletion logic to use document id instead of content hash (#1210)
<!-- .github/pull_request_template.md -->

## Description
Changing deletion logic to use document id instead of content hash

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-08-06 16:54:35 +02:00