Commit graph

56 commits

Author SHA1 Message Date
martin0731
3acb581bd0 Removed check_permissions_on_dataset.py and related references 2025-11-13 08:31:15 -05:00
Daulet Amirkhanov
56d464cfa8 rerun and update notebooks with latest cognee 2025-10-22 19:05:01 +01:00
Vasilije
666204d0db
Rerun notebooks with latest cognee - 0.3.5 (#1517)
<!-- .github/pull_request_template.md -->

## Description
<!--
Please provide a clear, human-generated description of the changes in
this PR.
DO NOT use AI-generated descriptions. We want to understand your thought
process and reasoning.
-->

Re-run notebooks:
1. validate notebooks work with latest 0.3.5 version
2. Override outputs that contained "Acknowledged" (old gpt-5 bug)

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [ ] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [ ] **I have tested my changes thoroughly before submitting this PR**
- [ ] **This PR contains minimal changes necessary to address the
issue/feature**
- [ ] My code follows the project's coding standards and style
guidelines
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have added necessary documentation (if applicable)
- [ ] All new and existing tests pass
- [ ] I have searched existing PRs to ensure this change hasn't been
submitted already
- [ ] I have linked any relevant issues in the description
- [ ] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-10-12 10:24:41 +02:00
Daulet Amirkhanov
63a1463073 Deprecate SearchType.INSIGHTS, replace all references to default search type - SearchType.GRAPH_COMPLETION 2025-10-08 12:13:59 +01:00
Daulet Amirkhanov
0c59bcdda2 Rerun notebooks with latest cognee - 0.3.5 2025-10-07 21:49:02 +01:00
Daulet Amirkhanov
f339b27dfa refactor: update Jupyter notebooks to prevent kernel exit during GitHub Actions 2025-08-27 15:37:29 +01:00
Daulet Amirkhanov
19b59fcd6d Update Jupyter notebooks: added execution counts, improved logging outputs, and updated Python version to 3.12.7. Removed unnecessary code cells and ensured consistent environment variable handling across demos. 2025-08-27 14:59:39 +01:00
vasilije
cf9b5a2977 added update 2025-06-30 14:20:06 +02:00
vasilije
3e43a4726f notebooks and poetry lock update 2025-06-30 14:20:06 +02:00
Boris
e7644f4b3a
feat: migrate new UI to cognee (#966)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: Igor Ilic <igorilic03@gmail.com>
2025-06-18 20:56:44 +02:00
Daniel Molnar
91f3cd9ef7
fix: notebooks (#818)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-05-13 18:13:26 +02:00
Daniel Molnar
9ba12b25ef
feat: add delete by document (#668)
<!-- .github/pull_request_template.md -->

## Description
Delete by document.

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin
2025-04-17 15:42:10 +02:00
Vasilije
228fba8096
fix: Refactor notebooks (#720)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

Co-authored-by: Boris Arzentar <borisarzentar@gmail.com>
2025-04-11 10:23:22 +02:00
Boris
9536395468
Revert "feat: pipeline tasks needs mapping" (#717)
Reverts topoteretes/cognee#690

I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-04-10 12:10:12 +02:00
Boris
0ce6fad24a
feat: pipeline tasks needs mapping (#690)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-04-03 10:52:59 +02:00
alekszievr
c1f7b667d1
feat: Eliminate the use of max_chunk_tokens and use a unified max_chunk_size instead [cog-1381] (#626)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit

- **Refactor**
- Simplified text processing by unifying multiple size-related
parameters into a single metric across chunking and extraction
functionalities.
- Streamlined logic for text segmentation by removing redundant
calculations and checks, resulting in a more consistent chunk management
process.
- **Chores**
  - Removed the `modal` package as a dependency.
- **Documentation**
- Updated the README.md to include a new demo video link and clarified
default environment variable settings.
- Enhanced the CONTRIBUTING.md to improve clarity and engagement for
potential contributors.
- **Bug Fixes**
- Improved handling of sentence-ending punctuation in text processing to
include additional characters.
- **Version Update**
  - Updated project version to 0.1.33 in the pyproject.toml file.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->
2025-03-12 14:03:41 +01:00
Vasilije
2072c7a081
feat: improve tests add macos runners (#540)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->

## Summary by CodeRabbit

- **Chores**
- Improved automated testing setups to run across multiple operating
systems (Ubuntu and macOS) for Python 3.10, 3.11, and 3.12.
- Enhanced compatibility and coverage across diverse environments,
ensuring a more robust validation process.

<!-- end of auto-generated comment: release notes by coderabbit.ai -->

---------

Co-authored-by: soekja <wes.hubert@gmail.com>
Co-authored-by: Boris <boris@topoteretes.com>
2025-02-15 04:19:19 +01:00
Vasilije
9ba2e0d6c1
chore: Fix and update visualization (#518)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit

- **New Features**
- Introduced enhanced visualization capabilities that let users launch a
dedicated server for visual displays.
  
- **Documentation**
- Updated several interactive notebooks to include execution outputs and
expanded explanatory content for better user guidance.
  
- **Style**
- Refined formatting and layout across notebooks to ensure consistent
presentation and improved readability.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->

---------

Co-authored-by: Igor Ilic <30923996+dexters1@users.noreply.github.com>
2025-02-11 19:25:01 +01:00
alekszievr
05ba29af01
Feat: log pipeline status and pass it through pipeline [COG-1214] (#501)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit

- **New Features**
- Enhanced pipeline execution now provides consolidated status feedback
with improved telemetry for start, completion, and error events.
- Automatic generation of unique dataset identifiers offers clearer task
and pipeline run associations.

- **Refactor**
- Task execution has been streamlined with explicit parameter handling
for more structured pipeline processing.
- Interactive examples and demos now return results directly, making
integration and monitoring more accessible.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->

---------

Co-authored-by: Boris Arzentar <borisarzentar@gmail.com>
2025-02-11 16:41:40 +01:00
Boris
f75e35c337
fix: custom model pipeline (#508)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit


- **New Features**
• Graph visualizations now allow exporting to a user-specified file path
for more flexible output management.
• The text embedding process has been enhanced with an additional
tokenizer option for improved performance.
• A new `ExtendableDataPoint` class has been introduced for future
extensions.
• New JSON files for companies and individuals have been added to
facilitate testing and data processing.

- **Improvements**
• Search functionality now uses updated identifiers for more reliable
content retrieval.
• Metadata handling has been streamlined across various classes by
removing unnecessary type specifications.
• Enhanced serialization of properties in the Neo4j adapter for improved
handling of complex structures.
• The setup process for databases has been improved with a new
asynchronous setup function.

- **Chores**
• Dependency and configuration updates improve overall stability and
performance.

<!-- end of auto-generated comment: release notes by coderabbit.ai -->
2025-02-08 02:00:15 +01:00
Igor Ilic
5fe7ff9883
refactor: Refactor search so graph completion is used by default (#505)
<!-- .github/pull_request_template.md -->

## Description
Refactor search so query type doesn't need to be provided to make it
simpler for new users

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit

- **Refactor**
- Improved the search interface by standardizing parameter usage with
explicit keyword arguments for specifying search types, enhancing
clarity and consistency.
- **Tests**
- Updated test cases and example integrations to align with the revised
search parameters, ensuring consistent behavior and reliable validation
of search outcomes.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->
2025-02-07 17:16:34 +01:00
Igor Ilic
3e29c3d8f2 docs: Update notebook to work with changes to max chunk tokens 2025-01-28 15:38:38 +01:00
hande-k
343de01d5a update notebooks with latest eval 2025-01-23 11:11:51 +01:00
vasilije
cbd15b98a5 Fix linter issues 2025-01-05 20:24:04 +01:00
vasilije
2675836149 Fix linter issues 2025-01-05 20:17:49 +01:00
vasilije
60c8fd103b ruff format 2025-01-05 19:09:08 +01:00
Boris
348610e73c
fix: refactor get_graph_from_model to return nodes and edges correctly (#257)
* fix: handle rate limit error coming from llm model

* fix: fixes lost edges and nodes in get_graph_from_model

* fix: fixes database pruning issue in pgvector (#261)

* fix: cognee_demo notebook pipeline is not saving summaries

---------

Co-authored-by: hajdul88 <52442977+hajdul88@users.noreply.github.com>
2024-12-06 12:52:01 +01:00
Vasilije
9d6081c7f7
feat: Add support for multiple audio and image formats (#12)
Added support for multiple audio and image formats with example

The formats added are the possible filetype library return values for
extension for Audio and Images

Feature COG-507
2024-11-23 16:31:55 +01:00
Igor Ilic
61ed516d12 docs: Add multimedia notebook
Added multimedia notebook for cognee

Docs COG-507
2024-11-20 16:21:29 +01:00
Igor Ilic
f9353d25fa fix: Update table name in notebook
Update table name in notebook

Fix COG-677
2024-11-20 15:14:38 +01:00
Igor Ilic
70fe6ac541 fix: Update table name in notebook
Update table name to use latest in notebook

Fix COG-677
2024-11-20 15:07:38 +01:00
Boris
d8b6eeded5
feat: log search queries and results (#166)
* feat: log search queries and results

* fix: address coderabbit review comments

* fix: parse UUID when logging search results

* fix: remove custom UUID type and use DB agnostic UUID from sqlalchemy

* Add new cognee_db

---------

Co-authored-by: Leon Luithlen <leon@topoteretes.com>
2024-11-17 11:59:10 +01:00
Boris
52180eb6b5
feat: COG-184 add falkordb (#192)
* feat: add falkordb adapter

---------

Co-authored-by: hajdul88 <52442977+hajdul88@users.noreply.github.com>
2024-11-11 18:20:52 +01:00
Igor Ilic
23ed38d615 test: Fix intentional typo in notebook
Removed typo used for testing notebook github action

Test #COG-462
2024-10-29 14:24:17 +01:00
Igor Ilic
2ba57220d8 test: Add typo in notebook to test github action
Added typo in notebook to test if github action will catch the issue

Test #COG-462
2024-10-29 14:20:47 +01:00
Igor Ilic
c183742ad5 test: Add test for Jupyter notebook
Added testing of Jupyter notebook through github actions

Test #COG-462
2024-10-29 13:47:23 +01:00
Igor Ilic
6555f4e88e fix: Resolve chunking issue for notebook
Add cleaning of local data to resolve chunking issue with repeated notebook use

Fix
2024-10-27 22:33:20 +01:00
Boris
2f832b190c
fix: various fixes for the deployment
* fix: remove groups from UserRead model

* fix: add missing system dependencies for postgres

* fix: change vector db provider environment variable name

* fix: WeaviateAdapter retrieve bug

* fix: correctly return data point objects from retrieve method

* fix: align graph object properties

* feat: add node example
2024-10-22 11:26:48 +02:00
Igor Ilic
658b6df4c6 refactor: Remove architecture overview
Removed architecture overview from notebook for now

Refactor #COG-387
2024-10-11 17:57:51 +02:00
Igor Ilic
1446606e95 fix: Fix pruning of relation database for Notebook
Fixes issue regarding no pruning of relational database for Jupyter Notebook

Fix #COG-387
2024-10-10 18:03:27 +02:00
Igor Ilic
85bf48b1f8 refactor: Raise font of last header
Bigger font for last header regarding starring cognee

Refactor #COG-387
2024-10-10 17:15:20 +02:00
Igor Ilic
9a12e29c97 refactor: Add cognee process diagram to notebook
Added cognee process diagram for the data used in the notebook

Refactor #COG-387
2024-10-10 17:05:16 +02:00
Igor Ilic
2bfe185672 refactor: Expand cognee notebook introduction
Expand information about cognee in notebook

Refactor #COG-387
2024-10-10 15:39:13 +02:00
Igor Ilic
f308cdd6fe refactor: Refactor cognee text in notebook
Added cognee description from landing page with appropriate graphs and images

Refactor #COG-387
2024-10-10 15:27:48 +02:00
Igor Ilic
e5dd99d2ca refactor: refactor rag problem image
Refactor image to accomodate to cognee style

Refactor #COG-387
2024-10-10 13:14:24 +02:00
Igor Ilic
bd8f5284a9 refactor: Add other search types to notebook
Added other search types and explanations to notebook

Refactor #COG-387
2024-10-09 20:58:16 +02:00
Igor Ilic
9ec292aac5 refactor: Add other search types to notebook
Added other search types and explanations to jupyter notebook

refactor #COG-387
2024-10-09 17:52:29 +02:00
Igor Ilic
eab6bda2d3 refactor: Add summarization task to notebook
Added summarization task to jupyter notebook

Refactor #COG-387
2024-10-09 17:12:32 +02:00
Igor Ilic
2497546cfb refactor: Add link for articles
Added hyperlinks to article mention

Refactor #COG-382
2024-10-08 20:12:25 +02:00
Igor Ilic
85694687a3 refactor: Refactor RAG issue explanation
Refactored explanation about issues with RAGs. Added information on how results are scored.

Refactor #COG-382
2024-10-08 19:56:59 +02:00