Commit graph

29 commits

Author SHA1 Message Date
Boris
57b0e0ef0b
Merge with main (#892)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: vasilije <vas.markovic@gmail.com>
Co-authored-by: Igor Ilic <30923996+dexters1@users.noreply.github.com>
Co-authored-by: Vasilije <8619304+Vasilije1990@users.noreply.github.com>
Co-authored-by: Igor Ilic <igorilic03@gmail.com>
Co-authored-by: Hande <159312713+hande-k@users.noreply.github.com>
Co-authored-by: Matea Pesic <80577904+matea16@users.noreply.github.com>
Co-authored-by: hajdul88 <52442977+hajdul88@users.noreply.github.com>
Co-authored-by: Daniel Molnar <soobrosa@gmail.com>
Co-authored-by: Diego Baptista Theuerkauf <34717973+diegoabt@users.noreply.github.com>
Co-authored-by: github-actions[bot] <github-actions@users.noreply.github.com>
2025-05-30 23:13:04 +02:00
Jan Müller
66da3d2934
Document collection of usage data and how to opt out (#767)
## Description
Document that Cognee collects usage data in the readme and add
instructions how users can opt out.

---------

Co-authored-by: i555611 <jan.mueller01@sap.com>
Co-authored-by: Vasilije <8619304+Vasilije1990@users.noreply.github.com>
2025-04-22 23:52:34 +02:00
rex993
54fb400d91
Set LiteLLM logging level (#758)
<!-- .github/pull_request_template.md -->

## Description
LiteLLM adds the debugging lines: LiteLLM:INFO to the output when
running and can flood the output making actual cognee debugging
difficult. This .env config will suppress logging to ERROR level only.

Note there is an open bug with LiteLLM here
[https://github.com/BerriAI/litellm/issues/9815](https://github.com/BerriAI/litellm/issues/9815)
as 'cost calculation' lines will still be shown despite ERROR level
logging

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

Co-authored-by: rex <rex@epicx.me>
2025-04-18 09:28:39 +02:00
Vasilije
67e980f22e
feat: Add default user from config (#682)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: Boris Arzentar <borisarzentar@gmail.com>
2025-04-02 14:48:10 +02:00
Igor Ilic
9f587a01a4
feat: Relational db to graph db [COG-1468] (#644)
<!-- .github/pull_request_template.md -->

## Description
Add ability to migrate relational database to graph database

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin
2025-03-26 11:40:06 +01:00
Daniel Molnar
69950a04dd
feat: Kuzu integration (#628)
<!-- .github/pull_request_template.md -->

## Description
Let's scope it out.

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit

- **New Features**
- Introduced support for the Kuzu graph database provider, enhancing
graph operations and data management capabilities.
- Added a comprehensive adapter for Kuzu, facilitating various graph
database operations.
  - Expanded the enumeration of graph database types to include Kuzu.

- **Tests**
- Launched comprehensive asynchronous tests to validate the new Kuzu
graph integration’s performance and reliability.

- **Chores**
- Updated dependency settings and continuous integration workflows to
include the Kuzu provider, ensuring smoother deployments and improved
system quality.
- Enhanced configuration documentation to clarify Kuzu database
requirements.
  - Modified Dockerfile to include Kuzu in the installation extras.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->

---------

Co-authored-by: Igor Ilic <30923996+dexters1@users.noreply.github.com>
2025-03-13 17:47:09 +01:00
Dmitrii Galkin
e147fa5bde
feat: Add support for ChromaDB (#622)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

# Add Support for ChromaDB

## Summary
This PR adds support for ChromaDB as a vector database option in the
Cognee application. ChromaDB is a modern, open-source embedding database
designed for AI applications.

## Changes
- Created a new ChromaDBAdapter implementation for vector database
operations
- Added comprehensive test suite for ChromaDB functionality
- Updated docker-compose.yml to include ChromaDB service
- Modified environment configuration to support ChromaDB settings
- Updated vector engine creation logic to support ChromaDB as an option

## Technical Details
- Implemented `ChromaDBAdapter.py` (347 lines) with full CRUD operations
for vector data
- Created test suite (`test_chromadb.py`) with 171 lines of test
coverage
- Updated vector engine creation process to dynamically select ChromaDB
when configured
- Modified settings router to accommodate new database option
- Updated environment template with ChromaDB configuration options

## Docker Changes
- Added ChromaDB service to docker-compose.yml with appropriate
configuration

This PR enhances Cognee's flexibility by providing an alternative vector
database option, allowing users to choose the most appropriate database
for their specific use case.



## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin

Tested with UI + tests.


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit

- **New Features**
- Expanded vector database integration by adding support for Chromadb,
enabling enhanced data management and search functionalities.
- **Tests**
- Added automated tests to validate the Chromadb integration and related
operations.
- **Chores**
- Updated configuration guidance and dependency management to include
Chromadb.
  - Provided an optional container deployment template for Chromadb.
- Added a new entry to ignore the `.chromadb_data/` directory in version
control.
- Introduced a new GitHub Actions workflow for testing Chromadb
integration.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->

---------

Co-authored-by: hajdul88 <52442977+hajdul88@users.noreply.github.com>
Co-authored-by: Vasilije <8619304+Vasilije1990@users.noreply.github.com>
2025-03-13 15:13:04 +01:00
SJ
0c42c10f64
Comment out the postgres configuration from .env.template (#502)
<!-- .github/pull_request_template.md -->

## Description
Simplest cognee docker setup of SQLite-NetworkX-LanceDB should not
enable postgres configuration in the .env.template by default. I think
leaving postgres details commented is better than removing them
entirely. This keeps simple optionality visible for newcomers.

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin



<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->

## Summary by CodeRabbit

- **Chores**
- Updated configuration template to remove database parameters not used
in the default setup, with clearer guidance to ensure the intended
configuration is maintained.

<!-- end of auto-generated comment: release notes by coderabbit.ai -->
2025-02-06 21:35:40 +01:00
Igor Ilic
8879f3fbbe
feat: Add gemini support [COG-1023] (#485)
<!-- .github/pull_request_template.md -->

## Description
PR to test Gemini PR from holchan

1. Add Gemini LLM and Gemini Embedding support 
2. Fix CodeGraph issue with chunks being bigger than maximum token value
3. Add Tokenizer adapters to CodeGraph

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit

- **New Features**
    - Added support for the Gemini LLM provider.
    - Expanded LLM configuration options.
- Introduced a new GitHub Actions workflow for multimetric QA
evaluation.
- Added new environment variables for LLM and embedding configurations
across various workflows.

- **Bug Fixes**
    - Improved error handling in various components.
    - Updated tokenization and embedding processes.
    - Removed warning related to missing `dict` method in data items.

- **Refactor**
    - Simplified token extraction and decoding methods.
    - Updated tokenizer interfaces.
    - Removed deprecated dependencies.
    - Enhanced retry logic and error handling in embedding processes.

- **Documentation**
    - Updated configuration comments and settings.

- **Chores**
- Updated GitHub Actions workflows to accommodate new secrets and
environment variables.
    - Modified evaluation parameters.
    - Adjusted dependency management for optional libraries.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->

---------

Co-authored-by: holchan <61059652+holchan@users.noreply.github.com>
Co-authored-by: Boris <boris@topoteretes.com>
2025-01-31 18:03:23 +01:00
Igor Ilic
a8644e0bd7 feat: Use litellm max token size as default for model, if model exists in litellm 2025-01-28 17:00:47 +01:00
Igor Ilic
3db7f85c9c feat: Add max_chunk_tokens value to chunkers
Add formula and forwarding of max_chunk_tokens value through Cognee
2025-01-28 14:32:00 +01:00
Igor Ilic
1319944dcd docs: Update .env.template to include llm and embedding options 2025-01-23 18:05:45 +01:00
Igor Ilic
fb5f0cf00f chore: Make milvus an optional dependency
Make Milvus an optional dependency, expand docs with Milvus information

Chore
2024-12-03 10:37:50 +01:00
Boris
c045f737f7
feat: add vector and graph dbs state to README file (#235) 2024-11-18 17:51:41 +01:00
Igor Ilic
88ded6e1a8 Merge branch 'main' of github.com:topoteretes/cognee into COG-170-PGvector-adapter 2024-10-22 12:41:18 +02:00
Boris
2f832b190c
fix: various fixes for the deployment
* fix: remove groups from UserRead model

* fix: add missing system dependencies for postgres

* fix: change vector db provider environment variable name

* fix: WeaviateAdapter retrieve bug

* fix: correctly return data point objects from retrieve method

* fix: align graph object properties

* feat: add node example
2024-10-22 11:26:48 +02:00
Igor Ilic
9b9ae6c8aa refactor: Remove unused env parameter
Remove VECTOR_DB_NAME env parameter as it's not needed

Refactor #COG-170
2024-10-17 17:13:40 +02:00
Igor Ilic
c62dfdda9b feat: Add PGVectorAdapter
Added PGVectorAdapter

Feature #COG-170
2024-10-11 15:00:28 +02:00
Igor Ilic
d68a3be32e feat: Add config support for pgvector
Added config support for using pgvector

Feature #COG-170
2024-10-11 13:23:11 +02:00
Igor Ilic
fcd60861ba
fix: Fix Jupyter Notebook (#142)
* fix: resolve issue with dlt sqlalchemy usage
Cognee database configuration information was not handled properly by dlt, a new dlt handler
moudle was made to handle database configuration propagation.

* fix: resolve issue with jupyter notebook

cognee add function uses old way of working in the notebook, updated it to
work with the latest state of the cognee add function which doesn't return output.

* fix: Remove empty DB_PATH argument from .env.template

Empty value for DB_PATH in the .env file overrides default value for path intended to be used by cognee.

---------
2024-10-07 12:58:54 +02:00
Boris Arzentar
6ca2154379 chore: remove unnecessary env variables 2024-09-12 20:32:25 +02:00
Vasilije
119fdcc359
Update .env.template 2024-06-27 21:49:08 +02:00
Boris Arzentar
d5391f903c chore: rename package in files 2024-03-13 16:27:07 +01:00
Vasilije
653fe049b4 Fixes to sqlite 2024-02-15 18:44:47 +01:00
Vasilije
2321c7ebdd Rename the build package name and refactored the local runner 2024-01-25 12:30:39 +01:00
Vasilije
2928f51343 Add all functions for architecture to work + for regular user flows, refactor and reformat issues, add boilerplate for loading jobs logic and extend search functionalities. 2023-12-27 23:56:07 +01:00
Vasilije
5ed04de27e Make sure that secrets would work on aws 2023-12-21 18:49:07 +01:00
Vasilije
6e47c1ff56 Automate summary function 2023-12-17 21:49:26 +01:00
Vasilije
dfb13a2181 Build the docker and push 2023-12-16 15:25:32 +01:00
Renamed from level_4/.env.template (Browse further)