cognee

Author	SHA1	Message	Date
Dmitrii Galkin	e147fa5bde	feat: Add support for ChromaDB (#622 ) <!-- .github/pull_request_template.md --> ## Description <!-- Provide a clear description of the changes in this PR --> # Add Support for ChromaDB ## Summary This PR adds support for ChromaDB as a vector database option in the Cognee application. ChromaDB is a modern, open-source embedding database designed for AI applications. ## Changes - Created a new ChromaDBAdapter implementation for vector database operations - Added comprehensive test suite for ChromaDB functionality - Updated docker-compose.yml to include ChromaDB service - Modified environment configuration to support ChromaDB settings - Updated vector engine creation logic to support ChromaDB as an option ## Technical Details - Implemented `ChromaDBAdapter.py` (347 lines) with full CRUD operations for vector data - Created test suite (`test_chromadb.py`) with 171 lines of test coverage - Updated vector engine creation process to dynamically select ChromaDB when configured - Modified settings router to accommodate new database option - Updated environment template with ChromaDB configuration options ## Docker Changes - Added ChromaDB service to docker-compose.yml with appropriate configuration This PR enhances Cognee's flexibility by providing an alternative vector database option, allowing users to choose the most appropriate database for their specific use case. ## DCO Affirmation I affirm that all code in every commit of this pull request conforms to the terms of the Topoteretes Developer Certificate of Origin Tested with UI + tests. <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Summary by CodeRabbit - New Features - Expanded vector database integration by adding support for Chromadb, enabling enhanced data management and search functionalities. - Tests - Added automated tests to validate the Chromadb integration and related operations. - Chores - Updated configuration guidance and dependency management to include Chromadb. - Provided an optional container deployment template for Chromadb. - Added a new entry to ignore the `.chromadb_data/` directory in version control. - Introduced a new GitHub Actions workflow for testing Chromadb integration. <!-- end of auto-generated comment: release notes by coderabbit.ai --> --------- Co-authored-by: hajdul88 <52442977+hajdul88@users.noreply.github.com> Co-authored-by: Vasilije <8619304+Vasilije1990@users.noreply.github.com>	2025-03-13 15:13:04 +01:00
Boris	f75e35c337	fix: custom model pipeline (#508 ) <!-- .github/pull_request_template.md --> ## Description <!-- Provide a clear description of the changes in this PR --> ## DCO Affirmation I affirm that all code in every commit of this pull request conforms to the terms of the Topoteretes Developer Certificate of Origin <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Summary by CodeRabbit - New Features • Graph visualizations now allow exporting to a user-specified file path for more flexible output management. • The text embedding process has been enhanced with an additional tokenizer option for improved performance. • A new `ExtendableDataPoint` class has been introduced for future extensions. • New JSON files for companies and individuals have been added to facilitate testing and data processing. - Improvements • Search functionality now uses updated identifiers for more reliable content retrieval. • Metadata handling has been streamlined across various classes by removing unnecessary type specifications. • Enhanced serialization of properties in the Neo4j adapter for improved handling of complex structures. • The setup process for databases has been improved with a new asynchronous setup function. - Chores • Dependency and configuration updates improve overall stability and performance. <!-- end of auto-generated comment: release notes by coderabbit.ai -->	2025-02-08 02:00:15 +01:00
Boris Arzentar	b89a4b8054	Merge remote-tracking branch 'origin/main' into code-graph	2024-12-03 21:14:19 +01:00
Boris Arzentar	d6f0d65b63	Merge remote-tracking branch 'origin/code-graph'	2024-12-01 11:51:54 +01:00
Rita Aleksziev	a4c56f118d	Connect code graph pipeline + retriever + benchmarking	2024-11-29 15:24:49 +01:00
Rita Aleksziev	f47b185a9e	feat/add correctness score calculation with LLM as a judge	2024-11-27 10:53:48 +01:00
0xideas	80b06c3acb	test: Test for code graph enrichment task Co-authored-by: lxobr <lazar@topoteretes.com>	2024-11-24 19:24:47 +01:00
Leon Luithlen	e0e93ae379	Clean up notebook merge request	2024-11-12 09:04:43 +01:00
Leon Luithlen	9fe1b6c5fa	Add code_graph_demo notebook	2024-11-12 09:01:03 +01:00
Igor Ilic	33c3748d1e	refactor: Renamed .anonymous_id file to anon_id Renamed .anonymous_id file to anon_id Refactor #COG-492	2024-11-11 11:53:09 +01:00
Boris	2f832b190c	fix: various fixes for the deployment * fix: remove groups from UserRead model * fix: add missing system dependencies for postgres * fix: change vector db provider environment variable name * fix: WeaviateAdapter retrieve bug * fix: correctly return data point objects from retrieve method * fix: align graph object properties * feat: add node example	2024-10-22 11:26:48 +02:00
Boris	dc187a81d7	feat: migrate search to tasks (#144 ) * fix: don't return anything on health endpoint * feat: add alembic migrations * feat: align search types with the data we store and migrate search to tasks	2024-10-07 14:41:35 +02:00
Vasilije	c9b2a06dff	rewrote configs	2024-06-10 13:40:05 +02:00
Vasilije	460583a40f	added gitignore updates	2024-06-02 18:08:29 +02:00
Boris Arzentar	84c0c8cab5	feat: add llm config	2024-05-22 22:36:30 +02:00
Boris Arzentar	1ac28f4cb8	feat: add initial cognee frontend	2024-05-17 13:42:14 +02:00
Boris	219afbce68	feat: add lancedb vector storage [COG-176] (#90 ) * feat: integrate lancedb * fix: use futures in weaviate adapter to enable async behaviour	2024-05-03 10:35:41 +02:00
Boris Arzentar	370b74988e	chore: add functions to improve user experience	2024-03-30 15:25:34 +01:00
Boris Arzentar	8d4be049f4	feat: add support for text and file in cognee.add	2024-03-29 13:53:59 +01:00
Boris Arzentar	2a7a545dcc	fix: remove unnecessary files	2024-03-13 17:28:52 +01:00
Boris Arzentar	d5391f903c	chore: rename package in files	2024-03-13 16:27:07 +01:00
Boris Arzentar	260a21fc22	Merge remote-tracking branch 'origin/feat/COG-24-add-qdrant' into feat/COG-24-add-qdrant	2024-03-12 20:55:31 +01:00
Boris Arzentar	769d6b5080	feat: add create-memory and remember API endpoints Add possibility to create a new Vector memory and store text data points using openai embeddings.	2024-02-25 23:56:50 +01:00
Boris Arzentar	47c3463406	chore: add debugpy and update readme	2024-02-15 10:13:19 +01:00
Vasilije	9d87eb3c23	Merge branch 'main' into code_review	2023-08-25 12:12:46 +02:00
Vasilije	a746739d32	added first pass of the code review	2023-08-24 17:53:53 +02:00
burnash	3e654dfd51	Add Python.gitignore	2023-08-17 10:43:16 +02:00

27 commits