Commit graph

320 commits

Author SHA1 Message Date
Boris
e8ab5b4797
fix: tiktoken upgrade (#587)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->

## Summary by CodeRabbit

- **Chores**
- Removed an outdated internal tracking reference to streamline
maintenance.
- Upgraded a key dependency to its latest stable release, delivering
enhanced performance and reliability for a smoother user experience.

<!-- end of auto-generated comment: release notes by coderabbit.ai -->
2025-02-27 18:16:11 +01:00
Igor Ilic
f9b6630024
chore: Add ollama optional depdendency (#584)
<!-- .github/pull_request_template.md -->

## Description
Add ollama optional dependency

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->

## Summary by CodeRabbit

- **Chores**
- Updated the project’s dependency configuration to include an
additional optional package for enhanced transformation functionality.

<!-- end of auto-generated comment: release notes by coderabbit.ai -->
2025-02-27 15:09:58 +01:00
Boris Arzentar
86b34657aa version: v0.1.30 2025-02-26 21:48:59 +01:00
Boris
45f7c63322
fix: notebooks errors (#565)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->

## Summary by CodeRabbit

- **New Features**
- Automatically creates a blank graph when a file isn’t found, ensuring
smoother operations.
- Updated demonstration notebooks with dynamic configurations, including
refined search operations and input prompts.
- Introduced optional support for additional graph functionalities via
an integrated dependency.

- **Refactor**
- Streamlined processing by eliminating duplicate steps and simplifying
graph rendering workflows.

- **Chores**
- Updated environment configurations and upgraded the Python runtime for
improved performance and consistency.

<!-- end of auto-generated comment: release notes by coderabbit.ai -->
2025-02-19 14:07:11 -08:00
Boris Arzentar
811e932cae version: v0.1.29 2025-02-19 20:19:51 +01:00
Igor Ilic
46e026f77f
Cognee gui [COG-1307] (#530)
<!-- .github/pull_request_template.md -->

## Description
Add a simple GUI to add documents to Cognee and use GRAPH_COMPLETION
search to get answers

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit

- **New Features**
- Introduced an interactive file search interface with intuitive
controls. Users can easily upload files, enter search terms, and view
results in a unified display with clear notifications during processing.
  
- **Chores**
- Updated project dependencies to include `pyside6` and `qasync` for
enhanced GUI functionality.
- Refined background query processing to improve the accuracy and
relevance of search outcomes.
- Improved code readability with formatting enhancements in the search
function.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->
2025-02-14 15:51:33 +01:00
Boris Arzentar
67c8edb853 version: v0.1.28 2025-02-13 13:17:00 +01:00
Boris Arzentar
e7b08def82 version: v0.1.27 2025-02-13 00:00:23 +01:00
Boris
f9e6dcf837
fix: simplify code pipeline (#529)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit


- **New Features**
  - Enhanced code search and dependency analysis for improved accuracy.
  - Introduced a new high-performance text embedding option.
  - Added an additional execution entry point for code graph processing.
- New optional parameters for flexible property selection in retrieval
functions.
- Introduced new classes for handling import statements, function
definitions, and class definitions.
  - Updated embedding engine selection based on configuration options.

- **Bug Fixes**
- Improved error handling in search operations and database queries for
a more stable user experience.
  - Enhanced error logging for source code parsing.

- **Refactor**
- Streamlined asynchronous processing and refactored internal dependency
extraction.
- Updated configuration and integration settings to enhance overall
reliability.
  - Restructured functions for simplified dependency handling.

- **Chores**
- Upgraded and reorganized dependency management with optional libraries
for extended functionality.
- Added new secret parameters for embedding configuration in workflow
settings.

<!-- end of auto-generated comment: release notes by coderabbit.ai -->

---------

Co-authored-by: vasilije <vas.markovic@gmail.com>
2025-02-12 23:58:48 +01:00
lxobr
bb8cb692e0
Cog 1293 corpus builder custom cognify tasks (#527)
<!-- .github/pull_request_template.md -->

## Description
- Enable custom tasks in corpus building
## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit

- **New Features**
- Introduced a configurable option to specify the task retrieval
strategy during corpus building.
- Enhanced the workflow with integrated task fetching, featuring a
default retrieval mechanism.
- Updated evaluation configuration to support customizable task
selection for more flexible operations.
- Added a new abstract base class for defining various task retrieval
strategies.
- Introduced a new enumeration to map task getter types to their
corresponding classes.
  
- **Dependencies**
  - Added a new dependency for downloading files from Google Drive.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->
2025-02-12 16:44:08 +01:00
hajdul88
6a0c0e3ef8
feat: Cognee evaluation framework development (#498)
<!-- .github/pull_request_template.md -->

This PR contains the evaluation framework development for cognee

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit

- **New Features**
- Expanded evaluation framework now integrates asynchronous corpus
building, question answering, and performance evaluation with adaptive
benchmarks for improved metrics (correctness, exact match, and F1
score).

- **Infrastructure**
- Added database integration for persistent storage of questions,
answers, and metrics.
- Launched an interactive metrics dashboard featuring advanced
visualizations.
- Introduced an automated testing workflow for continuous quality
assurance.

- **Documentation**
  - Updated guidelines for generating concise, clear answers.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->
2025-02-11 16:31:54 +01:00
Boris Arzentar
6930c750a7 version: v0.1.26 2025-02-08 15:23:00 +01:00
Boris Arzentar
5e350f57bd version: v0.1.25 2025-02-08 02:02:07 +01:00
Boris
f75e35c337
fix: custom model pipeline (#508)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit


- **New Features**
• Graph visualizations now allow exporting to a user-specified file path
for more flexible output management.
• The text embedding process has been enhanced with an additional
tokenizer option for improved performance.
• A new `ExtendableDataPoint` class has been introduced for future
extensions.
• New JSON files for companies and individuals have been added to
facilitate testing and data processing.

- **Improvements**
• Search functionality now uses updated identifiers for more reliable
content retrieval.
• Metadata handling has been streamlined across various classes by
removing unnecessary type specifications.
• Enhanced serialization of properties in the Neo4j adapter for improved
handling of complex structures.
• The setup process for databases has been improved with a new
asynchronous setup function.

- **Chores**
• Dependency and configuration updates improve overall stability and
performance.

<!-- end of auto-generated comment: release notes by coderabbit.ai -->
2025-02-08 02:00:15 +01:00
Boris Arzentar
79a1b86161 version: v0.1.24 2025-02-05 17:53:27 +01:00
Boris Arzentar
44a4f8fd0d version: v0.1.23 2025-02-01 15:23:43 +01:00
Igor Ilic
8879f3fbbe
feat: Add gemini support [COG-1023] (#485)
<!-- .github/pull_request_template.md -->

## Description
PR to test Gemini PR from holchan

1. Add Gemini LLM and Gemini Embedding support 
2. Fix CodeGraph issue with chunks being bigger than maximum token value
3. Add Tokenizer adapters to CodeGraph

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit

- **New Features**
    - Added support for the Gemini LLM provider.
    - Expanded LLM configuration options.
- Introduced a new GitHub Actions workflow for multimetric QA
evaluation.
- Added new environment variables for LLM and embedding configurations
across various workflows.

- **Bug Fixes**
    - Improved error handling in various components.
    - Updated tokenization and embedding processes.
    - Removed warning related to missing `dict` method in data items.

- **Refactor**
    - Simplified token extraction and decoding methods.
    - Updated tokenizer interfaces.
    - Removed deprecated dependencies.
    - Enhanced retry logic and error handling in embedding processes.

- **Documentation**
    - Updated configuration comments and settings.

- **Chores**
- Updated GitHub Actions workflows to accommodate new secrets and
environment variables.
    - Modified evaluation parameters.
    - Adjusted dependency management for optional libraries.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->

---------

Co-authored-by: holchan <61059652+holchan@users.noreply.github.com>
Co-authored-by: Boris <boris@topoteretes.com>
2025-01-31 18:03:23 +01:00
Igor Ilic
49f60971bb Merge branch 'dev' into COG-970-refactor-tokenizing 2025-01-28 10:12:55 +01:00
Boris
0c2c5870df
fix: use low_lever server for cognee mcp server (#470)
* fix: revert to older mcp version

* fix: use low_level server for the mcp

* fix: styling errors

* fix: mcp cognify arguments

* fix: ruff errors
2025-01-26 12:52:48 +01:00
Igor Ilic
77a72851fc Merge branch 'dev' into COG-970-refactor-tokenizing 2025-01-24 18:34:50 +01:00
Igor Ilic
cdc992750a test: Add github action to test code graph 2025-01-24 18:12:16 +01:00
Igor Ilic
6d5679f9d2 Merge branch 'dev' into COG-970-refactor-tokenizing 2025-01-23 18:14:49 +01:00
Igor Ilic
b25a82e206 chore: Add google-generativeai as gemini optional dependency to Cognee 2025-01-23 17:56:56 +01:00
Boris Arzentar
00f302c37a feat: use fastmcp for mcp server 2025-01-23 11:45:40 +01:00
Igor Ilic
964fca72c6 fix: Update ruff version for cognee 2025-01-17 10:36:04 +01:00
vasilije
b2355deeab Update format 2025-01-16 20:38:59 +01:00
vasilije
662faeb44b Update format 2025-01-16 20:07:24 +01:00
Vasilije
3ba98b2ecd
Merge branch 'dev' into COG-975 2025-01-14 10:04:21 +01:00
vasilije
047948ae54 Add data visualization for Anthropic 2025-01-14 09:57:53 +01:00
Boris Arzentar
8786fc35e7 version: Increase version to 0.1.22 2025-01-13 21:57:02 +01:00
Boris Arzentar
e983c216f0 version: Increase version to 0.1.21 2025-01-10 19:04:24 +01:00
Igor Ilic
16eefe4875 fix: resolve issue with langfuse dependency installation when integrating cognee in different packages 2025-01-10 18:02:15 +01:00
vasilije
dc91f01d07 Fix poetry issues 2025-01-08 17:52:31 +01:00
vasilije
5194353614 Fix poetry issues 2025-01-08 17:34:31 +01:00
vasilije
a5e993b857 Fix poetry issues 2025-01-08 17:14:41 +01:00
vasilije
3b649d1fa6 Fix poetry issues 2025-01-08 14:44:19 +01:00
vasilije
1a0052ba34 Fix poetry issues 2025-01-08 14:33:21 +01:00
vasilije
a856c234cb Fix poetry issues 2025-01-08 14:26:12 +01:00
vasilije
178b3ceec8 Fix poetry issues 2025-01-08 14:16:33 +01:00
vasilije
1a885fe6bc Fix poetry issues 2025-01-08 13:48:00 +01:00
vasilije
b13edb0ece Fix visualization 2025-01-08 13:29:23 +01:00
vasilije
fdd8ce7e32 Fix visualization 2025-01-08 13:20:38 +01:00
vasilije
0ff9ffa11b Fix visualization 2025-01-08 13:12:08 +01:00
vasilije
1b96a71d5a Fix ollama, work on visualization 2025-01-06 19:09:58 +01:00
vasilije
74e4b901bb Fix linter issues 2025-01-05 19:57:06 +01:00
vasilije
35460e9ad6 Fix linter issues 2025-01-05 19:53:38 +01:00
vasilije
76a0aa7e8b Fix linter issues 2025-01-05 19:48:35 +01:00
vasilije
5b115594b7 Fix linter issues 2025-01-05 19:26:43 +01:00
Igor Ilic
a90a232e95 Merge remote-tracking branch 'origin/dev' into COG-650-replace-pylint 2024-12-20 15:23:55 +01:00
Igor Ilic
f2bb1c5d47 fix: Add poetry.lock file, make langfuse mandatory
Added langfuse as mandatory dependency, added poetry.lock file

Fix
2024-12-19 16:04:24 +01:00