Commit graph

363 commits

Author SHA1 Message Date
Boris Arzentar
e7b08def82 version: v0.1.27 2025-02-13 00:00:23 +01:00
Boris
f9e6dcf837
fix: simplify code pipeline (#529)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit


- **New Features**
  - Enhanced code search and dependency analysis for improved accuracy.
  - Introduced a new high-performance text embedding option.
  - Added an additional execution entry point for code graph processing.
- New optional parameters for flexible property selection in retrieval
functions.
- Introduced new classes for handling import statements, function
definitions, and class definitions.
  - Updated embedding engine selection based on configuration options.

- **Bug Fixes**
- Improved error handling in search operations and database queries for
a more stable user experience.
  - Enhanced error logging for source code parsing.

- **Refactor**
- Streamlined asynchronous processing and refactored internal dependency
extraction.
- Updated configuration and integration settings to enhance overall
reliability.
  - Restructured functions for simplified dependency handling.

- **Chores**
- Upgraded and reorganized dependency management with optional libraries
for extended functionality.
- Added new secret parameters for embedding configuration in workflow
settings.

<!-- end of auto-generated comment: release notes by coderabbit.ai -->

---------

Co-authored-by: vasilije <vas.markovic@gmail.com>
2025-02-12 23:58:48 +01:00
lxobr
bb8cb692e0
Cog 1293 corpus builder custom cognify tasks (#527)
<!-- .github/pull_request_template.md -->

## Description
- Enable custom tasks in corpus building
## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit

- **New Features**
- Introduced a configurable option to specify the task retrieval
strategy during corpus building.
- Enhanced the workflow with integrated task fetching, featuring a
default retrieval mechanism.
- Updated evaluation configuration to support customizable task
selection for more flexible operations.
- Added a new abstract base class for defining various task retrieval
strategies.
- Introduced a new enumeration to map task getter types to their
corresponding classes.
  
- **Dependencies**
  - Added a new dependency for downloading files from Google Drive.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->
2025-02-12 16:44:08 +01:00
hajdul88
6a0c0e3ef8
feat: Cognee evaluation framework development (#498)
<!-- .github/pull_request_template.md -->

This PR contains the evaluation framework development for cognee

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit

- **New Features**
- Expanded evaluation framework now integrates asynchronous corpus
building, question answering, and performance evaluation with adaptive
benchmarks for improved metrics (correctness, exact match, and F1
score).

- **Infrastructure**
- Added database integration for persistent storage of questions,
answers, and metrics.
- Launched an interactive metrics dashboard featuring advanced
visualizations.
- Introduced an automated testing workflow for continuous quality
assurance.

- **Documentation**
  - Updated guidelines for generating concise, clear answers.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->
2025-02-11 16:31:54 +01:00
Boris Arzentar
6930c750a7 version: v0.1.26 2025-02-08 15:23:00 +01:00
Boris Arzentar
5e350f57bd version: v0.1.25 2025-02-08 02:02:07 +01:00
Boris
f75e35c337
fix: custom model pipeline (#508)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit


- **New Features**
• Graph visualizations now allow exporting to a user-specified file path
for more flexible output management.
• The text embedding process has been enhanced with an additional
tokenizer option for improved performance.
• A new `ExtendableDataPoint` class has been introduced for future
extensions.
• New JSON files for companies and individuals have been added to
facilitate testing and data processing.

- **Improvements**
• Search functionality now uses updated identifiers for more reliable
content retrieval.
• Metadata handling has been streamlined across various classes by
removing unnecessary type specifications.
• Enhanced serialization of properties in the Neo4j adapter for improved
handling of complex structures.
• The setup process for databases has been improved with a new
asynchronous setup function.

- **Chores**
• Dependency and configuration updates improve overall stability and
performance.

<!-- end of auto-generated comment: release notes by coderabbit.ai -->
2025-02-08 02:00:15 +01:00
Boris Arzentar
79a1b86161 version: v0.1.24 2025-02-05 17:53:27 +01:00
Boris Arzentar
44a4f8fd0d version: v0.1.23 2025-02-01 15:23:43 +01:00
Igor Ilic
8879f3fbbe
feat: Add gemini support [COG-1023] (#485)
<!-- .github/pull_request_template.md -->

## Description
PR to test Gemini PR from holchan

1. Add Gemini LLM and Gemini Embedding support 
2. Fix CodeGraph issue with chunks being bigger than maximum token value
3. Add Tokenizer adapters to CodeGraph

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit

- **New Features**
    - Added support for the Gemini LLM provider.
    - Expanded LLM configuration options.
- Introduced a new GitHub Actions workflow for multimetric QA
evaluation.
- Added new environment variables for LLM and embedding configurations
across various workflows.

- **Bug Fixes**
    - Improved error handling in various components.
    - Updated tokenization and embedding processes.
    - Removed warning related to missing `dict` method in data items.

- **Refactor**
    - Simplified token extraction and decoding methods.
    - Updated tokenizer interfaces.
    - Removed deprecated dependencies.
    - Enhanced retry logic and error handling in embedding processes.

- **Documentation**
    - Updated configuration comments and settings.

- **Chores**
- Updated GitHub Actions workflows to accommodate new secrets and
environment variables.
    - Modified evaluation parameters.
    - Adjusted dependency management for optional libraries.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->

---------

Co-authored-by: holchan <61059652+holchan@users.noreply.github.com>
Co-authored-by: Boris <boris@topoteretes.com>
2025-01-31 18:03:23 +01:00
Igor Ilic
49f60971bb Merge branch 'dev' into COG-970-refactor-tokenizing 2025-01-28 10:12:55 +01:00
Boris
0c2c5870df
fix: use low_lever server for cognee mcp server (#470)
* fix: revert to older mcp version

* fix: use low_level server for the mcp

* fix: styling errors

* fix: mcp cognify arguments

* fix: ruff errors
2025-01-26 12:52:48 +01:00
Igor Ilic
77a72851fc Merge branch 'dev' into COG-970-refactor-tokenizing 2025-01-24 18:34:50 +01:00
Igor Ilic
cdc992750a test: Add github action to test code graph 2025-01-24 18:12:16 +01:00
Igor Ilic
6d5679f9d2 Merge branch 'dev' into COG-970-refactor-tokenizing 2025-01-23 18:14:49 +01:00
Igor Ilic
b25a82e206 chore: Add google-generativeai as gemini optional dependency to Cognee 2025-01-23 17:56:56 +01:00
Boris Arzentar
00f302c37a feat: use fastmcp for mcp server 2025-01-23 11:45:40 +01:00
Igor Ilic
964fca72c6 fix: Update ruff version for cognee 2025-01-17 10:36:04 +01:00
vasilije
b2355deeab Update format 2025-01-16 20:38:59 +01:00
vasilije
662faeb44b Update format 2025-01-16 20:07:24 +01:00
Vasilije
3ba98b2ecd
Merge branch 'dev' into COG-975 2025-01-14 10:04:21 +01:00
vasilije
047948ae54 Add data visualization for Anthropic 2025-01-14 09:57:53 +01:00
Boris Arzentar
8786fc35e7 version: Increase version to 0.1.22 2025-01-13 21:57:02 +01:00
Boris Arzentar
e983c216f0 version: Increase version to 0.1.21 2025-01-10 19:04:24 +01:00
Igor Ilic
16eefe4875 fix: resolve issue with langfuse dependency installation when integrating cognee in different packages 2025-01-10 18:02:15 +01:00
vasilije
dc91f01d07 Fix poetry issues 2025-01-08 17:52:31 +01:00
vasilije
5194353614 Fix poetry issues 2025-01-08 17:34:31 +01:00
vasilije
a5e993b857 Fix poetry issues 2025-01-08 17:14:41 +01:00
vasilije
3b649d1fa6 Fix poetry issues 2025-01-08 14:44:19 +01:00
vasilije
1a0052ba34 Fix poetry issues 2025-01-08 14:33:21 +01:00
vasilije
a856c234cb Fix poetry issues 2025-01-08 14:26:12 +01:00
vasilije
178b3ceec8 Fix poetry issues 2025-01-08 14:16:33 +01:00
vasilije
1a885fe6bc Fix poetry issues 2025-01-08 13:48:00 +01:00
vasilije
b13edb0ece Fix visualization 2025-01-08 13:29:23 +01:00
vasilije
fdd8ce7e32 Fix visualization 2025-01-08 13:20:38 +01:00
vasilije
0ff9ffa11b Fix visualization 2025-01-08 13:12:08 +01:00
vasilije
1b96a71d5a Fix ollama, work on visualization 2025-01-06 19:09:58 +01:00
vasilije
74e4b901bb Fix linter issues 2025-01-05 19:57:06 +01:00
vasilije
35460e9ad6 Fix linter issues 2025-01-05 19:53:38 +01:00
vasilije
76a0aa7e8b Fix linter issues 2025-01-05 19:48:35 +01:00
vasilije
5b115594b7 Fix linter issues 2025-01-05 19:26:43 +01:00
Igor Ilic
a90a232e95 Merge remote-tracking branch 'origin/dev' into COG-650-replace-pylint 2024-12-20 15:23:55 +01:00
Igor Ilic
f2bb1c5d47 fix: Add poetry.lock file, make langfuse mandatory
Added langfuse as mandatory dependency, added poetry.lock file

Fix
2024-12-19 16:04:24 +01:00
Vasilije
ffb44529cc
Merge branch 'dev' into LANGFUSE_FIX 2024-12-18 19:07:13 +01:00
vasilije
c448dfb96d Fix langfuse 2024-12-18 19:01:29 +01:00
Igor Ilic
875dd1055e chore: Update ruff lint options in pyproject file
Update ruff lint options in pyproject file

Chore
2024-12-18 13:31:19 +01:00
Igor Ilic
7be7265c93 feat: Add ruff pre-commit hook for linting and formatting
Added formatting and linting on pre-commit hook

Feature COG-650
2024-12-18 13:19:02 +01:00
Vasilije
b0b2b56b03
Merge branch 'dev' into dependabot/pip/langsmith-0.2.3 2024-12-12 13:16:34 +01:00
dependabot[bot]
e6a179a244
⬆️ Bump langsmith from 0.1.139 to 0.2.3
Bumps [langsmith](https://github.com/langchain-ai/langsmith-sdk) from 0.1.139 to 0.2.3.
- [Release notes](https://github.com/langchain-ai/langsmith-sdk/releases)
- [Commits](https://github.com/langchain-ai/langsmith-sdk/compare/v0.1.139...v0.2.3)

---
updated-dependencies:
- dependency-name: langsmith
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
2024-12-12 12:15:58 +00:00
Vasilije
06baafe561
Merge branch 'dev' into dependabot/pip/debugpy-1.8.9 2024-12-12 13:15:37 +01:00