Commit graph

23 commits

Author SHA1 Message Date
alekszievr
17231de5d0
Test: Parse context pieces separately in MusiqueQAAdapter and adjust tests [cog-1234] (#561)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit

- **Tests**
- Updated evaluation checks by removing assertions related to the
relationship between `corpus_list` and `qa_pairs`, now focusing solely
on `qa_pairs` limits.

- **Refactor**
- Improved content processing to append each paragraph individually to
`corpus_list`, enhancing clarity in data structure.
- Simplified type annotations in the `load_corpus` method across
multiple adapters, ensuring consistency in return types.

- **Chores**
- Updated dependency installation commands in GitHub Actions workflows
for Python 3.10, 3.11, and 3.12 to include additional evaluation-related
dependencies.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->

---------

Co-authored-by: Vasilije <8619304+Vasilije1990@users.noreply.github.com>
2025-02-20 14:23:53 +01:00
Vasilije
2072c7a081
feat: improve tests add macos runners (#540)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->

## Summary by CodeRabbit

- **Chores**
- Improved automated testing setups to run across multiple operating
systems (Ubuntu and macOS) for Python 3.10, 3.11, and 3.12.
- Enhanced compatibility and coverage across diverse environments,
ensuring a more robust validation process.

<!-- end of auto-generated comment: release notes by coderabbit.ai -->

---------

Co-authored-by: soekja <wes.hubert@gmail.com>
Co-authored-by: Boris <boris@topoteretes.com>
2025-02-15 04:19:19 +01:00
Igor Ilic
8879f3fbbe
feat: Add gemini support [COG-1023] (#485)
<!-- .github/pull_request_template.md -->

## Description
PR to test Gemini PR from holchan

1. Add Gemini LLM and Gemini Embedding support 
2. Fix CodeGraph issue with chunks being bigger than maximum token value
3. Add Tokenizer adapters to CodeGraph

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit

- **New Features**
    - Added support for the Gemini LLM provider.
    - Expanded LLM configuration options.
- Introduced a new GitHub Actions workflow for multimetric QA
evaluation.
- Added new environment variables for LLM and embedding configurations
across various workflows.

- **Bug Fixes**
    - Improved error handling in various components.
    - Updated tokenization and embedding processes.
    - Removed warning related to missing `dict` method in data items.

- **Refactor**
    - Simplified token extraction and decoding methods.
    - Updated tokenizer interfaces.
    - Removed deprecated dependencies.
    - Enhanced retry logic and error handling in embedding processes.

- **Documentation**
    - Updated configuration comments and settings.

- **Chores**
- Updated GitHub Actions workflows to accommodate new secrets and
environment variables.
    - Modified evaluation parameters.
    - Adjusted dependency management for optional libraries.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->

---------

Co-authored-by: holchan <61059652+holchan@users.noreply.github.com>
Co-authored-by: Boris <boris@topoteretes.com>
2025-01-31 18:03:23 +01:00
vasilije
07836255ae Fix for now 2025-01-16 21:02:58 +01:00
vasilije
f19b58a7bb Update format 2025-01-16 20:54:02 +01:00
vasilije
5b31638bef Update format 2025-01-16 20:49:50 +01:00
Igor Ilic
41613436ca chore: Set fixed ubuntu version to 22.04 2025-01-08 15:57:07 +01:00
Igor Ilic
8e1803d713 chore: Remove if in github action to allow triggering on push 2025-01-08 14:57:24 +01:00
Igor Ilic
cb2b4415af chore: Update action to trigger on pull request for any branch 2025-01-08 14:54:39 +01:00
Igor Ilic
ce4c5b136a chore: Update version of poetry install action 2025-01-08 14:50:08 +01:00
vasilije
98cc84aa28 Fixes to our demo 2024-12-16 21:30:29 +01:00
Vasilije
5609bbccc8 removed issues 2024-12-11 13:13:19 +01:00
Igor Ilic
5567370214 chore: Update gh actions to install docs extra
Update library gh actions to install docs extra to test unstructured integration tests

Chore COG-685
2024-12-09 09:32:28 +01:00
Igor Ilic
d254471023 test: Update gh actions so they can run outside of PR to main
Allow github actions to run on PRs that aren't targeting main

Test
2024-12-06 11:09:26 +01:00
Igor Ilic
97dbede4c4 test: Add fix for telemetry issue in gh actions
Set environment variable of ENV to dev for all jobs in workflows in GH actions
2024-11-25 15:31:32 +01:00
Leon Luithlen
6f4ba20003 Run integration tests in pipeline 2024-11-12 16:47:28 +01:00
Leon Luithlen
18890715cf Run unit tests in github actions 2024-11-12 16:47:28 +01:00
Boris
52180eb6b5
feat: COG-184 add falkordb (#192)
* feat: add falkordb adapter

---------

Co-authored-by: hajdul88 <52442977+hajdul88@users.noreply.github.com>
2024-11-11 18:20:52 +01:00
Leon Luithlen
9192b04dc8 Change to AND operator and move from concurrency to run 2024-11-11 14:55:50 +01:00
Leon Luithlen
dedd3a851c Add manual execution for python-3.X tests 2024-11-08 15:42:54 +01:00
Boris
a9433e9283
feat: add sqlalchemy as dlt destination (#137)
* feat: add sqlalchemy as dlt destination

* Fix the demo, update Readme

* fix: add 1.5 notebook

---------

Co-authored-by: Vasilije <8619304+Vasilije1990@users.noreply.github.com>
2024-09-21 15:58:28 +02:00
Boris
1c77337818
chore: remove unnecessary env variables (#136)
* chore: remove unnecessary env variables
2024-09-12 22:37:54 +02:00
Boris
94a674a088
feat: split document reader from chunker (#131)
* fix: abstract chunking into a separate class

* fix: yield merged text from text chunker

* fix: split python version tests

* fix: change postgres live check

* fix: remove unnecessary code

* fix: update checkout action

* fix: update setup-python action

* fix: add PG_USER env variable

* fix: make sure relationship_name is used everywhere

* fix: remove duplicate import
2024-08-19 14:36:10 +02:00