Commit graph

1983 commits

Author SHA1 Message Date
Igor Ilic
b25a82e206 chore: Add google-generativeai as gemini optional dependency to Cognee 2025-01-23 17:56:56 +01:00
Igor Ilic
b686376c54 feat: Add gemini tokenizer to cognee 2025-01-23 17:55:04 +01:00
Igor Ilic
294ed1d960 feat: Add HuggingFace Tokenizer support 2025-01-23 16:52:35 +01:00
Igor Ilic
93249c72c5 fix: Initial commit to resolve issue with using tokenizer based on LLMs
Currently TikToken is used for tokenizing by default which is only supported by OpenAI,
this is an initial commit in an attempt to add Cognee tokenizing support for multiple LLMs
2025-01-21 19:53:22 +01:00
Igor Ilic
77f0b45a0d refactor: Resolve issue with notebook after metadata refactor
Resolve issue with LlamaIndex notebook after refactor
2025-01-20 18:02:57 +01:00
Igor Ilic
4196a4ce89 refactor: Update test to be up to date with current metadata refactor effort 2025-01-20 17:53:54 +01:00
Igor Ilic
5c17501bb8 refactor: add missing foreing_metadata attr to tests 2025-01-20 17:38:28 +01:00
Igor Ilic
ab8d95cc30 refactor: As neo4j can't support dictionaries, add foreign metadata as string 2025-01-20 17:28:14 +01:00
Igor Ilic
49ad292592 refactor: Reduce complexity of metadata handling
Have foreign metadata be a table column in data instead of it's own table to reduce complexity

Refactor COG-793
2025-01-20 16:39:05 +01:00
Igor Ilic
0c7c1d7503 refactor: Refactor ingestion to only have one ingestion task 2025-01-20 14:33:47 +01:00
alekszievr
75bc7f67eb
feat: Add incremental eval option to paramset (#446)
* QA eval dataset as argument, with hotpot and 2wikimultihop as options. Json schema validation for datasets.

* Load dataset file by filename, outsource utilities

* restructure metric selection

* Add comprehensiveness, diversity and empowerment metrics

* add promptfoo as an option

* refactor RAG solution in eval;2C

* LLM as a judge metrics implemented in a uniform way

* Use requests.get instead of wget

* clean up promptfoo config template

* minor fixes

* get promptfoo path instead of hardcoding

* minor fixes

* Add LLM as a judge prompts

* Support 4 different rag options in eval

* Minor refactor and logger usage

* feat: make tasks a configurable argument in the cognify function

* Run eval on a set of parameters and save results as json and png

* fix: add data points task

* script for running all param combinations

* enable context provider to get tasks as param

* bugfix in simple rag

* Incremental eval of cognee pipeline

* potential fix: single asyncio run

* temp fix: exclude insights

* Remove insights, have single asyncio run, refactor

* Include incremental eval in accepted paramsets

* minor fixes

* handle pipeline slices in utils

* Handle insights and customize search types

* Handle retrieved edges more safely

* bugfix

* fix simple rag

---------

Co-authored-by: lxobr <122801072+lxobr@users.noreply.github.com>
Co-authored-by: hajdul88 <52442977+hajdul88@users.noreply.github.com>
2025-01-17 18:04:31 +01:00
alekszievr
2e010f8dd1
Incremental eval of cognee pipeline (#445)
* QA eval dataset as argument, with hotpot and 2wikimultihop as options. Json schema validation for datasets.

* Load dataset file by filename, outsource utilities

* restructure metric selection

* Add comprehensiveness, diversity and empowerment metrics

* add promptfoo as an option

* refactor RAG solution in eval;2C

* LLM as a judge metrics implemented in a uniform way

* Use requests.get instead of wget

* clean up promptfoo config template

* minor fixes

* get promptfoo path instead of hardcoding

* minor fixes

* Add LLM as a judge prompts

* Support 4 different rag options in eval

* Minor refactor and logger usage

* feat: make tasks a configurable argument in the cognify function

* Run eval on a set of parameters and save results as json and png

* fix: add data points task

* script for running all param combinations

* enable context provider to get tasks as param

* bugfix in simple rag

* Incremental eval of cognee pipeline

* potential fix: single asyncio run

* temp fix: exclude insights

* Remove insights, have single asyncio run, refactor

* minor fixes

* handle pipeline slices in utils

* include all options in params json

---------

Co-authored-by: lxobr <122801072+lxobr@users.noreply.github.com>
Co-authored-by: hajdul88 <52442977+hajdul88@users.noreply.github.com>
2025-01-17 14:16:48 +01:00
Vasilije
ffa3c2daa0
Merge pull request #449 from topoteretes/feature/cog-186-run-cognee-on-windows
Feature/cog 186 run cognee on windows
2025-01-17 14:16:37 +01:00
hajdul88
b0634da43e fix: fixes typo in README 2025-01-17 11:30:45 +01:00
hajdul88
6f5d2bad47 Fix: Updates README 2025-01-17 11:29:51 +01:00
hajdul88
0b56e4b688 feat: Adds OS information to README 2025-01-17 11:22:34 +01:00
hajdul88
22ea4f0675
Merge branch 'dev' into feature/cog-186-run-cognee-on-windows 2025-01-17 10:49:53 +01:00
Vasilije
70e68fe8ff
Merge pull request #450 from topoteretes/ruff-version
fix: Update ruff version for cognee
2025-01-17 10:47:29 +01:00
Igor Ilic
be2aa9901f
Merge branch 'dev' into ruff-version 2025-01-17 10:40:50 +01:00
Igor Ilic
89b23b8728 refactor: Run ruff format 0.9.2 2025-01-17 10:40:24 +01:00
Igor Ilic
964fca72c6 fix: Update ruff version for cognee 2025-01-17 10:36:04 +01:00
lxobr
65a0c98455
COG-989 feat: make tasks a configurable argument in the cognify function (#442)
* feat: make tasks a configurable argument in the cognify function

* fix: add data points task

---------

Co-authored-by: hajdul88 <52442977+hajdul88@users.noreply.github.com>
2025-01-17 10:20:57 +01:00
hajdul88
4ea01b9d30 fix: fixes cognee backend on windows 2025-01-17 09:52:05 +01:00
hajdul88
08c22a542a fix: fixes typo in multimedia example 2025-01-17 09:31:48 +01:00
hajdul88
981f35c1e0 fix: fixes windows compatibility in examples 2025-01-17 09:28:10 +01:00
hajdul88
704f2c68e2 fix: fixes old 0.8.6 ruff format to 0.9.2 2025-01-17 09:25:05 +01:00
hajdul88
6e691885e6
Merge branch 'dev' into feature/cog-186-run-cognee-on-windows 2025-01-17 09:06:00 +01:00
Vasilije
7c3e46f14e
Update README.md 2025-01-17 08:15:13 +01:00
alekszievr
8ec1e48ff6
Run eval on a set of parameters and save them as png and json (#443)
* QA eval dataset as argument, with hotpot and 2wikimultihop as options. Json schema validation for datasets.

* Load dataset file by filename, outsource utilities

* restructure metric selection

* Add comprehensiveness, diversity and empowerment metrics

* add promptfoo as an option

* refactor RAG solution in eval;2C

* LLM as a judge metrics implemented in a uniform way

* Use requests.get instead of wget

* clean up promptfoo config template

* minor fixes

* get promptfoo path instead of hardcoding

* minor fixes

* Add LLM as a judge prompts

* Support 4 different rag options in eval

* Minor refactor and logger usage

* Run eval on a set of parameters and save results as json and png

* script for running all param combinations

* bugfix in simple rag

* potential fix: single asyncio run

* temp fix: exclude insights

* Remove insights, have single asyncio run, refactor

---------

Co-authored-by: lxobr <122801072+lxobr@users.noreply.github.com>
2025-01-17 00:18:51 +01:00
Vasilije
6c6ba3270c
Merge pull request #432 from topoteretes/COG-975
feat: Add data visualization for Anthropic
2025-01-16 21:41:22 +01:00
vasilije
fe47253b04 Fix for now 2025-01-16 21:37:37 +01:00
vasilije
cb7b2d311e Fix for now 2025-01-16 21:36:25 +01:00
vasilije
bbd51e8c27 Fix for now 2025-01-16 21:08:54 +01:00
vasilije
07836255ae Fix for now 2025-01-16 21:02:58 +01:00
vasilije
7a4a0f4705 Fix for now 2025-01-16 20:59:23 +01:00
vasilije
72b503f198 Fix for now 2025-01-16 20:58:10 +01:00
vasilije
5aaf420f02 Fix for now 2025-01-16 20:56:54 +01:00
vasilije
f19b58a7bb Update format 2025-01-16 20:54:02 +01:00
vasilije
5b31638bef Update format 2025-01-16 20:49:50 +01:00
vasilije
b2355deeab Update format 2025-01-16 20:38:59 +01:00
vasilije
1af24dc652 Update format 2025-01-16 20:25:26 +01:00
vasilije
4ae8eb90ce Update format 2025-01-16 20:16:58 +01:00
Vasilije
816325f14b
Merge pull request #447 from topoteretes/fix_guidelines
Update CONTRIBUTING.md
2025-01-16 20:13:55 +01:00
vasilije
4a87df9e72 Update format 2025-01-16 20:12:33 +01:00
Vasilije
ee7ddc53fc
Update CONTRIBUTING.md 2025-01-16 20:09:43 +01:00
vasilije
662faeb44b Update format 2025-01-16 20:07:24 +01:00
Vasilije
b61dfd0948
Merge pull request #378 from topoteretes/COG-748
feat: Add versioning to the data point model
2025-01-16 20:04:51 +01:00
hajdul88
bd6aafe9b7 fix: fixes event loop handling on windows in dynamic steps example 2025-01-16 18:17:11 +01:00
hajdul88
935763b08d fix: fixing changed lancedb search + pruning 2025-01-16 17:32:44 +01:00
vasilije
0a02886d76 Update format 2025-01-16 13:28:35 +01:00