Commit graph

1178 commits

Author SHA1 Message Date
hande-k
c6e447f28c docs: add print statements to the simple example, update README 2024-11-20 08:47:02 +01:00
Vasilije
2331739e07
Fix syntax in simple example (#2) 2024-11-19 10:44:36 +01:00
Rita Aleksziev
07b1956b6e Fix syntax in simple example 2024-11-19 09:55:21 +01:00
Boris
c045f737f7
feat: add vector and graph dbs state to README file (#235) 2024-11-18 17:51:41 +01:00
0xideas
34e140a41d
Switch to gpt-4o-mini by default (#233)
* Switch to gpt-4o-mini by default

* Add option and make gpt-4o-mini default in frontend

* Run llama index notebook without extra arguments in poetry install

* Install extras for llama_index_notebook run
2024-11-18 17:38:54 +01:00
0xideas
ced5385186
Merge pull request #205 from topoteretes/COG-417-chunking-unit-tests
Cog 417 chunking unit tests
2024-11-18 12:52:44 +01:00
Boris
22a0e43d4a
Merge branch 'main' into COG-417-chunking-unit-tests 2024-11-17 13:40:32 +01:00
Boris
d8b6eeded5
feat: log search queries and results (#166)
* feat: log search queries and results

* fix: address coderabbit review comments

* fix: parse UUID when logging search results

* fix: remove custom UUID type and use DB agnostic UUID from sqlalchemy

* Add new cognee_db

---------

Co-authored-by: Leon Luithlen <leon@topoteretes.com>
2024-11-17 11:59:10 +01:00
Igor Ilic
d30adb53f3
Cog 337 llama index support (#186)
* feat: Add support for LlamaIndex Document type

Added support for LlamaIndex Document type

Feature #COG-337

* docs: Add Jupyer Notebook for cognee with llama index document type

Added jupyter notebook which demonstrates cognee with LlamaIndex document type usage

Docs #COG-337

* feat: Add metadata migration from LlamaIndex document type

Allow usage of metadata from LlamaIndex documents

Feature #COG-337

* refactor: Change llama index migration function name

Change name of llama index function

Refactor #COG-337

* chore: Add llama index core dependency

Downgrade needed on tenacity and instructor modules to support llama index

Chore #COG-337

* Feature: Add ingest_data_with_metadata task

Added task that will have access to metadata if data is provided from different data ingestion tools

Feature #COG-337

* docs: Add description on why specific type checking is done

Explained why specific type checking is used instead of isinstance, as isinstace returns True for child classes as well

Docs #COG-337

* fix: Add missing parameter to function call

Added missing parameter to function call

Fix #COG-337

* refactor: Move storing of data from async to sync function

Moved data storing from async to sync

Refactor #COG-337

* refactor: Pretend ingest_data was changes instead of having two tasks

Refactor so ingest_data file was modified instead of having two ingest tasks

Refactor #COG-337

* refactor: Use old name for data ingestion with metadata

Merged new and old data ingestion tasks into one

Refactor #COG-337

* refactor: Return ingest_data and save_data_to_storage Tasks

Returned ingest_data and save_data_to_storage tasks

Refactor #COG-337

* refactor: Return previous ingestion Tasks to add function

Returned previous ignestion tasks to add function

Refactor #COG-337

* fix: Remove dict and use string for search query

Remove dictionary and use string for query in notebook and simple example

Fix COG-337

* refactor: Add changes request in pull request

Added the following changes that were requested in pull request:

Added synchronize label,
Made uniform syntax in if statement in workflow,
fixed instructor dependency,
added llama-index to be optional

Refactor COG-337

* fix: Resolve issue with llama-index being mandatory

Resolve issue with llama-index being mandatory to run cognee

Fix COG-337

* fix: Add install of llama-index to notebook

Removed additional references to llama-index from core cognee lib.
Added llama-index-core install from notebook

Fix COG-337

---------
2024-11-17 11:47:08 +01:00
Vasilije
a63490b916 Bump release version 2024-11-16 14:01:28 +01:00
Vasilije
c2e265fbf8
Merge pull request #231 from topoteretes/COG-597-refactor-analytics
Cog 597 refactor analytics
2024-11-16 13:56:56 +01:00
Vasilije
d1e9870972
Merge branch 'main' into COG-597-refactor-analytics 2024-11-16 13:49:30 +01:00
Igor Ilic
2703215dec refactor: Add user_id to event properties
Adding user_id to event properties allows tracking of which user started the event

Refactor COG-597
2024-11-15 15:20:41 +01:00
Igor Ilic
d90f5fe7c1 feat: Add proxy for analytics
Added proxy usage with vercel hosting for telemetry and analytics

Feature COG-597
2024-11-15 15:05:46 +01:00
hajdul88
8c0c0b97a2
Merge pull request #229 from topoteretes/feature/cog-620-bug-code_graph_pipeline-cant-process-false-class
fix: Fixes processing false Class keyword issue
2024-11-15 14:55:12 +01:00
hajdul88
1df12c1259 fix: Fixes processing false Class keyword issue 2024-11-15 14:47:13 +01:00
Leon Luithlen
6e416ed9f0 Merge branch 'main' into COG-417-chunking-unit-tests 2024-11-14 17:20:10 +01:00
Leon Luithlen
f51a44fd76 Remove unneeded document.read in AudioDocument_test 2024-11-14 17:18:36 +01:00
Leon Luithlen
e40e7386a0 Refactor word_type yielding in chuck_by_sentence 2024-11-14 17:16:04 +01:00
Leon Luithlen
14dd60576e Fix indexing in tests in chunk_by_sentence_test 2024-11-14 17:06:16 +01:00
Leon Luithlen
928e1075c6 Test chunk_by_paragraph chunk numbering 2024-11-14 16:55:24 +01:00
Leon Luithlen
84c98f16bb Remove chunk_index attribute from chunk_by_sentence return value 2024-11-14 16:49:13 +01:00
Leon Luithlen
15420dd864 Fix paragraph_ids handling 2024-11-14 16:47:51 +01:00
Vasilije
b858f0b06b
Merge pull request #216 from topoteretes/fix/remove-dspy-dependency
Remove dspy dependency
2024-11-14 16:25:06 +01:00
Leon Luithlen
0c5ac33aff Adjust poetry.lock 2024-11-14 15:54:28 +01:00
Leon Luithlen
ed8f9cd17e Remove dspy dependency 2024-11-14 15:50:29 +01:00
Leon Luithlen
7cf8c74cf9 Merge latest main 2024-11-14 15:05:57 +01:00
Leon Luithlen
d6a6a9eaba Return sentence_cut instead of word in chunk_by_paragraph 2024-11-14 15:03:09 +01:00
Vasilije
535d8281b4
Merge pull request #215 from topoteretes/clean_dspy
Remove dspy logic that confuses
2024-11-14 14:51:51 +01:00
Vasilije
bc2e17592d
Merge branch 'main' into clean_dspy 2024-11-14 14:50:43 +01:00
Vasilije
36ada5974d
Delete cognee/modules/cognify/dataset.py 2024-11-14 14:49:45 +01:00
Vasilije
8e9040815f
Delete cognee/modules/cognify/train.py 2024-11-14 14:49:34 +01:00
Vasilije
cf09a5ea37
Delete cognee/modules/cognify/test.py 2024-11-14 14:49:23 +01:00
Vasilije
c5d132ed14
Delete cognee/modules/cognify/evaluate.py 2024-11-14 14:49:08 +01:00
hajdul88
c1007091d1
Merge pull request #196 from topoteretes/feat/COG-553-graph-memory-projection
Feat/cog 553 graph memory projection
2024-11-14 14:48:41 +01:00
0xideas
8b681529b1
Update cognee/tasks/chunks/chunk_by_paragraph.py
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
2024-11-14 14:42:15 +01:00
Leon Luithlen
73f24f9e4d Fix sentence_cut return value in inappropriate places 2024-11-14 14:40:42 +01:00
Leon Luithlen
b4d509e682 Set batch_paragraph=False in run_chunking_test 2024-11-14 14:23:09 +01:00
Leon Luithlen
a52d3ac6ba Change document test ground truth values for new chunk_by_word 2024-11-14 14:20:18 +01:00
Leon Luithlen
eaf9167fa1 Change chunk_by_word to collect newlines in prior words 2024-11-14 14:19:34 +01:00
hajdul88
867e18de86 fix: Changes GraphDBInterface typing in CogneeGraph 2024-11-14 14:01:20 +01:00
Leon Luithlen
57d8149732 Save paragraph_ids in chunk_by_paragraph 2024-11-14 13:59:54 +01:00
Leon Luithlen
6721eaee83 Fix chunk_index bug in chunk_by_paragraph 2024-11-14 13:50:40 +01:00
0xideas
f2206a09c0
Update cognee/tasks/chunks/chunk_by_word.py
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
2024-11-14 13:16:17 +01:00
Leon Luithlen
8260647497 Add AudioDocument and ImageDocument tests 2024-11-14 12:42:10 +01:00
Leon Luithlen
f87fd12e9b Fix lambda bug in AudioDocument and ImageDocument 2024-11-14 12:41:47 +01:00
Leon Luithlen
8b3b2f8156 Add transcribe_image and create_transcript methods 2024-11-14 11:59:46 +01:00
hajdul88
32504255ef feat: Adds unit tests to CogneeGraph class 2024-11-14 11:46:17 +01:00
hajdul88
b516862edc Fix: Fixes import paths 2024-11-14 11:44:43 +01:00
Leon Luithlen
c905510f30 Change test_input order 2024-11-14 11:44:18 +01:00