Commit graph

1184 commits

Author SHA1 Message Date
Igor Ilic
acd88e83e2
fix: Resolve issue with table names in SQL commands (#13)
Some SQL commands require lowercase characters in table names unless
table name is wrapped in quotes. Renamed all new tables to use lowercase

Fix COG-677
2024-11-20 16:16:57 +01:00
Igor Ilic
f9353d25fa fix: Update table name in notebook
Update table name in notebook

Fix COG-677
2024-11-20 15:14:38 +01:00
Igor Ilic
70fe6ac541 fix: Update table name in notebook
Update table name to use latest in notebook

Fix COG-677
2024-11-20 15:07:38 +01:00
Igor Ilic
e4d00403ba fix: Update table names in tests
Update table names in tests to accomodate to recent fix

Fix COG-677
2024-11-20 15:02:22 +01:00
Igor Ilic
15b7b8ef2b fix: Resolve issue with table names in SQL commands
Some SQL commands require lowercase characters in table names unless table name is wrapped in quotes. Renamed all new tables to use lowercase

Fix COG-677
2024-11-20 14:54:35 +01:00
Igor Ilic
4b55354dce
fix: Resolve issue with pgvector timeout (#3)
By creating PGVector as a singleton all issues regrading timeout are
resolved as there are no more parallel instances trying to communicate
with the database
2024-11-19 15:31:26 +01:00
Boris
5f144a0f92
fix: make all checks green (#1) 2024-11-19 15:30:09 +01:00
Vasilije
2331739e07
Fix syntax in simple example (#2) 2024-11-19 10:44:36 +01:00
Rita Aleksziev
07b1956b6e Fix syntax in simple example 2024-11-19 09:55:21 +01:00
Boris
c045f737f7
feat: add vector and graph dbs state to README file (#235) 2024-11-18 17:51:41 +01:00
0xideas
34e140a41d
Switch to gpt-4o-mini by default (#233)
* Switch to gpt-4o-mini by default

* Add option and make gpt-4o-mini default in frontend

* Run llama index notebook without extra arguments in poetry install

* Install extras for llama_index_notebook run
2024-11-18 17:38:54 +01:00
0xideas
ced5385186
Merge pull request #205 from topoteretes/COG-417-chunking-unit-tests
Cog 417 chunking unit tests
2024-11-18 12:52:44 +01:00
Boris
22a0e43d4a
Merge branch 'main' into COG-417-chunking-unit-tests 2024-11-17 13:40:32 +01:00
Boris
d8b6eeded5
feat: log search queries and results (#166)
* feat: log search queries and results

* fix: address coderabbit review comments

* fix: parse UUID when logging search results

* fix: remove custom UUID type and use DB agnostic UUID from sqlalchemy

* Add new cognee_db

---------

Co-authored-by: Leon Luithlen <leon@topoteretes.com>
2024-11-17 11:59:10 +01:00
Igor Ilic
d30adb53f3
Cog 337 llama index support (#186)
* feat: Add support for LlamaIndex Document type

Added support for LlamaIndex Document type

Feature #COG-337

* docs: Add Jupyer Notebook for cognee with llama index document type

Added jupyter notebook which demonstrates cognee with LlamaIndex document type usage

Docs #COG-337

* feat: Add metadata migration from LlamaIndex document type

Allow usage of metadata from LlamaIndex documents

Feature #COG-337

* refactor: Change llama index migration function name

Change name of llama index function

Refactor #COG-337

* chore: Add llama index core dependency

Downgrade needed on tenacity and instructor modules to support llama index

Chore #COG-337

* Feature: Add ingest_data_with_metadata task

Added task that will have access to metadata if data is provided from different data ingestion tools

Feature #COG-337

* docs: Add description on why specific type checking is done

Explained why specific type checking is used instead of isinstance, as isinstace returns True for child classes as well

Docs #COG-337

* fix: Add missing parameter to function call

Added missing parameter to function call

Fix #COG-337

* refactor: Move storing of data from async to sync function

Moved data storing from async to sync

Refactor #COG-337

* refactor: Pretend ingest_data was changes instead of having two tasks

Refactor so ingest_data file was modified instead of having two ingest tasks

Refactor #COG-337

* refactor: Use old name for data ingestion with metadata

Merged new and old data ingestion tasks into one

Refactor #COG-337

* refactor: Return ingest_data and save_data_to_storage Tasks

Returned ingest_data and save_data_to_storage tasks

Refactor #COG-337

* refactor: Return previous ingestion Tasks to add function

Returned previous ignestion tasks to add function

Refactor #COG-337

* fix: Remove dict and use string for search query

Remove dictionary and use string for query in notebook and simple example

Fix COG-337

* refactor: Add changes request in pull request

Added the following changes that were requested in pull request:

Added synchronize label,
Made uniform syntax in if statement in workflow,
fixed instructor dependency,
added llama-index to be optional

Refactor COG-337

* fix: Resolve issue with llama-index being mandatory

Resolve issue with llama-index being mandatory to run cognee

Fix COG-337

* fix: Add install of llama-index to notebook

Removed additional references to llama-index from core cognee lib.
Added llama-index-core install from notebook

Fix COG-337

---------
2024-11-17 11:47:08 +01:00
Vasilije
a63490b916 Bump release version 2024-11-16 14:01:28 +01:00
Vasilije
c2e265fbf8
Merge pull request #231 from topoteretes/COG-597-refactor-analytics
Cog 597 refactor analytics
2024-11-16 13:56:56 +01:00
Vasilije
d1e9870972
Merge branch 'main' into COG-597-refactor-analytics 2024-11-16 13:49:30 +01:00
Igor Ilic
2703215dec refactor: Add user_id to event properties
Adding user_id to event properties allows tracking of which user started the event

Refactor COG-597
2024-11-15 15:20:41 +01:00
Igor Ilic
d90f5fe7c1 feat: Add proxy for analytics
Added proxy usage with vercel hosting for telemetry and analytics

Feature COG-597
2024-11-15 15:05:46 +01:00
hajdul88
8c0c0b97a2
Merge pull request #229 from topoteretes/feature/cog-620-bug-code_graph_pipeline-cant-process-false-class
fix: Fixes processing false Class keyword issue
2024-11-15 14:55:12 +01:00
hajdul88
1df12c1259 fix: Fixes processing false Class keyword issue 2024-11-15 14:47:13 +01:00
Leon Luithlen
6e416ed9f0 Merge branch 'main' into COG-417-chunking-unit-tests 2024-11-14 17:20:10 +01:00
Leon Luithlen
f51a44fd76 Remove unneeded document.read in AudioDocument_test 2024-11-14 17:18:36 +01:00
Leon Luithlen
e40e7386a0 Refactor word_type yielding in chuck_by_sentence 2024-11-14 17:16:04 +01:00
Leon Luithlen
14dd60576e Fix indexing in tests in chunk_by_sentence_test 2024-11-14 17:06:16 +01:00
Leon Luithlen
928e1075c6 Test chunk_by_paragraph chunk numbering 2024-11-14 16:55:24 +01:00
Leon Luithlen
84c98f16bb Remove chunk_index attribute from chunk_by_sentence return value 2024-11-14 16:49:13 +01:00
Leon Luithlen
15420dd864 Fix paragraph_ids handling 2024-11-14 16:47:51 +01:00
Vasilije
b858f0b06b
Merge pull request #216 from topoteretes/fix/remove-dspy-dependency
Remove dspy dependency
2024-11-14 16:25:06 +01:00
Leon Luithlen
0c5ac33aff Adjust poetry.lock 2024-11-14 15:54:28 +01:00
Leon Luithlen
ed8f9cd17e Remove dspy dependency 2024-11-14 15:50:29 +01:00
Leon Luithlen
7cf8c74cf9 Merge latest main 2024-11-14 15:05:57 +01:00
Leon Luithlen
d6a6a9eaba Return sentence_cut instead of word in chunk_by_paragraph 2024-11-14 15:03:09 +01:00
Vasilije
535d8281b4
Merge pull request #215 from topoteretes/clean_dspy
Remove dspy logic that confuses
2024-11-14 14:51:51 +01:00
Vasilije
bc2e17592d
Merge branch 'main' into clean_dspy 2024-11-14 14:50:43 +01:00
Vasilije
36ada5974d
Delete cognee/modules/cognify/dataset.py 2024-11-14 14:49:45 +01:00
Vasilije
8e9040815f
Delete cognee/modules/cognify/train.py 2024-11-14 14:49:34 +01:00
Vasilije
cf09a5ea37
Delete cognee/modules/cognify/test.py 2024-11-14 14:49:23 +01:00
Vasilije
c5d132ed14
Delete cognee/modules/cognify/evaluate.py 2024-11-14 14:49:08 +01:00
hajdul88
c1007091d1
Merge pull request #196 from topoteretes/feat/COG-553-graph-memory-projection
Feat/cog 553 graph memory projection
2024-11-14 14:48:41 +01:00
0xideas
8b681529b1
Update cognee/tasks/chunks/chunk_by_paragraph.py
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
2024-11-14 14:42:15 +01:00
Leon Luithlen
73f24f9e4d Fix sentence_cut return value in inappropriate places 2024-11-14 14:40:42 +01:00
Leon Luithlen
b4d509e682 Set batch_paragraph=False in run_chunking_test 2024-11-14 14:23:09 +01:00
Leon Luithlen
a52d3ac6ba Change document test ground truth values for new chunk_by_word 2024-11-14 14:20:18 +01:00
Leon Luithlen
eaf9167fa1 Change chunk_by_word to collect newlines in prior words 2024-11-14 14:19:34 +01:00
hajdul88
867e18de86 fix: Changes GraphDBInterface typing in CogneeGraph 2024-11-14 14:01:20 +01:00
Leon Luithlen
57d8149732 Save paragraph_ids in chunk_by_paragraph 2024-11-14 13:59:54 +01:00
Leon Luithlen
6721eaee83 Fix chunk_index bug in chunk_by_paragraph 2024-11-14 13:50:40 +01:00
0xideas
f2206a09c0
Update cognee/tasks/chunks/chunk_by_word.py
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
2024-11-14 13:16:17 +01:00