Commit graph

2375 commits

Author SHA1 Message Date
hajdul88
d9eec77f18 feat: Implements first step of the two step retrieval 2024-11-19 16:40:27 +01:00
hajdul88
44ac9b68b4 feat: adds get_distances from collection method to LanceDB and PgVector 2024-11-19 16:39:45 +01:00
hajdul88
f2c0fddeb2 feat: Adds graph-data-science to neo4j docker image 2024-11-19 16:29:52 +01:00
0xideas
c3757cca94
Merge pull request #230 from topoteretes/COG-533-pydantic-unit-tests
Cog 533 pydantic unit tests
2024-11-19 16:08:01 +01:00
Boris
ab1328d898
Merge branch 'main' into COG-533-pydantic-unit-tests 2024-11-19 15:39:31 +01:00
Igor Ilic
4b55354dce
fix: Resolve issue with pgvector timeout (#3)
By creating PGVector as a singleton all issues regrading timeout are
resolved as there are no more parallel instances trying to communicate
with the database
2024-11-19 15:31:26 +01:00
Boris
5f144a0f92
fix: make all checks green (#1) 2024-11-19 15:30:09 +01:00
Rita Aleksziev
2948089806 Read patch generation instructions from file 2024-11-19 14:07:53 +01:00
Rita Aleksziev
d38f7c294e Merge branch 'main' into feat/COG-544-eval-on-swe-bench 2024-11-19 13:57:35 +01:00
Rita Aleksziev
838d98238a Code cleanup 2024-11-19 13:54:04 +01:00
hajdul88
c4850f64dc feat: Implements pipeline structure for retrievers 2024-11-19 11:14:42 +01:00
0xideas
1dd07cdba6
Merge pull request #234 from topoteretes/fix/rename-remaining-query-to-query-text-kwargs
Rename remaining 'query' keyword args in cognee.search to 'query_text'
2024-11-19 11:05:40 +01:00
Leon Luithlen
b18f748c9e Merge dicts directly 2024-11-19 10:56:21 +01:00
Vasilije
2331739e07
Fix syntax in simple example (#2) 2024-11-19 10:44:36 +01:00
Rita Aleksziev
07b1956b6e Fix syntax in simple example 2024-11-19 09:55:21 +01:00
Rita Aleksziev
9973afffa1 Merge branch 'main' into feat/COG-544-eval-on-swe-bench 2024-11-19 09:38:11 +01:00
Vasilije
722c7b081a
Update README.md 2024-11-18 21:43:10 +01:00
Boris
c045f737f7
feat: add vector and graph dbs state to README file (#235) 2024-11-18 17:51:41 +01:00
Boris
bb404f51f9
Update docs/quickstart.md 2024-11-18 17:50:15 +01:00
Boris
64bc425330
Merge branch 'main' into fix/rename-remaining-query-to-query-text-kwargs 2024-11-18 17:39:23 +01:00
0xideas
34e140a41d
Switch to gpt-4o-mini by default (#233)
* Switch to gpt-4o-mini by default

* Add option and make gpt-4o-mini default in frontend

* Run llama index notebook without extra arguments in poetry install

* Install extras for llama_index_notebook run
2024-11-18 17:38:54 +01:00
Rita Aleksziev
d07ebbe350 Merge branch 'main' into feat/COG-544-eval-on-swe-bench 2024-11-18 15:59:42 +01:00
Rita Aleksziev
d986e7c981 minor code cleanup 2024-11-18 15:59:18 +01:00
Rita Aleksziev
98e3445c2c running swebench evaluation as subprocess 2024-11-18 15:12:36 +01:00
Leon Luithlen
7a2fc617a8 Rename remaining 'query' keyword args in cognee.search to 'query_text' 2024-11-18 14:00:14 +01:00
0xideas
ced5385186
Merge pull request #205 from topoteretes/COG-417-chunking-unit-tests
Cog 417 chunking unit tests
2024-11-18 12:52:44 +01:00
Leon Luithlen
fde56f0c3b Merge branch 'main' into COG-533-pydantic-unit-tests 2024-11-18 11:24:51 +01:00
Leon Luithlen
103eb13c77 Skip recursive pydantic tests for Python 3.9 and 3.10 2024-11-18 11:23:22 +01:00
Boris
22a0e43d4a
Merge branch 'main' into COG-417-chunking-unit-tests 2024-11-17 13:40:32 +01:00
Boris
d8b6eeded5
feat: log search queries and results (#166)
* feat: log search queries and results

* fix: address coderabbit review comments

* fix: parse UUID when logging search results

* fix: remove custom UUID type and use DB agnostic UUID from sqlalchemy

* Add new cognee_db

---------

Co-authored-by: Leon Luithlen <leon@topoteretes.com>
2024-11-17 11:59:10 +01:00
Igor Ilic
d30adb53f3
Cog 337 llama index support (#186)
* feat: Add support for LlamaIndex Document type

Added support for LlamaIndex Document type

Feature #COG-337

* docs: Add Jupyer Notebook for cognee with llama index document type

Added jupyter notebook which demonstrates cognee with LlamaIndex document type usage

Docs #COG-337

* feat: Add metadata migration from LlamaIndex document type

Allow usage of metadata from LlamaIndex documents

Feature #COG-337

* refactor: Change llama index migration function name

Change name of llama index function

Refactor #COG-337

* chore: Add llama index core dependency

Downgrade needed on tenacity and instructor modules to support llama index

Chore #COG-337

* Feature: Add ingest_data_with_metadata task

Added task that will have access to metadata if data is provided from different data ingestion tools

Feature #COG-337

* docs: Add description on why specific type checking is done

Explained why specific type checking is used instead of isinstance, as isinstace returns True for child classes as well

Docs #COG-337

* fix: Add missing parameter to function call

Added missing parameter to function call

Fix #COG-337

* refactor: Move storing of data from async to sync function

Moved data storing from async to sync

Refactor #COG-337

* refactor: Pretend ingest_data was changes instead of having two tasks

Refactor so ingest_data file was modified instead of having two ingest tasks

Refactor #COG-337

* refactor: Use old name for data ingestion with metadata

Merged new and old data ingestion tasks into one

Refactor #COG-337

* refactor: Return ingest_data and save_data_to_storage Tasks

Returned ingest_data and save_data_to_storage tasks

Refactor #COG-337

* refactor: Return previous ingestion Tasks to add function

Returned previous ignestion tasks to add function

Refactor #COG-337

* fix: Remove dict and use string for search query

Remove dictionary and use string for query in notebook and simple example

Fix COG-337

* refactor: Add changes request in pull request

Added the following changes that were requested in pull request:

Added synchronize label,
Made uniform syntax in if statement in workflow,
fixed instructor dependency,
added llama-index to be optional

Refactor COG-337

* fix: Resolve issue with llama-index being mandatory

Resolve issue with llama-index being mandatory to run cognee

Fix COG-337

* fix: Add install of llama-index to notebook

Removed additional references to llama-index from core cognee lib.
Added llama-index-core install from notebook

Fix COG-337

---------
2024-11-17 11:47:08 +01:00
Vasilije
a63490b916 Bump release version 2024-11-16 14:01:28 +01:00
Vasilije
c2e265fbf8
Merge pull request #231 from topoteretes/COG-597-refactor-analytics
Cog 597 refactor analytics
2024-11-16 13:56:56 +01:00
Vasilije
d1e9870972
Merge branch 'main' into COG-597-refactor-analytics 2024-11-16 13:49:30 +01:00
Leon Luithlen
8a2cf2075a Add model_rebuild 2024-11-15 17:57:03 +01:00
Rita Aleksziev
ed08cdb9f9 using the code graph pipeline instead of cognify 2024-11-15 17:56:19 +01:00
Rita Aleksziev
4d6229bf7e Merge branch 'main' into feat/COG-544-eval-on-swe-bench 2024-11-15 17:15:16 +01:00
Rita Aleksziev
721fde3d60 generating testspecs for data 2024-11-15 17:14:43 +01:00
Leon Luithlen
a3342918d9 Apply cosmetic changes and autoformat 2024-11-15 16:53:32 +01:00
Leon Luithlen
5b420ebccc Autoformat graph pydantic conversion code 2024-11-15 16:44:30 +01:00
Leon Luithlen
148eb4ed9b Add profile_graph_pydantic_conversion.py 2024-11-15 16:42:14 +01:00
Leon Luithlen
a1f72727bc Revert model_rebuild order 2024-11-15 16:17:33 +01:00
Leon Luithlen
f3f0bca9bd Revert making Person attributes optional 2024-11-15 16:03:53 +01:00
Leon Luithlen
370b59b39a Add get_graph_from_model_generative_test 2024-11-15 15:58:03 +01:00
Leon Luithlen
5a464bfca7 Refactor get_model_instance_from_graph 2024-11-15 15:57:50 +01:00
Igor Ilic
2703215dec refactor: Add user_id to event properties
Adding user_id to event properties allows tracking of which user started the event

Refactor COG-597
2024-11-15 15:20:41 +01:00
Leon Luithlen
afae70f3b5 Add get_graph_from_model_generative_test 2024-11-15 15:10:42 +01:00
Igor Ilic
d90f5fe7c1 feat: Add proxy for analytics
Added proxy usage with vercel hosting for telemetry and analytics

Feature COG-597
2024-11-15 15:05:46 +01:00
hajdul88
8c0c0b97a2
Merge pull request #229 from topoteretes/feature/cog-620-bug-code_graph_pipeline-cant-process-false-class
fix: Fixes processing false Class keyword issue
2024-11-15 14:55:12 +01:00
Leon Luithlen
3c8a52f4b0 Fix inconsistent state between nodes and added_nodes and edges and added_edges 2024-11-15 14:47:36 +01:00