Commit graph

1217 commits

Author SHA1 Message Date
lxobr
a8aefd57ef
COG-546 get_local_script_dependencies (#6)
A utility function, `get_local_script_dependencies`:

- Extracts and resolves local dependencies of a Python script using
`jedi` and `parso`.
- Returns a sorted list of unique module paths
- Optionally dependencies outside a specified repository path are
filtered out
- Includes an example/checker in `cognee/tasks/code`.

Will be used for creating a graph from a repo.
2024-11-20 16:36:03 +01:00
alekszievr
ad08b53ed7
Feat/cog 544 eval on swe bench (#5)
Evaluation script for SWE-bench benchmarking with and without cognee
2024-11-20 16:30:28 +01:00
Igor Ilic
acd88e83e2
fix: Resolve issue with table names in SQL commands (#13)
Some SQL commands require lowercase characters in table names unless
table name is wrapped in quotes. Renamed all new tables to use lowercase

Fix COG-677
2024-11-20 16:16:57 +01:00
Rita Aleksziev
98e693ad97 Merge branch 'main' into feat/COG-544-eval-on-swe-bench 2024-11-20 16:04:15 +01:00
Rita Aleksziev
e1d8f3ea86 use acreate_structured_output instead of create_structured_output in eval script 2024-11-20 16:02:15 +01:00
Igor Ilic
f9353d25fa fix: Update table name in notebook
Update table name in notebook

Fix COG-677
2024-11-20 15:14:38 +01:00
Igor Ilic
70fe6ac541 fix: Update table name in notebook
Update table name to use latest in notebook

Fix COG-677
2024-11-20 15:07:38 +01:00
Igor Ilic
e4d00403ba fix: Update table names in tests
Update table names in tests to accomodate to recent fix

Fix COG-677
2024-11-20 15:02:22 +01:00
Igor Ilic
15b7b8ef2b fix: Resolve issue with table names in SQL commands
Some SQL commands require lowercase characters in table names unless table name is wrapped in quotes. Renamed all new tables to use lowercase

Fix COG-677
2024-11-20 14:54:35 +01:00
lxobr
f27dc0c91a fix: Rename, extract checker into a separate script 2024-11-20 12:28:10 +01:00
lxobr
263ecb9149 fix: Add input validation and error handling for paths 2024-11-20 12:28:10 +01:00
lxobr
8bc26bba97 fix: Add error handling for path conversion 2024-11-20 12:28:10 +01:00
lxobr
ebb811af87 fix: Filter out None values in module paths 2024-11-20 12:28:10 +01:00
lxobr
2417d18607 fix: Add logging instead of print 2024-11-20 12:28:10 +01:00
lxobr
1a1452e177 fix: Add error handling for Jedi analysis, with debug mode 2024-11-20 12:28:10 +01:00
lxobr
3aadda9a89 feat: Add argparse for testing purposes 2024-11-20 12:28:10 +01:00
lxobr
4bf2281cd5 feat: Enable async processing 2024-11-20 12:28:10 +01:00
lxobr
742792b6c1 refactor: Remove a comment 2024-11-20 12:28:10 +01:00
lxobr
2be2b802c0 feat: Safely handle file read errors 2024-11-20 12:28:10 +01:00
lxobr
e148d32c14 refactor: Modify sys.path in context manager 2024-11-20 12:28:10 +01:00
lxobr
ba83d71269 feat: extract script dependencies 2024-11-20 12:28:10 +01:00
lxobr
26e2dc852d feat: new repo-to-graph task 2024-11-20 12:28:10 +01:00
Igor Ilic
4b55354dce
fix: Resolve issue with pgvector timeout (#3)
By creating PGVector as a singleton all issues regrading timeout are
resolved as there are no more parallel instances trying to communicate
with the database
2024-11-19 15:31:26 +01:00
Boris
5f144a0f92
fix: make all checks green (#1) 2024-11-19 15:30:09 +01:00
Rita Aleksziev
2948089806 Read patch generation instructions from file 2024-11-19 14:07:53 +01:00
Rita Aleksziev
d38f7c294e Merge branch 'main' into feat/COG-544-eval-on-swe-bench 2024-11-19 13:57:35 +01:00
Rita Aleksziev
838d98238a Code cleanup 2024-11-19 13:54:04 +01:00
Vasilije
2331739e07
Fix syntax in simple example (#2) 2024-11-19 10:44:36 +01:00
Rita Aleksziev
07b1956b6e Fix syntax in simple example 2024-11-19 09:55:21 +01:00
Rita Aleksziev
9973afffa1 Merge branch 'main' into feat/COG-544-eval-on-swe-bench 2024-11-19 09:38:11 +01:00
Boris
c045f737f7
feat: add vector and graph dbs state to README file (#235) 2024-11-18 17:51:41 +01:00
0xideas
34e140a41d
Switch to gpt-4o-mini by default (#233)
* Switch to gpt-4o-mini by default

* Add option and make gpt-4o-mini default in frontend

* Run llama index notebook without extra arguments in poetry install

* Install extras for llama_index_notebook run
2024-11-18 17:38:54 +01:00
Rita Aleksziev
d07ebbe350 Merge branch 'main' into feat/COG-544-eval-on-swe-bench 2024-11-18 15:59:42 +01:00
Rita Aleksziev
d986e7c981 minor code cleanup 2024-11-18 15:59:18 +01:00
Rita Aleksziev
98e3445c2c running swebench evaluation as subprocess 2024-11-18 15:12:36 +01:00
0xideas
ced5385186
Merge pull request #205 from topoteretes/COG-417-chunking-unit-tests
Cog 417 chunking unit tests
2024-11-18 12:52:44 +01:00
Boris
22a0e43d4a
Merge branch 'main' into COG-417-chunking-unit-tests 2024-11-17 13:40:32 +01:00
Boris
d8b6eeded5
feat: log search queries and results (#166)
* feat: log search queries and results

* fix: address coderabbit review comments

* fix: parse UUID when logging search results

* fix: remove custom UUID type and use DB agnostic UUID from sqlalchemy

* Add new cognee_db

---------

Co-authored-by: Leon Luithlen <leon@topoteretes.com>
2024-11-17 11:59:10 +01:00
Igor Ilic
d30adb53f3
Cog 337 llama index support (#186)
* feat: Add support for LlamaIndex Document type

Added support for LlamaIndex Document type

Feature #COG-337

* docs: Add Jupyer Notebook for cognee with llama index document type

Added jupyter notebook which demonstrates cognee with LlamaIndex document type usage

Docs #COG-337

* feat: Add metadata migration from LlamaIndex document type

Allow usage of metadata from LlamaIndex documents

Feature #COG-337

* refactor: Change llama index migration function name

Change name of llama index function

Refactor #COG-337

* chore: Add llama index core dependency

Downgrade needed on tenacity and instructor modules to support llama index

Chore #COG-337

* Feature: Add ingest_data_with_metadata task

Added task that will have access to metadata if data is provided from different data ingestion tools

Feature #COG-337

* docs: Add description on why specific type checking is done

Explained why specific type checking is used instead of isinstance, as isinstace returns True for child classes as well

Docs #COG-337

* fix: Add missing parameter to function call

Added missing parameter to function call

Fix #COG-337

* refactor: Move storing of data from async to sync function

Moved data storing from async to sync

Refactor #COG-337

* refactor: Pretend ingest_data was changes instead of having two tasks

Refactor so ingest_data file was modified instead of having two ingest tasks

Refactor #COG-337

* refactor: Use old name for data ingestion with metadata

Merged new and old data ingestion tasks into one

Refactor #COG-337

* refactor: Return ingest_data and save_data_to_storage Tasks

Returned ingest_data and save_data_to_storage tasks

Refactor #COG-337

* refactor: Return previous ingestion Tasks to add function

Returned previous ignestion tasks to add function

Refactor #COG-337

* fix: Remove dict and use string for search query

Remove dictionary and use string for query in notebook and simple example

Fix COG-337

* refactor: Add changes request in pull request

Added the following changes that were requested in pull request:

Added synchronize label,
Made uniform syntax in if statement in workflow,
fixed instructor dependency,
added llama-index to be optional

Refactor COG-337

* fix: Resolve issue with llama-index being mandatory

Resolve issue with llama-index being mandatory to run cognee

Fix COG-337

* fix: Add install of llama-index to notebook

Removed additional references to llama-index from core cognee lib.
Added llama-index-core install from notebook

Fix COG-337

---------
2024-11-17 11:47:08 +01:00
Vasilije
a63490b916 Bump release version 2024-11-16 14:01:28 +01:00
Vasilije
c2e265fbf8
Merge pull request #231 from topoteretes/COG-597-refactor-analytics
Cog 597 refactor analytics
2024-11-16 13:56:56 +01:00
Vasilije
d1e9870972
Merge branch 'main' into COG-597-refactor-analytics 2024-11-16 13:49:30 +01:00
Rita Aleksziev
ed08cdb9f9 using the code graph pipeline instead of cognify 2024-11-15 17:56:19 +01:00
Rita Aleksziev
4d6229bf7e Merge branch 'main' into feat/COG-544-eval-on-swe-bench 2024-11-15 17:15:16 +01:00
Rita Aleksziev
721fde3d60 generating testspecs for data 2024-11-15 17:14:43 +01:00
Igor Ilic
2703215dec refactor: Add user_id to event properties
Adding user_id to event properties allows tracking of which user started the event

Refactor COG-597
2024-11-15 15:20:41 +01:00
Igor Ilic
d90f5fe7c1 feat: Add proxy for analytics
Added proxy usage with vercel hosting for telemetry and analytics

Feature COG-597
2024-11-15 15:05:46 +01:00
hajdul88
8c0c0b97a2
Merge pull request #229 from topoteretes/feature/cog-620-bug-code_graph_pipeline-cant-process-false-class
fix: Fixes processing false Class keyword issue
2024-11-15 14:55:12 +01:00
hajdul88
1df12c1259 fix: Fixes processing false Class keyword issue 2024-11-15 14:47:13 +01:00
Leon Luithlen
6e416ed9f0 Merge branch 'main' into COG-417-chunking-unit-tests 2024-11-14 17:20:10 +01:00