Commit graph

1229 commits

Author SHA1 Message Date
hajdul88
a59517409c chore: Fixes some of the issues based on PR review + restructures things 2024-11-26 14:45:48 +01:00
hajdul88
676cdfcc84 Merge remote-tracking branch 'origin/main' into feature/cog-537-implement-retrieval-algorithm-from-research-paper 2024-11-21 17:07:16 +01:00
hajdul88
b5d9e7a6d2 chore: adds return value and sets tue entry point kg generation to true 2024-11-20 19:03:32 +01:00
hajdul88
6efe566849 fix: Adds new obligatory attributes to cognee graph tests 2024-11-20 18:40:56 +01:00
hajdul88
a114d68aef feat: Implements basic global triplet optimizing retrieval 2024-11-20 18:33:34 +01:00
hajdul88
980ae2b22c feat: Adds in time edge vector similarity calculation and triplet importances 2024-11-20 18:32:03 +01:00
lxobr
a8aefd57ef
COG-546 get_local_script_dependencies (#6)
A utility function, `get_local_script_dependencies`:

- Extracts and resolves local dependencies of a Python script using
`jedi` and `parso`.
- Returns a sorted list of unique module paths
- Optionally dependencies outside a specified repository path are
filtered out
- Includes an example/checker in `cognee/tasks/code`.

Will be used for creating a graph from a repo.
2024-11-20 16:36:03 +01:00
alekszievr
ad08b53ed7
Feat/cog 544 eval on swe bench (#5)
Evaluation script for SWE-bench benchmarking with and without cognee
2024-11-20 16:30:28 +01:00
Igor Ilic
acd88e83e2
fix: Resolve issue with table names in SQL commands (#13)
Some SQL commands require lowercase characters in table names unless
table name is wrapped in quotes. Renamed all new tables to use lowercase

Fix COG-677
2024-11-20 16:16:57 +01:00
Rita Aleksziev
98e693ad97 Merge branch 'main' into feat/COG-544-eval-on-swe-bench 2024-11-20 16:04:15 +01:00
Rita Aleksziev
e1d8f3ea86 use acreate_structured_output instead of create_structured_output in eval script 2024-11-20 16:02:15 +01:00
Igor Ilic
f9353d25fa fix: Update table name in notebook
Update table name in notebook

Fix COG-677
2024-11-20 15:14:38 +01:00
hajdul88
9f557b0c5b feat: Extends two steps retriever with graph projection 2024-11-20 15:14:36 +01:00
hajdul88
0101d43b8d feat: Adds graph node filtering by feature 2024-11-20 15:13:38 +01:00
Igor Ilic
70fe6ac541 fix: Update table name in notebook
Update table name to use latest in notebook

Fix COG-677
2024-11-20 15:07:38 +01:00
Igor Ilic
e4d00403ba fix: Update table names in tests
Update table names in tests to accomodate to recent fix

Fix COG-677
2024-11-20 15:02:22 +01:00
Igor Ilic
15b7b8ef2b fix: Resolve issue with table names in SQL commands
Some SQL commands require lowercase characters in table names unless table name is wrapped in quotes. Renamed all new tables to use lowercase

Fix COG-677
2024-11-20 14:54:35 +01:00
lxobr
f27dc0c91a fix: Rename, extract checker into a separate script 2024-11-20 12:28:10 +01:00
lxobr
263ecb9149 fix: Add input validation and error handling for paths 2024-11-20 12:28:10 +01:00
lxobr
8bc26bba97 fix: Add error handling for path conversion 2024-11-20 12:28:10 +01:00
lxobr
ebb811af87 fix: Filter out None values in module paths 2024-11-20 12:28:10 +01:00
lxobr
2417d18607 fix: Add logging instead of print 2024-11-20 12:28:10 +01:00
lxobr
1a1452e177 fix: Add error handling for Jedi analysis, with debug mode 2024-11-20 12:28:10 +01:00
lxobr
3aadda9a89 feat: Add argparse for testing purposes 2024-11-20 12:28:10 +01:00
lxobr
4bf2281cd5 feat: Enable async processing 2024-11-20 12:28:10 +01:00
lxobr
742792b6c1 refactor: Remove a comment 2024-11-20 12:28:10 +01:00
lxobr
2be2b802c0 feat: Safely handle file read errors 2024-11-20 12:28:10 +01:00
lxobr
e148d32c14 refactor: Modify sys.path in context manager 2024-11-20 12:28:10 +01:00
lxobr
ba83d71269 feat: extract script dependencies 2024-11-20 12:28:10 +01:00
lxobr
26e2dc852d feat: new repo-to-graph task 2024-11-20 12:28:10 +01:00
hajdul88
d9eec77f18 feat: Implements first step of the two step retrieval 2024-11-19 16:40:27 +01:00
hajdul88
44ac9b68b4 feat: adds get_distances from collection method to LanceDB and PgVector 2024-11-19 16:39:45 +01:00
hajdul88
f2c0fddeb2 feat: Adds graph-data-science to neo4j docker image 2024-11-19 16:29:52 +01:00
Igor Ilic
4b55354dce
fix: Resolve issue with pgvector timeout (#3)
By creating PGVector as a singleton all issues regrading timeout are
resolved as there are no more parallel instances trying to communicate
with the database
2024-11-19 15:31:26 +01:00
Boris
5f144a0f92
fix: make all checks green (#1) 2024-11-19 15:30:09 +01:00
Rita Aleksziev
2948089806 Read patch generation instructions from file 2024-11-19 14:07:53 +01:00
Rita Aleksziev
d38f7c294e Merge branch 'main' into feat/COG-544-eval-on-swe-bench 2024-11-19 13:57:35 +01:00
Rita Aleksziev
838d98238a Code cleanup 2024-11-19 13:54:04 +01:00
hajdul88
c4850f64dc feat: Implements pipeline structure for retrievers 2024-11-19 11:14:42 +01:00
Vasilije
2331739e07
Fix syntax in simple example (#2) 2024-11-19 10:44:36 +01:00
Rita Aleksziev
07b1956b6e Fix syntax in simple example 2024-11-19 09:55:21 +01:00
Rita Aleksziev
9973afffa1 Merge branch 'main' into feat/COG-544-eval-on-swe-bench 2024-11-19 09:38:11 +01:00
Boris
c045f737f7
feat: add vector and graph dbs state to README file (#235) 2024-11-18 17:51:41 +01:00
0xideas
34e140a41d
Switch to gpt-4o-mini by default (#233)
* Switch to gpt-4o-mini by default

* Add option and make gpt-4o-mini default in frontend

* Run llama index notebook without extra arguments in poetry install

* Install extras for llama_index_notebook run
2024-11-18 17:38:54 +01:00
Rita Aleksziev
d07ebbe350 Merge branch 'main' into feat/COG-544-eval-on-swe-bench 2024-11-18 15:59:42 +01:00
Rita Aleksziev
d986e7c981 minor code cleanup 2024-11-18 15:59:18 +01:00
Rita Aleksziev
98e3445c2c running swebench evaluation as subprocess 2024-11-18 15:12:36 +01:00
0xideas
ced5385186
Merge pull request #205 from topoteretes/COG-417-chunking-unit-tests
Cog 417 chunking unit tests
2024-11-18 12:52:44 +01:00
Boris
22a0e43d4a
Merge branch 'main' into COG-417-chunking-unit-tests 2024-11-17 13:40:32 +01:00
Boris
d8b6eeded5
feat: log search queries and results (#166)
* feat: log search queries and results

* fix: address coderabbit review comments

* fix: parse UUID when logging search results

* fix: remove custom UUID type and use DB agnostic UUID from sqlalchemy

* Add new cognee_db

---------

Co-authored-by: Leon Luithlen <leon@topoteretes.com>
2024-11-17 11:59:10 +01:00