* Revert "fix: Add metadata reflection fix to sqlite as well"
This reverts commit 394a0b2dfb.
* COG-810 Implement a top-down dependency graph builder tool (#268)
* feat: parse repo to call graph
* Update/repo_processor/top_down_repo_parse.py task
* fix: minor improvements
* feat: file parsing jedi script optimisation
---------
* Add type to DataPoint metadata (#364)
* Add missing index_fields
* Use DataPoint UUID type in pgvector create_data_points
* Make _metadata mandatory everywhere
* feat: Add search by dataset for cognee
Added ability to search by datasets for cognee users
Feature COG-912
* feat: outsources chunking parameters to extract chunk from documents … (#289)
* feat: outsources chunking parameters to extract chunk from documents task
* fix: Remove backend lock from UI
Removed lock that prevented using multiple datasets in cognify
Fix COG-912
* COG 870 Remove duplicate edges from the code graph (#293)
* feat: turn summarize_code into generator
* feat: extract run_code_graph_pipeline, update the pipeline
* feat: minimal code graph example
* refactor: update argument
* refactor: move run_code_graph_pipeline to cognify/code_graph_pipeline
* refactor: indentation and whitespace nits
* refactor: add deprecated use comments and warnings
---------
Co-authored-by: Vasilije <8619304+Vasilije1990@users.noreply.github.com>
Co-authored-by: Igor Ilic <30923996+dexters1@users.noreply.github.com>
Co-authored-by: Boris <boris@topoteretes.com>
* test: Added test for getting of documents for search
Added test to verify getting documents related to datasets intended for search
Test COG-912
* Structured code summarization (#375)
* feat: turn summarize_code into generator
* feat: extract run_code_graph_pipeline, update the pipeline
* feat: minimal code graph example
* refactor: update argument
* refactor: move run_code_graph_pipeline to cognify/code_graph_pipeline
* refactor: indentation and whitespace nits
* refactor: add deprecated use comments and warnings
* Structured code summarization
* add missing prompt file
* Remove summarization_model argument from summarize_code and fix typehinting
* minor refactors
---------
Co-authored-by: lxobr <122801072+lxobr@users.noreply.github.com>
Co-authored-by: Vasilije <8619304+Vasilije1990@users.noreply.github.com>
Co-authored-by: Igor Ilic <30923996+dexters1@users.noreply.github.com>
Co-authored-by: Boris <boris@topoteretes.com>
* fix: Resolve issue with cognify router graph model default value
Resolve issue with default value for graph model in cognify endpoint
Fix
* chore: Resolve typo in getting documents code
Resolve typo in code
chore COG-912
* Update .github/workflows/dockerhub.yml
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
* Update .github/workflows/dockerhub.yml
* Update .github/workflows/dockerhub.yml
* Update .github/workflows/dockerhub.yml
* Update get_cognify_router.py
* fix: Resolve syntax issue with cognify router
Resolve syntax issue with cognify router
Fix
* feat: Add ruff pre-commit hook for linting and formatting
Added formatting and linting on pre-commit hook
Feature COG-650
* chore: Update ruff lint options in pyproject file
Update ruff lint options in pyproject file
Chore
* test: Add ruff linter github action
Added linting check with ruff in github actions
Test COG-650
* feat: deletes executor limit from get_repo_file_dependencies
* feat: implements mock feature in LiteLLM engine
* refactor: Remove changes to cognify router
Remove changes to cognify router
Refactor COG-650
* fix: fixing boolean env for github actions
* test: Add test for ruff format for cognee code
Test if code is formatted for cognee
Test COG-650
* refactor: Rename ruff gh actions
Rename ruff gh actions to be more understandable
Refactor COG-650
* chore: Remove checking of ruff lint and format on push
Remove checking of ruff lint and format on push
Chore COG-650
* feat: Add deletion of local files when deleting data
Delete local files when deleting data from cognee
Feature COG-475
* fix: changes back the max workers to 12
* feat: Adds mock summary for codegraph pipeline
* refacotr: Add current development status
Save current development status
Refactor
* Fix langfuse
* Fix langfuse
* Fix langfuse
* Add evaluation notebook
* Rename eval notebook
* chore: Add temporary state of development
Add temp development state to branch
Chore
* fix: Add poetry.lock file, make langfuse mandatory
Added langfuse as mandatory dependency, added poetry.lock file
Fix
* Fix: fixes langfuse config settings
* feat: Add deletion of local files made by cognee through data endpoint
Delete local files made by cognee when deleting data from database through endpoint
Feature COG-475
* test: Revert changes on test_pgvector
Revert changes on test_pgvector which were made to test deletion of local files
Test COG-475
* chore: deletes the old test for the codegraph pipeline
* test: Add test to verify deletion of local files
Added test that checks local files created by cognee will be deleted and those not created by cognee won't
Test COG-475
* chore: deletes unused old version of the codegraph
* chore: deletes unused imports from code_graph_pipeline
* Ingest non-code files
* Fixing review findings
* Ingest non-code files (#395)
* Ingest non-code files
* Fixing review findings
* test: Update test regarding message
Update assertion message, add veryfing of file existence
* Handle retryerrors in code summary (#396)
* Handle retryerrors in code summary
* Log instead of print
* fix: updates the acreate_structured_output
* chore: Add logging to sentry when file which should exist can't be found
Log to sentry that a file which should exist can't be found
Chore COG-475
* Fix diagram
* fix: refactor mcp
* Add Smithery CLI installation instructions and badge
* Move readme
* Update README.md
* Update README.md
* Cog 813 source code chunks (#383)
* fix: pass the list of all CodeFiles to enrichment task
* feat: introduce SourceCodeChunk, update metadata
* feat: get_source_code_chunks code graph pipeline task
* feat: integrate get_source_code_chunks task, comment out summarize_code
* Fix code summarization (#387)
* feat: update data models
* feat: naive parse long strings in source code
* fix: get_non_py_files instead of get_non_code_files
* fix: limit recursion, add comment
* handle embedding empty input error (#398)
* feat: robustly handle CodeFile source code
* refactor: sort imports
* todo: add support for other embedding models
* feat: add custom logger
* feat: add robustness to get_source_code_chunks
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
* feat: improve embedding exceptions
* refactor: format indents, rename module
---------
Co-authored-by: alekszievr <44192193+alekszievr@users.noreply.github.com>
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
* Fix diagram
* Fix diagram
* Fix instructions
* Fix instructions
* adding and fixing files
* Update README.md
* ruff format
* Fix linter issues
* Fix linter issues
* Fix linter issues
* Fix linter issues
* Fix linter issues
* Fix linter issues
* Fix linter issues
* Fix linter issues
* Fix linter issues
* Fix linter issues
* Implement PR review
* Comment out profiling
* Comment out profiling
* Comment out profiling
* fix: add allowed extensions
* fix: adhere UnstructuredDocument.read() to Document
* feat: time code graph run and add mock support
* Fix ollama, work on visualization
* fix: Fixes faulty logging format and sets up error logging in dynamic steps example
* Overcome ContextWindowExceededError by checking token count while chunking (#413)
* fix: Fixes duplicated edges in cognify by limiting the recursion depth in add datapoints
* Adjust AudioDocument and handle None token limit
* Handle azure models as well
* Fix visualization
* Fix visualization
* Fix visualization
* Add clean logging to code graph example
* Remove setting envvars from arg
* fix: fixes create_cognee_style_network_with_logo unit test
* fix: removes accidental remained print
* Fix visualization
* Fix visualization
* Fix visualization
* Get embedding engine instead of passing it. Get it from vector engine instead of direct getter.
* Fix visualization
* Fix visualization
* Fix poetry issues
* Get embedding engine instead of passing it in code chunking.
* Fix poetry issues
* Fix poetry issues
* Fix poetry issues
* Fix poetry issues
* Fix poetry issues
* Fix poetry issues
* Fix poetry issues
* Fix poetry issues
* chore: Update version of poetry install action
* chore: Update action to trigger on pull request for any branch
* chore: Remove if in github action to allow triggering on push
* chore: Remove if condition to allow gh actions to trigger on push to PR
* chore: Update poetry version in github actions
* chore: Set fixed ubuntu version to 22.04
* chore: Update py lint to use ubuntu 22.04
* chore: update ubuntu version to 22.04
* feat: implements the first version of graph based completion in search
* chore: Update python 3.9 gh action to use 3.12 instead
* chore: Update formatting of utils.py
* Fix poetry issues
* Fix poetry issues
* Fix poetry issues
* Fix poetry issues
* Fix poetry issues
* Fix poetry issues
* Fix poetry issues
* Adjust integration tests
* fix: Fixes ruff formatting
* Handle circular import
* fix: Resolve profiler issue with partial and recursive logger imports
Resolve issue for profiler with partial and recursive logger imports
* fix: Remove logger from __init__.py file
* test: Test profiling on HEAD branch
* test: Return profiler to base branch
* Set max_tokens in config
* Adjust SWE-bench script to code graph pipeline call
* Adjust SWE-bench script to code graph pipeline call
* fix: Add fix for accessing dictionary elements that don't exits
Using get for the text key instead of direct access to handle situation if the text key doesn't exist
* feat: Add ability to change graph database configuration through cognee
* feat: adds pydantic types to graph layer models
* test: Test ubuntu 24.04
* test: change all actions to ubuntu-latest
* feat: adds basic retriever for swe bench
* Match Ruff version in config to the one in github actions
* feat: implements code retreiver
* Fix: fixes unit test for codepart search
* Format with Ruff 0.9.0
* Fix: deleting incorrect repo path
* docs: Add LlamaIndex Cognee integration notebook
Added LlamaIndex Cognee integration notebook
* test: Add github action for testing llama index cognee integration notebook
* fix: resolve issue with langfuse dependency installation when integrating cognee in different packages
* version: Increase version to 0.1.21
* fix: update dependencies of the mcp server
* Update README.md
* Fix: Fixes logging setup
* feat: deletes on the fly embeddings as uses edge collections
* fix: Change nbformat on llama index integration notebook
* fix: Resolve api key issue with llama index integration notebook
* fix: Attempt to resolve issue with Ubuntu 24.04 segmentation fault
* version: Increase version to 0.1.22
---------
Co-authored-by: vasilije <vas.markovic@gmail.com>
Co-authored-by: Igor Ilic <igorilic03@gmail.com>
Co-authored-by: Igor Ilic <30923996+dexters1@users.noreply.github.com>
Co-authored-by: lxobr <122801072+lxobr@users.noreply.github.com>
Co-authored-by: alekszievr <44192193+alekszievr@users.noreply.github.com>
Co-authored-by: hajdul88 <52442977+hajdul88@users.noreply.github.com>
Co-authored-by: Vasilije <8619304+Vasilije1990@users.noreply.github.com>
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
Co-authored-by: Rita Aleksziev <alekszievr@gmail.com>
Co-authored-by: Henry Mao <1828968+calclavia@users.noreply.github.com>
138 lines
4.1 KiB
TOML
138 lines
4.1 KiB
TOML
[tool.poetry]
|
|
name = "cognee"
|
|
version = "0.1.22"
|
|
description = "Cognee - is a library for enriching LLM context with a semantic layer for better understanding and reasoning."
|
|
authors = ["Vasilije Markovic", "Boris Arzentar"]
|
|
readme = "README.md"
|
|
license = "Apache-2.0"
|
|
homepage = "https://www.cognee.ai"
|
|
repository = "https://github.com/topoteretes/cognee"
|
|
classifiers = [
|
|
"Development Status :: 4 - Beta",
|
|
"Intended Audience :: Developers",
|
|
"License :: OSI Approved :: Apache Software License",
|
|
"Topic :: Software Development :: Libraries",
|
|
"Operating System :: MacOS :: MacOS X",
|
|
"Operating System :: POSIX :: Linux",
|
|
"Operating System :: Microsoft :: Windows"
|
|
]
|
|
|
|
[tool.poetry.dependencies]
|
|
python = ">=3.10.0,<3.13"
|
|
openai = "1.59.4"
|
|
pydantic = "2.8.2"
|
|
python-dotenv = "1.0.1"
|
|
fastapi = ">=0.109.2,<0.116.0"
|
|
uvicorn = "0.22.0"
|
|
requests = "2.32.3"
|
|
aiohttp = "3.10.10"
|
|
typing_extensions = "4.12.2"
|
|
nest_asyncio = "1.6.0"
|
|
numpy = "1.26.4"
|
|
datasets = "3.1.0"
|
|
falkordb = {version = "1.0.9", optional = true}
|
|
boto3 = "^1.26.125"
|
|
botocore="^1.35.54"
|
|
gunicorn = "^20.1.0"
|
|
sqlalchemy = "2.0.36"
|
|
instructor = "1.7.2"
|
|
networkx = "^3.2.1"
|
|
aiosqlite = "^0.20.0"
|
|
pandas = "2.2.3"
|
|
filetype = "^1.2.0"
|
|
nltk = "^3.8.1"
|
|
dlt = {extras = ["sqlalchemy"], version = "^1.4.1"}
|
|
aiofiles = "^23.2.1"
|
|
qdrant-client = {version = "^1.9.0", optional = true}
|
|
graphistry = "^0.33.5"
|
|
tenacity = "^9.0.0"
|
|
weaviate-client = {version = "4.9.6", optional = true}
|
|
scikit-learn = "^1.5.0"
|
|
pypdf = "^4.1.0"
|
|
neo4j = {version = "^5.20.0", optional = true}
|
|
jinja2 = "^3.1.3"
|
|
matplotlib = "^3.8.3"
|
|
tiktoken = "0.7.0"
|
|
langchain_text_splitters = {version = "0.3.2", optional = true}
|
|
langsmith = {version = "0.2.3", optional = true}
|
|
langdetect = "1.0.9"
|
|
posthog = {version = "^3.5.0", optional = true}
|
|
lancedb = "0.16.0"
|
|
litellm = "1.57.2"
|
|
groq = {version = "0.8.0", optional = true}
|
|
langfuse = "^2.32.0"
|
|
pydantic-settings = "^2.2.1"
|
|
anthropic = "^0.26.1"
|
|
sentry-sdk = {extras = ["fastapi"], version = "^2.9.0"}
|
|
fastapi-users = {version = "*", extras = ["sqlalchemy"]}
|
|
alembic = "^1.13.3"
|
|
asyncpg = {version = "0.30.0", optional = true}
|
|
pgvector = {version = "^0.3.5", optional = true}
|
|
psycopg2 = {version = "^2.9.10", optional = true}
|
|
llama-index-core = {version = "^0.12.10.post1", optional = true}
|
|
deepeval = {version = "^2.0.1", optional = true}
|
|
transformers = "^4.46.3"
|
|
pymilvus = {version = "^2.5.0", optional = true}
|
|
unstructured = { extras = ["csv", "doc", "docx", "epub", "md", "odt", "org", "ppt", "pptx", "rst", "rtf", "tsv", "xlsx"], version = "^0.16.10", optional = true }
|
|
pre-commit = "^4.0.1"
|
|
httpx = "0.27.0"
|
|
bokeh="^3.6.2"
|
|
|
|
|
|
|
|
|
|
|
|
[tool.poetry.extras]
|
|
filesystem = ["s3fs", "botocore"]
|
|
weaviate = ["weaviate-client"]
|
|
qdrant = ["qdrant-client"]
|
|
neo4j = ["neo4j"]
|
|
postgres = ["psycopg2", "pgvector", "asyncpg"]
|
|
notebook = ["notebook", "ipykernel", "overrides", "ipywidgets", "jupyterlab", "jupyterlab_widgets", "jupyterlab-server", "jupyterlab-git"]
|
|
langchain = ["langsmith", "langchain_text_splitters"]
|
|
llama-index = ["llama-index-core"]
|
|
deepeval = ["deepeval"]
|
|
posthog = ["posthog"]
|
|
falkordb = ["falkordb"]
|
|
groq = ["groq"]
|
|
milvus = ["pymilvus"]
|
|
docs = ["unstructured"]
|
|
|
|
[tool.poetry.group.dev.dependencies]
|
|
pytest = "^7.4.0"
|
|
pytest-asyncio = "^0.21.1"
|
|
coverage = "^7.3.2"
|
|
mypy = "^1.7.1"
|
|
notebook = {version = "^7.1.0", optional = true}
|
|
deptry = "^0.20.0"
|
|
debugpy = "1.8.9"
|
|
pylint = "^3.0.3"
|
|
ruff = ">=0.2.2,<0.9.0"
|
|
tweepy = "4.14.0"
|
|
gitpython = "^3.1.43"
|
|
pylance = "0.19.2"
|
|
|
|
[tool.poetry.group.docs.dependencies]
|
|
mkdocs-material = "^9.5.42"
|
|
mkdocs-minify-plugin = "^0.8.0"
|
|
mkdocstrings = {extras = ["python"], version = "^0.26.2"}
|
|
|
|
[tool.ruff] # https://beta.ruff.rs/docs/
|
|
line-length = 100
|
|
exclude = [
|
|
"migrations/", # Ignore migrations directory
|
|
"notebooks/", # Ignore notebook files
|
|
"build/", # Ignore build directory
|
|
"cognee/pipelines.py",
|
|
"cognee/modules/users/models/Group.py",
|
|
"cognee/modules/users/models/ACL.py",
|
|
"cognee/modules/pipelines/models/Task.py",
|
|
"cognee/modules/data/models/Dataset.py"
|
|
]
|
|
|
|
[tool.ruff.lint]
|
|
ignore = ["F401"]
|
|
|
|
[build-system]
|
|
requires = ["poetry-core"]
|
|
build-backend = "poetry.core.masonry.api"
|