Commit graph

19 commits

Author SHA1 Message Date
hajdul88
faeca138d9
fix: fixes distributed pipeline (#1454)
<!-- .github/pull_request_template.md -->

## Description
This PR fixes distributed pipeline + updates core changes in distr
logic.

## Type of Change
<!-- Please check the relevant option -->
- [x] Bug fix (non-breaking change that fixes an issue)
- [x] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [x] Code refactoring
- [x] Performance improvement
- [ ] Other (please specify):

## Changes Made
Fixes distributed pipeline:
-Changed spawning logic + adds incremental loading to
run_tasks_diistributed
-Adds batching to consumer nodes
-Fixes consumer stopping criteria by adding stop signal + handling
-Changed edge embedding solution to avoid huge network load in a case of
a multicontainer environment

## Testing
Tested it by running 1GB on modal + manually

## Screenshots/Videos (if applicable)
None

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [x] **I have tested my changes thoroughly before submitting this PR**
- [x] **This PR contains minimal changes necessary to address the
issue/feature**
- [ ] My code follows the project's coding standards and style
guidelines
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have added necessary documentation (if applicable)
- [ ] All new and existing tests pass
- [ ] I have searched existing PRs to ensure this change hasn't been
submitted already
- [ ] I have linked any relevant issues in the description
- [ ] My commits have clear and descriptive messages

## Related Issues
None

## Additional Notes
None

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: Boris <boris@topoteretes.com>
Co-authored-by: Boris Arzentar <borisarzentar@gmail.com>
2025-10-09 14:06:25 +02:00
Andrej Milicevic
c391be41d6 refactor: Remove falkordb adapter from main repo, since we have it on community 2025-09-17 12:09:57 +02:00
dependabot[bot]
94bf7b4c30
chore(deps): bump the pip group across 2 directories with 1 update
Updates the requirements on  and [pypdf](https://github.com/py-pdf/pypdf) to permit the latest version.

Updates `pypdf` to 6.0.0
- [Release notes](https://github.com/py-pdf/pypdf/releases)
- [Changelog](https://github.com/py-pdf/pypdf/blob/main/CHANGELOG.md)
- [Commits](https://github.com/py-pdf/pypdf/compare/4.1.0...6.0.0)

Updates `pypdf` to 6.0.0
- [Release notes](https://github.com/py-pdf/pypdf/releases)
- [Changelog](https://github.com/py-pdf/pypdf/blob/main/CHANGELOG.md)
- [Commits](https://github.com/py-pdf/pypdf/compare/4.1.0...6.0.0)

---
updated-dependencies:
- dependency-name: pypdf
  dependency-version: 6.0.0
  dependency-type: direct:production
  dependency-group: pip
- dependency-name: pypdf
  dependency-version: 6.0.0
  dependency-type: direct:production
  dependency-group: pip
...

Signed-off-by: dependabot[bot] <support@github.com>
2025-08-27 21:49:40 +00:00
Daulet Amirkhanov
b297289060
Fix/add async lock to all vector databases (#1244)
<!-- .github/pull_request_template.md -->

## Description
1. Cleans up VectorDB adapters that have been migrated to
`cognee-community` repo
2. Adds async lock protection create_collection method in remaining
VectorDB - ChromaDB

See #1222

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: Igor Ilic <30923996+dexters1@users.noreply.github.com>
2025-08-14 15:57:34 +02:00
vasilije
0ea5894823 added distributed fixes 2025-08-06 19:11:20 +02:00
Boris Arzentar
66427e725c
fix: remove obsolete files and fix unit tests 2025-07-08 22:47:09 +02:00
Boris Arzentar
a4a2742c52
fix: add retries 2025-07-08 10:26:06 +02:00
Boris Arzentar
68adf6877b
fix: handle queue error 2025-07-07 13:54:22 +02:00
Boris Arzentar
685d282f5c
fix: add error handling 2025-07-06 21:03:02 +02:00
Boris Arzentar
f8f1bb3576
fix: add queue for data points saving 2025-07-04 18:26:22 +02:00
Boris Arzentar
00dd3b8d97
fix: run cognee distributed with modal 2025-07-04 15:28:05 +02:00
Boris Arzentar
611df1e9b9
fix: unit tests and ruff lint errors 2025-07-02 11:39:10 +02:00
Boris Arzentar
86bd3e4a5a
Merge remote-tracking branch 'origin/dev' into feat/modal-parallelization 2025-07-02 11:28:22 +02:00
Boris Arzentar
dda3460715 fix: decrease batch size for chunks 2025-05-19 10:48:38 +02:00
Boris Arzentar
a1e605ca97 fix: batch datapoints on save to limit bandwidth size 2025-05-12 11:28:13 +02:00
Boris Arzentar
c46c80196e fix: lint errors 2025-04-23 13:18:04 +02:00
Boris Arzentar
bf9678065c fix: track producer features and stop the app when done 2025-04-23 13:13:29 +02:00
Boris Arzentar
9197cf9d61 feat: distributed cognee 2025-04-23 09:28:37 +02:00
Boris Arzentar
99ff4d73e6 intermidiate commit 2025-04-17 15:14:05 +02:00