Commit graph

4555 commits

Author SHA1 Message Date
shehab-badawy
9c87a10848 feat: Add delete preview for --dataset-name and --all flags
This commit introduces the preview functionality for the  command. The preview displays a summary of what will be deleted before asking for user confirmation.

The feature is fully functional for the following flags:
-  / : Correctly counts the number of data entries within the specified dataset.
- : Correctly counts the total number of datasets, data entries, and users in the system.

The logic for the  flag is a work in progress. The current implementation uses a placeholder and needs a method to query a user directly by their ID to be completed.
2025-10-02 01:44:11 -04:00
Geoff-Robin
4979f43fc0 Added playwright as a dependency 2025-10-02 02:21:33 +05:30
Geoff-Robin
c283977035 switched httpx AsyncClient to fetch webpage 2025-10-02 02:01:46 +05:30
Geoff-Robin
60499c439c Added logging 2025-10-02 01:54:56 +05:30
Geoff-Robin
925bd38195 Setup models.py and utils.py 2025-10-02 01:32:00 +05:30
Geoff-Robin
70a2cc9d65 removed scrapy and added bs4 2025-10-02 01:28:48 +05:30
Andrej Milicevic
6f0756f312 test: Rollback deduplication test 2025-10-01 18:10:57 +02:00
Igor Ilic
95fdbab406
refactor: Remove macos13 from ci/cd and support (#1489)
<!-- .github/pull_request_template.md -->

## Description
Remove MacOS13 support and CI/CD tests

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [ ] Code refactoring
- [ ] Performance improvement
- [x] Other (please specify): Remove MacOS13 support

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [ ] **I have tested my changes thoroughly before submitting this PR**
- [ ] **This PR contains minimal changes necessary to address the
issue/feature**
- [ ] My code follows the project's coding standards and style
guidelines
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have added necessary documentation (if applicable)
- [ ] All new and existing tests pass
- [ ] I have searched existing PRs to ensure this change hasn't been
submitted already
- [ ] I have linked any relevant issues in the description
- [ ] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-10-01 18:01:04 +02:00
Andrej Milicevic
5b46f86be5 test: Removed long text string about qunatum computers from tests. Used a file instead. 2025-10-01 17:59:53 +02:00
Daulet Amirkhanov
0bf3490d63 chore: update cognee-cli to use MCP Docker image from main. Bring back deprecation warnings 2025-10-01 16:16:06 +01:00
Aniruddha Mandal
4412495d67 chore: update dependency specifications in pyproject.toml
- Changed "mistralai==1.9.10" to "mistralai>=1.9.10" for more flexible versioning.
- Removed "mistralai" from the optional dependencies under "mistral".
- Expanded the "docs" dependency to include "pdf" support.
2025-10-01 00:33:05 +05:30
Aniruddha Mandal
fedb945365 chore: remove uv.lock file
- Deleted the uv.lock file to streamline dependency management.
- This change may require regeneration of the lock file in future dependency updates.
2025-10-01 00:26:59 +05:30
Aniruddha Mandal
4e96e04405 chore: update dependencies in pyproject.toml and uv.lock
- Added "mistralai==1.9.10" to the dependencies in pyproject.toml.
- Updated sdist entries in uv.lock to remove unnecessary upload-time fields for various packages.
- Ensured consistency in package specifications across the project files.
2025-10-01 00:22:28 +05:30
Igor Ilic
3dba072c49
fix: resolve formatting issue (#1486)
<!-- .github/pull_request_template.md -->

## Description
ruff formatting

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [ ] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [ ] **I have tested my changes thoroughly before submitting this PR**
- [ ] **This PR contains minimal changes necessary to address the
issue/feature**
- [ ] My code follows the project's coding standards and style
guidelines
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have added necessary documentation (if applicable)
- [ ] All new and existing tests pass
- [ ] I have searched existing PRs to ensure this change hasn't been
submitted already
- [ ] I have linked any relevant issues in the description
- [ ] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-09-30 18:12:57 +02:00
Igor Ilic
7ab000d891 refactor: Add test for updating of docs and visualization 2025-09-30 18:12:22 +02:00
Vasilije
2ee5a3ca7a
feat: Enhance PDF parsing (#1445)
<!-- .github/pull_request_template.md -->

## Description
<!-- 
Please provide a clear, human-generated description of the changes in
this PR.
DO NOT use AI-generated descriptions. We want to understand your thought
process and reasoning.
-->
I've just added a new PDF parser, AdvancedPdfLoader. It uses the
unstructured library and does a much better job of handling PDFs,
especially with its layout-aware parsing, table preservation, and image
handling.

I also built in a safeguard: if unstructured isn't installed or throws
an error, it'll automatically fall back to the old PyPdfLoader so it
won't just crash. All the related unit tests and project dependencies
are taken care of, too.

https://github.com/topoteretes/cognee/issues/1342

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [ ] Code refactoring
- [x] Performance improvement
- [ ] Other (please specify):

## Changes Made
<!-- List the specific changes made in this PR -->
- Added AdvancedPdfLoader class for enhanced PDF processing using the
unstructured library.
- Integrated fallback mechanism to PyPdfLoader in case of unstructured
library import failure or exceptions.
- Updated supported loaders to include AdvancedPdfLoader.
- Added unit tests for AdvancedPdfLoader to ensure functionality and
error handling.
- Updated poetry.lock and pyproject.toml to include new dependencies and
versions.

## Testing
<!-- Describe how you tested your changes -->
pytest -v ./cognee/tests/test_advanced_pdf_loader.py
## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [x] **I have tested my changes thoroughly before submitting this PR**
- [x] **This PR contains minimal changes necessary to address the
issue/feature**
- [x] My code follows the project's coding standards and style
guidelines
- [x] I have added tests that prove my fix is effective or that my
feature works
- [x] I have added necessary documentation (if applicable)
- [x] All new and existing tests pass
- [x] I have searched existing PRs to ensure this change hasn't been
submitted already
- [x] I have linked any relevant issues in the description
- [x] My commits have clear and descriptive messages

## Related Issues
<!-- Link any related issues using "Fixes #issue_number" or "Relates to
#issue_number" -->

## Additional Notes
<!-- Add any additional notes, concerns, or context for reviewers -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-09-30 17:46:53 +02:00
EricXiao
d868912df5
Merge branch 'dev' into feat/add-pdfproloader 2025-09-30 23:24:14 +08:00
Andrej Milicevic
45f00b022f test: Renamed s3 test. Commented out docling test. Fails until docling resolves their issue. 2025-09-30 17:22:43 +02:00
Geoff-Robin
6348c9d8de Created models.py 2025-09-30 20:46:26 +05:30
Geoff-Robin
510926f56c included scraping dependencies 2025-09-30 20:39:04 +05:30
Vasilije
24abceffa9
fix: Resolve issue with processing for gpt4 series models (#1485)
<!-- .github/pull_request_template.md -->

## Description
Use JSON_SCHEMA mode only for GPT-5 series models, it's a new mode from
OpenAI for structured output processing that they didn't properly handle
for older models

## Type of Change
<!-- Please check the relevant option -->
- [x] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [ ] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [ ] **I have tested my changes thoroughly before submitting this PR**
- [ ] **This PR contains minimal changes necessary to address the
issue/feature**
- [ ] My code follows the project's coding standards and style
guidelines
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have added necessary documentation (if applicable)
- [ ] All new and existing tests pass
- [ ] I have searched existing PRs to ensure this change hasn't been
submitted already
- [ ] I have linked any relevant issues in the description
- [ ] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-09-30 14:14:46 +02:00
Igor Ilic
f88289c425 fix: Resolve issue with processing for gpt4 series models 2025-09-30 14:05:12 +02:00
Andrej Milicevic
c8e6c1024b chore: Fix formatting. 2025-09-30 12:05:43 +02:00
Andrej Milicevic
e74ee55137 test: Add test to CI 2025-09-30 12:04:41 +02:00
Andrej Milicevic
0b5b0e5544 fix: PR comment changes 2025-09-30 11:40:42 +02:00
EricXiao
4938ad9fe9 Merge branch 'dev' into feat/add-pdfproloader
Signed-off-by: EricXiao <taoiaox@gmail.com>
2025-09-30 17:08:28 +08:00
Igor Ilic
74bc7c9420 refactor: set node_set to None for endpoint 2025-09-29 21:22:21 +02:00
Vasilije
7850c56ca8
Feature/windows compatibility fixes (#1464)
# **Pull Request: Windows Compatibility and Error Handling
Improvements**

## Description
This PR addresses multiple Windows compatibility issues and improves
error handling across the cognee CLI and frontend, making the
application fully functional on Windows systems. The changes include:

- **Windows Process Termination**: Fixed crashes when terminating
spawned processes by using `taskkill` instead of Unix-specific `killpg`
functions
- **npm Detection**: Resolved npm command failures on Windows by adding
`shell=True` for PowerShell script execution
- **Frontend SSR**: Fixed "window is not defined" errors by implementing
dynamic imports for the graph visualization component
- **Cloud API**: Improved error handling for local installations by
returning graceful responses instead of raising exceptions
- **Connection Retry**: Added retry mechanism for frontend health checks
with better error messages
- **String Formatting**: Fixed mixed f-string formatting that caused
placeholder issues
- **CLI Entry Point**: Added `cognee` command alongside `cognee-cli` for
better user experience

These changes ensure cognee works seamlessly on Windows while
maintaining backward compatibility and improving overall robustness.

## Type of Change
- [x] Bug fix (non-breaking change that fixes an issue)
- [x] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [ ] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
- [x] **I have tested my changes thoroughly before submitting this PR**
- [x] **This PR contains minimal changes necessary to address the
issue/feature**
- [x] My code follows the project's coding standards and style
guidelines
- [x] I have added tests that prove my fix is effective or that my
feature works
- [x] I have added necessary documentation (if applicable)
- [x] All new and existing tests pass
- [x] I have searched existing PRs to ensure this change hasn't been
submitted already
- [x] I have linked any relevant issues in the description
- [x] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---

## 🔧 **Technical Details**

### **Files Modified:**
-  **Windows Process Termination**: `cognee/cli/_cognee.py`,
`cognee/tests/test_cognee_server_start.py`
-  **npm Detection**: `cognee/api/v1/ui/ui.py`
-  **Frontend SSR**:
`cognee-frontend/src/app/(graph)/GraphVisualization.tsx`
-  **Cloud API**: `cognee/api/v1/cloud/routers/get_checks_router.py`
-  **Connection Retry**: `cognee-frontend/src/utils/fetch.ts`
-  **String Formatting**:
`cognee/infrastructure/llm/prompts/read_query_prompt.py`
-  **CLI Entry Point**: `pyproject.toml`

### **Key Changes:**
1. **Process Termination**: Added Windows-compatible `taskkill` commands
alongside Unix `killpg`
2. **npm Commands**: Added `shell=True` for Windows PowerShell script
execution
3. **Dynamic Imports**: Implemented `ssr: false` for graph visualization
component
4. **Graceful Errors**: Return JSON responses instead of raising
exceptions for local mode
5. **Retry Logic**: Added 5-retry mechanism with 1-second delays for
health checks
6. **String Formatting**: Fixed mixed f-string and old-style formatting
issues
7. **CLI Commands**: Added `cognee` entry point alongside `cognee-cli`

### **Testing Results:**
-  All existing tests pass
-  Windows process termination works correctly
-  npm detection and commands work on Windows
-  Frontend loads without SSR errors
-  Cloud API returns graceful responses for local mode
-  Frontend connection retries work properly
-  Code formatting and linting checks pass

This PR makes cognee fully functional on Windows while improving error
handling and user experience across all platforms.
2025-09-29 20:51:27 +02:00
Vasilije
52265a67f2
Merge branch 'dev' into feature/windows-compatibility-fixes 2025-09-29 20:51:17 +02:00
Vasilije
92d183aedb
feat: Add update endpoint to Cognee [COG-3016] (#1475)
<!-- .github/pull_request_template.md -->

## Description
Add update method end endpoint for Cognee

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [x] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [ ] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [x] **I have tested my changes thoroughly before submitting this PR**
- [x] **This PR contains minimal changes necessary to address the
issue/feature**
- [x] My code follows the project's coding standards and style
guidelines
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have added necessary documentation (if applicable)
- [ ] All new and existing tests pass
- [x] I have searched existing PRs to ensure this change hasn't been
submitted already
- [ ] I have linked any relevant issues in the description
- [ ] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

<!-- CURSOR_SUMMARY -->
---

> [!NOTE]
> <sup>[Cursor Bugbot](https://cursor.com/dashboard?tab=bugbot) is
generating a summary for commit
9524109029. Configure
[here](https://cursor.com/dashboard?tab=bugbot).</sup>
<!-- /CURSOR_SUMMARY -->
2025-09-29 20:50:11 +02:00
Igor Ilic
9524109029 Merge branch 'dev' into update-endpoint 2025-09-29 20:44:44 +02:00
Igor Ilic
3c7cc597e3 Merge branch 'update-endpoint' of github.com:topoteretes/cognee into update-endpoint 2025-09-29 20:42:42 +02:00
Igor Ilic
e333a860ba refactor: Add documentation for update endpoint 2025-09-29 20:42:25 +02:00
Igor Ilic
52c978faeb
docs: Multi user authorization example (#1466)
<!-- .github/pull_request_template.md -->

## Description
Add return value of creating role and tenant, add detailed permissions
example to Cognee

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [x] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [ ] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [x] **I have tested my changes thoroughly before submitting this PR**
- [x] **This PR contains minimal changes necessary to address the
issue/feature**
- [x] My code follows the project's coding standards and style
guidelines
- [x] I have added tests that prove my fix is effective or that my
feature works
- [x] I have added necessary documentation (if applicable)
- [x] All new and existing tests pass
- [x] I have searched existing PRs to ensure this change hasn't been
submitted already
- [x] I have linked any relevant issues in the description
- [x] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: Boris <boris@topoteretes.com>
Co-authored-by: Hande <159312713+hande-k@users.noreply.github.com>
Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>
2025-09-29 20:15:50 +02:00
Vasilije
1986ccff0a
fix: Resolve issues with GPT5 models (#1483)
<!-- .github/pull_request_template.md -->

## Description
Resolve issues with gpt 5 models for structured outputs by forcing JSON
mode in instructor

## Type of Change
<!-- Please check the relevant option -->
- [x] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [ ] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [x] **I have tested my changes thoroughly before submitting this PR**
- [x] **This PR contains minimal changes necessary to address the
issue/feature**
- [x] My code follows the project's coding standards and style
guidelines
- [x] I have added tests that prove my fix is effective or that my
feature works
- [x] I have added necessary documentation (if applicable)
- [x] All new and existing tests pass
- [x] I have searched existing PRs to ensure this change hasn't been
submitted already
- [x] I have linked any relevant issues in the description
- [x] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-09-29 19:25:25 +02:00
Andrej Milicevic
d352807a9d fix: Fix docling import so other executions don't fail 2025-09-29 17:57:58 +02:00
Igor Ilic
213bab5307
Merge branch 'dev' into fix-gpt-5-series 2025-09-29 17:57:38 +02:00
Andrej Milicevic
8ef3bf6393 feat: Add Docling as an ingestion option to cognee add. 2025-09-29 17:32:25 +02:00
Igor Ilic
6bc5c6f162 refactor: Use latest JSON tool type for structured outputs 2025-09-29 17:06:01 +02:00
Igor Ilic
bd4a605849
Merge branch 'dev' into update-endpoint 2025-09-29 14:53:02 +02:00
Igor Ilic
4c8e3b8bb3 refactor: Add docstring to update function 2025-09-29 14:40:41 +02:00
Igor Ilic
f2e216cdf7 fix: Resolve issues with GPT5 models 2025-09-29 14:11:06 +02:00
Igor Ilic
db39a43975
fix: Resolve schema migration for Neo4j (#1482)
<!-- .github/pull_request_template.md -->

## Description
Fix Neo4j issue with migrating DB schema

## Type of Change
<!-- Please check the relevant option -->
- [x] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [ ] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [x] **I have tested my changes thoroughly before submitting this PR**
- [x] **This PR contains minimal changes necessary to address the
issue/feature**
- [x] My code follows the project's coding standards and style
guidelines
- [x] I have added tests that prove my fix is effective or that my
feature works
- [x] I have added necessary documentation (if applicable)
- [x] All new and existing tests pass
- [x] I have searched existing PRs to ensure this change hasn't been
submitted already
- [x] I have linked any relevant issues in the description
- [x] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-09-29 13:50:11 +02:00
Vasilije
18dcab3cac
Update README with cognee features and deployment info
Clarify the functionality and deployment of cognee.
2025-09-29 13:37:09 +02:00
Igor Ilic
74f7a65110
test: Use smaller files than Alice for tests. (#1474)
<!-- .github/pull_request_template.md -->

## Description
<!--
Please provide a clear, human-generated description of the changes in
this PR.
DO NOT use AI-generated descriptions. We want to understand your thought
process and reasoning.
-->
Alice in wonderland expensive to process, plus Azure OpenAI flags it as
inappropriate content. Changed this to a smaller file about quantum
computers.

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [x] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [ ] **I have tested my changes thoroughly before submitting this PR**
- [ ] **This PR contains minimal changes necessary to address the
issue/feature**
- [ ] My code follows the project's coding standards and style
guidelines
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have added necessary documentation (if applicable)
- [ ] All new and existing tests pass
- [ ] I have searched existing PRs to ensure this change hasn't been
submitted already
- [ ] I have linked any relevant issues in the description
- [ ] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

<!-- CURSOR_SUMMARY -->
---

> [!NOTE]
> Update vector search limit-none tests to use local quantum/NLP files
and add `test_data/Quantum_computers.txt`, adjusting queries
accordingly.
> 
> - **Tests**:
> - **Vector engine limit-none coverage**
(`test_vector_engine_search_none_limit`):
> - Replace `examples/data/alice_in_wonderland.txt` with
`tests/test_data/Quantum_computers.txt` and
`tests/test_data/Natural_language_processing.txt` across
`cognee/tests/test_chromadb.py`, `cognee/tests/test_lancedb.py`, and
`cognee/tests/test_pgvector.py`.
> - Update query to "Tell me about Quantum computers" and keep assertion
verifying no implicit `limit` (ensure `len(result) > 15`).
>   - **Test data**:
>     - Add `cognee/tests/test_data/Quantum_computers.txt`.
> 
> <sup>Written by [Cursor
Bugbot](https://cursor.com/dashboard?tab=bugbot) for commit
af1603c8f9. This will update automatically
on new commits. Configure
[here](https://cursor.com/dashboard?tab=bugbot).</sup>
<!-- /CURSOR_SUMMARY -->
2025-09-29 12:21:16 +02:00
Igor Ilic
240925c5d4
Merge branch 'dev' into feature/cog-2837-rework-limit0-for-vector-adapters 2025-09-29 10:41:50 +02:00
Igor Ilic
e3c2c776d0
Chore: Merge main to dev branch (#1478)
<!-- .github/pull_request_template.md -->

## Description
Merge main to dev branch

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [ ] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [ ] **I have tested my changes thoroughly before submitting this PR**
- [ ] **This PR contains minimal changes necessary to address the
issue/feature**
- [ ] My code follows the project's coding standards and style
guidelines
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have added necessary documentation (if applicable)
- [ ] All new and existing tests pass
- [ ] I have searched existing PRs to ensure this change hasn't been
submitted already
- [ ] I have linked any relevant issues in the description
- [ ] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

<!-- CURSOR_SUMMARY -->
---

> [!NOTE]
> Standardizes vector search to accept `limit=None` (fetch all/count),
updates adapters/interface and retrievers, and adds tests plus minor
logging/doc assets.
> 
> - **Vector search API**:
> - Change `limit` to `Optional[int]` in `vector_db_interface` and
adapters (`ChromaDBAdapter`, `LanceDBAdapter`, `PGVectorAdapter`,
`NeptuneAnalyticsAdapter`).
> - Behavior: `limit=None` fetches all/collection count; early-return on
`limit<=0`; minor logging copy/level tweaks; `PGVector` counts via
`func.count`.
> - **Retrievers**:
> - Switch internal calls to `limit=None` (`CogneeGraph`,
`temporal_retriever`, brute-force triplet search).
> - **Tests**:
> - Add `limit=None` coverage for ChromaDB, LanceDB (new suite), and
PGVector; assert results exceed default caps.
> - **Examples/Docs**:
> - Demo script prunes before run; add tutorial notebook and data
assets; remove one markdown article.
> - **Misc**: LLM utils demote a log line to `debug`.
> 
> <sup>Written by [Cursor
Bugbot](https://cursor.com/dashboard?tab=bugbot) for commit
7403e31738. This will update automatically
on new commits. Configure
[here](https://cursor.com/dashboard?tab=bugbot).</sup>
<!-- /CURSOR_SUMMARY -->
2025-09-29 10:41:18 +02:00
Aniruddha Mandal
3b57a3fcfe Merge branch 'dev' into feature/mistral_llm_provider 2025-09-29 13:08:08 +05:30
Vasilije
107b5af6b5
Feature/cog 2979 fix falkordb adapter (#1430)
<!-- .github/pull_request_template.md -->

## Description
<!-- 
Please provide a clear, human-generated description of the changes in
this PR.
DO NOT use AI-generated descriptions. We want to understand your thought
process and reasoning.
-->
Falkordb adapter didn't work on main repo, but we have it working on
community. Decision was to remove it from main repo, so it is removed.

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [x] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Changes Made
<!-- List the specific changes made in this PR -->
- 
- 
- 

## Testing
<!-- Describe how you tested your changes -->

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [ ] **I have tested my changes thoroughly before submitting this PR**
- [ ] **This PR contains minimal changes necessary to address the
issue/feature**
- [ ] My code follows the project's coding standards and style
guidelines
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have added necessary documentation (if applicable)
- [ ] All new and existing tests pass
- [ ] I have searched existing PRs to ensure this change hasn't been
submitted already
- [ ] I have linked any relevant issues in the description
- [ ] My commits have clear and descriptive messages

## Related Issues
<!-- Link any related issues using "Fixes #issue_number" or "Relates to
#issue_number" -->

## Additional Notes
<!-- Add any additional notes, concerns, or context for reviewers -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-09-28 17:02:48 +02:00
Vasilije
7403e31738
Merge branch 'dev' into merge-main-vol6 2025-09-28 15:29:23 +02:00