cognee/cognee/modules/data
alekszievr a61df966c6
feat: use external chunker [cog-1354] (#551)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit

- **New Features**
- Introduced a modular content chunking interface that offers flexible
text segmentation with configurable chunk size and overlap.
- Added new chunkers for enhanced text processing, including
`LangchainChunker` and improved `TextChunker`.

- **Refactor**
- Unified the chunk extraction mechanism across various document types
for improved consistency and type safety.
- Updated method signatures to enhance clarity and type safety regarding
chunker usage.
- Enhanced error handling and logging during text segmentation to guide
adjustments when content exceeds limits.

- **Bug Fixes**
- Adjusted expected output in tests to reflect changes in chunking logic
and configurations.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->
2025-02-21 14:10:59 +01:00
..
deletion ruff format 2025-01-05 19:09:08 +01:00
exceptions ruff format 2025-01-05 19:09:08 +01:00
extraction Fix linter issues 2025-01-05 19:48:35 +01:00
methods feat: Calculate graph metrics for networkx graph [COG-1082] (#484) 2025-02-03 18:05:53 +01:00
models feat: Cognee evaluation framework development (#498) 2025-02-11 16:31:54 +01:00
operations fix: Add reraising of general exception handling in cognee [COG-1062] (#490) 2025-02-04 10:51:05 +01:00
processing feat: use external chunker [cog-1354] (#551) 2025-02-21 14:10:59 +01:00
__init__.py Cog 174 (#84) 2024-04-26 00:16:03 +02:00