cognee/cognee/infrastructure
Vasilije 2f2a4487f0
feat: csv ingestion & chunking (#1574)
<!-- .github/pull_request_template.md -->

## Description
<!--
Please provide a clear, human-generated description of the changes in
this PR.
DO NOT use AI-generated descriptions. We want to understand your thought
process and reasoning.
-->
Create a dedicated CSV ingestion path with a custom loader and custom
chunker that preserves row-column relationships in the produced chunks.
#1348

## Type of Change
<!-- Please check the relevant option -->
- [x] Bug fix (non-breaking change that fixes an issue)
- [x] New feature (non-breaking change that adds functionality)
- [x] Breaking change (fix or feature that would cause existing
functionality to change)
- [x] Documentation update
- [x] Code refactoring
- [x] Performance improvement
- [x] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [x] **I have tested my changes thoroughly before submitting this PR**
- [x] **This PR contains minimal changes necessary to address the
issue/feature**
- [x] My code follows the project's coding standards and style
guidelines
- [x] I have added tests that prove my fix is effective or that my
feature works
- [x] I have added necessary documentation (if applicable)
- [x] All new and existing tests pass
- [x] I have searched existing PRs to ensure this change hasn't been
submitted already
- [x] I have linked any relevant issues in the description
- [x] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-11-22 14:48:27 -08:00
..
context
data addressed issues 2025-09-07 15:56:11 -07:00
databases feat: fs-cache (#1645) 2025-11-12 15:34:30 +01:00
engine feat: optimize repeated entity extraction (#1682) 2025-10-30 13:56:06 +01:00
entities
files chore: remove unnecessary csv file type 2025-11-17 14:41:55 +08:00
llm feat: add instructor mode env variable and config parameter (#1789) 2025-11-22 14:18:40 -08:00
loaders Merge branch 'dev' into feat/csv-ingestion 2025-11-14 14:46:11 +08:00
utils Merge dev into main (#1398) 2025-09-12 20:20:21 +02:00
__init__.py