No description
Find a file
Vasilije 2f2a4487f0
feat: csv ingestion & chunking (#1574)
<!-- .github/pull_request_template.md -->

## Description
<!--
Please provide a clear, human-generated description of the changes in
this PR.
DO NOT use AI-generated descriptions. We want to understand your thought
process and reasoning.
-->
Create a dedicated CSV ingestion path with a custom loader and custom
chunker that preserves row-column relationships in the produced chunks.
#1348

## Type of Change
<!-- Please check the relevant option -->
- [x] Bug fix (non-breaking change that fixes an issue)
- [x] New feature (non-breaking change that adds functionality)
- [x] Breaking change (fix or feature that would cause existing
functionality to change)
- [x] Documentation update
- [x] Code refactoring
- [x] Performance improvement
- [x] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [x] **I have tested my changes thoroughly before submitting this PR**
- [x] **This PR contains minimal changes necessary to address the
issue/feature**
- [x] My code follows the project's coding standards and style
guidelines
- [x] I have added tests that prove my fix is effective or that my
feature works
- [x] I have added necessary documentation (if applicable)
- [x] All new and existing tests pass
- [x] I have searched existing PRs to ensure this change hasn't been
submitted already
- [x] I have linked any relevant issues in the description
- [x] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-11-22 14:48:27 -08:00
.github Chore: Acceptance Criteria for PRs (#1781) 2025-11-12 15:40:33 +01:00
alembic feat: enable multi user for falkor (#1689) 2025-11-11 17:03:48 +01:00
assets
bin
cognee feat: csv ingestion & chunking (#1574) 2025-11-22 14:48:27 -08:00
cognee-frontend COG-3050 - remove insights search (#1506) 2025-10-11 09:09:56 +02:00
cognee-mcp Fix: MCP remove cognee.add() preprequisite from the doc 2025-11-13 17:35:16 +01:00
cognee-starter-kit
deployment
distributed chore: deletes toml and lock files from distributed directory 2025-10-14 09:55:02 +02:00
evals Deprecate SearchType.INSIGHTS, replace all references to default search type - SearchType.GRAPH_COMPLETION 2025-10-08 12:13:59 +01:00
examples Merge branch 'dev' into multi-tenancy 2025-11-06 18:55:18 +01:00
licenses
logs refactor: Return logs folder 2025-10-29 16:31:42 +01:00
notebooks rerun and update notebooks with latest cognee 2025-10-22 19:05:01 +01:00
tools
working_dir_error_replication feat: Redis lock integration and Kuzu agentic access fix (#1504) 2025-10-16 15:48:20 +02:00
.dockerignore
.env.template fix: PR comment changes 2025-11-21 16:20:19 +01:00
.gitattributes
.gitguardian.yml
.gitignore
.pre-commit-config.yaml
.pylintrc
AGENTS.md Add repository guidelines to AGENTS.md 2025-10-26 11:18:17 +01:00
alembic.ini
CODE_OF_CONDUCT.md
CONTRIBUTING.md
CONTRIBUTORS.md
DCO.md
docker-compose.yml added logs 2025-10-25 10:26:46 +02:00
Dockerfile
entrypoint.sh added logs 2025-10-25 10:26:46 +02:00
LICENSE
mypy.ini
NOTICE.md
poetry.lock feat: fs-cache (#1645) 2025-11-12 15:34:30 +01:00
pyproject.toml Fix: Remove cognee script from pyproject.toml 2025-11-13 13:35:07 +01:00
README.md Correct typo in installation section of README 2025-10-25 13:25:16 +02:00
SECURITY.md
uv.lock feat: fs-cache (#1645) 2025-11-12 15:34:30 +01:00

Cognee Logo

cognee - Memory for AI Agents in 6 lines of code

Demo . Learn more · Join Discord · Join r/AIMemory . Docs . cognee community repo

GitHub forks GitHub stars GitHub commits Github tag Downloads License Contributors Sponsor

cognee - Memory for AI Agents  in 5 lines of code | Product Hunt topoteretes%2Fcognee | Trendshift

Build dynamic memory for Agents and replace RAG using scalable, modular ECL (Extract, Cognify, Load) pipelines.

🌐 Available Languages : Deutsch | Español | français | 日本語 | 한국어 | Português | Русский | 中文

Why cognee?

Get Started

Get started quickly with a Google Colab notebook , Deepnote notebook or starter repo

About cognee

cognee works locally and stores your data on your device. Our hosted solution is just our deployment of OSS cognee on Modal, with the goal of making development and productionization easier.

Self-hosted package:

  • Interconnects any kind of documents: past conversations, files, images, and audio transcriptions
  • Replaces RAG systems with a memory layer based on graphs and vectors
  • Reduces developer effort and cost, while increasing quality and precision
  • Provides Pythonic data pipelines that manage data ingestion from 30+ data sources
  • Is highly customizable with custom tasks, pipelines, and a set of built-in search endpoints

Hosted platform:

Self-Hosted (Open Source)

📦 Installation

You can install Cognee using either pip, poetry, uv or any other python package manager..

Cognee supports Python 3.10 to 3.12

With uv

uv pip install cognee

Detailed instructions can be found in our docs

💻 Basic Usage

Setup

import os
os.environ["LLM_API_KEY"] = "YOUR OPENAI_API_KEY"

You can also set the variables by creating .env file, using our template. To use different LLM providers, for more info check out our documentation

Simple example

Python

This script will run the default pipeline:

import cognee
import asyncio


async def main():
    # Add text to cognee
    await cognee.add("Cognee turns documents into AI memory.")

    # Generate the knowledge graph
    await cognee.cognify()

    # Add memory algorithms to the graph
    await cognee.memify()

    # Query the knowledge graph
    results = await cognee.search("What does cognee do?")

    # Display the results
    for result in results:
        print(result)


if __name__ == '__main__':
    asyncio.run(main())

Example output:

  Cognee turns documents into AI memory.

Via CLI

Let's get the basics covered

cognee-cli add "Cognee turns documents into AI memory."

cognee-cli cognify

cognee-cli search "What does cognee do?"
cognee-cli delete --all

or run

cognee-cli -ui

Hosted Platform

Get up and running in minutes with automatic updates, analytics, and enterprise security.

  1. Sign up on cogwit
  2. Add your API key to local UI and sync your data to Cogwit

Demos

  1. Cogwit Beta demo:

Cogwit Beta

  1. Simple GraphRAG demo

Simple GraphRAG demo

  1. cognee with Ollama

cognee with local models

Contributing

Your contributions are at the core of making this a true open source project. Any contributions you make are greatly appreciated. See CONTRIBUTING.md for more information.

Code of Conduct

We are committed to making open source an enjoyable and respectful experience for our community. See CODE_OF_CONDUCT for more information.

Citation

We now have a paper you can cite:

@misc{markovic2025optimizinginterfaceknowledgegraphs,
      title={Optimizing the Interface Between Knowledge Graphs and LLMs for Complex Reasoning},
      author={Vasilije Markovic and Lazar Obradovic and Laszlo Hajdu and Jovan Pavlovic},
      year={2025},
      eprint={2505.24478},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2505.24478},
}