Commit graph

25 commits

Author SHA1 Message Date
EricXiao
742866b4c9 feat: csv ingestion loader & chunk
Signed-off-by: EricXiao <taoiaox@gmail.com>
2025-10-22 16:56:46 +08:00
hajdul88
df3a3df117 feat: adds errors to classify, and chunking top level 2025-08-14 13:12:08 +02:00
hajdul88
c99b453d96 feat: adds WrongDataDocumentError to classify documents 2025-08-14 10:57:16 +02:00
Daniel Molnar
bb68d6a0df
Docstring tasks. (#878)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-05-27 21:33:16 +02:00
Vasilije
bb7eaa017b
feat: Group DataPoints into NodeSets (#680)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.

---------

Co-authored-by: lxobr <122801072+lxobr@users.noreply.github.com>
Co-authored-by: Boris <boris@topoteretes.com>
Co-authored-by: Boris Arzentar <borisarzentar@gmail.com>
2025-04-19 20:21:04 +02:00
Igor Ilic
40c0279ec5 Merge branch 'COG-793-metadata-rework' of github.com:topoteretes/cognee into COG-793-metadata-rework 2025-01-22 16:13:11 +01:00
Igor Ilic
80e67b0619 refactor: Rename foreign to external metadata
Rename foreign metadata to external metadata for metadata coming outside of Cognee
2025-01-22 16:07:35 +01:00
Igor Ilic
655ab0b8cc
Merge branch 'dev' into COG-793-metadata-rework 2025-01-21 18:20:49 +01:00
Igor Ilic
ab8d95cc30 refactor: As neo4j can't support dictionaries, add foreign metadata as string 2025-01-20 17:28:14 +01:00
Igor Ilic
49ad292592 refactor: Reduce complexity of metadata handling
Have foreign metadata be a table column in data instead of it's own table to reduce complexity

Refactor COG-793
2025-01-20 16:39:05 +01:00
hande-k
2c351c499d add docstrings any typing to cognee tasks 2025-01-17 10:30:34 +01:00
vasilije
60c8fd103b ruff format 2025-01-05 19:09:08 +01:00
Igor Ilic
62db3f8598 feat: Remove the need for libmagic for unstructured documents
Remove the need for libmagic so for unstructured documents by providing mime_type information

Feature COG-685
2024-12-08 14:37:50 +01:00
Igor Ilic
78214456a6 feat: Add unstructured document handler
Added unstructured library and handling of certain document types through their library

Feature COG-685
2024-12-06 17:50:22 +01:00
Boris
348610e73c
fix: refactor get_graph_from_model to return nodes and edges correctly (#257)
* fix: handle rate limit error coming from llm model

* fix: fixes lost edges and nodes in get_graph_from_model

* fix: fixes database pruning issue in pgvector (#261)

* fix: cognee_demo notebook pipeline is not saving summaries

---------

Co-authored-by: hajdul88 <52442977+hajdul88@users.noreply.github.com>
2024-12-06 12:52:01 +01:00
Leon Luithlen
15802237e9 Get metadata from metadata table 2024-11-28 09:18:49 +01:00
Leon Luithlen
cd0e505ac0 WIP 2024-11-28 09:18:49 +01:00
Leon Luithlen
7324564655 Add metadata_id attribute to Document and DocumentChunk, make ingest_with_metadata default 2024-11-28 09:18:49 +01:00
Boris
64b8aac86f
feat: code graph swe integration
Co-authored-by: hajdul88 <52442977+hajdul88@users.noreply.github.com>
Co-authored-by: hande-k <handekafkas7@gmail.com>
Co-authored-by: Igor Ilic <igorilic03@gmail.com>
Co-authored-by: Vasilije <8619304+Vasilije1990@users.noreply.github.com>
Co-authored-by: Igor Ilic <30923996+dexters1@users.noreply.github.com>
2024-11-27 09:32:29 +01:00
Leon Luithlen
cd80525420 Revert to EXTENSION_TO_DOCUMENT_CLASS implementation of classify_documents 2024-11-13 14:32:10 +01:00
Leon Luithlen
826de0edbf Remove orphan dictionary 2024-11-12 16:47:28 +01:00
Leon Luithlen
83995fa548 Try old version of classify_documents 2024-11-12 16:47:28 +01:00
Leon Luithlen
8107709e98 Remove duplicate pdf key 2024-11-12 16:47:28 +01:00
Leon Luithlen
66fb2948f8 Small cleanup pull request 2024-11-12 15:37:03 +01:00
Boris
52180eb6b5
feat: COG-184 add falkordb (#192)
* feat: add falkordb adapter

---------

Co-authored-by: hajdul88 <52442977+hajdul88@users.noreply.github.com>
2024-11-11 18:20:52 +01:00