# Datasets > Project-level containers for organization, permissions, and processing ## What is a dataset in Cognee? A dataset is a named container that groups documents and their metadata. It is the main boundary for: * Organizing content * Running pipelines * Applying permissions **Dataset isolation** requires specific configuration. See [permissions system](../permissions-system/datasets#dataset-isolation) for details on access control requirements and supported database setups. * **[Add](../main-operations/add)**: * Direct new content into a specific dataset (by name or ID) * If it doesn’t exist, Cognee creates it and associates your permissions * Items ingested are linked to that dataset and deduplicated within it * **[Cognify](../main-operations/cognify)**: * Choose which dataset(s) to transform into a knowledge graph * Loads the dataset’s content, checks rights, and runs the pipeline per dataset * If none are specified, processes all datasets you’re authorized to use * Progress is tracked per dataset for reliable re-runs * **[Search](../main-operations/search)**: * Queries can be scoped by dataset * Results and metrics remain separated by dataset ## Access control * Permissions (read, write, share, delete) are enforced at the dataset level * Share one dataset with a team, keep another private * Independently manage who can modify or distribute content ## Incremental processing * Processing status is tracked per dataset * After you add more data, Cognify focuses on new or changed items * Skips what’s already completed for that dataset ## Datasets vs NodeSets **Datasets** scope storage, permissions, and pipeline execution; **[NodeSets](../further-concepts/node-sets)** are semantic tags within a dataset. * During Add, you can label items with one or more NodeSet names (e.g., "AI", "FinTech") * Cognify propagates those labels into the graph by creating `NodeSet` nodes and linking derived chunks and entities via `belongs_to_set` relationships * This lets you slice a single dataset’s graph by topic or team without creating new datasets, while dataset-level permissions still control overall access Direct content into a dataset Run pipelines per dataset Scope queries by dataset --- > To find navigation and other pages in this documentation, fetch the llms.txt file at: https://docs.cognee.ai/llms.txt