Doc: Updated Create dataset descriptions

This commit is contained in:
writinwaters 2025-12-04 16:37:41 +08:00
parent e7022db9a4
commit bc57b08453

View file

@ -512,15 +512,18 @@ curl --request POST \
- Maximum: `2048` - Maximum: `2048`
- `"delimiter"`: `string` - `"delimiter"`: `string`
- Defaults to `"\n"`. - Defaults to `"\n"`.
- `"html4excel"`: `bool` Indicates whether to convert Excel documents into HTML format. - `"html4excel"`: `bool`
- Whether to convert Excel documents into HTML format.
- Defaults to `false` - Defaults to `false`
- `"layout_recognize"`: `string` - `"layout_recognize"`: `string`
- Defaults to `DeepDOC` - Defaults to `DeepDOC`
- `"tag_kb_ids"`: `array<string>` refer to [Use tag set](https://ragflow.io/docs/dev/use_tag_sets) - `"tag_kb_ids"`: `array<string>`
- Must include a list of dataset IDs, where each dataset is parsed using the Tag Chunking Method - IDs of datasets to be parsed using the Tag chunk method.
- `"task_page_size"`: `int` For PDF only. - Before setting this, ensure a tag set is created and properly configured. For details, see [Use tag set](https://ragflow.io/docs/dev/use_tag_sets).
- `"task_page_size"`: `int`
- For PDFs only.
- Defaults to `12` - Defaults to `12`
- Minimum: `1` - Minimum: `1
- `"raptor"`: `object` RAPTOR-specific settings. - `"raptor"`: `object` RAPTOR-specific settings.
- Defaults to: `{"use_raptor": false}` - Defaults to: `{"use_raptor": false}`
- `"graphrag"`: `object` GRAPHRAG-specific settings. - `"graphrag"`: `object` GRAPHRAG-specific settings.