doc: change to chunk_token num (#8590)
### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/8556 ### Type of change - [x] Documentation Update
This commit is contained in:
parent
8801de2772
commit
4a1680a799
2 changed files with 3 additions and 3 deletions
|
|
@ -881,7 +881,7 @@ curl --request PUT \
|
|||
{
|
||||
"name": "manual.txt",
|
||||
"chunk_method": "manual",
|
||||
"parser_config": {"chunk_token_count": 128}
|
||||
"parser_config": {"chunk_token_num": 128}
|
||||
}'
|
||||
|
||||
```
|
||||
|
|
@ -910,7 +910,7 @@ curl --request PUT \
|
|||
- `"parser_config"`: (*Body parameter*), `object`
|
||||
The configuration settings for the dataset parser. The attributes in this JSON object vary with the selected `"chunk_method"`:
|
||||
- If `"chunk_method"` is `"naive"`, the `"parser_config"` object contains the following attributes:
|
||||
- `"chunk_token_count"`: Defaults to `256`.
|
||||
- `"chunk_token_num"`: Defaults to `256`.
|
||||
- `"layout_recognize"`: Defaults to `true`.
|
||||
- `"html4excel"`: Indicates whether to convert Excel documents into HTML format. Defaults to `false`.
|
||||
- `"delimiter"`: Defaults to `"\n"`.
|
||||
|
|
|
|||
|
|
@ -461,7 +461,7 @@ dataset = rag_object.list_datasets(id='id')
|
|||
dataset = dataset[0]
|
||||
doc = dataset.list_documents(id="wdfxb5t547d")
|
||||
doc = doc[0]
|
||||
doc.update([{"parser_config": {"chunk_token_count": 256}}, {"chunk_method": "manual"}])
|
||||
doc.update([{"parser_config": {"chunk_token_num": 256}}, {"chunk_method": "manual"}])
|
||||
```
|
||||
|
||||
---
|
||||
|
|
|
|||
Loading…
Add table
Reference in a new issue