convert → split → shape → extract.
convert
Normalizes a source document into clean text and pages.| Parameter | Type | Description |
|---|---|---|
source | string | URL or storage path of the document to convert. |
split
Breaks converted text into smaller chunks that the later steps can process reliably.| Parameter | Type | Description |
|---|---|---|
text | string | Converted document text to split. |
max_words | integer | Maximum number of words per chunk. Defaults to 500. |
shape
Inspects chunks and infers the schema of the data they contain.| Parameter | Type | Description |
|---|---|---|
chunks | string[] | Ordered list of text chunks to inspect. |
extract
Pulls the requested fields out of the text and returns them as structured values.| Parameter | Type | Description |
|---|---|---|
chunks | string[] | Ordered list of text chunks to extract from. |
fields | string[] | Names of the fields to pull out of the text. |
Tools called over MCP are billed the same as the equivalent API tools. See Pricing for credit rates.