Check the init and runtime parameters you can pass for this component.
YAML Init Parameters
You can specify the following parameters for AzureOCRDocumentConverter in the pipeline YAML:
Parameter | Type | Possible values | Description |
---|---|---|---|
| String | The endpoint of your Azure resource. | |
| String | Uses the | The API key to connect to your Azure resource. |
| String | Default: | The ID of the model you want to use to convert files to documents. For a list of supported models, see Microsoft documentation. |
| Integer | Default: | The number of lines before a table to extract as its preceding context. |
| Integer | Default: | The number of lines after a table to extract as its subsequent context. |
| Boolean |
| If a table contains more than one row used as a header, this parameter specifies if you want to merge multiple header rows into a single row. |
| Literal |
| Specifies the type of reading order to follow. Possible values are:
|
| Float | Default: | The threshold to determine if two recognized elements in a PDF should be grouped into a single line. This is especially relevant for section headers or numbers, which may be spatially separated on the horizontal axis from the remaining text. The threshold is specified in units of inches. |
REST API Runtime Parameters
There are no runtime parameters you can pass to this component when making a request to the Search REST API endpoint.