Skip to main content

OCR Providers

Each OCR Model wraps a single provider. The provider is selected with the provider field, and configured through config.connection (endpoint and credentials) and config.options (model and provider-specific options).

AlphaEdge 🇫🇷 🇪🇺

AlphaEdge exposes OCR through a multipart/form-data endpoint. Otoroshi resolves the document to bytes and uploads it for you.

FieldTypeDefaultDescription
config.connection.base_urlstringhttps://api-endpoints.alphaedge-ai.comAlphaEdge base URL
config.connection.tokenstringAlphaEdge API key (sent as the X-API-Key header). Supports vault references and comma-separated rotation.
config.connection.timeoutnumber180000Request timeout in milliseconds
config.options.modelstringalpha-digit-maxThe default OCR model (alpha-digit-max or alpha-digit-medium)
config.options.pdf_passwordstringOptional password used to open protected PDFs

Full entity example

{
"id": "ocr-model_xxxxxxxxx",
"name": "AlphaEdge OCR",
"description": "AlphaEdge OCR model",
"provider": "alphaedge",
"config": {
"connection": {
"base_url": "https://api-endpoints.alphaedge-ai.com",
"token": "${vault://local/ALPHAEDGE_API_KEY}",
"timeout": 180000
},
"options": {
"model": "alpha-digit-max"
}
},
"kind": "ai-gateway.extensions.cloud-apim.com/OcrModel"
}

Mistral

Mistral exposes a native JSON OCR endpoint (POST /v1/ocr) that returns page-by-page markdown. Otoroshi sends the document as a remote URL or as a base64 data-uri, depending on the input.

FieldTypeDefaultDescription
config.connection.base_urlstringhttps://api.mistral.aiMistral base URL
config.connection.tokenstringMistral API key (sent as a Bearer token). Supports vault references and comma-separated rotation.
config.connection.timeoutnumber180000Request timeout in milliseconds
config.options.modelstringmistral-ocr-latestThe default OCR model (mistral-ocr-latest or mistral-ocr-2505)

Full entity example

{
"id": "ocr-model_xxxxxxxxx",
"name": "Mistral OCR",
"description": "Mistral OCR model",
"provider": "mistral",
"config": {
"connection": {
"base_url": "https://api.mistral.ai",
"token": "${vault://local/MISTRAL_API_TOKEN}",
"timeout": 180000
},
"options": {
"model": "mistral-ocr-latest"
}
},
"kind": "ai-gateway.extensions.cloud-apim.com/OcrModel"
}