Supported providers
OpenAI
- Base URL:
https://api.openai.com/v1 - Default model:
text-embedding-3-small - API endpoint:
POST /embeddings
Models
| Model | Dimensions | Description |
|---|---|---|
text-embedding-3-small | 1536 | Small, fast, cost-effective |
text-embedding-3-large | 3072 | Higher quality, supports custom dimensions |
text-embedding-ada-002 | 1536 | Legacy model |
Supports the dimensions parameter to reduce embedding size (for text-embedding-3-* models).
Azure OpenAI
- Base URL: Computed from Azure resource configuration
- API endpoint:
POST /embeddings
Azure OpenAI supports two modes depending on the api_version setting:
v1 mode
When api_version is set to "v1", behaves identically to OpenAI. Uses the standard connection config with base_url and token.
Legacy mode
Uses Azure-specific connection settings:
{
"connection": {
"resource_name": "my-azure-resource",
"deployment_id": "my-embedding-deployment",
"api_version": "2024-02-01",
"api_key": "xxx"
}
}
| Parameter | Type | Description |
|---|---|---|
resource_name | string | Azure resource name |
deployment_id | string | Deployment ID (used as model) |
api_version | string | API version (use "v1" for OpenAI-compatible mode) |
api_key | string | API key (alternative to bearer token) |
Azure AI Foundry
- Base URL:
https://<resource>.services.ai.azure.com/models - Default model:
text-embedding-3-small - API endpoint:
POST /embeddings
Uses OpenAI-compatible API.
Mistral 🇫🇷 🇪🇺
- Base URL:
https://api.mistral.ai - Default model:
mistral-embed - API endpoint:
POST /v1/embeddings
Models
| Model | Dimensions | Description |
|---|---|---|
mistral-embed | 1024 | Multilingual embedding model |
Ollama (Local Models)
- Base URL:
http://localhost:11434 - Default model:
snowflake-arctic-embed:22m - API endpoint:
POST /api/embed
Ollama uses its native embed API (not OpenAI-compatible format). The token is optional for local installations.
Models
Any embedding model available in Ollama can be used, for example:
snowflake-arctic-embed:22mnomic-embed-textmxbai-embed-large
Cohere
- Base URL:
https://api.cohere.com - Default model:
embed-multilingual-v3.0 - API endpoint:
POST /v2/embed
Cohere uses its own API format (not OpenAI-compatible). The extension handles the translation automatically.
Models
| Model | Dimensions | Description |
|---|---|---|
embed-multilingual-v3.0 | 1024 | Multilingual, 100+ languages |
embed-english-v3.0 | 1024 | English-optimized |
Gemini
- Base URL:
https://generativelanguage.googleapis.com/v1beta/openai - Default model:
gemini-embedding-001 - API endpoint:
POST /embeddings
Uses OpenAI-compatible API.
X-AI (Grok)
- Base URL:
https://api.x.ai - Default model:
v1 - API endpoint:
POST /v1/embeddings
Deepseek
- Base URL:
https://api.deepseek.com - Default model:
deepseek-r1 - API endpoint:
POST /embeddings
Uses OpenAI-compatible API.