📄️ Overview
The Otoroshi LLM extension provides a unified, OpenAI-compatible API for computing text embeddings across multiple providers. Embeddings are vector representations of text that capture semantic meaning, enabling similarity search, clustering, and RAG (Retrieval-Augmented Generation) pipelines.
📄️ Supported providers
OpenAI
📄️ Setting up an embedding model
An embedding model entity configures a provider, its connection settings, and model options. You can create embedding models from the admin UI or via the admin API.
📄️ Embedding plugin
The Otoroshi LLM extension provides the Cloud APIM - LLM OpenAI Compat. Embeddings plugin to expose embedding models on an Otoroshi route. The API is compatible with the OpenAI embeddings API.