💸 Cost optimization | Otoroshi LLM Extension

📄️ Overview

⚡ Cache: Fast and Efficient

Our highly flexible quota management system ensures optimal resource allocation.

Cost tracking for LLMs with a gateway means monitoring and managing the costs of using different LLMs through an API gateway.

simple cache works on prompts word per word

semantic cache uses an embedding datastore to find prompt with the same semantic