Skip to main content

Managing tokens usage

Our highly flexible quota management system ensures optimal resource allocation.

With Otoroshi LLM extension, you can:

  • 📏 Define quotas based on any attribute in the HTTP request
  • 🏷️ Group quotas by users, API keys, or any custom identifier
  • ⏳ Set time windows (per second, minute, hour, or custom intervals)

This enables precise control over token consumption and prevents overuse, ensuring smooth operation for all users.