Plans & Rate Limits
General Compute offers four plans to match your usage needs. All plans include access to every model.Pay As You Go
No monthly fee. Add a card to enable auto-reload.| Limit | Value |
|---|---|
| Requests per minute | 60 |
| Input tokens per minute | 100,000 |
| Output tokens per minute | 10,000 |
| Tokens per day | 1,000,000 |
| Max concurrent requests | 200 |
| Max requests per day | 1,000 |
Developer — $50/mo
For growing applications with higher throughput needs.| Limit | Value |
|---|---|
| Requests per minute | 300 |
| Input tokens per minute | 500,000 |
| Output tokens per minute | 100,000 |
| Tokens per day | 10,000,000 |
| Max concurrent requests | 500 |
| Max requests per day | 5,000 |
Scale — $1,000/mo
For production workloads with high throughput.| Limit | Value |
|---|---|
| Requests per minute | 1,000 |
| Input tokens per minute | 2,000,000 |
| Output tokens per minute | 500,000 |
| Tokens per day | 50,000,000 |
| Max concurrent requests | 10,000 |
| Max requests per day | 100,000 |
Enterprise — Custom
For organizations that need custom limits, dedicated infrastructure, or SLAs. Contact us to discuss your needs.Rate Limit Headers
Every API response includes headers to help you track your usage:Handling Rate Limits
When you exceed a rate limit, the API returns a429 Too Many Requests response. We recommend implementing exponential backoff:
Need higher limits? Upgrade your plan or contact us for Enterprise pricing.

