> ## Documentation Index
> Fetch the complete documentation index at: https://docs.generalcompute.com/llms.txt
> Use this file to discover all available pages before exploring further.

# Rate Limits

> Understand rate limits and quotas for each General Compute plan.

## Plans & Rate Limits

General Compute offers four plans to match your usage needs. All plans include access to every model.

### Pay As You Go

No monthly fee. Add a card to enable auto-reload.

| Limit               | Value      |
| ------------------- | ---------- |
| Requests per minute | 100        |
| Tokens per minute   | 200,000    |
| Requests per day    | 50,000     |
| Tokens per day      | 10,000,000 |

### Developer — \$50/mo

For growing applications with higher throughput needs.

| Limit               | Value       |
| ------------------- | ----------- |
| Requests per minute | 500         |
| Tokens per minute   | 1,000,000   |
| Requests per day    | 250,000     |
| Tokens per day      | 100,000,000 |

### Scale — \$1,000/mo

For production workloads with high throughput.

| Limit               | Value       |
| ------------------- | ----------- |
| Requests per minute | 2,000       |
| Tokens per minute   | 5,000,000   |
| Requests per day    | 1,000,000   |
| Tokens per day      | 500,000,000 |

### Enterprise — Custom

For organizations that need custom limits, dedicated infrastructure, or SLAs. [Contact us](mailto:support@generalcompute.com) to discuss your needs.

## Rate Limit Headers

Every API response includes headers to help you track your usage:

```
x-ratelimit-limit-requests: 500
x-ratelimit-remaining-requests: 299
x-ratelimit-reset-requests: 2024-01-01T00:00:00Z
```

## Handling Rate Limits

When you exceed a rate limit, the API returns a `429 Too Many Requests` response. We recommend implementing exponential backoff:

<CodeGroup>
  ```typescript Node.js theme={null}
  import GeneralCompute from "@generalcompute/sdk";

  const client = new GeneralCompute();

  // The SDK automatically retries on 429 errors with exponential backoff.
  // You can configure the retry behavior:
  const client = new GeneralCompute({
    maxRetries: 3, // default is 2
  });
  ```

  ```python Python theme={null}
  from generalcompute import GeneralCompute

  client = GeneralCompute()

  # The SDK automatically retries on 429 errors with exponential backoff.
  # You can configure the retry behavior:
  client = GeneralCompute(max_retries=3)  # default is 2
  ```
</CodeGroup>

<Note>
  Need higher limits? [Upgrade your plan](https://generalcompute.com) or [contact us](mailto:support@generalcompute.com) for Enterprise pricing.
</Note>
