Every API key has its own request budget, applied independently per key. Limits are split between read and write operations to keep heavy reads from starving writes (and vice versa).Documentation Index
Fetch the complete documentation index at: https://docs.clarky.ai/llms.txt
Use this file to discover all available pages before exploring further.
Limits
| Operation | Methods | Limit |
|---|---|---|
| Reads | GET, HEAD | 120 requests/minute per key |
| Writes | POST, PATCH, DELETE | 60 requests/minute per key |
429 Too Many Requests until the window resets.
Rate limit headers
Every response — successful or rate-limited — includes headers that describe your current budget:Maximum requests allowed in the current window for this method class (read or write).
Requests remaining in the current window.
Unix timestamp (seconds since epoch) when the current window resets.
What a 429 looks like
When you exceed the limit, you’ll get a429 with a standard error envelope and a Retry-After header (in seconds):
Recommended client behavior
Respect Retry-After
On a
429, wait at least the number of seconds in the Retry-After header before retrying. This is the simplest correct strategy.Use exponential backoff with jitter
For longer-running jobs, layer exponential backoff on top of
Retry-After. Add random jitter so a fleet of workers doesn’t synchronize and thunder back together.Throttle proactively
Watch
X-RateLimit-Remaining. If it’s getting close to zero, slow down before you hit the limit — it’s much cheaper than recovering from a 429.
