The IETF draft standard for rate limit headers defines three headers: `RateLimit-Limit` (the maximum requests allowed in a window), `RateLimit-Remaining` (requests left in the current window), and `RateLimit-Reset` (when the window resets, in Unix epoch seconds). The older `X-RateLimit-*` prefix is also widely used by APIs before the standard was proposed.
For scrapers calling APIs, reading rate limit headers allows the client to proactively throttle before hitting the limit rather than reactively backing off after receiving a 429. A polite client monitors `RateLimit-Remaining` and slows down as it approaches zero, then waits until `RateLimit-Reset` before resuming full speed.
Some anti-bot systems embed rate limit information in custom headers specific to their platform (Cloudflare: `cf-cache-status`, Fastly: `Fastly-RateLimit-*`). Understanding these platform-specific headers helps scrapers optimise their request pacing without triggering hard blocks.