🎁 New User? Get 20% off your first purchase with code NEWUSER20 Register Now →
Menu

Categories

Web Development Intermediate

What is API Rate Limiting?

A strategy for limiting the number of API requests a client can make within a specified time window to protect server resources.

API rate limiting prevents abuse and ensures fair resource sharing. Common algorithms include fixed window (100 requests/minute), sliding window (smoother distribution), token bucket (allows bursts), and leaky bucket (constant rate).

Rate limits are communicated via HTTP headers: X-RateLimit-Limit, X-RateLimit-Remaining, and Retry-After. Clients should implement exponential backoff when rate limited. API keys or JWT tokens identify clients for per-user limits.

Related Terms

WebSocket
A communication protocol that enables full-duplex, real-time data exchange between a browser and server over a single connection.
CDN (Content Delivery Network)
A distributed network of servers that delivers web content to users from the geographically closest location.
Cookie
A small piece of data stored in the browser by a website to remember user preferences, login state, or tracking information.
Server-Sent Events (SSE)
A web technology enabling servers to push real-time updates to browsers over a single HTTP connection, simpler than WebSockets.
Lazy Loading
A technique that delays loading non-critical resources until they are needed, improving initial page load performance.
Caching
Storing copies of frequently accessed data in faster storage to reduce load times and server processing.
View All Web Development Terms →