A flexible and robust rate limiter implementation using Redis for managing API keys and model access with multiple time windows.
Rate limits are tracked across different time windows (minute, hour, day, month). Each window operates independently, meaning a request must satisfy ALL window constraints to be allowed.
Example:
{
minute: 44, // 44 requests per minute
hour: 442, // 442 requests per hour
day: 2000 // 2000 requests per day
}
Limit borrowing allows exceeding shorter time window limits by "borrowing" from longer windows. This is useful for handling burst traffic while maintaining overall usage constraints.
Example Use Case: Consider an API with limits:
- 10 requests/minute
- 50 requests/hour
- 1000 requests/day
result = await limiter.getModel('api 1');
This returns:
{
"key": "key 1",
"model": "model 1",
"limits": { "minute": 13 }
}
When we have exhausted all model limits, then the model
property will be null.
When to Use Borrowing:
- Handling burst traffic (e.g., batch processing)
- Managing irregular usage patterns
- Processing time-sensitive operations
- Providing flexibility for premium customers
The system automatically rotates through available API keys and models when limits are reached. This helps maximize availability and distribute load.
Use Case:
const config = [
{
name: "api 1",
keys: ["key1", "key2"], // Multiple keys
models: [
{
name: "model1", // Multiple models
limits: {...}
},
{
name: "model2",
limits: {...}
}
]
}
];
The rate limiter supports different strategies for selecting both keys and models. These strategies can be configured independently:
-
Ordered (default)
- Uses keys/models in the order they are defined
- Predictable, sequential access pattern
- Best for simple use cases
-
Random
- Randomly selects keys/models
- Good for load balancing
- Prevents predictable patterns
Set strategies during initialization:
const limiter = new RateLimiter(config, {
keyStrategy: 'round-robin', // Strategy for key selection
modelStrategy: 'random', // Strategy for model selection
redis: {
/* redis options */
},
// ... other options
});
const limiter = new RateLimiter(config, {
redis: {
host: 'localhost',
port: 6379,
password: 'optional',
db: 0,
maxRetriesPerRequest: 3,
retryStrategy: (times) => Math.min(times * 50, 2000),
enableOfflineQueue: true,
connectTimeout: 10000,
},
customWindows: {
shift: 28800,
week: 604800,
},
keyStrategy: 'ordered',
modelStrategy: 'random',
});
The first parameter (config
) defines your APIs, keys, and models:
const config = [
{
name: 'api 1', // Unique API identifier
keys: ['key1', 'key2'], // Array of API keys
models: [
// Array of models
{
name: 'model1', // Unique model identifier
limits: {
// Rate limits per window
minute: 44,
hour: 442,
day: 2000,
month: 200000,
},
},
],
},
// ... more APIs
];
{
customWindows: {
// Window name: duration in seconds
halfhour: 1800, // 30 minutes
shift: 28800, // 8 hours
week: 604800, // 1 week
fortnight: 1209600, // 2 weeks
quarter: 7776000 // 3 months
}
}
Get next available key:model combination for a specific API.
// Get next available combination for "api 1"
const result = await limiter.getModel('api 1');
if (result) {
console.log('Key:', result.key);
console.log('Model:', result.model);
console.log('Limits:', result.limits);
}
Freeze a specific key:model combination for an API.
// Freeze for 5 minutes
await limiter.freezeModel({
apiName: 'api 1',
key: 'key 1',
model: 'model 1',
duration: 3000,
});
MIT