Blog
Engineering notes on reliable execution.
Practical, opinionated writing about AI infrastructure, queues, retries, rate limits, and the sharp edges of running async work in production.
Engineering
Queue vs workflow engine: what startups actually need
Temporal is amazing — and overkill for most teams. Here's a pragmatic breakdown of when a queue is enough and when you need a full workflow engine.
April 9, 2026 · 11 min
AI
How to handle OpenAI rate limits in production
Tokens-per-minute limits, retry-after headers, and shared buckets across your fleet — a practical guide that doesn't end in 429s.
March 28, 2026 · 9 min
Reliability
Why webhook retries matter
If your webhook delivery doesn't have backoff, jitter, idempotency, and a dead-letter queue, you are silently losing customer events.
March 17, 2026 · 6 min
Architecture
Building async infrastructure without overengineering
When to reach for Redis, when to reach for a hosted queue, and when to skip both. A guide to staying simple as your traffic grows.
March 5, 2026 · 10 min