Neo Kim curates 16 system cases
- Neo Kim’s April 2026 post turned system-design prep into a concrete reading list — 16 named case studies spanning AirTags, Instagram, YouTube, S3, Kafka, and Uber. (substack.com) - The list is specific enough to show the pattern: consumer apps, infra primitives, and geo systems sit together because interviews now test tradeoffs, not memorized diagrams. (substack.com) - That matters because senior design rounds increasingly reward rollbacks, failure handling, and fit-for-purpose storage choices over polished template answers. (substack.com)
System design interview prep keeps getting sold as a bag of templates. Learn a cache. Learn a queue. Memorize a URL shortener. But the useful version is messier than that. Neo(substack.com)one canonical architecture, but a set of concrete systems that force you to reason from constraints. (substack.com)t 2026 system design interviews, with examples like Airbnb, Google Docs, Apple AirTags, Spotify, Instagram at 2.5 billion users, Reddit, Bluesky, ChatGPT, Kafka, (substack.com)be on MySQL, and Amazon S3. The point was not “here is the answer.” The point was “study real systems with different bottlenecks.” (substack.com) ### Why these cases and not one template? Because these systems break in different ways. AirTags are about location privacy, intermittent connectivity, an(substack.com)anking and fanout problems. S3 is durability and object storage semantics. Uber nearby-driver search is geospatial indexing under latency pressure. If you can explain why those systems diverge, you are doing design instead of reciting. (substack.com) ### What’s the hidden pattern in the list? It mixes product systems and infrastructure systems on purpose. That matters. A lot of i(substack.com)e, “distributed systems” over there. But senior interviews usually blur them together. A feed product becomes a storage problem. A storage problem becomes a cost problem. A cost problem becomes an operability problem. Neo Kim’s list reflects that stack-crossing reality. (substack.com) ### Why are storage answers where people go wrong? Because storage is where generic advice stops helping. “Use SQL for (substack.com)ou ask about access patterns, write amplification, secondary indexes, retention, multi-region replication, or recovery time. The better engineers start from data shape and failure mode, then pick the storage engine. That is why S3, Kafka, and YouTube-on-MySQL belong on the same prep list. (substack.com) ### Why do rollbacks matter so much now? Because shipping architecture is not the same as drawing architecture. (substack.com) less elegant but easy to disable. That is where feature flags, staged rollouts, and shadow traffic become part of the architecture itself, not just release tooling. In practice, interviewers increasingly like candidates who think, “How do we back out safely?” before “How do we make this clever?” (arpitbhayani.me) ### Where do edge cases show up? At the boundaries — cross-region failover, partial outages, duplicate events, hot partitions, stale c(substack.com)eparate a whiteboard design from a production design. Plenty of systems look fine at steady state. The hard part is surviving the weird hour when one dependency is slow, another is inconsistent, and your retries start making everything worse. (substack.com) ### So what should an engineer study from this? Study systems as tradeoff stories. For each one, ask four things: what is the core object, what breaks first at scale, (arpitbhayani.me)hey give up? If you can answer those questions across feeds, storage, messaging, search, and geo, you are building the kind of judgment cross-team architecture roles actually need. (substack.com) ### Bottom line The useful takeaway from Neo Kim’s list is simple: stop preparing for system design as if there is one right diagram. The bar in 2026 is higher. You need to show that you can choos(substack.com)es finally arrive. (substack.com)