Resilience Now a Core Infra Pillar

For trading infrastructure providers, 24/7 uptime is now as critical as low latency. TNS shared its strategies for continuous operations, including redundant network paths and active-active data centers, reflecting a market where always-on digital asset trading demands constant availability.

The financial cost of infrastructure failure is staggering, with financial services firms losing an average of $152 million annually due to downtime. Revenue losses alone account for approximately $37 million of that figure each year. For large enterprises, an hour of downtime can cost anywhere from $300,000 to over $5 million, excluding regulatory fines or legal settlements. Recent exchange outages highlight the vulnerabilities in the system. A data center cooling failure at the Chicago Mercantile Exchange in November 2025 disrupted global futures and options trading for hours. Similarly, the Tokyo Stock Exchange experienced a full-day shutdown in 2020 due to a hardware failure, while the London Stock Exchange faced a similar closure from software issues the same year. To combat this, new regulations are enforcing stricter resilience standards. The EU's Digital Operational Resilience Act (DORA), which took effect in January 2025, mandates comprehensive ICT risk management, incident reporting, and resilience testing for financial entities. In the UK, financial regulators now have powers to oversee the resilience of services provided by critical third parties, requiring them to report major incidents like cyber-attacks and conduct scenario-based testing. In the quest for sub-millisecond performance, firms are moving beyond simple server co-location to hardware-level acceleration. Field-Programmable Gate Arrays (FPGAs) are becoming foundational, executing trading logic directly in silicon to achieve deterministic, nanosecond-level latency. This allows FPGAs to parse market data feeds and execute orders before a CPU can process the information. Kernel bypass techniques are another critical optimization, allowing trading applications to communicate directly with network hardware, avoiding the latency of the operating system's kernel. Using libraries like DPDK or protocols like RDMA, applications can interact directly with NIC buffers, which is essential for processing data at speeds of 10 Gbps and higher without packet loss. While on-premises infrastructure has traditionally been favored for its control and ultra-low latency, cloud platforms are becoming more viable for certain trading functions. Cloud offers scalability for deploying analytics and can connect directly to on-premise data centers via dedicated lines like Alibaba Cloud's Express Connect. However, for the most latency-sensitive high-frequency trading strategies, firms like Citadel Securities and Jump Trading still rely on customized on-premise systems.

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.