AI Gateways Process Over 1 Trillion Tokens Daily
OpenRouter, the world's largest AI gateway, is now processing over one trillion tokens per day across more than 70 providers, signaling the immense scale of production LLM operations. This growth is fueling the development of an "agent web," with companies like Coinbase, Stripe, and Cloudflare building agentic layers and APIs simultaneously.
- AI gateways serve as a critical middleware layer, abstracting the complexity of integrating with multiple AI providers like OpenAI and Google. For platform teams, this means they can enforce centralized security, manage costs through rate-limiting and caching, and switch between models without altering downstream applications. This architecture is crucial for maintaining system reliability and enabling developers to focus on building features rather than managing complex AI integrations. - The concept of an "agent web" is being driven by infrastructure-level commitments from major tech companies. For instance, Cloudflare now automatically converts websites into an agent-readable markdown format, while Coinbase has launched "Agentic Wallets" for AI-to-AI transactions. These developments signal a shift towards a web where software agents are first-class citizens, capable of transacting and consuming information autonomously. - For logistics and shipping, AI-enhanced APIs are moving beyond simple transactional data exchange to create self-optimizing ecosystems. These intelligent APIs can proactively manage supply chains by predicting disruptions, optimizing routes in real-time, and automating complex processes, which is a significant evolution from traditional, passive API calls. Companies like SeaRates offer APIs that enable automated freight management and intelligent cost forecasting. - From an investment perspective, the massive build-out of AI infrastructure is a dominant market driver, with hyperscalers projected to increase capital expenditures by 73% from 2025 to 2026. This spending is fueling growth for companies throughout the AI value chain, from energy providers to data centers and chipmakers like Nvidia, which has seen its data center revenue grow 66% year-over-year. However, there are concerns that AI-related capital expenditures are outpacing actual revenue generation from AI applications. - The rise of AI agents is creating a new economic model, often called the "agent economy." Stripe's "machine payments" tool, which uses the x402 protocol and USDC stablecoin on the Base network, allows AI agents to pay for services like API calls or data access without human intervention. This enables new pay-per-use business models where autonomous agents can transact directly with services. - On the technical side, the share of AI model outputs that are "tool calls"—requests for an external function to be executed—has grown from less than 5% to over 25% in the past year. This indicates a significant shift from simple text generation to complex, multi-step tasks orchestrated by AI agents. Similarly, "reasoning tokens," which represent the model's internal thought process, now account for 50% of output tokens for some models, a capability that didn't exist in mainstream models just over a year ago. - For engineering leaders, the integration of LLMs into the development lifecycle is changing team dynamics. The role of a developer is shifting towards prompt engineering, code review of AI-generated outputs, and integrating AI components. LLMs are now used to generate test cases, suggest architectural patterns, and create deployment scripts, which requires a new set of skills and review processes. - Open-source models are steadily gaining market share, accounting for approximately one-third of token usage by late 2025. A significant portion of this growth is driven by Chinese-developed open-source models, which have grown from a negligible base to nearly 30% of total usage in some weeks. This trend towards a more diversified ecosystem of both open and closed-source models provides more options for platform teams to balance cost, performance, and customization.