AI Tools Accelerate SaaS Sprawl, Report Finds

Torii's 2026 Benchmark Report finds that the proliferation of AI tools is accelerating SaaS sprawl within enterprises. According to the report, 61% of applications are now unmanaged "shadow IT," increasing governance and security risks for companies.

- Reinforcement Learning from Human Feedback (RLHF) is a key technique for aligning models, but it faces scalability challenges and potential for human bias, as the quality depends on diverse and consistent evaluators. To address this, some AI labs use models to automate ranking or generate synthetic data, a process known as Reinforcement Learning from AI Feedback (RLAIF). - While synthetic data can be generated much faster and avoids privacy issues, it struggles with nuance and can perpetuate biases from the original datasets. Models trained on human-labeled data have been found to outperform synthetic-trained counterparts by 12-18% on complex reasoning tasks. A hybrid approach is often most effective, using synthetic data for scale and a smaller amount of human-labeled data for accuracy. - The biggest operational bottlenecks in training AI are often not the models themselves but the data pipeline, including CPU-bound data preprocessing that leaves expensive GPUs idle. Data quality is a primary source of AI/ML project failure, with data science teams spending significant time cleaning and reconciling data instead of building models. - Constitutional AI provides a more transparent, rule-based approach to model alignment by embedding principles directly into the training process, which can help in systematically detecting and mitigating bias. Emerging "runtime" constitutional AI methods take this further by validating every action an agent takes against a set of rules *before* execution, catching issues like hallucinated references or duplicate actions. - New benchmarks are being developed specifically for agentic AI, moving beyond traditional text-quality metrics to evaluate task completion, tool use, and multi-step reasoning. For example, AgentBench tests across domains like web navigation and database queries, while GAIA focuses on general intelligence tasks requiring tool use. - The go-to-market playbook for AI infrastructure startups is shifting away from traditional SaaS models, with a focus on usage-based, credit-style pricing and choosing initial customers who accelerate learning over those who are easiest to close. Key success metrics now include Return on AI Investment (ROAI), which measures revenue generated from automated workflows against the cost of model inference and compute. - The fundraising climate for AI is robust, with AI startups attracting approximately one-third of all global venture capital in recent cycles. However, investors are becoming more selective, rewarding companies that can demonstrate a clear link between capital expenditure and revenue growth, and there is a growing consolidation of investment into fewer, more mature companies. - While AI is expected to displace millions of jobs, it is also projected to create new ones, with a potential net gain of 58 million jobs globally by 2025. The primary challenge is a shift in required skills; it is estimated that up to 30% of hours worked in the US could be automated by 2030, requiring around 12 million occupational transitions.

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.