Ex‑DeepMind team launches Elorian

Former Google DeepMind researchers launched Elorian to tackle visual AI, arguing current large models are still weak at interpreting visual prompts and need specialized approaches. The debut suggests there’s room for domain‑specialised model work—particularly for applications that require reliable multimodal understanding. (bloomberg.com)

Most artificial intelligence can talk about a photo the way a student bluffs through a book report: it recognizes patterns, but it often misses what is actually happening in the scene. That gap is where a new startup called Elorian says the next fight in artificial intelligence will be won. (bloomberg.com) Elorian came out of stealth on April 9 with $55 million in funding and a reported $300 million valuation. The company was co-founded by former Google DeepMind researcher Andrew Dai and is based in Palo Alto, California. (bloomberg.com) (techinasia.com) Dai’s pitch is blunt: he told Bloomberg that big lab models still reason about visual prompts like “a 3-year-old kid.” Elorian says it is building systems that can understand images in the world around them with better reasoning, not just better captioning. (bloomberg.com) (elorian.ai) That distinction matters because language is tidy and the physical world is not. A chatbot can sound smart in a paragraph and still fail at a simple question like which object in a cluttered room is blocking a door. (bloomberg.com) The industry calls this multimodal reasoning, which means one model handles words and pictures together instead of treating an image like a side attachment. Elorian’s own site says it is building “next-generation multimodal reasoning frontier models,” which is startup language for artificial intelligence that is supposed to see and think in the same system. (elorian.ai) Big companies already have image-capable models, but Elorian is betting that general-purpose systems still leave room for specialists. The company plans to release its first public reasoning model in about 12 months and is already talking with potential customers, even though Bloomberg reported it has no revenue yet. (bloomberg.com) The investors tell you what kind of market this is aimed at. Bloomberg and Tech in Asia both reported backing from Menlo Ventures, Altimeter Capital, Striker Venture Partners, Nvidia, and Google’s chief scientist Jeff Dean. (bloomberg.com) (techinasia.com) Those names make more sense if you think about robots, factories, and design software instead of chatbots. A system that reliably understands a camera feed can help a machine act in a room, while a system that only writes polished text is mostly stuck on a screen. (economictimes.indiatimes.com) (elorian.ai) The timing also says something about the artificial intelligence market in 2026. After two years of giant spending on ever-larger general models, former researchers from one of the top labs are now arguing that narrower, domain-specific systems may be the better way to solve hard real-world tasks. (bloomberg.com) So this launch is not just another ex-Big Tech startup story. It is a bet that the next valuable model may not be the one that talks the smoothest, but the one that can look at a messy scene and get the details right. (bloomberg.com)

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.