Google launches Gemini 3.1 Flash‑Lite

- Google moved Gemini 3.1 Flash‑Lite from preview to general availability and made Workspace Studio broadly available for Business and Enterprise customers building AI agents. - The key hook is price and speed — Flash‑Lite starts at $0.25 per 1 million input tokens and $1.50 output, aimed at high‑volume workloads. - This matters because Google is bundling cheap inference with in‑workflow agent tools, pushing enterprise AI from demos toward everyday operations.

Google is trying to win the boring part of enterprise AI — the part that actually gets deployed. The flashy model news is Gemini 3.1 Flash‑Lite, now generally available after a preview launch in March. The quieter but maybe more important move is Workspace Studio, which is now broadly available inside Google Workspace for businesses that want to build agents where people already work. Together, those releases tell you the strategy: make the model cheap enough to use constantly, then put the agent builder inside Gmail, Docs, Sheets, and the rest. ### What is Flash‑Lite actually for? Flash‑Lite is Google’s low-cost, low-latency model tier. This is not the model you pick for the hardest reasoning problem in the building. It is the model you pick when you need millions of fast, cheap calls — classification, extraction, routing, short answers, chat handoffs, and other background tasks that pile up in production systems. Google is pitching it as its most cost-efficient Gemini model, optimized for high-volume traffic and positioned as a step up from earlier Flash‑Lite versions. (cloud.google.com) ### Why does the price matter so much? Because enterprise AI usually dies in the spreadsheet before it dies in the demo. Google said the preview pricing for Gemini 3.1 Flash‑Lite was $0.25 per 1 million input tokens and $1.50 per 1 million output tokens. That kind of pricing changes which use cases are even worth attempting. A workflow that calls a model once per customer support ticket is one thing. A workflow that calls a model 20 times per ticket — to classify, summarize, draft, check policy, and log actions — only works if the model is cheap enough to disappear into operating cost. (cloud.google.com) ### So what is Workspace Studio? Workspace Studio is Google’s no-code or low-code place to design, manage, and share AI agents inside Workspace. The important detail is not just that it exists. It’s that Google wants agent creation to happen natively in the same environment where employees already write email, review documents, and collaborate. Google says it is generally available with Workspace Business and Enterprise plans, and it can automate workflows using Gemini-powered agents rather than rigid rule-based automation. (blog.google) ### Why build agents inside Workspace? Because most enterprise work is not a standalone app problem. It is a workflow problem. An agent is more useful when it can see the inbox, the document, the spreadsheet, the calendar context, and the approval chain without forcing employees to jump into a separate AI console. Google has been expanding that idea with Workspace Intelligence and new reusable “skills” in Workspace Studio, which makes the product feel less like a chatbot and more like a work layer spread across familiar tools. (workspace.google.com) ### Is this just a Workspace story? Not really. Google is also building the heavier-duty side in Cloud. At Next ’26 it introduced Gemini Enterprise Agent Platform as the broader system for building, governing, orchestrating, and securing agents. So the stack is starting to look split in a sensible way — Workspace Studio for employee-facing workflow automation, and the Cloud platform for deeper enterprise plumbing, integrations, and control. (workspace.google.com) ### What’s the catch? Cheap models and easy agent builders remove one barrier, but not the hardest one. The hard part is getting agents wired into real business processes with the right permissions, handoffs, and failure handling. An agent that drafts an email is easy. An agent that can reconcile invoices, trigger approvals, or touch customer records without causing a mess is much harder. Google’s own recent messaging leans into long-running agents, orchestration, and monitoring — basically an admission that enterprise AI lives or dies on operations, not just model quality. (cloud.google.com) ### Why now? Because the market is moving from “which model is smartest?” to “which stack gets used every day?” Google already had the models. What it needed was a clearer path from model API to actual workplace behavior. Flash‑Lite lowers the cost of constant inference. Workspace Studio lowers the friction of building agents. Put together, Google is trying to make agentic work feel less like a special project and more like another feature of office software. (cloud.google.com) ### Bottom line This launch is less about a single new model than about packaging. Google is bundling cheap AI calls with in-context agent tooling and enterprise controls. If that works, the winners won’t be the teams with the coolest demos — they’ll be the teams that quietly automate the most routine work. (cloud.google.com)

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.