Google’s AI Overviews misfires
A recent study found Google’s AI Overviews are producing a substantial volume of incorrect answers—roughly a 90% accuracy rate in tests, which still translates to millions of false answers at scale. Publishers and product observers warn that ‘mostly right’ can be dangerously misleading for a search product, underscoring why correctness and verification remain central engineering problems. (mobilesyrup.com, tech.yahoo.com)
Google’s search box used to hand you a list of links. Now, for many questions, it writes the answer itself at the top of the page, and a new analysis says that answer is still wrong often enough to create errors at industrial scale. (tech.yahoo.com) The analysis, reported by The New York Times and described by several outlets on April 7 to April 9, found Google’s AI Overviews were accurate about 90% to 91% of the time on a factual benchmark called SimpleQA. That still leaves roughly 1 answer in 10 wrong. (searchengineland.com, pcmag.com) Google handles more than 5 trillion searches a year, so a 10% error rate does not stay small for long. Outlets summarizing the analysis put the math at tens of millions of incorrect answers per hour, including one estimate of about 57 million every hour. (tech.yahoo.com, analyticsinsight.net) That is the trap with search: a chatbot can be “mostly right” and still be dangerous, because people treat the box at the top of Google like a calculator, not like a rough draft. A wrong answer in position one looks finished even when the sources underneath do not fully support it. (pcmag.com) Google launched AI Overviews in the United States on May 14, 2024, then expanded the feature to more than 100 countries in October 2024. Google now says AI Overviews are available in more than 120 countries and territories and 11 languages. (blog.google, blog.google, search.google) Google has also kept swapping in newer Gemini models to improve the summaries. In February 2025, Google said Gemini 2.0 was powering AI Overviews in the United States, and by February 2026 Google said Gemini 3 had become the default model for AI Overviews. (blog.google, blog.google) The problem is not just making up a false sentence from nowhere. Some reviews of the analysis say AI Overviews can also give answers that look correct while citing pages that do not actually prove the claim, which is like showing your homework with the wrong page numbers attached. (pcmag.com, oecd.ai) Google has been here before. Within weeks of the May 2024 launch, the company published a response after viral examples showed AI Overviews giving bizarre advice, and Google said it was using policy updates and technical fixes to limit nonsensical or satirical results. (blog.google) The new study suggests the product has improved from about 85% accuracy in October to about 91% in February on the same benchmark. The catch is that a better model does not erase the scale problem when the system is attached to a search engine used by more than a billion people. (searchengineland.com, blog.google) That leaves Google trying to solve two jobs at once. It has to make the answer sound natural enough that people use it, and make the answer verifiable enough that “pretty good” does not turn into millions of confident mistakes every day. (tech.yahoo.com, blog.google)