Netflix Deploys OpenAI-Powered Conversational Search
Netflix has launched a new AI search feature powered by OpenAI, allowing users to find content through conversational queries. The system layers LLM-driven semantic retrieval on top of its existing recommendation stack to reduce user decision fatigue. This reflects a broader industry trend of integrating generative AI into core product discovery features.
- The new search feature is part of a broader strategy to develop a centralized foundation model for recommendations, inspired by the shift to large language models in NLP, to unify the dozens of specialized ML models that currently power different parts of the user experience. - This move follows a long history of recommendation system evolution at Netflix, which began with the Cinematch collaborative filtering algorithm in 2000 and the $1 million Netflix Prize competition in 2006 to improve its accuracy. - The conversational search feature is currently in a limited, opt-in beta test for iOS users in specific regions, a cautious rollout strategy Netflix uses to gather data before wider deployment. - Competitors like YouTube and Spotify are taking similar steps; YouTube adapts Google's Gemini model by creating "Semantic IDs" to tokenize videos, effectively teaching the LLM a new "YouTube language," while Spotify uses a similar "Semantic ID" approach to help its model reason about the relationships between artists, albums, and user behavior. - Internally, Netflix is also applying LLMs to its enterprise Graph Search platform, enabling employees to query complex, federated datasets using natural language instead of a structured Domain Specific Language (DSL). - Deploying such features relies on Netflix's mature MLOps infrastructure, which includes a centralized "fact store" for raw feature data and a sophisticated Experimentation Platform that facilitates rigorous A/B testing for every product change, from UI redesigns to algorithm updates. - The new search functionality goes beyond keyword matching to interpret mood and emotional cues in queries like "a lighthearted comedy after a long day," aligning with trends in affective computing.