OpenAI's Sora Video Model to be Deployed on Android

OpenAI's AI-powered video generation model, Sora, is being prepared for deployment on Android phones. The move signifies a trend of optimizing sophisticated AI models, which once required cloud infrastructure, to run natively on mobile and edge hardware.

- The technology behind Sora is a diffusion transformer, an adaptation of the architecture used for the DALL-E 3 image generator. It creates video by generating 3D "patches" of data in a latent space and then transforming them into a standard video format using a decompressor. - OpenAI first previewed Sora in February 2024, with a public release for ChatGPT Plus and Pro users in December 2024. The more advanced Sora 2 model was unveiled on September 30, 2025, alongside an iOS app, with the Android app launching two months later. - Running generative AI models on-device requires significant hardware, including a powerful System-on-a-Chip (SoC) with a dedicated Neural Processing Unit (NPU) or Digital Signal Processor (DSP) for AI acceleration, and at least 8 GB of RAM. Companies like Google with its Tensor chip and Apple with its Neural Engine produce SoCs optimized for these tasks. - The Android ecosystem supports on-device AI through platforms like Google AI Edge, which provides tools like LiteRT (formerly TensorFlow Lite) and MediaPipe. These allow developers to convert, optimize, and accelerate models for local execution on smartphone hardware. - The Sora Android app, powered by the Sora 2 engine, features a social media-style interface for users to share their creations. It includes a "Cameo" feature that allows users to record their own likeness and voice to be inserted into the AI-generated videos. - All videos generated by Sora include a C2PA metadata tag and a visible digital watermark to indicate that they are AI-generated, a measure intended to prevent misuse. - Prior to Sora, other companies had developed text-to-video models, including Meta with its Make-A-Video, Runway's Gen-2, and Google's Veo, establishing a competitive landscape for generative video technology. - The Sora 2 model introduced significant upgrades over the original, including the ability to generate synchronized dialogue and sound effects, and a more accurate understanding of physics and complex instructions.

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.