The NPU Arms Race: How Qualcomm, MediaTek, and OpenAI Are Redesigning the Smartphone for AI Agents
OpenAI is partnering with Qualcomm and MediaTek to build a smartphone where AI agents replace traditional apps, with mass production targeted for 2028 and projected annual shipments of 300 to 400 million units. This represents a fundamental reimagining of mobile hardware, where the neural processing unit (NPU), the dedicated AI engine inside a chip, becomes the central architecture rather than a bolted-on feature. The device would process lighter tasks directly on the phone while offloading complex AI reasoning to the cloud, maintaining what analysts call "full real-time state" by continuously capturing location, activity, and environmental context.
What Is an NPU and Why Does It Matter for Your Phone?
A neural processing unit is a specialized chip designed specifically to run artificial intelligence models efficiently, unlike general-purpose processors that handle everything from email to gaming. Traditional smartphones added NPUs as afterthoughts, but the new generation of phones is being engineered from the ground up around AI inference. Qualcomm's Snapdragon 8 Elite Gen 5, which powers 75 percent of Samsung's Galaxy S26 series, includes a Hexagon NPU that delivers 37 percent faster AI processing than its predecessor and supports agentic AI that learns from user behavior. MediaTek's Dimensity 9500 matches this performance at lower cost with better power efficiency.
The practical difference is significant. Instead of downloading an app to book a restaurant or order a ride, users would interact with an AI agent that handles the task directly. The phone's NPU would manage context awareness and memory, understanding what you're doing and where you are, while the cloud handles the heavy computational lifting. This architecture requires NPUs powerful enough to run continuous inference without draining the battery in hours.
How Are Phone Makers Upgrading NPU Capabilities?
- Custom Processor Design: Qualcomm and MediaTek are jointly designing a custom processor specifically for OpenAI's phone, moving beyond off-the-shelf chips to optimize for agentic AI workloads.
- Memory Architecture Innovation: Qualcomm is reportedly building custom 3D DRAM specifically optimized for AI workloads on mobile devices, addressing the bottleneck of moving data between memory and the NPU.
- Continuous Context Awareness: The upgraded sensing hub in next-generation NPUs captures real-time environmental data, location, and user activity to feed AI agents without constant cloud queries.
- Personal Knowledge Graphs: NPUs now support building personal knowledge graphs that allow AI agents to learn from user behavior and preferences over time.
Huawei is pursuing a similar strategy with its Mate XT 2, expected in October 2026. The device will run on the Kirin 9050 Pro, which features an enhanced NPU that enables features like real-time translation and AI photo editing to run directly on the device rather than requiring a cloud connection. This is particularly relevant for Huawei, which remains isolated from Google services; on-device AI fills the gap left by the absence of cloud-connected services.
Why Are Companies Betting Everything on NPU-First Design?
The shift reflects a fundamental change in how the smartphone industry views AI. Every major phone manufacturer is moving toward AI-first experiences, but most are constrained by backward compatibility with billions of existing apps and operating systems. OpenAI's approach sidesteps this problem entirely by building a device where the AI agent is the interface and the app is obsolete. Samsung's Galaxy S26 runs a triple AI engine with Gemini, Perplexity, and Bixby, while Google's Pixel 10 hands off multi-step tasks to background AI agents. Apple Intelligence processes queries on-device with an emphasis on privacy.
The financial stakes are enormous. Qualcomm's stock surged as much as 13 percent in premarket trading on the OpenAI partnership report, reflecting investor confidence in the company's NPU strategy. This matters because Qualcomm's traditional business, supplying modems and processors to phone manufacturers, is under pressure from Apple's efforts to develop its own modem chips and MediaTek's encroachment on the premium Android segment. An OpenAI partnership represents a new revenue stream in a market where Qualcomm's earnings growth has declined 46.9 percent and gross margins have fallen to 55.1 percent.
"AI agents will replace the mobile operating system and apps as the primary interaction layer, and the hardware must be designed from scratch to support continuous, power-efficient AI inference rather than retrofitting existing chipsets with neural processing units bolted on," explained Cristiano Amon, CEO of Qualcomm.
Cristiano Amon, CEO at Qualcomm
What Are the Risks of This NPU-First Bet?
The AI device category has produced more failures than successes. The Humane AI Pin, a $699 wearable with a laser projector, was permanently bricked on February 28, 2025, when HP acquired Humane's remnants for $116 million and shut down the servers. The Rabbit R1, a $199 "large action model" device, attracted 100,000 pre-orders but retained only 5,000 active users after five months, a 95 percent abandonment rate. Both failed because they created new form factors that solved no problem the smartphone did not already solve, at price points that demanded users carry a second device.
OpenAI's phone takes a fundamentally different approach. It is not an additional device; it is a replacement for the device 4.7 billion people already carry, in the same form factor, with the same basic capabilities, but with a radically different interaction model. Whether that is enough to avoid the graveyard depends on whether agents can do what apps do, better, faster, and without the friction of learning a new paradigm. The supply chain credibility is strong; Luxshare Precision Industry, which will manufacture the device, already assembles AirPods and Apple Watch components. Specifications and the supplier list are expected to be finalized by late 2026 or the first quarter of 2027, with mass production targeted for 2028.
Meanwhile, Huawei's Mate XT 2 demonstrates that NPU upgrades are happening across the industry. The device's larger 6,000mAh battery, up from 5,600mAh in the current generation, reflects the power demands of running continuous AI inference on-device. A redesigned hinge mechanism aims to reduce the visible crease where the screen folds, addressing a consistent criticism of the original model. The Mate XT 2 is tipped to inherit the optical setup from the Mate X7 series, featuring a 50MP primary camera, 50MP periscope telephoto, and 40MP ultrawide, putting it among the best-equipped foldables for photography.
The real question is not whether NPUs will become standard in smartphones; they already are. The question is whether the software paradigm of AI agents replacing apps will actually work in practice, and whether consumers will embrace a fundamentally different way of interacting with their phones. The hardware is ready. The silicon exists. What remains to be seen is whether the user experience justifies the bet.