Google's Custom AI Chips Are Now Nvidia's Real Threat, Not AMD
Google has emerged as Nvidia's most serious long-term threat, not because it builds better chips, but because it controls the entire AI stack: the hardware, the cloud infrastructure, the software, and the models themselves. While AMD remains Nvidia's closest competitor in the merchant chip market, Google's vertical integration strategy poses a fundamentally different challenge that could reshape how the world runs artificial intelligence workloads.
Why Is Google More Dangerous Than AMD?
Nvidia's dominance is staggering. In fiscal 2026, the company reported revenue of approximately 216 billion dollars with gross margins near 75 percent, commanding between 80 and 90 percent of the AI data centre chip market. CEO Jensen Huang has spoken of half a trillion dollars in chip demand visibility and projected that Nvidia will sell roughly a trillion dollars of Blackwell and Rubin generation systems across 2026 and 2027.
AMD, by contrast, has captured roughly 10 percent of the AI accelerator market in 2026, roughly double its 2024 position, built on its Instinct line including the MI300X, MI325X, and upcoming MI350 and MI400 generations. Yet much of AMD's momentum stems from customers wanting a credible second source to negotiate Nvidia's prices down, rather than a genuine preference for AMD's technology. AMD's software stack, ROCm, remains years behind Nvidia's CUDA in maturity and developer adoption.
Google's threat operates on a different plane entirely. Starting in late 2025, Google began selling and leasing its seventh-generation Tensor Processing Units, codenamed Ironwood, to external customers at scale for the first time. The company secured an extraordinary anchor deal with Anthropic, the maker of the Claude models, committing roughly 400,000 Ironwood chips worth around 10 billion dollars in finished systems, with additional capacity rented through Google Cloud. Meta has since signed a multibillion-dollar deal to lease TPUs and is negotiating to install them in its own data centres from 2027. The mere threat of switching has already won OpenAI and others discounts of approximately 30 percent on their Nvidia fleets.
What Makes Google's Approach Structurally Superior?
Google does not need to beat Nvidia on raw performance benchmarks. Instead, it offers a total cost of ownership that some analysts estimate is 30 to 50 percent lower per unit of useful computation for the workloads it targets. This advantage stems from vertical integration: Google designs the chip with Broadcom, runs the cloud infrastructure, builds the models, and writes the software, creating a closed ecosystem optimized for efficiency.
The critical insight is that the AI market is shifting from training new models to running existing models at scale, a process called inference. Inference rewards low latency, high memory bandwidth, and efficiency per watt, characteristics where specialized chips like Google's TPUs excel. Forecasters expect custom chip shipments from cloud providers to grow more than 44 percent in 2026, against approximately 16 percent for general-purpose GPUs. This growth is migrating toward the one area where Nvidia's competitive moat is shallowest.
How Does Nvidia's CUDA Advantage Protect Its Position?
Nvidia's dominance has never rested on silicon alone. It rests on CUDA, the software layer that more than five million developers have learned, optimized against, and built careers around, reinforced by Nvidia's networking, its rack-scale systems, and an architecture cadence that ships a new generation every 12 to 18 months. Switching away means rewriting and re-optimizing code, retraining teams, and accepting risk, for gains that must be overwhelming to justify the disruption.
Yet this moat is being probed at the compiler level. Tools such as OpenAI's Triton and the broader MLIR effort let developers write model code once and run it across different hardware at close to native speed, which in principle makes the underlying chip swappable and dissolves some of CUDA's lock-in. This represents the most important slow-moving threat to Nvidia, more important than any single rival chip, because it attacks the dependency rather than the hardware itself. However, this threat is incremental and most effective precisely in inference, where workloads are simpler and portability is easier to achieve.
How to Understand the Competitive Landscape in AI Chips
- Merchant Suppliers: AMD is the only other merchant supplier of data centre GPUs at meaningful scale, competing directly with Nvidia's hardware but lacking the software ecosystem and developer mindshare that CUDA provides.
- Vertical Integrators: Google, Meta, and other hyperscalers are building custom chips optimized for their specific workloads, offering lower total cost of ownership by controlling the entire stack from hardware to software to models.
- Specialized Designers: Companies like Cerebras and Groq build radical chip designs optimized for specific tasks like large-scale inference or low-latency language processing, but lack the broad ecosystem and proven unit economics to challenge Nvidia's overall dominance.
The distinction between these categories is crucial. AMD competes on the same terms as Nvidia, selling chips to customers who write their own software. Google competes on different terms entirely, offering an integrated solution where the chip is just one component of a larger value proposition.
Other specialized chip makers occupy niches. Cerebras builds wafer-scale processors, a single continuous slab of silicon that sidesteps the memory bottleneck constraining clusters of conventional GPUs, and has landed a reported 10-billion-dollar-plus commitment from OpenAI for 750 megawatts of capacity through 2028. Yet its strength is large-scale inference; its training story is weaker, its unit economics are unproven, and it lacks anything resembling CUDA's ecosystem. Groq built a deterministic inference chip called a Language Processing Unit, with on-chip memory delivering latency that GPUs struggle to match, but faces similar constraints in scale and ecosystem.
The fundamental question is not whether any technology can beat Nvidia on a single metric. The question is whether any technology, existing or yet to arrive, has a realistic chance of overtaking Nvidia across the entire market. AMD's answer is no; it will remain a credible second source but not a replacement. Google's answer is more complex: it may not overtake Nvidia in total market share, but it is carving out a structural alternative in the fastest-growing segment of the market, inference, where Nvidia's traditional advantages matter less.