Cameron Otsuka

TPUs Advance on Nvidia

Metadata
  • Description: What happens when AI accelerator demand is no longer synonymous with "Nvidia GPUs"?
  • Publication: Inference Draft 2026-19
  • Published:
  • Last Modified:
  • Type: newsletter
  • Tags: ai
  • POSSE: Substack 
Nvidia, Google, and Amazon depicted as supercomputers.

What happens when AI accelerator demand is no longer synonymous with “Nvidia GPUs”? Google (Alphabet) announced it is now delivering its TPUs to select customers’ own data centers. While Nvidia stock took a same-day leg down, likely on China export restriction revenue data they shared, I think there’s also a medium-term story of a shifting mix away from Nvidia GPUs over time.

NVDA price chart from 2026-04-22 through 2026-05-05

Why would Google’s TPUs be a credible substitute for Nvidia GPUs?

Google’s eighth-generation TPU architecture blog post offers some insight into where they might be best-used (emphasis mine):

Their wording obviously points towards hyperscalers, frontier labs, and massive organizations who have large, repetitive, well-optimized workloads. These TPUs aren’t designed to be universal substitutes for GPUs across all workloads, but with scale and engineering talent they can improve cost efficiency versus GPU stacks.

Why would customers like Anthropic choose TPUs over (or alongside) Nvidia?

The obvious answer is cost and power efficiency. The more interesting angle to consider is bargaining power: TPUs (and other AI accelerators) give customers optionality when selecting infrastructure for their workloads. The threat to Nvidia is then whether they are able to maintain pricing power despite competitors coming into the chip mix.

Why might Nvidia’s moat remain intact despite AI accelerator advances?

It’s important to note that Nvidia’s moat extends beyond its chips, inclusive of developer familiarity with CUDA and its related libraries, data centers designed specifically with Nvidia’s rack-scale systems in mind, NVLink networking, the list goes on. Even Sundar mentions that Nvidia GPUs are a core part of Google Cloud’s AI accelerator portfolio in their earnings call.

It isn’t as simple as buying a new set of chips and swapping them out in a rack as it’s often necessary to both retool the data center and redevelop the software.

Anyone else?


Mine Print Hash

On last week’s podcast, Matt and I go in-depth on the power struggle underway at the Fed, the UAE withdrawing from OPEC and its implications for the global dollar system, and round out the episode with a discussion of recent Chinese and American sovereign power plays within the AI space.


Open Threads

On privacy, or should I say, surveillance:

Agentic AI: