AI infrastructure & semiconductors

AI infrastructure and semiconductors is the silicon, systems, networking and cloud capacity layer that trains and serves frontier models. Nvidia's data centre revenue ran at a $110B+ annual rate by late 2024, the company shipped roughly 3.5M Hopper GPUs into 2024, and Blackwell ramp through 2025 took data centre revenue past $35B in a single quarter. The category covers training and inference accelerators (Nvidia, AMD, Cerebras, Groq, SambaNova and Tenstorrent), hyperscaler in-house silicon (Google TPU, AWS Trainium and Inferentia, Microsoft Maia, Meta MTIA), GPU clouds (CoreWeave, Lambda, Crusoe, Together AI and Nebius) and the networking, memory and packaging stack underneath.

The sector spans training accelerators, inference accelerators, hyperscaler in-house silicon, GPU cloud platforms, AI networking and interconnect, high-bandwidth memory and advanced packaging, AI data centre capacity and power, and edge and on-device AI silicon.

Revenue comes from GPU and accelerator hardware sales, GPU cloud rental priced per hour per GPU, foundry wafer revenue tied to advanced nodes, licensing of IP cores and architectures, networking and switching hardware, and long-dated power purchase and colocation contracts underpinning new AI data centres.

AI infrastructure & semiconductors is part of AI & ML.

$136B

Global market size

423

Public companies

Shenzhen Capital Group
European Innovation Council
Y Combinator
Samsung Catalyst Fund

Key VC investors

Cadence Design Systems
GlobalFoundries
Qualcomm
Credo Technology Group

Key strategic buyers

Business model

How AI infrastructure & semiconductors companies monetize?

AI infrastructure & semiconductors companies monetize through accelerator hardware sales, GPU cloud rental and long-dated capacity contracts.

Accelerator hardware sales

Direct hardware revenue from GPUs, AI accelerators and full rack systems. Nvidia DGX and HGX, AMD Instinct, Cerebras CS-3 and Groq LPU systems ship via OEM, ODM, hyperscaler and direct enterprise channels.

GPU cloud rental

Per-GPU per-hour pricing on H100, H200, B200 and GB200 capacity. CoreWeave, Lambda, Crusoe and Nebius sell this directly; AWS, Azure, GCP and Oracle Cloud price it inside broader commitments.

Foundry wafer revenue

Wafer fabrication revenue on advanced nodes (3nm and 2nm). TSMC captures the majority of advanced AI silicon manufacturing; Samsung Foundry and Intel Foundry Services compete with smaller share.

IP licensing and royalties

Licensing of CPU, GPU, NPU and DSP IP cores plus per-chip royalties. Arm licenses to virtually every smartphone and a growing share of data centre silicon; Synopsys and Cadence license EDA tools and IP blocks.

Networking and switching hardware

Switches, NICs, optical interconnect and DPUs for AI clusters. Nvidia (NVLink, Spectrum-X, Quantum and BlueField), Broadcom (Tomahawk and Jericho), Arista and Marvell lead AI networking.

Long-dated capacity contracts

Multi-year power, colocation and capacity contracts underpinning new AI data centres. Stargate (OpenAI, Oracle, SoftBank and MGX, $500B announced January 2025), Microsoft-CoreWeave, Anthropic-AWS Project Rainier and Meta-Hyperion are reference examples.

AI infrastructure & semiconductors valuations in May 2026

Public AI infrastructure & semiconductors comps trade at 7.9x EV/Revenue. Median revenue multiple across AI infrastructure & semiconductors M&A deals was 3.2x in the last 12 months. Median revenue multiple across AI infrastructure & semiconductors VC rounds was 29x in the last 12 months.

7.9x

Median EV/Revenue as of May 2026 for public AI infrastructure & semiconductors companies

24x

NVIDIA

NVIDIA is the highest valued public AI infrastructure & semiconductors company based on EV/Revenue (excluding outliers)

3.2x

Median EV/Revenue across AI infrastructure & semiconductors M&A deals in the last 12 months

29x

Median EV/Revenue across AI infrastructure & semiconductors VC rounds in the last 12 months

Sector breakdown

AI infrastructure & semiconductors market segments

Major AI infrastructure & semiconductors segments include training accelerators, GPU cloud platforms and high-bandwidth memory & packaging.

Training accelerators

Silicon designed for training frontier models. Nvidia H100, H200, B200 and GB200 dominate the category with sustained majority share; AMD Instinct MI300X and MI325X are the principal merchant challenger with anchor wins at Microsoft and Oracle; Cerebras CS-3 (wafer-scale) competes on specific workloads; Google TPU v5p and TPU Trillium anchor hyperscaler-native training.

Inference accelerators

Silicon optimised for low-latency model serving. Nvidia continues to dominate on broad workloads; Groq LPU (founded by Jonathan Ross), SambaNova SN40L, Tenstorrent and Etched (transformer-specific ASICs) compete on latency and cost; AWS Inferentia 2 and Google TPU serve hyperscaler workloads; on-device NPUs from Apple and Qualcomm anchor the edge tier.

Hyperscaler in-house silicon

Custom silicon designed in-house by hyperscalers to reduce Nvidia dependency. Google TPU is the most mature (TPU v5p and Trillium shipped at scale); AWS Trainium 2 and Inferentia 2 ramping; Microsoft Maia and Cobalt; Meta MTIA.

GPU cloud platforms

Pure-play providers renting Nvidia and AMD GPU capacity by the hour. CoreWeave (NASDAQ: CRWV; IPO March 2025) leads scale; Lambda, Crusoe, Nebius (Yandex spin), Together AI and Voltage Park serve the mid-market.

AI networking and interconnect

Switching, NICs and optical interconnect linking GPUs into training clusters. Nvidia NVLink and Spectrum-X, Broadcom Tomahawk and Jericho, Arista 7800R, and Marvell pluggable optics anchor the category.

High-bandwidth memory and packaging

HBM3e and HBM4 stacks plus advanced CoWoS packaging underpin every frontier accelerator. SK Hynix leads HBM share with Samsung and Micron competing; TSMC CoWoS capacity is the principal industry bottleneck for AI silicon supply and is sold out 12-18 months forward.

AI data centre capacity and power

Power-dense colocation, hyperscale build-out, grid interconnects and PPA-backed greenfield campuses. Equinix, Digital Realty, Vantage, QTS (Blackstone) and Compass build colocation; Stargate ($500B announced January 2025 by OpenAI, Oracle, SoftBank and MGX) anchors greenfield US capacity.

Edge and on-device AI silicon

NPUs and accelerators for laptops, smartphones, wearables and embedded devices. Apple Neural Engine, Qualcomm Hexagon, Intel Lunar Lake NPU and AMD XDNA define PC-class silicon; Arm and MediaTek license edge AI IP into mobile.

Fractional CFO, financial modelling and deal advice for AI infrastructure & semiconductors companies

See how Flow helps AI infrastructure & semiconductors founders.

We speak founders' language and have great operational understanding of AI infrastructure & semiconductors businesses.

Book an intro call - we'll look under the hood and recommend concrete next steps.

Explore pricing
Sector KPIs

Key AI infrastructure & semiconductors KPIs to track

Data centre revenue, gross margin, GPU shipments, utilisation and power capacity under contract are the metrics investors track in AI infrastructure & semiconductors.

KPIDefinition
Data centre revenueHardware and silicon revenue tied to AI training and inference. Nvidia data centre segment runs above a $110B annual rate; the headline gauge of AI capex.
Gross marginHardware gross margin. Nvidia data centre at 75-78%; AMD MI300 series at 50-55%; specialty AI silicon often below; commodity GPU cloud at 55-70%.
GPU shipments / installed baseVolume of accelerators shipped and active in customer clusters. Nvidia shipped roughly 3.5M Hopper GPUs into 2024; the cleanest read on training capacity available globally.
UtilisationGPU and cluster utilisation. The principal economic signal for GPU clouds; healthy operators run sustained utilisation above 75%.
Power capacity (MW) under contractMegawatts of power contracted for AI data centres. The new binding constraint on growth; tracked by hyperscalers, colocation operators and GPU clouds.
Backlog / RPORemaining performance obligations on multi-year contracts. CoreWeave reported over $30B in RPO at IPO; Nvidia and AMD report supply commitments running 12-18 months forward.
Capex intensityCapex as a share of revenue. Hyperscalers running capex above 40% of revenue on AI build-out; GPU clouds running above 200% in growth years.
Yield and packaging capacityTSMC CoWoS wafer yield and packaging capacity. The hard constraint on advanced AI silicon supply; tracked by every analyst covering the category.
Key players

Main AI infrastructure & semiconductors players globally

The most active AI infrastructure & semiconductors companies and category leaders globally.

CompanyHQOverview
Santa Clara
Dominant supplier of AI training and inference accelerators (NASDAQ: NVDA). Data centre segment passed a $110B annual rate by late 2024; market cap crossed $3T in 2024 and remained the largest or second-largest listed company globally through 2025. CUDA software stack remains the principal moat.
Santa Clara
Principal Nvidia challenger in data centre accelerators (NASDAQ: AMD). MI300X and MI325X ramped through 2024-25 with anchor commitments from Microsoft, Meta, Oracle and OpenAI; data centre AI revenue passed $5B in 2024 and ran well above $10B in 2025.
Hsinchu
World's largest semiconductor foundry (TWSE: 2330; NYSE: TSM). Fabricates the majority of advanced AI silicon (Nvidia, AMD, Apple, Google TPU and Broadcom) on 5nm, 4nm, 3nm and 2nm nodes. CoWoS advanced packaging capacity is the principal industry bottleneck.
Broadcom
broadcom.com
Palo Alto
AI networking silicon, custom ASICs and the largest enterprise software business after VMware (acquired November 2023 for $69B). Designs Google TPU and Meta MTIA accelerators; market cap above $1T through 2025.
Cerebras Systems
cerebras.net
Sunnyvale
Wafer-scale AI accelerator manufacturer. CS-3 system built around the WSE-3 chip; filed for IPO in 2024 (delayed pending CFIUS review of G42 investment); principal customer concentration in G42 (UAE).
Mountain View
Founded by Jonathan Ross (Google TPU originator); designs the LPU inference-optimised accelerator. Raised $640M Series D in 2024 at $2.8B valuation led by BlackRock; offers token-priced API competing with Nvidia GPU clouds on inference latency.
SK Hynix
skhynix.com
Icheon
World leader in HBM3e high-bandwidth memory (KOSPI: 000660). Single largest HBM supplier into Nvidia Hopper and Blackwell; HBM revenue grew several-fold across 2023-25 as AI capex ramped.
CoreWeave
coreweave.com
Livingston
GPU cloud (NASDAQ: CRWV; IPO March 2025). Largest pure-play GPU cloud globally; over $15B revenue run-rate by 2025 with anchor customers Microsoft, Meta, OpenAI and Mistral; reported over $30B in remaining performance obligations at IPO.
San Francisco
GPU cloud and on-premise AI infrastructure. Raised $480M Series D in early 2024 at $1.5B valuation; principal Nvidia partner for mid-market AI training and inference.
Crusoe
crusoe.ai
Denver
Stranded-energy-powered AI cloud and data centre operator. Raised $600M Series D in late 2024 at $2.8B valuation; builds gas-flare-powered data centres in West Texas oilfields and is delivering capacity into Stargate.

CFO-as-a-service without the overhead

Fractional CFO services priced for startups - senior operator, no permanent headcount.

Market trends

Key AI infrastructure & semiconductors market trends

The Stargate $500B build-out, power as the binding constraint and HBM & CoWoS as the supply chokepoint are reshaping AI infrastructure & semiconductors right now.

Stargate and the $500B build-out

OpenAI, Oracle, SoftBank and MGX announced the Stargate joint venture in January 2025 to deploy up to $500B into US AI data centres over four years. The first 1.2GW Abilene, Texas site is under construction with Crusoe; the programme has reset the scale of single-site AI infrastructure announcements.

Hyperscaler in-house silicon

Google TPU v5p and Trillium, AWS Trainium 2 and Inferentia 2, Microsoft Maia and Cobalt, and Meta MTIA are all in serious production deployment. Hyperscalers are not exiting Nvidia, but are routing inference and selected training workloads to in-house silicon to compress unit economics.

Power as the binding constraint

GPU supply has eased relative to 2023; power, transmission, substation interconnect and cooling have replaced it as the binding constraint on AI capacity growth. Microsoft signed a 20-year PPA with Constellation to restart Three Mile Island Unit 1 in September 2024; Amazon, Google, Oracle and Meta have all announced nuclear deals through 2024-25.

Export controls reshaping global silicon

US export controls expanded in October 2023 and again in December 2024 and January 2025 restrict advanced AI silicon to roughly 40 countries. Nvidia H20 and B20 (China-compliant variants) shipped through 2024-25 before the H20 ban in April 2025; Huawei Ascend 910C ramped to fill the gap inside China; Cambricon and Biren are growing fast on domestic substitution; the divergence between Western and Chinese AI silicon is now structural.

HBM and CoWoS as the supply chokepoint

SK Hynix, Samsung, Micron and CXMT HBM capacity plus TSMC CoWoS advanced packaging are sold out 12-18 months forward. Nvidia, AMD, Broadcom and Google each made multi-billion-dollar long-term commitments in 2024-25 to secure allocation; HBM4 ramp in 2026 will be the next inflection.

GPU cloud public market debut

CoreWeave's IPO in March 2025 marked the first scaled public-market test of the GPU cloud model. Trading dynamics, customer concentration disclosures (Microsoft anchor) and capex intensity have shaped how Nebius, Lambda, Crusoe and Together AI are pricing follow-on rounds. Multiple peers expected to file in 2026.

Similar verticals to AI infrastructure & semiconductors

Explore niches like AI software and AI consulting.

Explore other sectors

We know tech inside & out.

We live and breath tech - true understanding of how startups operate is fundamental at what we do.

SoftwareAI & MLFintechConsumer internetDigital mediaE-commerce & marketplacesConsumer productsMobilityDigital healthIndustrial technologyDigital infrastructureIT services

Recent insights across AI infrastructure & semiconductors and beyond

Talk to us

Schedule a call to get a health check on your business and see how we could help.

Fractional CFO

Stages

Countries

Cities

Sectors

© 2026 Flow Partners (London) Ltd. All rights reserved. Registered as a limited liability company in England and Wales (registered number 12969521).