Back to stories
Industry

Alibaba's Zhenwu M890 Claims 3x Nvidia's H20, Ships With Qwen 3.7-Max

Michael Ouroumis2 min read
Alibaba's Zhenwu M890 Claims 3x Nvidia's H20, Ships With Qwen 3.7-Max

Alibaba's semiconductor unit T-Head unveiled the Zhenwu M890, an in-house AI accelerator it claims runs agentic inference workloads roughly three times faster than Nvidia's H20 — the throttled Hopper part Washington still permits for export to China. It arrives paired with Qwen 3.7-Max, a model Alibaba says can run autonomously for up to 35 hours and over 1,000 tool calls without performance degradation. BABA shares slipped premarket on the news, which landed just ahead of Nvidia's quarterly earnings.

The specs that matter

The M890 is built on T-Head's in-house PPU (Parallel Processing Unit) architecture with a Transformer core engine. The headline figures:

Read the 3x-H20 claim carefully: it targets agentic inference, where memory capacity and interconnect — not peak FLOPs — are the binding constraint. Nvidia engineered the H20 with deliberately limited compute and bandwidth to clear export rules. The M890's 144GB of HBM3 lets it hold a larger KV cache and longer context per accelerator, which is exactly what long-horizon agents starve for.

A real deployment, not a paper launch

T-Head says it has shipped over 560,000 Zhenwu units to date, with 400+ external customers across 20 industries, including automakers and financial-services firms. The M890 reaches developers through Alibaba Cloud's Bailian platform and the Panjiu AL128 server — 128 M890 accelerators per rack.

A roadmap built to outpace the cadence

Alibaba published a multi-year line: the V900 in Q3 2027, promising another ~3x gain over the M890, and the J900 in Q3 2028. That is an aggressive annual rhythm aimed squarely at Nvidia's release cycle.

Why the timing

The launch is a self-sufficiency statement. US export controls have restricted advanced American silicon to Chinese entities since 2022, and the Trump administration tightened them again in April 2025 to block even China-market parts like the H20. Pairing a domestic accelerator with a domestic frontier model gives Chinese builders a vertically integrated stack — model, chip, and cloud — that no longer depends on a throttled or smuggled Nvidia GPU.

What changes for builders

For teams operating inside China or hedging against export risk, the M890 plus Qwen 3.7-Max is a credible escape hatch from Nvidia dependence. The KV-cache headroom and the 35-hour autonomous-run claim map directly onto the long-horizon agent workloads that blow up memory budgets on current inference fleets. Treat the 3x figure as a vendor benchmark, though — validate it against your own agent traces and tool-calling patterns before re-architecting around it.

Learn AI for Free — FreeAcademy.ai

Take "AI for Business: Practical Implementation" — a free course with certificate to master the skills behind this story.

More in Industry

Google Ships Antigravity 2.0: A Standalone Agent Platform That Retires the Gemini CLI
Industry

Google Ships Antigravity 2.0: A Standalone Agent Platform That Retires the Gemini CLI

At I/O 2026, Google relaunched Antigravity as a standalone agent-orchestration platform with a CLI that replaces the Gemini CLI, an SDK, Managed Agents in the Gemini API, and an enterprise tier — all defaulting to Gemini 3.5 Flash.

10 min ago2 min read
Cohere Buys Reliant AI to Build 'North for Pharma' Sovereign Biopharma Agents
Industry

Cohere Buys Reliant AI to Build 'North for Pharma' Sovereign Biopharma Agents

Cohere has acquired Montréal- and Berlin-based biopharma startup Reliant AI, folding its automated research workbench into the North platform to launch 'North for Pharma' — its second acquisition in a month as it doubles down on sovereign, regulated-industry AI.

3 hours ago2 min read
Armada Raises $230M at $2B Valuation to Mass-Produce Modular AI Data Centers
Industry

Armada Raises $230M at $2B Valuation to Mass-Produce Modular AI Data Centers

Edge-infrastructure startup Armada closed a $230M oversubscribed Series B at a $2B pre-money valuation, co-led by Overmatch, BlackRock and 8090 Industries, alongside a Johnson Controls deal to build a 400,000-sq-ft Arizona factory for its Leviathan modular data centers.

6 hours ago2 min read