Back to stories
Industry

Wirestock Raises $23M Series A to Supply Multimodal Training Data to AI Labs

Michael Ouroumis2 min read
Wirestock Raises $23M Series A to Supply Multimodal Training Data to AI Labs

Wirestock closed a $23 million Series A to scale its creator-sourced training-data platform, the company said in a release dated May 15. The round was led by Nava Ventures with participation from SBVP — the fund co-founded by Sheryl Sandberg — alongside Formula VC and I2BF Global Ventures, bringing total capital raised to about $26 million.

The company says it now sits above $40 million in annual run-rate revenue, with creator payouts growing 20x year over year since it pivoted from a marketplace into a licensed data supplier in 2023.

What the platform actually sells

Wirestock aggregates content from a network of more than 700,000 contributors — photographers, videographers, graphic designers, 3D artists, filmmakers and musicians — and licenses curated datasets to AI labs training image, video and multimodal models. Categories include stock photo, video, design files, 3D models, gaming assets and audio.

The company pitches itself as the rights-cleared alternative to scraped web corpora: creators sign explicit licensing terms, are paid monthly, and contribute structured metadata that labs can use to assemble bespoke training sets.

Why the round lands now

Frontier labs are exhausting their easy pools of high-quality visual data just as multimodal training demand spikes. Video generation, world models and embodied-AI training pipelines need orders of magnitude more annotated frames, depth maps and 3D scenes than text-only LLMs ever did — and most of that material is still locked inside stock libraries, studios and individual creator hard drives.

Licensing pressure is also escalating. Disney and Universal sued Midjourney in June 2025 over alleged mass IP infringement in training data, and Getty's long-running case against Stability AI yielded a landmark UK ruling on November 4, 2025 — largely in Stability's favor on copyright, with Getty's secondary copyright claim rejected outright after it had already abandoned its primary copyright and database-right claims before closing submissions; Stability lost only on limited trademark findings. The ruling nonetheless underscored how exposed unlicensed pipelines have become. Reports of frontier labs being pressured to retroactively scrub training datasets after rights complaints continue to circulate. A clean upstream supplier with documented chain-of-custody is starting to look less like a nice-to-have and more like procurement table stakes.

Where the money goes

Wirestock says the new capital will fund custom-dataset capacity for labs that want bespoke collections, deeper integrations with research partners, and expanded support for emerging modalities — specifically calling out motion design, 3D modeling and music as next-priority verticals. The company will also invest in creator tools, payout infrastructure and earning pathways.

For AI builders, the read-through is simple: the supply side of multimodal training data is becoming a venture-funded category, not a side hustle. Labs that depended on murky scraping at GPT-3 scale will need real procurement relationships at GPT-6 scale, and the vendor list is being established right now. Expect more $20M–$50M Series A rounds in this lane through 2026.

Learn AI for Free — FreeAcademy.ai

Take "AI for Business: Practical Implementation" — a free course with certificate to master the skills behind this story.

More in Industry

Google Ships Antigravity 2.0: A Standalone Agent Platform That Retires the Gemini CLI
Industry

Google Ships Antigravity 2.0: A Standalone Agent Platform That Retires the Gemini CLI

At I/O 2026, Google relaunched Antigravity as a standalone agent-orchestration platform with a CLI that replaces the Gemini CLI, an SDK, Managed Agents in the Gemini API, and an enterprise tier — all defaulting to Gemini 3.5 Flash.

10 min ago2 min read
Alibaba's Zhenwu M890 Claims 3x Nvidia's H20, Ships With Qwen 3.7-Max
Industry

Alibaba's Zhenwu M890 Claims 3x Nvidia's H20, Ships With Qwen 3.7-Max

Alibaba's chip unit T-Head unveiled the Zhenwu M890 AI accelerator, claiming 3x the agentic-inference performance of Nvidia's H20 and 144GB of HBM3, paired with the Qwen 3.7-Max model — a full domestic stack as US export controls tighten.

1 hours ago2 min read
Cohere Buys Reliant AI to Build 'North for Pharma' Sovereign Biopharma Agents
Industry

Cohere Buys Reliant AI to Build 'North for Pharma' Sovereign Biopharma Agents

Cohere has acquired Montréal- and Berlin-based biopharma startup Reliant AI, folding its automated research workbench into the North platform to launch 'North for Pharma' — its second acquisition in a month as it doubles down on sovereign, regulated-industry AI.

3 hours ago2 min read