TL;DR
Origin Lab has raised $8 million to develop a marketplace enabling AI research labs to purchase high-quality data derived from video game assets. This addresses a key data gap for training physical world models. The funding signals growing industry demand for licensed, high-quality training data.
Origin Lab has secured $8 million in seed funding, led by Lightspeed Ventures, to establish a marketplace that facilitates the sale of video game-derived data to AI research labs developing physical world models. This development aims to address a significant data shortage faced by labs working on robotics and object modeling, making high-quality digital assets from video games available for training.
The startup, founded by Anne-Margot Rodde and her team, will serve as an intermediary platform where AI labs such as Yann LeCun’s AI initiatives or Fei-Fei Li’s World Labs can purchase licensed, high-quality data derived from video game environments. On the other side, video game companies can monetize existing digital assets, including rendered scenes or walkthrough footage, by licensing them through Origin Lab.
According to Rodde, the platform will convert video game assets into formats suitable for training AI models, ranging from simple rendering runs to complex datasets involving hours of gameplay footage. The company aims to bridge a gap identified by industry insiders, where the lack of infrastructure and licensing options has limited the use of video game data for AI training, despite its potential value.
The funding round was led by Lightspeed Ventures partner Faraz Fatemi, who noted that the increasing demand for high-quality training data from well-capitalized AI labs makes this a lucrative opportunity. The success of data vendors like Scale.AI has underscored the market’s potential, with industry players recognizing data as a critical bottleneck for AI development.
Why It Matters
This development is significant because it addresses a key challenge in AI research: acquiring high-quality, diverse, and licensed training data. By creating a marketplace for video game assets, Origin Lab could accelerate the development of physical world AI models, impacting robotics, simulation, and virtual environment understanding. The funding indicates investor confidence in the market’s growth and the strategic importance of data licensing for AI progress.

Video Game Display Frame Compatible with Standard PS5/PS4/PS3 Game Case and Disc, Solid Wood Shadow Box with EVA Foam Lining and Black Flocked Fabric, Wall or Tabletop Gaming Room Decor
Designed to be compatible with standard PS5, PS4, and PS3 game cases and discs, including common Blu-ray-style case…
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Background
Over recent years, AI labs have sought alternative data sources to improve model training, especially for understanding physical environments. Video game footage has been considered promising but difficult to license and standardize for research purposes. In December 2024, OpenAI faced controversy when its Sora model appeared to use streamed footage from Twitch, highlighting the demand and legal complexities surrounding game data. The emergence of dedicated marketplaces like Origin Lab aims to formalize and streamline this process, responding to industry needs for scalable, licensed data sources.
“The AI systems being built now need to understand how the physical world works, and that data essentially lives in video games.”
— Anne-Margot Rodde, co-CEO of Origin Lab
“The revenue scaling for data vendors serving major labs has shown how critical data is as a bottleneck for AI development.”
— Faraz Fatemi, partner at Lightspeed Ventures

How to Make Money With Your DJI Osmo 360 Camera Standard Combo: Turn immersive imaging into licensable digital assets for AI training simulation and recurring revenue streams
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
What Remains Unclear
Details remain unclear about the specific licensing terms, the initial catalog of available game assets, and how widespread adoption will be among AI labs and game companies. It is also not yet confirmed how the platform will handle legal and copyright issues at scale.

Vulkan 3D Graphics Rendering Cookbook: Implement expert-level techniques for high-performance graphics with Vulkan
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
What’s Next
Origin Lab plans to launch its marketplace in the coming months, with initial offerings from select video game companies. Monitoring how AI labs adopt the platform and how game publishers respond will be key milestones. Further funding rounds or partnerships may follow as the platform scales.
game footage data collection tools
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Key Questions
How will Origin Lab ensure licensing compliance for game data?
Origin Lab will establish licensing agreements with game publishers to ensure legal use of digital assets for AI training, though specific terms are still being finalized.
What types of data will be available on the marketplace?
The platform aims to offer various formats, including rendered scenes, gameplay footage, and possibly automated walkthroughs, suitable for different AI training needs.
Why is video game data valuable for training physical world models?
Video game environments simulate real-world physics and object interactions, providing rich, diverse data that can help AI systems understand spatial relationships and movement.
What challenges might arise in scaling this marketplace?
Legal, copyright, and licensing issues are potential hurdles, along with ensuring data quality and standardization across different game publishers and platforms.