
World Labs' real-time generative world model that creates persistent, navigable 3D scenes from a single image on one H100 GPU.
Some links may be affiliate links. We may earn a small commission at no extra cost to you. Learn more

World Labs' real-time generative world model that creates persistent, navigable 3D scenes from a single image on one H100 GPU.
Category
AI Simulation
RTFM is available as a free public research preview at rtfm.worldlabs.ai with no account or subscription required. Server-side inference runs on a single NVIDIA H100 GPU operated by World Labs. The underlying technology is being commercialized through Marble, World Labs' 3D world creation product, which has separate pricing.
| Plan | Details |
|---|---|
| Free | Free research preview accessible at rtfm.worldlabs.ai. No account or local installation required. Interactive demo available for public exploration. |
Quick Summary
RTFM (Real-Time Frame Model) is a generative world model developed by World Labs — the spatial intelligence company founded by Dr. Fei-Fei Li — that generates persistent, navigable 3D scenes from a single input image at interactive framerates, running inference on a single NVIDIA H100 GPU. Released as a research preview in October 2025, it is designed for researchers, game developers, and creators who want to explore the current state of real-time generative world modeling and understand what persistent AI-generated environments can look like in practice. The interactive demo is publicly accessible at rtfm.worldlabs.ai.
Associated Tags
generative world model, real-time AI simulation, World Labs AI, AI 3D scene generation, persistent AI world, Fei-Fei Li AI, interactive AI environment
Discover practical workflows and real-world scenarios where RTFM delivers key solutions.
Exploring a generated 3D environment derived from a single photograph to evaluate what current real-time generative world models can produce from minimal input
Using the RTFM demo as a reference point for understanding the practical difference between generative world models and traditional 3D rendering pipelines
Researching World Labs' autoregressive diffusion transformer and spatial memory architecture as part of an AI or computer vision research study on generative 3D models
Demonstrating the current state of persistent real-time generative environments to stakeholders in game development, film, architecture, or simulation industries
Exploring how scene type, lighting conditions, and image characteristics in the input photograph affect the visual quality and consistency of the generated navigable world
Evaluating RTFM as a technical benchmark before assessing Marble, World Labs' commercial 3D world creation product built on the same underlying model
Reviewed by Sohail Akhtar
Lead Editor & Founder
What we like
Limitations
Who should use RTFM?
Open-source AI world model by Decart and Etched that generates real-time Minecraft-style interactive gameplay at 20 FPS using next-frame prediction, with no traditional game engine required.
Tencent's open-source AI model that generates interactive, action-controllable game video sequences from a single image and keyboard inputs.
Multimodal AI world model by World Labs that generates persistent, navigable 3D environments from text, images, video, or 3D layouts, with in-scene editing and Gaussian splat, mesh, and video export.
Open-source AI model by Tencent that generates explorable, interactive 3D worlds from text or image inputs using panoramic scene reconstruction.