Skip to content
Pricing: Free
Rating: 4.2/5

World Labs' real-time generative world model that creates persistent, navigable 3D scenes from a single image on one H100 GPU.

Category

AI Simulation

View all AI Simulation tools
Editor-selected listing
Independent & reader-supported

Pricing

RTFM is available as a free public research preview at rtfm.worldlabs.ai with no account or subscription required. Server-side inference runs on a single NVIDIA H100 GPU operated by World Labs. The underlying technology is being commercialized through Marble, World Labs' 3D world creation product, which has separate pricing.

PlanDetails
FreeFree research preview accessible at rtfm.worldlabs.ai. No account or local installation required. Interactive demo available for public exploration.

What is RTFM?

Quick Summary

RTFM (Real-Time Frame Model) is a generative world model developed by World Labs — the spatial intelligence company founded by Dr. Fei-Fei Li — that generates persistent, navigable 3D scenes from a single input image at interactive framerates, running inference on a single NVIDIA H100 GPU. Released as a research preview in October 2025, it is designed for researchers, game developers, and creators who want to explore the current state of real-time generative world modeling and understand what persistent AI-generated environments can look like in practice. The interactive demo is publicly accessible at rtfm.worldlabs.ai.

RTFM is an autoregressive diffusion transformer model developed by World Labs that takes one or more 2D images of a scene as input and generates new viewpoint frames in real time as a user navigates through the generated environment. Unlike traditional 3D rendering pipelines or neural radiance field approaches, RTFM does not construct an explicit 3D representation of the scene — instead, it converts input frames into neural network activations forming an implicit world representation, then renders new frames by attending to that representation as the user moves. The architecture is trained end-to-end on large-scale video data. A spatial memory mechanism positions each generated frame in 3D space, allowing the model to retrieve contextually nearby frames when rendering new views — a system World Labs calls context juggling — which enables the world to persist indefinitely without dissolving as the user navigates away from the starting point. RTFM is primarily a research demonstration aimed at developers, game designers, and researchers who want to understand the current frontier of generative world models. Game developers studying AI-native world generation use it to evaluate what real-time generative environments can produce from minimal input. See top alternatives. AI and computer vision researchers use it as a reference implementation of persistent generative navigation — a capability distinct from pre-rendered video generation or static scene reconstruction. World Labs has since integrated RTFM's technology into Marble, its commercial 3D world creation product, which extends the underlying model with text, image, and panorama inputs, AI editing, and export to Unity, Unreal, Blender, and Houdini. RTFM is available as a free public research preview at rtfm.worldlabs.ai, with no account or subscription required to try the interactive demo. The model requires a single NVIDIA H100 GPU for server-side inference — users access it through the web interface rather than running it locally. World Labs raised $1 billion in funding in early 2026 to scale Marble and RTFM-derived research into game development, film, architecture, robotics, and education. The current research preview produces visually coherent scenes with correct lighting effects, reflections, and shadows, but operates within the capability constraints of a neural generative model rather than a deterministic renderer — output fidelity varies by scene type and visual complexity View alternatives.

Associated Tags

generative world model, real-time AI simulation, World Labs AI, AI 3D scene generation, persistent AI world, Fei-Fei Li AI, interactive AI environment

Key Features

Real-time 3D scene generation from a single image
Persistent world navigation without scene loss
Dynamic visual effects: reflections, shadows, gloss
Autoregressive diffusion transformer architecture
Spatial memory for consistent long-range exploration
Interactive framerates on single H100 GPU
Web-based demo with no installation required
Real Use Cases

How professionals leverage RTFM – World Labs Real-Time Generative World Model for Interactive 3D Exploration

Discover practical workflows and real-world scenarios where RTFM delivers key solutions.

01

Exploring a generated 3D environment derived from a single photograph to evaluate what current real-time generative world models can produce from minimal input

02

Using the RTFM demo as a reference point for understanding the practical difference between generative world models and traditional 3D rendering pipelines

03

Researching World Labs' autoregressive diffusion transformer and spatial memory architecture as part of an AI or computer vision research study on generative 3D models

04

Demonstrating the current state of persistent real-time generative environments to stakeholders in game development, film, architecture, or simulation industries

05

Exploring how scene type, lighting conditions, and image characteristics in the input photograph affect the visual quality and consistency of the generated navigable world

06

Evaluating RTFM as a technical benchmark before assessing Marble, World Labs' commercial 3D world creation product built on the same underlying model

Editor's Verdict

Official Review
RTFM is a technically significant research demonstration of persistent real-time generative world navigation from World Labs, showing that an autoregressive diffusion transformer can maintain scene coherence across extended exploration on a single H100 GPU — a meaningful step toward viable generative world models. As a research preview its output quality varies by scene and input, and users wanting to build on the underlying technology should review Marble, World Labs' production-stage product that extends RTFM into a full creation platform.
4.2 / 5.0
Editor Rating

Reviewed by Sohail Akhtar

Lead Editor & Founder

Pros

What we like

  • RTFM's spatial memory and context juggling architecture enables genuinely persistent world exploration — the generated environment does not reset or dissolve as the user navigates, which is a meaningful technical advance over prior generative video-based navigation models
  • Running interactive-framerate inference on a single NVIDIA H100 GPU rather than a distributed cluster is a significant efficiency milestone for real-time generative world models, making the technology more tractable for future commercial deployment
  • The free web-based research preview with no account requirement gives researchers, developers, and creators immediate access to evaluate the model's output quality and navigation behavior without any setup barrier

Cons

Limitations

  • RTFM is a research preview rather than a production tool — output fidelity varies by scene type and input image, and the model operates within the visual consistency constraints of a learned neural renderer rather than a deterministic 3D engine
  • Server-side inference on H100 GPU hardware means the experience depends on World Labs' infrastructure availability, and the demo's frame quality and responsiveness will reflect server load conditions rather than a fixed local performance baseline

Target Audience

Who should use RTFM?

AI and computer vision researchers studying generative world model architectures, persistence mechanisms, and real-time neural rendering approachesGame developers and technical directors evaluating AI-generated world models for potential integration into concept exploration and prototyping workflowsDevelopers and creators who want to experience the current frontier of real-time generative 3D environments without installing local softwareTechnology analysts and journalists covering advances in spatial AI and generative world modeling from leading research organizationsEducators and students exploring the difference between explicit 3D representations and neural implicit world generation through an accessible public demo
Free
Oasis AI Game

Oasis AI Game

Open-source AI world model by Decart and Etched that generates real-time Minecraft-style interactive gameplay at 20 FPS using next-frame prediction, with no traditional game engine required.

Free
Hunyuan GameCraft

Hunyuan GameCraft

Tencent's open-source AI model that generates interactive, action-controllable game video sequences from a single image and keyboard inputs.

Free
Marble by World Labs

Marble by World Labs

Multimodal AI world model by World Labs that generates persistent, navigable 3D environments from text, images, video, or 3D layouts, with in-scene editing and Gaussian splat, mesh, and video export.

Freemium
Hunyuan World 1.0

Hunyuan World 1.0

Open-source AI model by Tencent that generates explorable, interactive 3D worlds from text or image inputs using panoramic scene reconstruction.

Frequently Asked Questions

What is RTFM by World Labs?
RTFM (Real-Time Frame Model) is a generative world model from World Labs that creates persistent, navigable 3D environments from a single input image at interactive framerates, using an autoregressive diffusion transformer with spatial memory — available as a free public research preview.
Is RTFM free to use?
Yes. RTFM is available as a free public research preview at rtfm.worldlabs.ai with no account or subscription required. Inference runs on World Labs' server-side H100 GPU infrastructure.
How does RTFM generate persistent 3D worlds?
RTFM converts input image frames into neural network activations that implicitly represent the scene, then renders new viewpoint frames by attending to this representation. A spatial memory mechanism positions frames in 3D space, allowing the model to recall relevant views as the user navigates without losing world coherence.
What hardware does RTFM require?
RTFM's inference runs on a single NVIDIA H100 GPU on World Labs' servers. Users access the model through the web demo at rtfm.worldlabs.ai without local GPU requirements.
Who built RTFM?
RTFM was developed by World Labs, a spatial intelligence company founded by Stanford AI researcher Dr. Fei-Fei Li. It was released as a research preview in October 2025.
Who should use RTFM?
RTFM is best suited for AI researchers, game developers, and technology creators who want to explore the current frontier of real-time generative world modeling and evaluate what persistent AI-generated 3D navigation looks like in practice today.