Physical AI

NVIDIA Cosmos

An open platform for physical AI with world foundation models (WFMs), video data processing libraries, video evaluation, and post-training frameworks.

Cookbook   |   Documentation   |   Discord

World Foundation Models

Open Models for World Generation and Understanding

Cosmos Predict

Leading world generation model, adaptable to any physical AI task or environment.

Generate 30s predictive video worlds from text, image, or video with 2B/14B models, or post-train on your data to create custom edge cases, closed-loop policies, and multiview, robot-centric simulations.

Cosmos Transfer

Multicontrol model for simulation to photoreal transformation.

Pair with physical AI simulation frameworks, such as CARLA or NVIDIA Isaac Sim™, to accelerate synthetic data generation across various environments and lighting conditions.

Cosmos Reason

Leading vision language model (VLM) enabling robots and vision AI agents to reason like humans.

Combines prior knowledge, physics, and common sense for real-time alerts and actionable insights across public safety, traffic monitoring, logistics, quality inspection, and physical AI.

Data Processing and Evaluation

Speed up efficient dataset processing and evaluation.

NVIDIA NeMo Curator

Cosmos Curator

Quickly filter, annotate, and deduplicate large amounts of sensor data with Cosmos Curator.

Cosmos Dataset Search

Instantly query datasets and retrieve scenarios with NVIDIA Cosmos Dataset Search (CDS).

Cosmos Evaluator

Review and score generative video outputs at scale using Cosmos Evaluator.

Use Cases

How Cosmos Accelerates AI Across Industries

Use Cosmos WFMs to simulate, reason, and generate data for downstream pipelines in robotics, autonomous vehicles, and industrial vision systems.

Robot Learning

Build custom world models for downstream tasks, environments, camera or sensor layouts, and policies.

  • Post-train Cosmos Predict for robot-specific views or control policies
  • Generate synthetic data across environments and lighting conditions with Cosmos Transfer
  • Post-train Cosmos Reason using the Cosmos RL framework to build vision-language-action (VLA) models
  • Create an end-to-end synthetic data augmentation and evaluation pipeline using the Physical AI Data Factory Blueprint built on Cosmos
Train robot policies in simulation

Starting Options

Get Started With NVIDIA Cosmos

1

Ready to build? Access open models and code directly.

2

Not ready to build yet? Try Cosmos models in our hosted catalog.

3

 Need help? Start quickly with our hands-on model recipes.

Trustworthy AI

Supporting the Physical AI Community

Cosmos models, guardrails, and tokenizers are available on Hugging Face and GitHub, with resources to tackle data scarcity in training physical AI models.

AI Infrastructure

Get the Best Performance With NVIDIA Blackwell

NVIDIA RTX PRO 6000 Blackwell Series Servers accelerate physical AI development for robots, autonomous vehicles, and AI agents across training, synthetic data generation, simulation, and inference.

Unlock peak performance for Cosmos world foundation models on NVIDIA Blackwell GB200 for industrial post-training and inference workloads.

NVIDIA GB200 Grace Blackwell Superchip

Ecosystem

Adopted by Leading Physical AI Innovators

Model developers from the robotics, autonomous vehicles, and vision AI industries are using Cosmos to accelerate physical AI development.

1X Technologies logo
Agile Robots logo
Agile Robots logo
Agility Robotics logo
Ambient AI
Avathon
Carla
Centific
Field AI
Figure AI logo
Foretellix logo
Galbot logo
Gatik
General Motors logo
Hexagon
IntBot logo
Inverted AI
Li Auto
Linker Vision
Magna
Mentee Robotics
Milestone Systems
Neura Robotics logo
Nexar
Oxa
Parallel Domain
Plus
Skild AI logo
Toyota Research Institute
Uber logo
VAST Data
Virtual Incision logo
VorWerk logo
Voxel51
Wistron logo
X-humanoid

Next Steps

Join the Cosmos Community

Connect with Cosmos experts, engage with fellow developers, provide model feedback, and access continued learning through livestreams and recipes.

Cosmos Cookbook

A comprehensive guide for working with the NVIDIA Cosmos ecosystem for real-world, domain-specific applications across robotics, simulation, autonomous systems, and physical scene understanding.

Build Video Analytics AI Agents

Use Cosmos Reason with NVIDIA Blueprint for video search and summarization (VSS) to build AI agents for scalable, real-time video understanding.

Resources

The Latest From Cosmos Developers

Frequently Asked Questions

Select Location
Middle East