• Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io
No Result
View All Result
Converge Digest
Saturday, May 30, 2026
  • Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io
No Result
View All Result
Converge Digest
No Result
View All Result

Home » NVIDIA’s Physical AI Dataset Aims to Boost Robot and AV Model Training

NVIDIA’s Physical AI Dataset Aims to Boost Robot and AV Model Training

March 20, 2025
in All
A A

NVIDIA has released a large, open-source dataset to support the development of physical AI systems, including robotics and autonomous vehicles (AVs). Announced at the NVIDIA GTC conference in San Jose, the Physical AI Dataset is available on Hugging Face and provides 15 terabytes of data, featuring more than 320,000 robotics trajectories and up to 1,000 Universal Scene Description (OpenUSD) assets. NVIDIA plans to expand the dataset to include data supporting end-to-end AV development, with 20-second traffic scenario clips from over 1,000 U.S. cities and multiple European countries.

The dataset is intended to accelerate pretraining and post-training for AI models used in applications like warehouse robotics, humanoid surgical assistants, and AVs navigating complex traffic conditions. NVIDIA said the dataset will also feed into its existing platforms, including Cosmos, DRIVE AV, Isaac, and Metropolis. Research institutions such as the Berkeley DeepDrive Center, Carnegie Mellon Safe AI Lab, and UC San Diego’s Contextual Robotics Institute are early adopters.

NVIDIA highlighted that physical AI model development typically requires extensive, diverse data to train robust systems. The company emphasized that collecting and curating such data can be cost-prohibitive, especially for smaller organizations. The dataset’s scale is designed to enhance safety research, with tools like NVIDIA NeMo Curator allowing for faster processing of large video datasets. Developers can also leverage NVIDIA’s Isaac GR00T workflow for generating synthetic robot manipulation data.

• NVIDIA launched a 15TB open-source dataset for physical AI development.

• Dataset includes 320,000+ robotics training trajectories and 1,000 OpenUSD assets.

• Autonomous vehicle dataset expansion will cover 1,000+ cities across the U.S. and Europe.

• Supports NVIDIA Cosmos, DRIVE AV, Isaac, and Metropolis platforms.

• Early adopters: UC Berkeley, Carnegie Mellon, UC San Diego labs.

• Enables faster AI model training for safety-critical applications.

• NVIDIA NeMo Curator processes 20 million hours of video in two weeks on Blackwell GPUs.

• Dataset is available now on Hugging Face.

“We can do a lot of things with this dataset, such as training predictive AI models that help autonomous vehicles better track the movements of vulnerable road users like pedestrians to improve safety,” said Henrik Christensen, director of robotics and AV labs at UC San Diego.

ShareTweetShareSummarizeSummarize
Previous Post

Zayo Commits $90 Million to Expand Fiber in Tennessee

Next Post

NVIDIA Scales AI-Driven Vehicle Platforms

Jim Carroll

Jim Carroll

Editor and Publisher, Converge! Network Digest, Optical Networks Daily - Covering the full stack of network convergence from Silicon Valley

Related Posts

AI Infrastructure

Anthropic Raises $65B as its AI Infrastructure Buildout Accelerates 

May 28, 2026
Financials

Credo Completes DustPhotonics Acquisition, Adds Silicon Photonics PICs

May 28, 2026
All

COMPUTEX 2026 Preview: AI Infrastructure Showcase in Taipei

May 28, 2026
Clouds and Carriers

Deutsche Telekom and SAP to Build Sovereign AI for Germany

May 28, 2026
Clouds and Carriers

Sparkle and GÉANT Expand Global Research Connectivity 

May 28, 2026
AI Infrastructure

CoreWeave Launches Unified Agentic AI Capabilities

May 28, 2026
Next Post

NVIDIA Scales AI-Driven Vehicle Platforms

Categories

  • 5G / 6G / Wi-Fi
  • AI Infrastructure
  • All
  • Automotive Networking
  • Blueprints
  • Clouds and Carriers
  • Data Centers
  • Enterprise
  • Explainer
  • Feature
  • Financials
  • Last Mile / Middle Mile
  • Legal / Regulatory
  • Optical
  • Quantum
  • Research
  • Security
  • Semiconductors
  • Space
  • Start-ups
  • Subsea
  • Sustainability
  • Video
  • Webinars

Archives

Tags

5G All AT&T Australia AWS Blueprint columns BroadbandWireless Broadcom China Ciena Cisco Data Centers Dell'Oro Ericsson FCC Financial Financials Huawei Infinera Intel Japan Juniper Last Mile Last Mille LTE Mergers and Acquisitions Mobile NFV Nokia Optical Packet Systems PacketVoice People Regulatory Satellite SDN Service Providers Silicon Silicon Valley StandardsWatch Storage TTP UK Verizon Wi-Fi
Converge Digest

A private dossier for networking and telecoms

Follow Us

  • Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io

© 2026 Converge Digest - A private dossier for networking and telecoms.

No Result
View All Result
  • Home
  • About
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Manage Email Delivery
  • NextGenInfra.io

© 2026 Converge Digest - A private dossier for networking and telecoms.

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.
Go to mobile version