Nvidia Unveils H100 AI Chip
Nvidia Unveils H100 AI Chip
5:35

Nvidia Unveils H100 AI Chip

Tech
Speaker 1: Today we are announcing the next generation. The engine of the world's AI computing infrastructure makes a giant leap. Introducing Envidia H 100. The H 100 is a massive 80 billion transistor chip using TSMC four N process. We designed the H 100 for scale up and scale out infrastructures. [00:00:30] So bandwidth memory, networking and ENV link chip to chip data rates are vital. H 100 is the first PCI express gen five GPU and the first HBM three GPU, a single H 100 sustains 40 Teras per second of IO bandwidth to put it in perspective, 20 H 100 S can sustain the equivalent of the entire world's internet traffic. [00:01:00] The hopper architecture is a J and leap over AMPI. Let me highlight five groundbreaking inventions. First, the H 100 has incredible performance, a new tensor processing format F P eight H 100 has four plops of F P two plops of FP 16, one pedo flops of TF, 32 60 [00:01:30] Terra, flops of FP 64 and FP 32 designed for air and liquid cooling. Speaker 1: H 100 is also the first GPU to scale in performance to 700 Watts over the past six years through Pascal Volta AMPI and now hopper, we developed technologies to train with FP 32, then FP 16, and now F P eight for AI processing [00:02:00] hopper H 100 S four plops of F P eight is an amazing six times. The performance of AMPI a 100 S FP 16, our largest generational leap ever. The transformer is unquestionably. The most important deep learning model invented hopper introduces a transformer engine. The hopper transformer engine combines a new tensor core and software that uses [00:02:30] FPA and FP 16 numerical formats and dynamically processes. Layers of a transformer network transformer model training can be reduced from weeks to days for cloud computing. Multi-tenant infrastructure translates directly to revenues and cost of service. A service can partition H 100 up to seven instances. Speaker 1: AMPI can also do this. However, hopper added complete per instance, [00:03:00] isolation and per instance, IO virtualization to support multi-tendency in the cloud. H 100 can host seven cloud tenants while a 100 can only host one. Each one is equivalent in performance to two full T four GS are most popular cloud inference, GPU. Each hopper multi-instance supports confidential computing with trusted execution environment. Sensitive data is often encrypted [00:03:30] at rest and in transit over to network, but unprotected during use data can be an AI model that results from millions of dollars of trained on years of domain knowledge or company proprietary data, and is valuable or secret hopper, confidential computing, a combination of processor, architecture and software addresses this gap by protecting both data and application during use confidential computing today is [00:04:00] only CPU hopper introduces the first GPU confidential computing hopper, confidential computing protects the confidentiality and integrity of AI models and algorithms of the owners, software developers and services can now distribute and deploy their proprietary and valuable AI models on shared remote infrastructure, protecting their intellectual property and scaling their business models. Speaker 1: And there's more hopper [00:04:30] introduces a new set of instructions called DPX designed to accelerate dynamic programming algorithms. Many real world algorithms grow with commonatorial or exponential complexity. Examples include the famous traveling salesperson optimization problems. Floyd wash for shortest route optimization used for mapping Smith, Waterman pattern matching [00:05:00] for gene sequencing and protein folding and E graph optimization algorithms, dynamic programming breaks, complex problems down to simpler sub problems that are solved recursively, reducing complexity and time to polynomial scale H 100 is the newest engine of AI infrastructures. H 100 S are packaged with HBM three memories, TSMs cos two and a half D packaging and integrated with voltage [00:05:30] regulation into a super chip module called SX M.

Up Next

What is the Fediverse?
240418-fediverse-winged

Up Next

What is the Fediverse?

The Missing Piece to Apple's Eco-Friendly Mission
240418-site-omt-the-core-problem-of-apples-green-goals-v1.jpg

The Missing Piece to Apple's Eco-Friendly Mission

Boston Dynamics Retires Its HD Atlas Robot
p1022506-00-00-01-20-still001

Boston Dynamics Retires Its HD Atlas Robot

Apple and Disney's Unique Bond: Why Vision Pro Needs the Mouse
240411-site-can-disney-save-the-apple-vision-pro-v1

Apple and Disney's Unique Bond: Why Vision Pro Needs the Mouse

The Ocean Cleanup's System 03 Collects Plastic Pollution at Record Levels
The Ocean Cleanup System 03

The Ocean Cleanup's System 03 Collects Plastic Pollution at Record Levels

Latest iOS 18 Rumor Roundup: New Designs, AI Tricks
240404-yt-omt-ios-18-siri-ai-v06

Latest iOS 18 Rumor Roundup: New Designs, AI Tricks

Apple to Talk AI in June: This WWDC Is a Big Deal
240328-yt-omt-wwdc24-v07

Apple to Talk AI in June: This WWDC Is a Big Deal

What Google Gemini AI on the iPhone Could Look Like
240321-site-apple-and-gemini-ai

What Google Gemini AI on the iPhone Could Look Like

Microsoft Surface Pro 10, Surface Laptop 6 Are Here
240320-site-microsoft-surface-pros-first-look-v2

Microsoft Surface Pro 10, Surface Laptop 6 Are Here

Everything Just Announced at Google's AI Health Event
sc-googlehealthai-00-02-29-25-still001

Everything Just Announced at Google's AI Health Event

Tech Shows

The Apple Core
apple-core-w

The Apple Core

Alphabet City
alphabet-city-w

Alphabet City

CNET Top 5
cnet-top-5-w

CNET Top 5

The Daily Charge
dc-site-1color-logo.png

The Daily Charge

What the Future
what-the-future-w

What the Future

Tech Today
tech-today-w

Tech Today

Latest News All latest news

Robosen's Megatron Transformer Is Too Much Fun for an Evil Robot
240419-megatron-v04

Robosen's Megatron Transformer Is Too Much Fun for an Evil Robot

Apple May Give FineWoven Accessories One More Season
finewoven-240424-land-00-00-13-04-still003

Apple May Give FineWoven Accessories One More Season

US vs. TikTok: What Happens Next
240424-yt-tiktok-vs-us-v04

US vs. TikTok: What Happens Next

Battle of the Humanoid Robots: MenteeBot Is Ready
240423-yt-menteebot-ai-robot-v08

Battle of the Humanoid Robots: MenteeBot Is Ready

What to Expect at Apple's May 7 iPad Event
240423-yt-apple-ipad-ipad-pro-pencil-v02

What to Expect at Apple's May 7 iPad Event

Did a Week With the Apple Watch Make Me Use My iPhone Less?
240419-site-does-having-an-apple-watch-make-me-use-my-iphone-less-4

Did a Week With the Apple Watch Make Me Use My iPhone Less?

Most Popular All most popular

First Look at TSA's Self-Screening Tech (in VR!)
innovation

First Look at TSA's Self-Screening Tech (in VR!)

Samsung Galaxy S24 Ultra Review: More AI at a Higher Cost
240123-site-samsung-galaxy-s24-ultra-review-4

Samsung Galaxy S24 Ultra Review: More AI at a Higher Cost

'Circle to Search' Lets Users Google From Any Screen
circlesearchpic

'Circle to Search' Lets Users Google From Any Screen

Asus Put Two 14-inch OLEDs in a Laptop, Unleashes First OLED ROG Gaming Laptop
asus-preces-00-00-25-11-still003

Asus Put Two 14-inch OLEDs in a Laptop, Unleashes First OLED ROG Gaming Laptop

Samsung Galaxy Ring: First Impressions
samsung-galaxy-ring-clean

Samsung Galaxy Ring: First Impressions

Best of Show: The Coolest Gadgets of CES 2024
240111-site-best-of-ces-2024-1

Best of Show: The Coolest Gadgets of CES 2024

Latest Products All latest products

Robosen's Megatron Transformer Is Too Much Fun for an Evil Robot
240419-megatron-v04

Robosen's Megatron Transformer Is Too Much Fun for an Evil Robot

Battle of the Humanoid Robots: MenteeBot Is Ready
240423-yt-menteebot-ai-robot-v08

Battle of the Humanoid Robots: MenteeBot Is Ready

2025 Audi Q6, SQ6 E-Tron: Audi's Newest EV Is Its Most Compelling
cnet-audiq6

2025 Audi Q6, SQ6 E-Tron: Audi's Newest EV Is Its Most Compelling

Hands-On with Ford's Free Tesla Charging Adapter
pic3

Hands-On with Ford's Free Tesla Charging Adapter

Nuro R3 is an Adorable Self-Driving Snack Bar
240320-site-nuro-r3-first-look-v1

Nuro R3 is an Adorable Self-Driving Snack Bar

First Look: The $349 Nothing Phone 2A Aims to Brighten Your Day
240304-site-nothing-phone-2-first-look-v3

First Look: The $349 Nothing Phone 2A Aims to Brighten Your Day

Latest How To All how to videos

Tips and Tricks for the AirPods Pro 2
airpods-pro-2

Tips and Tricks for the AirPods Pro 2

How to Watch the Solar Eclipse Safely From Your Phone
screenshot-2024-04-03-at-15-47-11.png

How to Watch the Solar Eclipse Safely From Your Phone

Windows 11 Tips and Hidden Features
240311-site-windows-11-hidden-tips-and-tricks-v2

Windows 11 Tips and Hidden Features

Vision Pro App Walkthrough -- VisionOS 1.0.3
VisionOS 1.0.3

Vision Pro App Walkthrough -- VisionOS 1.0.3

Tips and Tricks for the Galaxy S24 Ultra
240216-site-galaxy-s24-ultra-tips-and-hidden-features-2

Tips and Tricks for the Galaxy S24 Ultra

TikTok Is Now on the Apple Vision Pro
tiktok-on-vision-pro-clean

TikTok Is Now on the Apple Vision Pro