Meta Introduces Project CAIRaoke

Meta Introduces Project CAIRaoke

5:44

Meta Introduces Project CAIRaoke

Feb 23, 2022

Tech

Speaker 1: Hey everyone. And thanks for joining us for inside the lab. We work on a lot of different technologies here at meta everything from virtual reality to designing our own data centers. And we are particularly focused on foundational technologies that can make entirely new things possible. And today we are going to focus on perhaps the most important foundational technology of our time, artificial intelligence. We're gonna share some breakthroughs in our AI research and some of [00:00:30] the problems that we need to solve as we build for the metaverse. The kinds of experiences that you'll have in the metaverse are beyond what is possible today. It's an immersive verse of the internet, instead of just looking at something on a screen, you're gonna actually feel like you're inside or right there present with another person. And that's going to require advances across a whole range of areas from new hardware devices, to software for building and exploring worlds. Speaker 1: [00:01:00] And the key unlocking a lot of these is advances in AI. So let's take a look at some of the challenges that we are working on first, creating a new generation of assistance that will help us explore new worlds today. A lot of AI research is focused on understanding the physical world, but in the metaverse we're going to need AI that is builder on helping people navigate virtual worlds, as well as our physical world [00:01:30] with augmented reality. And because these worlds will be dynamic and always changing, AI is going to need to be able to understand context and learn in the way that humans do. And when we have go glasses on our faces, that will be the first time that an AI system will be able to really see the world from our perspective. See what we see, hear what we hear and more so the ability and expectation that we have for AI systems is going to be much higher. Speaker 1: Now we [00:02:00] are already using simpler machine learning systems to parse information for us today. Every time you get a recommendation or search for something, or even take a photo on a phone, there is machine learning in the background. Computing is also becoming increasingly contextual instead of this static experience. That's the same, no matter where you are, the way that we use computers now adapts much more to you're doing. And as devices have gotten better at understanding and anticipating what we want, they've also gotten more useful. [00:02:30] Now I expect that these trends will only increase in the future. The metaverse will consist of immersive worlds that you can create and interact with with all the visual information that includes like your position in 3d your, your body language, facial gestures, and so on. And this is all from your first person perspective. So you experience it and move through it as if you are really there. Speaker 1: And all that adds up to a lot more input to be processed [00:03:00] and a lot more content to be generated. So we're gonna need help navigating all of this efficiently. And the work that we do to build this is gonna pave the way for assistance that can move between virtual and physical worlds too. A key part of this effort is building better models for richer and deeper communication between people and AI. So today we are announcing project karaoke, which is a fully end to end neural model for building on device assistance. It combines [00:03:30] the approach behind blender bot with the latest in conversational AI to deliver better dialogue capabilities. And from there to support true world creation and exploration, we need to advance well beyond the current state of the art for smart assistance. So we are working on two areas of AI research to make this possible egocentric perception, which is about seeing worlds from a first person perspective and a whole new class of generat AI [00:04:00] models that help you create anything that you can imagine. Now here's an AI concept that we created called builder bot, which showcases this work. It enables you to describe a world and then it will generate aspects of that world for you. So let's take a look at how this works. Hey, builder bot, let's start with the scene. Let's go to a park. Speaker 1: Actually. Let's go to the beach. [00:04:30] Pretty good. Let's add some clouds, Huh? That's all AI generated. Actually. Let's add some Alto Cumulus clouds. All right. And let's add an island over there. Speaker 2: That's cool. How about we add some trees out here by the, by the sand. Let's get a picnic blanket [00:05:00] down here. Let's put up a table. Let's put a stereo. Let's get some drinks as well. Let's get the sound of some waves and seagulls. Speaker 1: Does that speaker work? Let's play some tropical music And let's [00:05:30] add a hydrofoil. You gotta have a hydrophone. Speaker 2: You gotta teach me how to ride one in VR.

Up Next

How to Install Windows 11 on M-Series Mac Computers

240425-site-how-to-install-windows-11-on-an-m3-macbook-air-thumbnail

Up Next

How to Install Windows 11 on M-Series Mac Computers

06:39

Rabbit R1: Here's What It Can Actually Do

240430-yt-rabbit-r1-review-v06

Rabbit R1: Here's What It Can Actually Do

13:18

How to Access AI on Your Apple Watch

Apple Watch 9 and snakeio app

How to Access AI on Your Apple Watch

00:56

Meta Expands Its Mixed Reality Beyond the Quest Headsets Explainer

Meta Quest 2

Meta Expands Its Mixed Reality Beyond the Quest Headsets Explainer

01:56

Apple May Give FineWoven Accessories One More Season

finewoven-240424-land-00-00-13-04-still003

Apple May Give FineWoven Accessories One More Season

00:43

US vs. TikTok: What Happens Next

240424-yt-tiktok-vs-us-v04

US vs. TikTok: What Happens Next

02:14

Battle of the Humanoid Robots: MenteeBot Is Ready

240423-yt-menteebot-ai-robot-v08

Battle of the Humanoid Robots: MenteeBot Is Ready

03:19

What to Expect at Apple's May 7 iPad Event

240423-yt-apple-ipad-ipad-pro-pencil-v02

What to Expect at Apple's May 7 iPad Event

02:55

Boston Dynamics' New Electric Atlas vs. Tesla's Optimus

240419-wtf-atlas-vs-optimus-v04

Boston Dynamics' New Electric Atlas vs. Tesla's Optimus

03:15

Laptop Buying Guide: What to Look For

laptop-buying-guide-2024-00-02-36-12-still001

Laptop Buying Guide: What to Look For

07:21

Tech Shows

apple-core-w

The Apple Core

alphabet-city-w

Alphabet City

cnet-top-5-w

CNET Top 5

The Daily Charge

The Daily Charge

What the Future

what-the-future-w

What the Future

tech-today-w

Tech Today

Latest News All latest news

How to Install Windows 11 on M-Series Mac Computers

240425-site-how-to-install-windows-11-on-an-m3-macbook-air-thumbnail

How to Install Windows 11 on M-Series Mac Computers

06:39

Rabbit R1: Here's What It Can Actually Do

240430-yt-rabbit-r1-review-v06

Rabbit R1: Here's What It Can Actually Do

13:18

How to Access AI on Your Apple Watch

Apple Watch 9 and snakeio app

How to Access AI on Your Apple Watch

00:56

Beats Solo 4 Headphones Review: Same Look, but Better Sound and USB-C

beatssolo4still-cms2

Beats Solo 4 Headphones Review: Same Look, but Better Sound and USB-C

09:00

Living With Samsung Galaxy S24 Ultra: 3 Months Later

240424-yt-living-wiht-s24-ultra-v03

Living With Samsung Galaxy S24 Ultra: 3 Months Later

07:51

Trash Gobbling Robots Cleaning Lake Tahoe and Beyond!

240426-site-wtf-beach-cleaning-robots-thumbnail-2

Trash Gobbling Robots Cleaning Lake Tahoe and Beyond!

06:30

Most Popular All most popular

First Look at TSA's Self-Screening Tech (in VR!)

innovation

First Look at TSA's Self-Screening Tech (in VR!)

03:06

Samsung Galaxy S24 Ultra Review: More AI at a Higher Cost

240123-site-samsung-galaxy-s24-ultra-review-4

Samsung Galaxy S24 Ultra Review: More AI at a Higher Cost

12:23

'Circle to Search' Lets Users Google From Any Screen

circlesearchpic

'Circle to Search' Lets Users Google From Any Screen

05:53

Asus Put Two 14-inch OLEDs in a Laptop, Unleashes First OLED ROG Gaming Laptop

asus-preces-00-00-25-11-still003

Asus Put Two 14-inch OLEDs in a Laptop, Unleashes First OLED ROG Gaming Laptop

02:59

Samsung Galaxy Ring: First Impressions

samsung-galaxy-ring-clean

Samsung Galaxy Ring: First Impressions

02:46

Best of Show: The Coolest Gadgets of CES 2024

240111-site-best-of-ces-2024-1

Best of Show: The Coolest Gadgets of CES 2024

05:24

Latest Products All latest products

Rabbit R1: Here's What It Can Actually Do

240430-yt-rabbit-r1-review-v06

Rabbit R1: Here's What It Can Actually Do

13:18

Beats Solo 4 Headphones Review: Same Look, but Better Sound and USB-C

beatssolo4still-cms2

Beats Solo 4 Headphones Review: Same Look, but Better Sound and USB-C

09:00

Robosen's Megatron Transformer Is Too Much Fun for an Evil Robot

240419-megatron-v04

Robosen's Megatron Transformer Is Too Much Fun for an Evil Robot

06:56

Battle of the Humanoid Robots: MenteeBot Is Ready

240423-yt-menteebot-ai-robot-v08

Battle of the Humanoid Robots: MenteeBot Is Ready

03:19

2025 Audi Q6, SQ6 E-Tron: Audi's Newest EV Is Its Most Compelling

cnet-audiq6

2025 Audi Q6, SQ6 E-Tron: Audi's Newest EV Is Its Most Compelling

06:58

Hands-On with Ford's Free Tesla Charging Adapter

Hands-On with Ford's Free Tesla Charging Adapter

03:48

Latest How To All how to videos

How to Install Windows 11 on M-Series Mac Computers

240425-site-how-to-install-windows-11-on-an-m3-macbook-air-thumbnail

How to Install Windows 11 on M-Series Mac Computers

06:39

Tips and Tricks for the AirPods Pro 2

airpods-pro-2

Tips and Tricks for the AirPods Pro 2

08:27

How to Watch the Solar Eclipse Safely From Your Phone

How to Watch the Solar Eclipse Safely From Your Phone

02:59

Windows 11 Tips and Hidden Features

240311-site-windows-11-hidden-tips-and-tricks-v2

Windows 11 Tips and Hidden Features

05:19

Vision Pro App Walkthrough -- VisionOS 1.0.3

VisionOS 1.0.3

Vision Pro App Walkthrough -- VisionOS 1.0.3

12:11

Tips and Tricks for the Galaxy S24 Ultra

240216-site-galaxy-s24-ultra-tips-and-hidden-features-2

Tips and Tricks for the Galaxy S24 Ultra

06:53