Meta Introduces Project CAIRaoke
Meta Introduces Project CAIRaoke
5:44

Meta Introduces Project CAIRaoke

Tech
Speaker 1: Hey everyone. And thanks for joining us for inside the lab. We work on a lot of different technologies here at meta everything from virtual reality to designing our own data centers. And we are particularly focused on foundational technologies that can make entirely new things possible. And today we are going to focus on perhaps the most important foundational technology of our time, artificial intelligence. We're gonna share some breakthroughs in our AI research and some of [00:00:30] the problems that we need to solve as we build for the metaverse. The kinds of experiences that you'll have in the metaverse are beyond what is possible today. It's an immersive verse of the internet, instead of just looking at something on a screen, you're gonna actually feel like you're inside or right there present with another person. And that's going to require advances across a whole range of areas from new hardware devices, to software for building and exploring worlds. Speaker 1: [00:01:00] And the key unlocking a lot of these is advances in AI. So let's take a look at some of the challenges that we are working on first, creating a new generation of assistance that will help us explore new worlds today. A lot of AI research is focused on understanding the physical world, but in the metaverse we're going to need AI that is builder on helping people navigate virtual worlds, as well as our physical world [00:01:30] with augmented reality. And because these worlds will be dynamic and always changing, AI is going to need to be able to understand context and learn in the way that humans do. And when we have go glasses on our faces, that will be the first time that an AI system will be able to really see the world from our perspective. See what we see, hear what we hear and more so the ability and expectation that we have for AI systems is going to be much higher. Speaker 1: Now we [00:02:00] are already using simpler machine learning systems to parse information for us today. Every time you get a recommendation or search for something, or even take a photo on a phone, there is machine learning in the background. Computing is also becoming increasingly contextual instead of this static experience. That's the same, no matter where you are, the way that we use computers now adapts much more to you're doing. And as devices have gotten better at understanding and anticipating what we want, they've also gotten more useful. [00:02:30] Now I expect that these trends will only increase in the future. The metaverse will consist of immersive worlds that you can create and interact with with all the visual information that includes like your position in 3d your, your body language, facial gestures, and so on. And this is all from your first person perspective. So you experience it and move through it as if you are really there. Speaker 1: And all that adds up to a lot more input to be processed [00:03:00] and a lot more content to be generated. So we're gonna need help navigating all of this efficiently. And the work that we do to build this is gonna pave the way for assistance that can move between virtual and physical worlds too. A key part of this effort is building better models for richer and deeper communication between people and AI. So today we are announcing project karaoke, which is a fully end to end neural model for building on device assistance. It combines [00:03:30] the approach behind blender bot with the latest in conversational AI to deliver better dialogue capabilities. And from there to support true world creation and exploration, we need to advance well beyond the current state of the art for smart assistance. So we are working on two areas of AI research to make this possible egocentric perception, which is about seeing worlds from a first person perspective and a whole new class of generat AI [00:04:00] models that help you create anything that you can imagine. Now here's an AI concept that we created called builder bot, which showcases this work. It enables you to describe a world and then it will generate aspects of that world for you. So let's take a look at how this works. Hey, builder bot, let's start with the scene. Let's go to a park. Speaker 1: Actually. Let's go to the beach. [00:04:30] Pretty good. Let's add some clouds, Huh? That's all AI generated. Actually. Let's add some Alto Cumulus clouds. All right. And let's add an island over there. Speaker 2: That's cool. How about we add some trees out here by the, by the sand. Let's get a picnic blanket [00:05:00] down here. Let's put up a table. Let's put a stereo. Let's get some drinks as well. Let's get the sound of some waves and seagulls. Speaker 1: Does that speaker work? Let's play some tropical music And let's [00:05:30] add a hydrofoil. You gotta have a hydrophone. Speaker 2: You gotta teach me how to ride one in VR.

Up Next

Apple Vision Pro: I Tried Apple's AR/VR Headset
vision-pro-apple-walks-through-mixed-reality-headset-design-mp4-00-00-37-04-still001.png

Up Next

Apple Vision Pro: I Tried Apple's AR/VR Headset

Apple AirPods Get Adaptive Audio
aipodspic

Apple AirPods Get Adaptive Audio

3 Google Bard AI Settings to Change, 3 Prompts to Try
230524-yt-3-settings-bard-ai-protect-yourself-v03

3 Google Bard AI Settings to Change, 3 Prompts to Try

Razr Plus and Razr 2023 Hands On: First Look at Motorola's New Foldable
razrthumb

Razr Plus and Razr 2023 Hands On: First Look at Motorola's New Foldable

Apple, Please Bring These Apple Watch Features to WatchOS 10
thumb3

Apple, Please Bring These Apple Watch Features to WatchOS 10

RedMagic 8 Pro Review: What to Know About This Lower-Priced Gaming Phone
yt-review-redmagic-8-pro-v06

RedMagic 8 Pro Review: What to Know About This Lower-Priced Gaming Phone

Apple's WWDC 2023: What We Expect
230524-clean-wwdc-what-to-expect

Apple's WWDC 2023: What We Expect

Sony PlayStation Unveils Project Q Gaming Handheld
gaming-image-cnet

Sony PlayStation Unveils Project Q Gaming Handheld

Windows 11 Gets AI Copilot
thumbcnet

Windows 11 Gets AI Copilot

Tech Shows

The Apple Core
apple-core-w

The Apple Core

Alphabet City
alphabet-city-w

Alphabet City

CNET Top 5
cnet-top-5-w

CNET Top 5

The Daily Charge
dc-site-1color-logo.png

The Daily Charge

What the Future
what-the-future-w

What the Future

Tech Today
tech-today-w

Tech Today

Latest News All latest news

WWDC 2023: Here Are All the Major iOS 17 Features
230605-clean-ios-17-walkthrough

WWDC 2023: Here Are All the Major iOS 17 Features

Apple Vision Pro: I Tried Apple's AR/VR Headset
vision-pro-apple-walks-through-mixed-reality-headset-design-mp4-00-00-37-04-still001.png

Apple Vision Pro: I Tried Apple's AR/VR Headset

New MacBook Air: Hands-On With the 15-Inch Display
hands-on-thumb-1

New MacBook Air: Hands-On With the 15-Inch Display

First Look: Apple Mac Pro and Mac Studio
macstudiomacpro-00-01-50-22-still002

First Look: Apple Mac Pro and Mac Studio

First Impressions of Apple's Vision Pro Mixed Reality Headset
apple-reveals-vision-pro-mixed-reality-headset-mp4-00-00-12-10-still001.png

First Impressions of Apple's Vision Pro Mixed Reality Headset

Apple iOS 17: Every New Feature (Supercut)
iosstill

Apple iOS 17: Every New Feature (Supercut)

Most Popular All most popular

3 Google Bard AI Settings to Change, 3 Prompts to Try
230524-yt-3-settings-bard-ai-protect-yourself-v03

3 Google Bard AI Settings to Change, 3 Prompts to Try

iOS 17 Features Apple Needs to Add for the iPhone
wwdc

iOS 17 Features Apple Needs to Add for the iPhone

How the World's Largest Metal 3D Printer Makes Rockets
3d-printed-rocket-2

How the World's Largest Metal 3D Printer Makes Rockets

Apple, Disney Partner on Vision Pro Entertainment
appledisneypic

Apple, Disney Partner on Vision Pro Entertainment

Everything Apple Announced at WWDC 2023
230605-clean-apple-wwdc-supercut-thumbnail-1

Everything Apple Announced at WWDC 2023

Apple's WWDC 2023: Clues to a Changing iPhone
pinkhair

Apple's WWDC 2023: Clues to a Changing iPhone

Latest Products All latest products

razrthumb

Razr Plus and Razr 2023 Hands On: First Look at Motorola's New Foldable

xperia1v

Review: We Tested the Cameras on the Sony Xperia 1 V

pixelfold

Pixel Fold Hands-On: A First Look at Google's First Foldable

thumbrog1

Asus ROG Ally First Look

samsung-tv-event-cnet-00-01-22-10-still001.png

Samsung's 2023 OLED TVs Challenge LG on Price, Picture

p1100354

Galaxy A54 5G: Hands-on With Samsung's New Budget Phone

Latest How To All how to videos

230524-yt-3-settings-bard-ai-protect-yourself-v03

3 Google Bard AI Settings to Change, 3 Prompts to Try

230331-yt-howto-bard-google-ai-v04

Google's Bard AI: Here's How to Get Started

bing, bing ai, bing chat

How to Get Started With Bing AI Search and Chat

car-cam-2

How to Install Ring's New Car Cam

pc-vr-5

Connect a Meta Quest 2 VR Headset to a PC

cast-2

Cast Your Meta Quest Headset to a TV, Phone or Browser