X

Otter's free app brings voice transcripts into the AI age

Otter's artificial intelligence absorbs recorded conversations and churns out transcripts that identify speakers, suggest keywords and offer text search.

Joan E. Solsman Former Senior Reporter
Joan E. Solsman was CNET's senior media reporter, covering the intersection of entertainment and technology. She's reported from locations spanning from Disneyland to Serbian refugee camps, and she previously wrote for Dow Jones Newswires and The Wall Street Journal. She bikes to get almost everywhere and has been doored only once.
Expertise Streaming video, film, television and music; virtual, augmented and mixed reality; deep fakes and synthetic media; content moderation and misinformation online Credentials
  • Three Folio Eddie award wins: 2018 science & technology writing (Cartoon bunnies are hacking your brain), 2021 analysis (Deepfakes' election threat isn't what you'd think) and 2022 culture article (Apple's CODA Takes You Into an Inner World of Sign)
Joan E. Solsman
3 min read
Mark Licea/CNET
Watch this: Otter's app makes free voice transcriptions easy and free

If you don't hate transcribing, it's probably just because you don't have to do it very much. 

Otter, a new, free mobile app from a team of vets from Google and speech-recognition company Nuance, aims to make voice transcriptions become as easy and accessible as typing into a Google Doc. 

Voice is an obsession of tech giants right now. But companies like Amazon , Google and Apple are mostly zeroed in on voice-command assistants like Alexa or Siri , premised on the bet that voice interaction will become the next stage of computing. Getting less attention, though, are voice transcriptions -- tech that takes a recording of people talking and turns it into text -- even though that kind of technology could be transformative for people across different needs and professions.  

Most transcription apps or services fall into two buckets. If they're free, they're not often accurate. And those that give you clean transcriptions are usually expensive. That's because voice-to-text is technologically tricky, and most services that provide you with accurate transcripts usually need a human to review the recording. 

Otter, which debuted this week at Mobile World Congress , aims to make transcripts not only free and accurate but also smart. 

An iPhone shows a transcription in the Otter app

Otter is a free mobile app aiming to make transcription simple and easy.

Mark Licea/CNET

"This is a perfect time," said Sam Liang, CEO and founder of AISense, the company behind Otter. "With AI tech and deep learning in the last few years, the accuracy for speech recognition has improved dramatically. A few years ago, this system wouldn't be usable."

To use Otter, you tap a microphone icon to begin a recording, and almost immediately, a live raw transcript of what you're saying begins to unspool in front of you. It's after that recording ends that Otter's artificial intelligence can really do its work. After processing, the cleaned-up transcript separates speakers as they take turns talking. As part of your sign-up process, Otter takes a "voiceprint" of you by asking you to read a five-paragraph statement so it can learn your voice and specifically identify you next to the passages you spoke. 

Otter's transcripts also are searchable, not only within one transcript but across all your stored recordings. It automatically generates keywords you can tap to search. You can form teams in Otter, and content can be shared within the Otter app with individuals or team members. You can also send anyone a link to a transcript viewable on the web. 

In one neat feature, the text and audio are synchronized when you play back the recording, so if you tap on any part of the transcript, the audio will jump right to that place for quick accuracy checks. During an audio playback, the app also highlights each word as it's spoken. 

It's not a perfect transcription. Punctuation is sometimes out of whack. It misidentifies some words -- I told Otter, "Hi, I'm Joan" and my readout picked up my name as John, for example. And it's less successful in a crowded environment or during cross-talk. And the app, which was released publicly Monday, is somewhat buggy in its early days. Once when I opened it, all my recordings and their transcripts from the previous day were missing. Restarting the app restored them, but you may notice other bugs. 

And it lacks some tantalizing capabilities, like importing a previously recorded conversation. The app and its baseline transcription service is and will remain free, but the company plans to add a subscription tier later that unlocks extra utilities, along the same lines as Evernote's business model. 

Its privacy relationship with your materials is similar to that of an Evernote, too. Otter is confidential and encrypted. And the company isn't building an ad-based business with Otter, so it's not interested in peeking into your materials so it can create a profile that will target ads to you, the company said. 

Otter is available for devices running Google's Android system and on Apple mobile products.

Tech Enabled: CNET chronicles tech's role in providing new kinds of accessibility.

'Alexa, be more human': Inside Amazon's effort to make its voice assistant smarter, chattier and more like you.