• On MovieTome: See the TRAILER for TERMINATOR 4!
October 31, 2007 1:44 PM PDT

How digital sound works

Posted by Matt Rosoff
  • Font size
  • Print

Speaking of Zeroes and Ones...

Among audiophiles, the analog vs. digital debate rages without end. I, like a lot of other musicians and music fans, have my own preferences--I own many more LPs than CDs, and have paid dearly to record some of my bands' music onto 2-inch tape instead of direct to hard drive. But included in those preferences are some preconceptions. You've heard it before: digital music sounds "colder" or "cleaner" or "more sterile" because it's delivering a stream of 0s and 1s, instead of a pure sound wave. Or something like that.

Audio professionals don't use terms like these, largely because they're subjective and imprecise, and sometimes inaccurate. Recently, one of these professionals presented the best explanation of analog vs. digital sound that I've ever heard. Here's a super-condensed version of an already simplified explanation.

A pure tone can be represented as a perfect sine wave. Each point on the sine wave has two values, height (volume) and its point in time. Looking at the overall wave, the distance between the top and bottom of that wave is the volume of that tone. The distance between one peak and the next is the frequency, or pitch, of that tone.

But pure tones don't occur in nature. Think of a roomful of people all saying their names at the exact same time. At any given moment, there are dozens of individual voices, all at different pitches and volumes.

Imagine a microphone recording this incident: at any given moment, the diaphragm of that microphone can only be at one position. So, at each moment, it's taking the average of all the frequencies and volumes of all the noises in the room and presenting a value for it. A stream of such values, over time, can be charted as on a graph. But instead of a perfect sine wave, it appears as a complicated squiggly line. There's only one value at any given moment, but over the course of a second or so, the changes in that average give you the overall character of the sound, which your ear interprets as a group of people saying their names. Or, in a more relevant example, a group of musicians playing instruments.

In recording, the question is: how do you capture that changing value? Tape uses tiny magnetizable bits of metal. Very roughly, the more of those pieces aligned in the same direction, the higher the value. (Remember, we've taken time out of the equation because this is one point in the tape, so all that's left to measure is the value itself.) Even if there's no sound, the metal pieces are still passing by the head, which creates tape noise. Other types of noise arise from irregularities in the surface of the tape (modulation and asperity), or from trying to force more signal onto the tape than it can handle (oversaturation). For various reasons--physics and years of conditioning--these types of noise can sound acceptable, or even desirable, to many people.

In digital recording, a software program takes samples of that sound. If you imagine the graph of the squiggly line, it's plotted against a theoretical top (the highest value the program can record) and bottom (the lowest possible value). The program asks a series of questions to determine the value of the sound at the moment it's taking the sample. The first question: is the value above or below the halfway point? Let's say it's above half, which the program records as a "1." The second question: if you cut that first half into half again, would the value be higher or lower? Above=1, below=0. And so on, until you've narrowed it enough to come up with a number that's close to the real value.

Bitrate represents the number of questions that the program asks at each moment. So, 16-bit, which is standard CD-quality sound, asks 16 questions, and each value is represented by a 16-digit binary number, something like 0100010101101111. Some professional audio programs record at 24-bit, which lets it ask eight more "above or below" questions per moment, making it 256 times more accurate.

Sample rate represents the number of samples the program takes in a second. There's a hard and fast rule (the Nyquist theorem) stating that the sample rate must be more than twice as high as the highest note you want to record. If a note that's higher than that frequency is allowed to enter the sound stream, it will create a false frequency, leading to harmonic distortion. While some forms of analog distortion sound good to some ears, digital distortion never does. So, the programs use a filter to make sure that these high notes aren't recorded.

Standard CD sound is 44.1KHz, or 44,100 samples per second, which means the highest note it can record is about 20kHz. Most professional audio programs are capable of recording at 48kHz, and 96kHz isn't unheard of.

After all these samples are taken, you end up with a very fine stairstep wave, which is then fed through various programs to smooth it out and a digital-to-analog converter to translate it back into an electrical signal form that audio equipment can then play through a speaker or headphone.

Analog lovers might argue that, although the human ear cannot hear tones above about 20KHz (or 22KHz, or even 48KHz), those tones nonetheless affect the character of the overall signal at any given moment. Therefore, eliminating those tones makes the tone "inaccurate." Digital lovers might respond that there are far more opportunities for distortion in analog forms of recording and playback.

Matt Rosoff is an analyst with Directions on Microsoft, where he covers Microsoft's consumer products and corporate news. He's written about the technology industry since 1995, and reviewed the first Rio MP3 player for CNET.com in 1998. He is a member of the CNET Blog Network. Disclosure.
Recent posts from Digital Noise: Music and Tech
Yes, Apple should sell a $99 iPhone
CBS adds Launchcast to its online radio arsenal
MSN Unsigned seems half-hearted
McCartney's freak-folk goes on sale
Sampling 'Chinese Democracy'
Sounds like the Storm isn't much of a music phone
Byrne/Eno succeed in cutting out the middleman
Zune Pass adds 10 permanent downloads per month
Add a Comment (Log in or register) 2 comments
it's all an illusion...
by Mick O November 8, 2007 12:48 AM PST
I find it a bit absurd to say that an analog recording is more accurate than a
digital. There is no such thing as an accurate recording! Put a microphone in
front of an instrument, and that's it, you've changed its sound forever, long
before it gets onto tape, hard disc, or whatever. At every stage in the
recording process, the original signal is being manipulated, tweaked, and
distorted. Every inch of cable, every mixing desk, tape machine, A/D
converter, effect unit, and so on, changes what we hear in the end.

Of course analog and digital sound different; they're completely different
methods of recording. But any two tape machines will sound different, as will
any two CD players, speakers, amplifiers, etc. Even supposedly identical units
will sound different! By the time the music hits your ear, the original is long
gone, and you're left with an approximate facsimile, no matter how it was
recorded, or reproduced.
Reply to this comment
by OStrolphant December 25, 2007 9:39 AM PST
I have never had the 16-bit part of a recording so I understood it. coolio! Now I just don't understand the benefits of "1-bit sound"? what's the benefit of that?
Reply to this comment
advertisement

In the news now

Slowing expectations at a green-tech start-up

Six months ago, biofuels start-up Mascoma had the wind in its sails, as did the rest of the clean-tech sector. Now, the company is treading carefully and scaling back.


With JavaFX, Sun seeks new coders, new revenue

With the launch of JavaFX 1.0, Sun is trying to reclaim Java's strength as a foundation for rich Internet applications. But it's no longer the incumbent.


Tim Lincecum, motion capture star

San Francisco Giants pitcher, who won the Cy Young award last month, dons a motion capture suit for 2K Sports' Major League Baseball 2K9 video game.


Resource center from CNET News sponsors
Business. Ready.
Sony VAIO® Professional PCs.

Click Here!
A new grade in mobility demands a new kind of notebook. And Sony delivers.Tough, portable and featuring up to 7.5 hours of battery life! VAIO® Professional notebooks are built for business. Learn more.

Click Here!
Built tough for business.

Learn more about the rigorous quality testing Sony puts its notebooks through.

Protect your investment.

Find out why VAIO® tech support recently won a Laptop Editors' Choice Award, July 2008.

Long battery life.

Up to 7.5 hours of battery life! See how VAIO® PCs will keep you productive longer when on the road.

Travel light

Check out our ultraportable line-up, starting at 2.87 lbs.

PCs for every need.

Find out which VAIO® notebook is right for you.

About Digital Noise: Music and Tech

Matt Rosoff is an analyst with Directions on Microsoft, where he covers Microsoft's consumer products and corporate news. He's written about the technology industry since 1995 and reviewed the first Rio MP3 player for CNET.com in 1998. He's also a bass guitarist and an avid collector (and digitizer) of LP records. DISCLAIMER: This blog contains the personal opinions of the author and does not necessarily represent the opinions of his employers or of CNET Networks. As an IT industry analyst, the author occasionally agrees to nondisclosure agreements from Microsoft or other companies, and he will not violate the terms of such agreements on this blog.

He is a member of the CNET Blog Network and is not an employee of CNET.

Disclosure.

Add this feed to your online news reader

Digital Noise: Music and Tech topics

advertisement
advertisement

Inside CNET News

Scroll Left Scroll Right