A quick unofficial Apple Music Spatial Audio FAQ

So, what is it?

Music mixed in Dolby Atmos.

So, like, surround sound?

Yes, with an asterisk we’ll come back to. Most surround systems use multiple channels: the original Dolby Surround used four (left, right, center, and rear), then moved to five (splitting the rear into left rear and right rear), and a few Even More Channels variants. Dolby Atmos, though, doesn’t have channels. Instead, it assigns audio tracks to “audio objects,” which have three-axis positions in virtual space. Each object has metadata that says, “this object should sound like it’s coming from a point this far between the left and right walls, this far between the rear and front walls, and this far between the floor and ceiling.” When you play the Atmos soundtrack, the decoder—called a “renderer” by Dolby—knows how many speakers it has available in the room and their positions within 3-D space, and it maps the audio objects onto specific speakers at specific values. It’s remixing the audio to match the playback environment on the fly, with as high fidelity to the original audio object positions as possible.

That’s pretty cool! What about, uh, not movie theaters?

The home version of Atmos works essentially the same way, just mapping to fewer speakers—typically just five, although you can get fancy and go up to eleven.

Is that a Spinal Tap reference?

If you want.

So how do you get all that fancy three-dimensional positioning with only two speakers, or a pair of headphones?

Congratulations, you’ve found that asterisk I said I was going to come back to!

Yay! Explain.

For headphones or any other two-channel system, Atmos positional data has to be down-mixed into two-channel audio. If you want to hear Atmos music with the highest fidelity to the recording, you’re going to need to play it over a system with at least four physical speakers. As far as I know, the only way to do that with Apple Music is to use an Apple TV box: it outputs Dolby Atmos over HDMI.

So it’s all a lie!

Well, not so fast. What Dolby Atmos does for Headphones is render binaural audio. Binaural audio is, per Wikipedia, “a method of recording sound that uses two microphones arranged with the intent to create a 3-D stereo sound sensation for the listener of actually being in the room with the performers or instruments.” Basically, the theory is that we only have two ears, yet we can clearly hear when sounds are up, down, front, or back, not just left or right. In theory, two channels should be enough to capture full positional information if we record the sound properly.

So: when there’s only two speakers to output to, the Atmos renderer takes each Atmos audio object and calculates volume and left/right panning settings to synthesize a binaural mix. All the positional information is still used.

Is it better stereo, or just different?

That’s hard to answer. First off, you may not hear much difference between binaural stereo and normal stereo. I hear what audio nerds call a “wider soundstage,” but I rarely think sounds are in front of me or behind me. Occasionally, I get that “enveloping sound” sensation they talk about, but I can also get that from a good normal stereo recording. In my experience, I’m more likely to get it from a pair of great full-range speakers than from headphones. From what I’ve read, I’m not an outlier here; the truth is that most people just don’t have great speakers.

That’s very audiophile snob of you.

Sorry.

You said “first off.” Second?

Second, the original multichannel Atmos mix just might not be as good, at least subjectively, as the stereo mix. Tracks are definitionally going to be placed differently, and you might find some are overemphasized or underemphasized to your tastes. This gets even more complicated if you’re listening to headphones, because you’re relying on Apple’s renderer to synthesize the binaural mix. (Atmos lets music producers include cues on how to make the best binaural version of an Atmos mix, but apparently Apple doesn’t use that, because as we all know, Apple Knows Best.)

So is this an Apple exclusive thing? Apple talks about it like they’re doing amazing things nobody else has.

Apple talks about everything that way. Atmos Music has been a thing since at least 2017, and TIDAL and Amazon Music already stream it. When Apple says they’ve been working with studios on this, I have a suspicion that what they mean is “Dolby has been working with studios on this.” I haven’t seen any indication yet that Apple’s getting exclusive tracks and deals.

Huh. So what does any of this have to do with AirPods “Spatial Audio”?

Apple seems to be using that to describe two unrelated things:

So (Atmos) Spatial Audio isn’t (Apple) Spatial Audio.

Right. Except when it is.

Argh. What?

If you’re using an Apple TV box, then you get the actual Atmos mix. If you want to listen to the synthesized binaural audio on non-Apple headphones, though, you’ll need to go into the Music app’s settings and set Atmos to “Always On” rather than “Automatic.” Apple Music warns that isn’t “supported” on all speakers, but I would read that as a warning that it may not sound good on all speakers—remember, the Atmos binaural rendering happens in Apple Music itself. It should be fine on any decent set of headphones, though.

What about two external speakers?

I wouldn’t, but if you want to, go nuts.

So is this really the future of music?

If anyone can finally make multichannel audio mainstream, it’s Apple, but that’s a big if—the consumer audio market has been rejecting multichannel audio for going on five decades. In its favor: multi-speaker setups have become very common for home theater systems, which they weren’t when SACD and DVD-Audio tried 25 years ago (and quadraphonic sound tried 25 years before that). Against it: a lot of these systems probably won’t actually sound that great with multichannel music. Movies and TV shows mostly just send dialogue to the center speaker and effects to surround speakers, so they don’t have to be “full-range.” But they do with multichannel music. So it’s quite possible that normal stereo tracks will sound better—not “you must be a golden-eared audiophile” better, but “obvious to even the most casual of listeners” better—on most home theater systems. And I’m dubious whether binaural downmixes for headphones will prove sufficiently compelling on their own.

(Updated in April 2024)

Back to Articles