Why the Decoder Sounds the Way It Does – Methodological Context

2026-03-21
3-minute read
intermediate

Level: Intermediate | Audience: Composer, student, technically curious reader.

The basic principle: field instead of objects

B-format describes a sound field, not individual objects or channels. This field consists of spherical harmonics that describe sound in all directions of space at the same time. The decoder projects this field onto a concrete loudspeaker arrangement without changing the field itself.

This distinguishes Ambisonics fundamentally from vector-based approaches such as VBAP or ALLRAD, where individual direction vectors are distributed to nearby loudspeakers. There the field is broken into objects; here it remains intact as a whole.

What spherical harmonics have to do with it

Spherical harmonics divide the surface of a sphere into directional components, much like Fourier series divide a time curve into frequency components. Each order refines directional resolution:

0th order (W): omnidirectional — uniform energy in all directions
1st order (X, Y, Z): axis-oriented — front/back, left/right, up/down
Higher orders: sharper localisation, more channels

A 3rd-order B-format signal (HOA3) contains 16 channels and localises sources much more precisely than 1st order (4 channels). The Ambisonics Order in the decoder therefore has to match the source material.

Rotation invariance

B-format is rotation-invariant: the sound field can be rotated, mirrored, or transformed without having to be re-rendered, because spherical harmonics form a complete orthogonal basis that does not lose information under rotation. With object-based formats such as Dolby Atmos bed-plus-object workflows, this is not possible in the same direct way.

Geometric accuracy and psychoacoustic coherence

Geometry: Loudspeaker coordinates enter directly into the decoder matrix. Wrong angles or distances cause deviations in the spatial image. Measured setups therefore sound better than guessed ones.

Weighting: The Basic, In-Phase, and Max-rE schemes control how strongly different orders contribute to loudspeaker distribution. Max-rE balances energy focus and localisation sharpness and is the right starting point for most setups. In-Phase is often more stable under difficult listening conditions, but less precise in localisation.

What this means in practice

When setting up a new system, this order is a good rule:

Order first — match it to the source material and loudspeaker density
Geometry second — enter positions as accurately as possible
Weighting last — Max-rE as the default start, In-Phase for irregular arrays
Scaling when needed — for delay compensation with unequal distances

The decoder does not make artistic decisions. It calculates exactly what you tell it to calculate.

Why this matters for composers

This theoretical distinction has direct practical consequences:

You compose into a field, not into a fixed loudspeaker layout.
The same B-format work can survive across different playback systems.
Decoder decisions shape monitoring, but they do not redefine the work itself.
Weightings change how clearly or how stably spatial gestures are perceived.

In practice, this means: keep the B-format Master stable, choose decoder settings consciously, and evaluate key passages on the real target setup whenever possible.