Frequency
→ Pitch
physical vs. perceived height of tone
Amplitude
→ Loudness
physical vs. perceived strength of sound
Human hearing is nonlinear, and sensitivity changes with frequency and sound level. We are most sensitive to the range of speech perception.
Two tones with identical physical amplitude can be perceived as having different loudness:
Frequency weighting curves apply frequency-dependent filters to sound level measurements to approximate human hearing sensitivity at different SPLs.
A-weighting (dBA) approximates the inverse of the 40-phon equal-loudness contour and is the standard for audio engineering, environmental noise, and equipment specifications.
Phon: The unit phon is derived from equal-loudness contours (isophones) and corresponds to the sound pressure level (SPL) in dB of a 1 kHz sine tone that is perceived as equally loud as a sound of any other frequency.
Sone: The sone is a unit of perceived loudness where 1 sone corresponds to the loudness of a 1 kHz tone at 40 dB SPL. The loudness in sones doubles for every 10 dB increase in SPL, reflecting the logarithmic nature of loudness perception.
| Phon | 40 | 50 | 60 | 70 | 80 | 90 | 100 | 110 | 120 | 130 | 140 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Sone | 1 | 2 | 4 | 8 | 16 | 32 | 64 | 128 | 256 | 512 | 1024 |
Understanding psychoacoustics also has practical consequences for health and safe listening practices
Safe exposure levels:
→ Noise-induced hearing loss is permanent. Hair cells do not regenerate!
Frequency: Physical property of sound wave (Hz)
Pitch: Subjective perception, ordering sounds low to high
Many pitch-related illusions arise from the interaction of place and temporal coding:
Place theory:
Temporal theory:
→ Neurons fire at a lower rate than the maximum audible frequency.
The "missing fundamental" of a complex tone is a psychoacoustic phenomenon where the brain perceives a fundamental frequency even when it is absent from the actual sound signal:
repetition patterns of higher harmonics (periodicity)
Application: pseudo low frequency psycho-acoustic sensation (MaxxBass)
Missing fundamental
The auditory system functions as a series of bandpass filters, modeling the frequency-selective response of the basilar membrane.
Equivalent Rectangular Bandwidth (ERB) approximates auditory filter bandwidth:
where
→ ERB increases with frequency—auditory filters become broader at higher frequencies.
The just noticeable difference (JND) is the smallest change in a sound property that can be reliably detected by human listeners.
The JND for loudness is approximately 1 dB SPL.
The JND for pitch varies with frequency:
→ The human auditory system can perceive approximately 1400 distinct pitch steps across the hearing range.
The Bark scale, named after Heinrich Barkhausen, is a psychoacoustic scale that reflects the spatial frequency mapping of the basilar membrane.
The Mel scale models perceived pitch.
Auditory masking is a psychoacoustic phenomenon where the perception of one sound is affected by the presence of another, making the target sound less audible, depending on factors like frequency, intensity, and temporal proximity.
The degree of masking is defined as the difference between the masked threshold and the unmasked threshold.
A strong sound in one frequency band masks weaker sounds in nearby frequencies. (1kHz example).
1 kHz masker increases hearing thresholds
A loud sound masks a softer one that occurs before (pre-masking) or after (post-masking) it within a short time window.
Sounds close in frequency share the same critical band, making one mask the other.
In mixing:
Critical bands are ranges where tones influence each other during perception.
This interaction explains effects such as:
→ All effects result from limited frequency resolution of the auditory system (critical bands).
Perceptual audio codecs (MP3, AAC, Opus) use psychoacoustic masking to reduce file size without perceptible quality loss:
→ Psychoacoustic models allow a reduction from ~50 MB to ~5 MB without audible loss.
Interference of sound waves with slightly different frequencies.
Two sound waves with slightly different frequencies interfere, causing a cyclical change in volume known as beating. This beat frequency equals the absolute difference between the two frequencies.
Used for tuning instruments.
Interference of sound waves with slightly different frequencies.
Auditory illusion from two slightly different frequencies presented dichotically (one to each ear).
Roughness occurs when two tones of similar amplitude fall within the same critical bandwidth.
→ Limited frequency resolution of the basilar membrane.
When two tones sound simultaneously, an additional tone may be perceived, which corresponds to the sum or difference of the fundamental frequencies of the two tones:
→ Similar to electronic ring modulation
The sonic characteristics of a sound, determined by the combination of its fundamental tone, overtones, noise, and the amplitude signature of its frequency components.
→ This explains why the same musical pitch (note) sounds different when played on different instruments.
A formant is a local maximum of energy around a specific frequency in the sound spectrum, caused by resonances.
Formants contribute to the characteristic timbre of instruments and, in phonetics, are critical for distinguishing vowels and voiced sounds.
→ Formant regions are largely independent of the pitch of the fundamental
Examples: Vowels /ä/, /i/, /o/ have distinct formant patterns regardless of pitch
ASA is how the auditory system groups sounds into distinct, meaningful streams, making sense of complex environments.
Key concepts:
The ability to focus on a single conversation in a noisy environment while filtering out competing sounds.
Stream segregation in a cycle of six tones
Perceptual continuation of a gliding tone
through a noise burst
The Shepard–Risset glissando is an auditory illusion where a continuously ascending or descending pitch appears to rise or fall endlessly, even though the physical stimulus is cyclically structured
Psychoacoustics connects physics with perception:
Original content: © 2025 Lorenz Schwarz
Licensed under CC BY 4.0. Attribution required for all reuse.
Includes: text, diagrams, illustrations, photos, videos, and audio.
Third-party materials: Copyright respective owners, educational use.
Contact: lschwarz@hfg-karlsruhe.de
Georg Friedrich Haas: In Vain ℗ KAIROS © HNE Rights GmbH Künstler:innen-Biografie Austrian composer Beat Furrer founded Klangforum Wien, a chamber orchestra devoted to contemporary music, in Vienna in 1985. The group is made up of 24 players and considers itself a collective, with its members