Screen Shot 2020-03-29 at 12.54.36 PM.png

Written 2016

Streaming services use volume normalization to create a balanced listening experience across playlists and albums. Services like Spotify, Tidal and Apple Music determine an average loudness value for singles, EPs & LPs using a loudness measurement called LUFS. The song, EP or LP's loudness value is used to normalize playback volume to a target level set by the streaming service. The Audio Engineering Society suggests a streaming target level of -16LUFS, however, most streaming services use a louder target level between -13 and -16LUFS. Streaming service's target levels are much lower in volume than master levels preferred by modern artists, producers and engineers. Because of the difference between master levels and streaming service target levels, the louder a master recording is made, the more streaming services will turn down the recording to match their target level. For example, if you master an album to -8LUFS (loud) and submit the files to Spotify, they will turn down the songs playback volume 6dB to match their target volume of -14LUFS.

Streaming Service Target Volumes:

Apple Music (soundcheck on):
- -16LUFS
Spotify:
- -14LUFS
Tidal:
- -14LUFS
YouTube:
- -14LUFS

Mastering Levels and Streaming Service Target Volumes:

Below are five different masters of a single song at a different loudness levels (-8LUFS, -10LUFS, -12LUFS, -14LUFS and -16LUFS). The target playback level in the below example is -14LUFS (Spotify & Tidal). Loud master recordings (pink, orange, yellow) are turned down to the streaming service's target volume. Lower level masters are not turned down as much and provide for a greater peak to loudness ratio than albums that are mastered at loud volumes. in effect, the louder you master your album, the lower your peak to loudness ratio. However, this does not impact perceived loudness as you would expect. A more dynamic master will not result in a louder perceived playback. LUFS measurement uses a k- weighting filtering and creating a mix with an appropriate spectral balance will contribute to perceived loudness more than your master level. However, different master levels are attributed to genre. Electronic records often sound more appropriate at louder levels. Some pop or rock records sound more ‘together’ at louder master levels. The decision regarding master level is one of personal taste more than perceived loudness on streaming platforms.

The -8LUFS master (pink) is turned down 6dB, -10LUFS is turned down 4dB, -12LUFS is turned down 2dB, -14LUFS file is uneffected and the -16LUFS file is amplified by 2dB, potentially approaching the service's playback limiter.

Test Files Submitted to Streaming Services:

To illustrate the playback volume manipulation performed by streaming services I have submitted test files of master levels to streaming services. Each file consists of an identical sequences of pink noise calibrated to specific loudness levels. The five songs were submitted as "singles" to streaming services so that each track's volume is assessed individually, and not as an average for the entire EP or LP. Some streaming services have an "album mode" which normalizes the entire album's average volume to the streaming service's target volume and maintains the individual level differences between tracks set by the mastering engineer. The below test files were submitted as singles to avoid the album mode loudness averaging. This simulates what a song would do when it is added to a playlist. You can download the 16 bit, 44.1kHz test files below and the AAC files here.

Test File Info:

"8 Times" -8LUFS, -1.9dBTP
"10 Shoes" -10LUFS, -4dBTP
"12 Dozen" -12LUFS, -5.9dBTP
"14 Team" -14LUFS, -7.9dBTP
"16 Ounces" -16LUFS, -9.9dBTP

Loudness Test Files - 16bit, 44.1kHz

Streaming Services & Normalized Playback Volume:

Click the below links to open in-app playlists of the above test files.

Spotify

Apple Music

Tidal

The track "8 Times" is mastered 8 dB louder than "16 Ounces", but both tracks play back at a very similar perceived volume on all Spotify, Tidal and Apple Music.

Conclusion:

As you can hear in the above playlists, louder masters do not create a louder playback experience for the listener. Songs mastered at different volume levels are streamed at almost identical playback levels. Even though each streaming services has a different approach to loudness normalization, they all use a target level far below the master volume preferred by many modern artists, producers & engineers. Generally, masters that are perceived louder on streaming platforms have more to do with spectral balance than master levels. For example, excessive mid range or high frequencies will trigger a higher measurement and result in a lower playback level. Determining a master level is a decision about personal preference and genre. Loud master can sometimes sound more contained or together. Extremely dynamic masters can sound more open or loose. These differences are more about what is appropriate for the genre and personal taste. As a music maker you should worry less about level and more about feel, tone, balance and energy. You will read a lot of articles telling you the best levels for masters — do not listen to them. If you prefer the sound of louder masters, make it loud. If you prefer the sound of open and dynamic songs, master at lower levels. The most important aspect you should consider the spectral balance of your masters and the intentionality and clarity of your mastering processing. Obviously, music is not made by measurement and some forms of music simply sound better with more compression and limiting in the master recordings, while other styles of music will benefit from a more gentle approach. You should work with your mastering engineer to determine an appropriate target level that suits your particular project and genre.

Soundcloud:

Bandcamp:

<a href="http://thepinknoises.bandcamp.com/album/the-pink-noises">The Pink Noises by The Pink Noises</a>

96kHz & The Music Industry's Next Digital Supply Chain

June 12, 2022Ryan Schwabe

June 25th, 2016

Most modern songs are created in digital audio workstations that default to 24-bit wav file format and 44.1kHz sampling rate. The 44.1kHz sampling rate has been the de facto standard for music distributors since the first commercial CD was released in August, 1982 by the Dutch technology company, Philips. In 2016, 24-bit wav, 96kHz sampling rate is becoming the high resolution audio standard for the new music industry's digital supply chain.

44.1kHz sample rate was originally chosen for the CD because it is the minimum sampling rate necessary to satisfy the Nyquist – Shannon Theorem. The Nyquist – Shannon Theorem states that in order to faithfully create a digitization of a sound, the sample rate must be twice that of the highest recorded frequency. Technically, the human ear can hear frequencies up to 20kHz. Therefore, the minimum sampling rate must be 40kHz in order to properly reconstruct the signal. The incorrect reproduction of frequencies beyond the Nyquist Shannon Theorem is known as aliasing.

The red source signal requires 4 samples within the 2 wave cycles in order to properly capture the sound.  The blue line represents the aliasing created by the DAC when the sample rate is not twice that of the source.       … — The red source signal requires 4 samples within the 2 wave cycles in order to properly capture the sound. The blue line represents the aliasing created by the DAC when the sample rate is not twice that of the source.

Since 1982, the music industry has delivered music to consumers using the 44.1kHz sampling rate. However, the new streaming based digital supply chain is slowly adopting the 24-bit, 96kHz file format.

Mastered for iTunes logo

In February of 2012, the Recording Academy and Apple iTunes worked together to create the “Mastered for iTunes “ digital delivery standard. This standard is largely misunderstood, but creates a method for the mastering engineer to compare what he or she hears in the studio with what the consumer will hear. The MfiT standard also protects against peak distortion that can be created during the format conversion process. A common approach to protecting against peak distortion during the conversion process is to create -1.5 to -0.5dBFS of unused headroom in the master digital audio file, creating headroom in the top of the master. If your limiter is set to a maximum output level of -0.1dBFS, or even -0.3dBFS, peak distortion can be created in the consumer file when your file is converted form a wav file to a consumer file format. By leaving at least -0.5dBFS of headroom the encoding process will stay within full scale (0.0dBFS), reducing the chance of peak distortion. The MfiT applet allows you to perform the conversion process and hear the AAC file before it hits retail.

The below picture shows a wav file with a limiter's output set to a maximum loudness of -.5dBFS. When the master file is encoded to an MP3 or AAC by the retailer, the codec will encode overs above your limiter level. If you limiter is set with some headroom the encoded peaks will not result in distortion. It will simply take advantage of the headroom you left in the master.

Peak distortion created during the format conversion process performed by digital music retailers.  The above photo shows amplitude (up, down) and time (L, R) . — Peak distortion created during the format conversion process performed by digital music retailers. The above photo shows amplitude (up, down) and time (L, R) .

The MfiT protocol prefers 24-bit wav, 96kHz sample rate files for AAC encoding. Technically, you can deliver a 24-bit wav, 44.1kHz file to your distributor and it will still be considered "Mastered for iTunes", but 24-bit 96kHz files are preferred. In my opinion, the MfiT guidelines work extremely well across the entire digital supply chain, not just the iTunes marketplace.

High Resolution Audio Logo created by the Consumer Technology Association

In February of 2016, The Consumer Technology Association created a classification for “High Resolution Audio” as “better than CD quality”. In addition to High Resolution Audio standards, streaming services are slowly moving to High Resolution Audio with the incorporation of “Master Quality Authenticated” encoding and decoding technology developed by Bob Stuart of Meridian Audio.

The MQA process allows for the encoding and decoding of 96kHz, 24-bit files by streaming services, but at a fraction of the file size. Tidal has adopted the technology and other streaming services are showing interest in Meridian's breakthroughs. MQA audio streaming will require a hardware decoder to playback the full bandwidth 96kHz, 24-bit stream. However, normal playback devices such as an iPhone or laptop will support "CD quality" MQA streams without an MQA decoder.

As you can see, the largest supplier of music (iTunes) has incorporated a high resolution audio as the archival standard with it's “Mastered for iTunes“ program. Apple is currently amassing the largest database of 24-bit, 96kHz music in the world. The Consumer Technology Association has designated a minimum standard and logo for High Resolution Audio and they plan on licensing the logo to appear dynamically within streaming services.

As streaming services continue to innovate we will hear higher quality audio and greater integration of metadata delivered to consumers. The Digital Data Exchange (DDEX) worked with the Recording Academy to set standards for the formatting metadata that will travel down the digital supply chain to digital distributors. Once metadata is integrated into the digital supply chain it will change the way we discover new music and learn about the people who make it. It will not be long before there will be a high resolution audio streaming service with a fully integrated digital credits list allowing consumers to discover new music in a whole new way.