History - Technical

 

Super Audio CD

When Compact Disc was brought out by Philips and Sony in 1982, it was believed to be the perfect medium. Twenty years later, Compact Disc is still in full development. Many experienced and knowledgeable people, such as recording engineers, and leaders of the audio industry claimed that the digital medium was not that perfect. They quickly realised that the 16/44.1 resolution that conventional (“Red Book”) Compact Disc had on offer was not enough to match or even improve upon the musical qualities the LP had to offer. ( for an extensive description of the shortcomings of CD, see the CD-page).Although most of the engineers knew that the resolution of CD had to be enhanced, the technology simply wasn’t available. By incremental progress they succeeded in getting a 24 bit/96kHz resolution onto a conventional CD, but only if the CD was recorded in 24/96 ánd if the digital-analog converter in the CD-player was compatible with that kind of resolution. That way, conventional CD came very close to offering a sound that sounded was almost ‘analog’. It seems a strange paradox that the digital revolution of CD, and the emission of LP was followed by a process of further development in which the engineers tried to make the CD sound as good as the medium it had made disappear. This is of course because of the fact that the consumers adapted to the new medium so quickly due to its practical advantages, and not its ultimate sound quality. The CD may have offered digital encoding, but compared with the often-abused LP, it was CD's random access, portability, indestructibility, and lack of surface noise that helped it establish a beachhead.

It was the appearance of the Digital Versatile Disc (DVD) that made it possible for the engineers to come up with a succesor of CD, that could meet the consumers demands of practicality, ánd offer truly superior sound quality. They could now record in a resolution of 24 bits/192 kHz, resulting in a sound finally surpassing that of LP, and match the ease of handling of CD. For these reasons of sound quality they called it Super Audio CD. It was originally co-developed by Sony and Philips, but it took only a few months for other companies to buy their participation and develop SACD-players of their own. The recording technology however is still monopolised by Sony, and is called Direct Stream Digital.

The technique behind SACD

A key provision of SACD is the potential for a dual-layer "hybrid" disc. A hybrid disc has two information layers; the bottom layer is identical to that of a CD, and contains 16-bit/44.1kHz digital audio (the so-called "Red Book" layer). The second "high-density" layer uses much smaller information-carrying pits to store more data. This high-density layer is semi-transmissive, meaning it reflects light from the SACD playback laser, but is transparent to a CD player’s laser.
A hybrid SACD can present CD-quality sound to any CD player, and also deliver high-resolution digital audio when played on an SACD machine. SACD’s backward compatibility with CD has several important implications. The first is that you can play a hybrid SACD in your car or portable CD player, then move the disc inside for SACD playback in your listening room. Second, there’s no dual inventory for record companies and retailers; they carry a single disc for both CD and SACD customers. Third, CD consumers will buy into the SACD format without making a conscious decision to stop buying music in one format and start buying it in a new, incompatible format. After several years of buying what they think are CDs, the average consumer will theoretically have amassed a library of SACD titles.
The hybrid disc is only one of three possible disc options. Record companies could also choose to release music only in high-resolution form, with a single high-density layer and no backward-compatible CD layer. Or the third option, a disc with two high-density layers for about 150 minutes of high-resolution playing time.
The high-density layer contains not only a two-channel stereo version of the music, but can also store a six-channel mix . Artists and record companies will decide whether to release two-channel or multi-channel discs. SACD thus works equally well for two-channel and surround-sound listeners.

Packing eight channels (a two-channel plus a six-channel mix) of high-resolution digital audio in the same space that contained two channels of relatively low-resolution CD-quality digital audio is made possible by three techniques. First, the information-carrying pits in the high-density layer are about half the size of conventional CD pits. Second, the distance between tracks is about half that of CD. Third, the digital audio data are compressed with a lossless coding system that more efficiently encodes that data. For example, rather than store a stream of eight consecutive zeros, the code 8X0 may be recorded instead. This is a crude oversimplification of a very complex technique. What’s important, however, is that this data compression system is perfectly lossless – that is, the same ones and zeros that go in are the same ones and zeros recovered from the disc. Consequently, there’s no reduction in sound quality. SACD’s lossless compression is contrasted with lossy compression systems such as Dolby Digital that discard musical information and degrade fidelity.
Direct Stream Digital (DSD) Encoding
The compact disc and most professional digital audio systems rely on Pulse Code Modulation (PCM) to convert analog audio into digital information. In PCM encoding, the audio signal is sampled at precise intervals. At each sample point, a binary number is assigned to represent the audio signal’s amplitude at the time the sample was taken. Assigning this number is called "quantization." Sampling is akin to taking snapshots of the audio waveform; quantization is "weighing" the audio signal contained in the snap-shot. A movie camera is a form of sampler; the images aren’t continuous, but are taken at discrete time intervals.
Sony and Philips believed that PCM encoding had been so improved that any future advancement would be minimal. Consequently, they devised a radically different method of encoding digital audio called Direct Stream Digital (DSD). In DSD encoding, the analog audio waveform is sampled at 2.8224 million times per second. Each sample, however, generates only one bit of information. The musical information is encoded in the width of the 1-bit-high pulses.
The data rate from DSD is exactly four times that of 16-bit/44.1kHz digital audio. (CD produces an audio datastream of 705,600 bits per second per channel.) This four-fold increase in the number of bits representing the music allows DSD to encode more musical information over a wider audio band-width compared with CD.
Using a single bit to encode the musical signal’s amplitude results in a very high noise floor. But because DSD’s 2.8224MHz sampling frequency produces a theoretical bandwidth of 1.4MHz (according to the Nyquist Theorem, the audio bandwidth is half the sampling frequency), that noise floor is spread over this very wide frequency range. This noise floor can also be shifted from one frequency range to another, a technique called noise shaping. In DSD, noise in the audio band is shifted to a higher frequency where it becomes inaudible (Figure 3). Note that the total noise power in the audio signal doesn’t change; noise shaping merely shifts its frequency away from the audioband.

This technique of quantizing the audio signal with one bit is far from new; the earliest digital audio systems used this so-called "delta-sigma modulation." One advantage of delta-sigma modulation is that it is a much simpler process than PCM encoding and decoding. DSD requires no decimation and interpolation filtering as does PCM, processes that introduce noise and errors into the signal. In fact, the DSD bitstream looks remarkably like the analog signal it represents . In theory, this DSD bitstream could be converted to analog with a low-pass filter made from a single capacitor.
DSD provides a dynamic range of 120dB and an audio bandwidth of more than 100kHz. Note that the 120dB dynamic range is available up to 20kHz; above that frequency the noise shaping technique described earlier reduces dynamic range. Nonetheless, DSD can encode information well above 20kHz, a first for a consumer digital audio system. DSD can also encode a 10kHz square wave, something 44.1kHz PCM cannot come close to achieving. Some designers consider this ability a fundamental prerequisite for high sound quality.
A Direct Stream Digital signal can be converted to a 16-bit/44.1kHz PCM signal for storage on an SACD’s CD-compatible layer. This processing, which Sony calls Super Bit Mapping Direct, preserves some qualities of the high-resolution DSD bitstream. Record companies can thus issue hybrid CDs from a single DSD master. This has the benefit of improving the CD layer’s sound quality.

Al these technical advantages would of course be useless if they would not result in a thouroughly enhanced sound quality offered by SACD. As with CD, several recording engineers, leaders of the industry and influentials in the matter will be quoted on their opinion on the new medium.

How do SACD and DVD-A compare ?

back to top