With both DVD-Audio and Super Audio CD on the horizon, the compact disc may face two serious challenges to its long-held position as the premium carrier for prerecorded music. Of course, even if consumer acceptance of these new formats-both scheduled for launch this year-is remarkably rapid, the CD will reign for the foreseeable future. But just as the CD did to the LP and the cassette on its way up, these new formats will take their toll on the prestige of the CD well before they actually eclipse the CD in units sold. A psychological turning point will likely come when the CD is no longer seen as the flagship format, at which time further efforts at improving its sonic fidelity may seem somewhat pointless.
This scenario poses an interesting dilemma for companies such as Pacific Microsonics of Union City, Calif. Because the CD is the only viable consumer digital delivery format available, the company’s efforts to promote its High Definition Compatible Digital technology have until recently tended to focus on HDCD’s benefits for that particular format, and its success to date has been built largely on its ability to provide 20-bit performance in a 16-bit medium. But with the introduction of new formats offering better resolution without special processing, Pacific Microsonics must now address questions of HDCD’s future relevance, even as it successfully moves the process from the audiophile fringes to the mainstream of consumer audio electronics.
HDCD is a multifaceted process that is not specifically tied either to the CD or to the format’s underlying Red Book specification, which calls for linear PCM audio at a 16-bit word length and 44.1kHz sample rate. “HDCD is really a brand name for a family of synergetic and interrelated technologies,” says Michael Ritter, Pacific Microsonics co-founder and VP of professional audio. “All of the technologies are aimed at providing lower distortion and higher resolution through any form of digital audio recording and playback. Some parts of the system apply through all release formats, while subsets of the technology only apply to certain release formats.” To help ensure a role for HDCD in new PCM formats such as DVD, the company is introducing this month at the Tokyo AES the Model Two HDCD processor, which will work not only at the 44.1/88.2kHz sample rates, but also at 48/96/176.4/192 kHz.
MASTERING THE MASS MARKETUp until now, the Model One processor has been Pacific Microsonics’ sole product for the professional market, and its only hardware product. Used mainly in mastering rooms, more than 70 of the units are now in use worldwide at top facilities such as Sony Music, Sterling Sound and Gateway Mastering on the East Coast, Future Disc, OceanView Digital and Precision Mastering in the West, and Georgetown Masters and MasterMix in Nashville. Bernie Grundman Mastering is in the process of acquiring a total of ten units (six in Hollywood and four in Tokyo). But Pacific Microsonics’ main focus is on licensing HDCD decoding for inclusion in consumer audio hardware. According to Ritter, 1999 looks like the year when HDCD licensing will move into mass market goods, largely because of synergy between developments in the realms of consumer and professional audio.
“The continued and growing acceptance of HDCD on the pro side,” Ritter says, “has translated into our current list of well over 2,000 HDCD CD titles released so far.” According to the company, that’s double the number of HDCD titles available since the end of 1997. And with more than 125 million HDCD-encoded CDs sold so far, it’s hard to argue that HDCD is simply an obscure phenomenon. Many of the encoded CDs are very high-profile projects, including recent releases from artists such as Bruce Springsteen, Garth Brooks and Jewel.
“Look at the Billboard Top 200,” Ritter says. “At any given time you’ll find between 20 and 30 HDCD titles. So it’s not just a lot of titles, but titles from important artists. That has really caught the notice of some of the larger companies in the consumer electronics arena. In the past, the consumer hardware manufacturers who have licensed HDCD-more than 90 companies-were relatively high-end. But this last year we started to see products that are at prices that will really move us into the mass market.” As evidence of this trend, Ritter cites recent HDCD licensing announcements, including a $399 DVD player from Toshiba, a $400 CD changer from Harman-Kardon, a five-disc changer from Denon at the $299 price point, a Marantz consumer CD-R recorder (HDCD decoding only), and upcoming products from Kenwood and TEAC.
Although these recent announcements indicate that HDCD is gaining momentum in the consumer hardware market, the current installed base of CD players and receivers equipped with HDCD decoding covers perhaps 1% of the total installed base of players. That means most of the people who buy HDCD releases, at least the many Gold and Platinum titles, are not actually listening to them with HDCD decoding. Yet there’s been no groundswell among fans to demand that record companies sell the same recordings in non-encoded versions. By itself, this does not prove Pacific Microsonics’ claim that HDCD encoding makes music sound better even if it is not decoded. But it does add credibility to the word “compatible” in the product’s name. And the fact that many top facilities are using the process despite the limited installed base of decoders indicates that mastering experts believe that some of HDCD’s conversion and encoding processes improve the translation of audio signals into the digital domain, and that these improvements are evident in many playback situations even without decoding. All of this ties in neatly with Pacific Microsonics’ argument that its future is assured even if the CD is dethroned.
THE HDCD PROCESSThe HDCD process is the brainchild of Keith Johnson and Michael “Pflash” Pflaumer, who were brought together by Ritter in the mid-1980s to address what the three men perceived as shortcomings in the CD’s digital sound. According to an AES paper describing their work, the trio’s goal was to create a system that could deliver audio reproduction that “changes the sound as little as possible.” The group was familiar with research by Louis Fielder, R.A. Greiner, Jeff Eggers and others suggesting that true fidelity in critical listening situations requires a dynamic range in excess of 120 dB, as well as a way to mitigate the effects of the “brick wall” filtering commonly used in 44.1 or 48kHz A/D conversion.
The methodology of the HDCD development team involved a thorough review-conducted intermittently over the first several years-of every component in the signal chain in order to isolate and either eliminate or minimize negative effects. Johnson and Pflaumer were guided not simply by test measurements but also by extensive listening tests. “Frequently,” they say in their paper, “changing a design parameter produced a clear perceptual correlate that eluded quantification by measurement.” They refer to the system they came up with as a “conjugate system of encoding and decoding,” but they believe (as noted above) that some benefits are derived from improvements in encoding, particularly in the area of filtering, that do not require complementary decoding to be realized upon playback.
According to Ritter, HDCD’s sonic improvements begin with the Model One’s unique A/D conversion process. “The actual A/D converter in the Model One runs at 24 bits and 176.4 kHz currently; the Model Two will also convert at 192 kHz. We improve the linearity of our conversion with a high-amplitude broadband dither signal that we mix in with the program in the analog domain. The dither appears to be random, but the system knows at any given instant precisely what the amplitude of that dither signal is. And because we use our own custom, discrete, full-ladder converter with excellent amplitude and phase accuracy, we are able to apply an ‘anti-dither’ signal, exactly out-of-phase and matched in time, in the digital domain after conversion. That nulls the dither noise out of the signal.”
The next steps in the process are related to filtering, and they depend on the sample rate of the destination (release) format. “If it’s going to be a 176.4 or 192kHz DVD-Audio release, then we will not decimate that signal; we use a proprietary filter [non-oversampled] optimized to that sample rate. If it’s going to be 88.2/96 kHz, we use 2:1 decimation, and once again we use a filter optimized to that frequency. But in both high-resolution settings, the Nyquist frequency is high enough that we don’t use the ‘dynamic decimation’ process that becomes necessary when we go down to 44.1 or 48 kHz.”
Dynamic-decimation filtering is HDCD’s response to the well-known problems inherent in filter design for digital conversion systems where the Nyquist frequency is only slightly above the range of human hearing. “A filter designer who has to make a ‘brick wall’ filter at 22 kHz is confronted with conflicting requirements,” Ritter explains. “You want to have flat frequency response out to at least 20 kHz, but you can’t have any energy above 22 kHz or you will get alias distortion. This requires a very sharp multipole filter with a very steep transition between the passband and the stopband, which has a number of distortive effects on the signal. It smears transients and causes significant ripples in the passband. If you try to simplify the filter, then to avoid totally unacceptable aliasing you have to start rolling off at 13 to 15 kHz, and even then the signal will not be completely cut off by 22 kHz.”
Ritter describes the HDCD approach to this problem: “We slightly delay the 88.2kHz signal, not enough to cause any sync problems but enough that we can do a continuous Fast Fourier Transform. The resultant information is digitally analyzed in real time by an algorithm that determines, based upon a model of the mechanics of hearing and psychoacoustics, what is perceptually dominant in the signal from instant to instant. And that information is used to optimize the decimation filter. One moment you might have a sudden sharp transient, so it uses a filter with minimum time dispersion to pass the transient cleanly. The next instant, there might be a cymbal crash, so it uses a filter that minimizes alias distortion. All the filters are the same length, so you are not getting a phase shift as this is going on.”
Another element in the process of downconverting for CD is word-length reduction to 16 bits. “We never simply truncate,” Ritter says. “And with the introduction of Version 2.0 of the Model One at the end of 1998, available as a flash-ROM upgrade to existing units, we now have a palette of four 16-bit dither options.” The dither and the dynamic decimation together, Ritter believes, add up to a big improvement over typical CD sound. “The reduced distortion-sharper transient response and reduced aliasing-becomes part of the digital recording and will be heard on any player, whether it has HDCD decoding or not,” he says. Nonetheless, the optimal playback setting is one in which the playback filters are matched to those used in recording. To achieve this, the Model One hides control information in the signal that tells the HDCD decoder which filter to use. This data is encoded as a pattern in the dither used for word-length reduction. It occurs only 1% to 2% of the time, and the company says that extensive testinghas shown that it is inaudible.
In addition to dynamic decimation filtering, HDCD uses the control data to fit a 20-bit dynamic range into a 16-bit signal. Two types of complementary amplitude encoding/decoding are available; the use of either is optional. At the high end of the dynamic range, “peak extend” allows the user to boost gain by up to 6 dB. For quiet signals, “low level range extend” may be used to add up to 7 dB of gain. With both dynamic processes, the control data allows the decoder to restore the dynamics of the original signal. “When you have control over the whole signal path from start to end, you can optimize the record side for what you know will be happening on the play side,” Ritter says. “You just can’t take it to that degree of optimization if you have no control over playback.”
In sum, then, HDCD is a multitiered process, with some aspects involved at all supported sample rates and word lengths, and others used only in certain contexts (see page 95). “Our proprietary A/D conversion, D/A conversion and filtering processes are used at all times,” Ritter says, “though the filters used at the 176.4/192 kHz sample rates are different from those used at 88.2/96 kHz. When you get down to the 44.1/48kHz sample rates, then you have ‘dynamic decimation filtering’ going on. And when you use 16-bit resolution, you also have the option of using ‘amplitude encode/decode’ processing.”
HDCD FOR DVD AND DTSThe ability of the new HDCD encoder to work with the 48kHz sample rate family will allow HDCD encoding to be used on the audio tracks of a DVD-Video disc. Toshiba will introduce two DVD-Video player models featuring HDCD decoding in the second quarter of the year. “The HDCD technology in a DVD-Video player will typically handle several tasks,” Ritter says. “It will provide proprietary precision digital filtering, and interpolation for the D/A converter, for all stereo sources , as well as for the two front channels of surround sound material. In addition, for playback of Red Book audio for HDCD-encoded CDs, the playback circuit will provide amplitude decoding to enable up to 20-bit dynamic range.”
Another area where Pacific Microsonics is working to broaden HDCD applications is multichannel delivery. In January, the company announced an agreement with DTS Entertainment (Digital Theater Systems Inc.) to “work together to enable producers, engineers and record labels to use the HDCD process as the front-end recording technology for DTS Digital Surround 5.1 music CD releases.” Ritter explains that “in the DTS header there is now a place for a flag that has been identified as an HDCD flag. That feature is implemented now in DSPs from companies such as Analog Devices that handle decoding of both DTS and HDCD. If a player has both DTS and HDCD in it, then if a DTS recording was made with HDCD, the HDCD playback filter will be selected. But there will be no amplitude processing going on.” Ritter also notes that with the Model Two, HDCD encoding will be possible not only for DTS CDs, but also for DTS tracks on DVD-Video discs.
The multichannel processing required for implementing the DTS capability is achieved by using three HDCD processors together. “For surround,” Ritter says, “you put one of the units into master mode and the other two into slave mode. There is an AC-coupled precision word clock that comes out of the master that you daisy-chain through the slaves, and then you terminate the last unit. That gives you a conversion clock that is synchronous throughout all the converters.”
The use of multiple processors will also allow HDCD to be used to create high-resolution surround material for the DVD-Audio format. One potential complication there is the Meridian Lossless Packing scheme used to fit high-resolution multichannel into the DVD transfer rate. But Ritter says MLP should have no effect. “Any system that is bit-for-bit lossless,” he says, “will have no impact on the HDCD signal.”
Granting that HDCD will work for DVD-Audio, the question arises again as to why you would want to use it in a format that already has such high resolution. Ritter’s answer is that better resolution actually increases the need for optimizing all links in the recording/playback chain. “Let’s take a camera analogy,” he says, “where HDCD is like a lens with very high resolution and very low chromatic aberration. That quality can actually be better appreciated if you are making a 4×5 plate than a 35mm image. So the truth is that the advantages of all the work we do to achieve extremely low distortion and extremely high resolution are even more evident in a high-resolution format like DVD-Audio than with the CD.”
Beyond the purely technical considerations, Ritter adds that “CDs will continue to be the dominant format for a long time to come, so player manufacturers will want to include HDCD in their DVD players for playing back CDs. And once that capability is there, it doesn’t cost any more to make it available for DVD playback, as well. That means that in addition to the advantages in performance, there are also no obstacles in terms of cost or marketing. So I think we are going to do very well as far as getting HDCD into future generations of audio players.”