Chapter 6. The compact disc and digital audio

Bạn đang xem bản rút gọn của tài liệu. Xem và tải ngay bản đầy đủ của tài liệu tại đây (14.34 MB, 389 trang )

234

The Compact Disc and Digital Audio

Unfortunately, each time a successive analogue tape copy is made, some

degradation of the original signal will occur, in respect of bandwidth and

signal-to-noise ratio, and as a result of minor tape malfunctions and

'dropouts'. So by the time the recording has been converted into a cassette

tape, or an undulating groove on the surface of a vinyl disc, a lot of the

immediacy and transparency of the original recording will have been lost.

By comparison with this, a 'digital' recording- in which the analogue

signal has been converted into a 'digitally encoded' electronic equivalent

where the continuously variable voltage levels of the original signal are

represented by a repetitively sampled sequence of alternating '0's and 'l's,

which signify clearly defined, constant and distinct electrical voltage levels

- is, at least in principle, capable of being copied over and over again,

without any degradation at all. Any minor errors in the received '0' or '1'

levels can be automatically corrected, and freed from any spurious noisea process which is obviously impracticable with any signal in analogue

form.

In addition, the incoming signal, once converted into its digital form,

need no longer exist in any specific time domain. After all, it is now just a

collection of data, divided into a sequence of blocks. This allows the data

to be divided, stored and manipulated, and reassembled in any way

necessary for the purposes of recording or handling. It also means that,

once the signal is converted into digital form, it is intrinsically free from

any added 'rumble', 'wow' or 'flutter' or other intrusions due to the speed

irregularities of the subsequent recording or replay systems. However,

there are also snags.

PROBLEMS WITH DIGITAL ENCODING

Quantisation noise

Although a number of ways exist by which an analogue signal can be

converted into its digital equivalent, the most popular, and the technique

used in the CD, is the one known as 'pulse code modulation', usually

referred to as 'PCM'. In this, the incoming signal is sampled at a

sufficiently high repetition rate to permit the desired audio bandwidth to

be achieved. In practice this demands a sampling frequency somewhat

greater than twice the required maximum audio frequency. The measured

signal voltage level, at the instant of sampling, is then represented

numerically as its nearest equivalent value in binary coded form (a process

which is known as 'quantisation').

This has the effect of converting the original analogue signal, after

encoding and subsequent decoding, into a voltage 'staircase' of the kind

shown in Fig. 6.1. Obviously, the larger the number of voltage steps in

235

The Compact Disc and Digital Audio

I-- Input waveform

t

i

>

~staircase'

e~

t_

g.

I

i,,

I

I

~Fig. 6.1

I

I,

I

I

,

,

Time

Digitally encoded~decoded waveform.

which the analogue signal can be stored in digital form (that shown in the

figure is encoded at '4-bit' - 24 or 16 possible voltage levels), the smaller

each of these steps will be, and the more closely the digitally encoded

waveform will approach the smooth curve of the incoming signal.

The difference between the staircase shape of the digital version and the

original analogue waveform causes a defect of the kind shown in Fig. 6.2,

known as 'quantisation error', and since this error voltage is not directly

related in frequency or amplitude to the input signal it has many of the

characteristics of noise, and is therefore also known as 'quantisation noise'.

This error increases in size as the number of encoding levels is reduced. It

will be audible if large enough, and is the first problem with digitally

encoded signals. I will consider this defect, and the ways by which it can be

minimised, later in this chapter.

t

"O

:3

r

E

L

o

L

tu

Fig. 6.2

Time

Quantisation error.

236

The Compact Disc and Digital Audio

Bandwidth

The second practical problem is that of the bandwidth which is necessary to

store or transmit such a digitally encoded signal. In the case of the CD, the

specified audio bandwidth is 20 I-Iz to 20 kHz, which requires a sampling

frequency somewhat greater than 40 kI-Iz. In practice, a sampling

frequency of 44.1 kHz is used. In order to reduce the extent of the staircase

waveform quantisation error, a 16-bit sampling resolution is used in the

recording of the CD, equivalent to 216 or 65 536 possible voltage steps. If

16 bits are to be transmitted in each sampling interval, then, for a stereo

signal, the required bandwidth will be 2 x 16 x 44100 I-Iz, or 1.4112 mHz,

which is already seventy times greater than the audio bandwidth of the

incoming signal. However, in practice, additional digital 'bits' will be

added to this signal for error correction and other purposes, which will

extend the required bandwidth even further.

Translation non-linearity

The conversion of an analogue signal both into and from its binary coded

digital equivalent carries with it the problem of ensuring that the

magnitudes of the binary voltage steps are defined with adequate

precision. If, for example, '16-bit' encoding is used, the size of the 'most

significant bit' (MSB) will be 32 768 times the size of the 'least significant

bit' (LSB). If it is required that the error in defining the LSB shall be not

worse than +0.5%, then the accuracy demanded of the MSB must be at

least within +0.0000152% if the overall linearity of the system is not to be

degraded.

The design of any switched resistor network, for encoding or decoding

purposes, which demanded such a high degree of component precision

would be prohibitively expensive, and would suffer from great problems as

a result of component ageing or thermal drift. Fortunately, techniques are

available which lessen the difficulty in achieving the required accuracy in

the quantisation steps. The latest technique, known as 'low bit' or

'bit-stream' decoding, side-steps the problem entirely by effectively using a

time-division method, since it is easier to achieve the required precision in

time, rather than in voltage or current, intervals.

Detection and correction of transmission errors

The very high bandwidths which are needed to handle or record PCM

encoded signals means that the recorded data representing the signal must

be very densely packed. This leads to the problem that any small blemish

on the surface of the CD, such as a speck of dust, or a scratch, or a thumb

The Compact Disc and Digital Audio

237

print could blot out, or corrupt, a significant part of the information

needed to reconstruct the original signal. Because of this, the real-life

practicability of all digital record/replay systems will depend on the

effectiveness of electronic techniques for the detection, correction or, if the

worst comes to the worst, masking of the resultant errors. Some very

sophisticated systems have been devised, which will also be examined

later.

Filtering for bandwidth limitation and signal recovery

When an analogue signal is sampled, and converted into its PCM encoded

digital equivalent, a spectrum of additional signals is created, of the kind

shown in Fig. 6.3(a), where f~ is the sampling frequency and fm is the upper

modulation frequency. Because of the way in which the sampling process

operates, it is not possible to distinguish between a signal having a

frequency which is somewhat lower than half the sampling frequency and

one which is the same distance above it; a problem which is called

'aliasing'. In order to avoid this, it is essential to limit the bandwidth of the

incoming signal to ensure that it contains no components above fs/2.

If, as is the case with the CD, the sampling frequency is 44.1 kHz, and

the required audio bandwidth is 20 Hz to 20 kHz, +0/ 1 dB, an input

'anti-aliasing' filter must be employed to avoid this problem. This filter

must allow a signal magnitude which is close to 100% at 20 kHz, but nearly

zero (in practice, usually - 6 0 dB) at frequencies above 22.05 kHz. It is

possible to design a steep-cut, low-pass filter which approximates closely to

this characteristic using standard linear circuit techniques, but the phase

shift, and group delay (the extent to which signals falling within the

affected band will be delayed in respect of lower frequency signals)

introduced by this filter would be too large for good audio quality or stereo

image presentation.

This difficulty is illustrated by the graph of Fig. 6.4, which shows the

relative group delay and phase shift introduced by a conventional low-pass

analogue filter circuit of the kind shown in Fig. 6.5. The circuit shown gives

only a modest - 9 0 dB/octave attenuation rate, while the actual slope

necessary for the required anti-aliasing characteristics (say, 0 dB at 20 kHz

and - 6 0 dB at 22.05 kHz) would be - 4 2 6 dB/octave. If a group of filters of

the kind shown in Fig. 6.5 were connected in series to increase the

attenuation rate from - 9 0 dB to - 4 2 6 dB/octave, this would cause a group

delay, at 20 kHz, of about I ms with respect to I kHz, and a relative phase

shift of some 3000~, which would be clearly audible.*

*In the recording equipment it is possible to employ steep-cut filter systems in which the

phase and group delay characteristics are more carefully controlled than would be practicable

in a mass-produced CD replay system where both size and cost must be considered.

Out )ut

k

__....~ I

etc.

m

m

Frequency

9

0

I

22.05 kHz

I

~

44.1 kHz

I

88.2 kHz

v

i

J

132.3 kHz

!

I

k

176.4 kHz

(a)

Output

I ,

Frequency

"

0

m

22.05 kHz

9

44. I kHz

'

88.2 kHz

,

132.3 kHz

,

176.4 kHz

(b)

Fig. 6.3 PCM frequency spectrum (a) when sampled at 44.1 kHz and (b) when four times oversampled.

The Compact Disc and Digital Audio

239

Similarly, since the frequency spectrum produced by a PCM encoded

20 kHz bandwidth audio signal will look like that shown in Fig. 6.3(a), it is

necessary, on replay, to introduce yet another equally steep-cut low-pass

filter to prevent the generation of spurious audio signals which would result

from the heterodyning of signals equally disposed on either side of fs/2.

An improved performance in respect both of relative phase error and of

group delay in such 'brick wall' filters can be obtained using so-called

'digital' filters, particularly when combined with pre-filtering phase

correction. However, this problem was only fully solved, and then only on

replay (because of the limitations imposed by the original Philips CD

patents), by the use of 'over-sampling' techniques, in which, for example,

the sampling frequency is increased to 176.4 kHz ('four times oversampling'), which moves the aliasing frequency from 22.05 kHz up to

154.35 kHz, giving the spectral distribution shown in Fig. 6.3(b). It is then

a relatively easy matter to design a filter, such as that shown in Fig. 6.14,

having good phase and group delay characteristics, which has a

transmission near to 100% at all frequencies up to 20 kHz, but near zero at

154.35 kHz.

THE RECORD-REPLAY SYSTEM

The recording system layout

How the signal is handled, on its way from the microphone or other signal

source to the final CD, is shown in the block diagram of Fig. 6.6. Assuming

the signal has by now been reduced to a basic L - R stereo pair, this is

amplitude limited to ensure that no signals greater than the possible

encoding amplitude limit are passed on to the analogue-to-digital converter

(ADC) stage. These input limiter stages are normally crosslinked in

operation to avoid disturbance of the stereo image position if the maximum

permitted signal level is exceeded, and the channel gain reduced in

consequence of this, in only a single channel.

The signal is then passed to a very steep-cut 20 kHz anti-aliasing filter

(often called a 'brick wall filter') to limit the bandwidth offered for

encoding. This bandwidth limitation is a specific requirement of the digital

encoding/decoding process, for the reasons already considered. It is

necessary to carry out this filtering process after the amplitude limiting

stage, because it is possible that the action of peak clipping may generate

additional high frequency signal components. This would occur because

'squaring-off' the peaks of waveforms will generate a Fourier series of

higher frequency harmonic components.

The audio signal, which is still, at this stage, in analogue form, is then

passed to two parallel operating 16-bit ADCs, and, having now been

converted into a digital data stream, is fed into a temporary data-storage

Gain (dB)

0dB -

-0

200

"0

-50dB -

-

Phase

.200

0

0

0

0

Oe ,

-100 d B -

I

-.4oo ~

~P

-1ooE

W

n

l

--6OO

-150 dB -

-200de I

10

-

I

"

100

I

Ik

Frequency (Hz)

Fig. 6.4 Responses of low-pass LC filter

I

10k

I

lOOk

.800

--0

L1

Input C

SourcempedancekO

i

I

L2

L3

L4

L5

17.35 i

mH

24.84mH =~

25.26mH

24.84mH

17.35mH

ryY

C1

=~8.905nF

ov 0

Fig. 6.5 Steep-cut L P filter circuit.

C2 nF

=~ 9.467

9C'.3.4nF

67

C4 nF

8.905

"=

~ Output

l lkO

~R Load

0

ov

L

Inputs

R

Limitan

,,,,I

Fig. 6.6 Basic CD recording system.

The Compact Disc and Digital Audio

243

device - usually a 'shift register' - from which the output data stream is

drawn as a sequence of 8-bit blocks, with the 'L' and 'R' channel data now

arranged in a consecutive but interlaced time sequence.

From the point in the chain at which the signal is converted into digitally

encoded blocks of data, at a precisely controlled 'clock' frequency, to the

final transformation of the encoded data back into analogue form, the

signal is immune to frequency or pitch errors as a result of motor speed

variations in the disc recording or replay process.

The next stage in the process is the addition of data for error correction

purposes. Because of the very high packing density of the digital data on

the disc, it is very likely that the recovered data will have been corrupted to

some extent by impulse noise or blemishes, such as dust, scratches, or

thumb prints on the surface of the disc, and it is necessary to include

additional information in the data code to allow any erroneous data to be

corrected. A number of techniques have been evolved for this purpose, but

the one used in the CD is known as the 'Cross-interleave Reed-Solomon

code' or CIRC. This is a very powerful error correction method, and allows

complete correction of faulty data arising from quite large disc surface

blemishes.

Because all possible '0' or '1' combinations may occur in the 8-bit

encoded words, and some of these would offer bit sequences which were

rich in consecutive 'O's or 'l's, which could embarrass the disc speed or spot

and track location servo-mechanisms, or, by inconvenient juxtaposition,

make it more difficult to read the pit sequence recorded on the disc

surface, a bit-pattern transformation stage known as the 'eight to fourteen

modulation' (EFM) converter is interposed between the output of the

error correction (CIRC) block and the final recording. This expands the

recorded bit sequence into the form shown in Fig. 6.7, to facilitate the

operation of the recording and replay process. I shall explain the functions

and method of operation of all these various stages in more detail later in

this chapter.

Disc recording

This follows a process similar to that used in the manufacture of vinyl EP

and LP records, except that the recording head is caused to generate a

spiral pattern of pits in an optically flat glass plate, rather than a spiral

groove in a metal one, and that the width of the spiral track is very much

smaller (about 1/60th) than that of the vinyl groove. (Detail of the CD

groove pattern is, for example, too fine to be resolved by a standard optical

microscope.) When the master disc is made, 'mother' and 'daughter' discs

are then made preparatory to the production of the stampers which are

used to press out the track pattern on a thin (1.4 mm) plastics sheet, prior

to the metallisation of the pit pattern for optical read-out in the final disc.

8114 bit ROM

'~

I

~"~

I

~'~

I

I, Iololoi !Ioi, I, I, I, I01, I, I, I, Ioi' I

.i-~

4----

k_J--k_/

/

Oisc indentations,

(if EFM was not employed)

'

'

4-----

Joining bits

x_/-~__/----x

'

'

i

!

Shift register

4-----

14-bit symbol

14-bit symbol

Joining bits

--~ ~IoI, IolIoI,I01oI, Iololo olol,lolOloJ[, lolollol,lolol,lolo-Out

/

\

/

Disc indentations (with EFM)

Fig. 6.7

The EFM process.

\

J

\_

/

Xem Thêm

Chapter 6. The compact disc and digital audio

Tài liệu liên quan

Tài liệu bạn tìm kiếm đã sẵn sàng tải về