Enchanting Instruments 32 - The History of Music and Electricity 8: Data Structure of CD

2022-01-17

The CD is not a direct recording of uncompressed audio data. There are techniques in place to ensure that the data is read as accurately as possible with each track.

■ CD Data Structure

The data structure of an audio CD is as follows:

■ Lead-in and Lead-out

A CD is read in a spiral pattern starting from the inner ring. It begins with a section called the "lead-in." This area contains information that’s like the table of contents of the CD; it includes the time data for each track and the total playback time. After that, there is the area where the track data is recorded. When all the tracks are played, the CD enters the "lead-out" area and ends the playback. The lead-out contains only 90 seconds of silence and does not store any special information.

The length of the spiral track can be easily calculated because the linear velocity is constant. With a linear velocity of 1.25 m/sec, up to 74 minutes of audio can be recorded, which would require 5,550 meters of tracks. So, on a 120mm diameter disc, there is over 5 kilometers of track.

Between the tracks, there is an area known as the "gap," which represents a period of silence. If the gap time is set to 0, the next track will start without any interruption. This feature is useful for live albums, where the aim is to remove any gaps between the tracks.

Track data is managed in units called sectors. A sector contains data for 1/75th of a second, so 75 sectors are required to play one second of audio. Within each sector, there are 98 EFM (Eight-to-Fourteen Modulation) frames.

■ EFM (Eight to Fourteen Modulation)

The internal structure of an EFM frame is complex. A 24-bit synchronization signal always starts at the beginning of the frame, followed by subcodes, with the P and Q channels used in CD-DA. These channels contain data such as track numbers and time. After that, audio data is included along with error correction data. Additionally, 3-bit merge bits are inserted between each piece of data. These are mechanisms to prevent consecutive 1s or isolated 0s.

As shown in the diagram, the pit lengths of a CD are defined as nine types from T3 to T11. The number of consecutive 0s is restricted to between 2 and 10, and consecutive 1s are not allowed. To achieve this rule, EFM is used. As the name suggests, it is a system that converts 8 bits into 14 bits. Although the audio signal on a CD is 16 bits, it is split into two 8-bit segments, and each of these 8-bit segments is converted into 14 bits.

The conversion is done according to the table below. Since it's 8 bits, the conversion list ranges from 0 to 255. The EFM frame has a total size of 588 bits, but as audio data, it only contains the equivalent of 12 samples (6 samples x 2).

10 decimals	2 decimals (8-bit)	EFM(14-bit)
0	00000000	01001000100000
1	00000001	10000100000000
2	00000010	10010000100000
3	00000011	10001000100000
4	00000100	01000100000000
5	00000101	00000100010000
6	00000110	00010000100000
7	00000111	00100100000000
8	00001000	01001001000000
・	・	・
・	・	・
255	11111111	00100000010010

■ Merge Bits

If we convert 255 and 1 into 14-bit EFM format, it becomes 00100000010010 and 10000100000000. If we connect them directly, we get:

00100000010010 10000100000000

In this case, the 0 at the junction is isolated, so we need to use one of the four merge bits to prevent the isolated 0. The merge bit that can be used here is "000".

0010000001001 000 010000100000000

The merge bit doesn't have any data significance, but it is an essential component to ensure the rules are followed.

■ Error Correction: CIRS (Cross-Interleaved Reed-Solomon)

Error correction refers to the system that repairs inaccurately read bits and restores them to the original signal. In reality, it's difficult to read the data accurately with just one optical trace as errors of several bits per second occur. Therefore, error correction is essential. The CIRS used in CDs combines two error correction methods: Cross-Interleave and Reed-Solomon. It’s a highly reliable error correction system, and as a result, errors on a CD are reduced to about 1 bit. If the condition of the disc is good, it can be considered that errors are virtually nonexistent.

■ Cross-Interleave

Cross-Interleave is an effective method for dealing with long errors. By distributing the data across the CD, Cross-Interleave avoids long errors caused by dirt or scratches. In other words, the data is not arranged along the time axis but is dispersed across the CD. After reading the data, the data is reassembled into its original time axis, restoring the audio signal.

■ Reed-Solomon

Reed-Solomon is an effective method for short errors that occur frequently. These errors are common in optical reading. It has multiple mechanisms for recovering unreadable or misread data by combining it with surrounding data. One method involves calculating the sum of the data, so even if part of the data is lost, it can be reconstructed from the sum. Another method uses simultaneous equations to restore the data more powerfully. In simple terms, it's a system that checks and verifies the data that has been read.

■ CD Audio Data Percentages

The above describes the overall structure of data recorded on a CD. When we look at just the frames of the audio data on a CD, only 12 samples are included in the total 588-bit frame. The final converted data becomes 16 bits x 12 samples = 192 bits. If we compare this, the audio data makes up only about 33% of the total frame. Alternatively, if we calculate based on EFM with 14 bits, 57% of the data is audio. When including the margin bits, about 70% of the data is audio. The remaining 30% is mainly data used for error correction. Looking at it this way, it may seem like a lot of the space is taken up by non-audio elements, and it could appear to be inefficient, but these elements are indispensable for accurately restoring the data.

The “sound & person” column is made up of contributions from you.
For details about contributing, click here.

achapi

楽器メーカーで楽器開発していました。楽器は不思議な道具で、人間が生きていく上で、必要不可欠でもないのに、いつの時代も、たいへんな魅力を放っています。音楽そのものが、実用性という意味では摩訶不思議な立ち位置ですが、その音楽を奏でる楽器も、道具としては一風変わった存在なのです。そんな掴み所のない楽器について、作り手視点で、あれこれ書いていきたいと思います。
blog https://achapi2718.blogspot.com/
HP https://achapi.cloudfree.jp