Title:
Speech coding algorithms : foundation and evolution of standardized coders
Personal Author:
Publication Information:
Haboken, NJ : Wiley, 2003
Physical Description:
xxiv, 558 p. : ill. ; 24 cm.
ISBN:
9780471373124
Available:*
Library | Item Barcode | Call Number | Material Type | Item Category 1 | Status |
---|---|---|---|---|---|
Searching... | 30000010177949 | TK7882.S65 C488 2003 | Open Access Book | Book | Searching... |
On Order
Summary
Summary
Speech coding is a highly mature branch of signal processing deployed in products such as cellular phones, communication devices, and more recently, voice over internet protocol This book collects many of the techniques used in speech coding and presents them in an accessible fashion Emphasizes the foundation and evolution of standardized speech coders, covering standards from 1984 to the present The theory behind the applications is thoroughly analyzed and proved
Author Notes
Wai C. Chu is currently at DoCoMo USA Labs (San Jose, California), he is involved with R&D activities in speech/audio coding, digital signal processing, and multimedia applications
Table of Contents
Preface | p. xiii |
Acronyms | p. xix |
Notation | p. xxiii |
1 Introduction | p. 1 |
1.1 Overview of Speech Coding | p. 2 |
1.2 Classification of Speech Coders | p. 8 |
1.3 Speech Production and Modeling | p. 11 |
1.4 Some Properties of the Human Auditory System | p. 18 |
1.5 Speech Coding Standards | p. 22 |
1.6 About Algorithms | p. 26 |
1.7 Summary and References | p. 31 |
2 Signal Processing Techniques | p. 33 |
2.1 Pitch Period Estimation | p. 33 |
2.2 All-Pole and All-Zero Filters | p. 45 |
2.3 Convolution | p. 52 |
2.4 Summary and References | p. 57 |
Exercises | p. 57 |
3 Stochastic Processes and Models | p. 61 |
3.1 Power Spectral Density | p. 62 |
3.2 Periodogram | p. 67 |
3.3 Autoregressive Model | p. 69 |
3.4 Autocorrelation Estimation | p. 73 |
3.5 Other Signal Models | p. 85 |
3.6 Summary and References | p. 86 |
Exercises | p. 87 |
4 Linear Prediction | p. 91 |
4.1 The Problem of Linear Prediction | p. 92 |
4.2 Linear Prediction Analysis of Nonstationary Signals | p. 96 |
4.3 Examples of Linear Prediction Analysis of Speech | p. 101 |
4.4 The Levinson-Durbin Algorithm | p. 107 |
4.5 The Leroux-Gueguen Algorithm | p. 114 |
4.6 Long-Term Linear Prediction | p. 120 |
4.7 Synthesis Filters | p. 127 |
4.8 Practical Implementation | p. 131 |
4.9 Moving Average Prediction | p. 137 |
4.10 Summary and References | p. 138 |
Exercises | p. 139 |
5 Scalar Quantization | p. 143 |
5.1 Introduction | p. 143 |
5.2 Uniform Quantizer | p. 147 |
5.3 Optimal Quantizer | p. 149 |
5.4 Quantizer Design Algorithms | p. 151 |
5.5 Algorithmic Implementation | p. 155 |
5.6 Summary and References | p. 158 |
Exercises | p. 158 |
6 Pulse Code Modulation and its Variants | p. 161 |
6.1 Uniform Quantization | p. 161 |
6.2 Nonuniform Quantization | p. 166 |
6.3 Differential Pulse Code Modulation | p. 172 |
6.4 Adaptive Schemes | p. 175 |
6.5 Summary and References | p. 180 |
Exercises | p. 181 |
7 Vector Quantization | p. 184 |
7.1 Introduction | p. 185 |
7.2 Optimal Quantizer | p. 188 |
7.3 Quantizer Design Algorithms | p. 189 |
7.4 Multistage VQ | p. 194 |
7.5 Predictive VQ | p. 216 |
7.6 Other Structured Schemes | p. 219 |
7.7 Summary and References | p. 221 |
Exercises | p. 222 |
8 Scalar Quantization of Linear Prediction Coefficient | p. 227 |
8.1 Spectral Distortion | p. 227 |
8.2 Quantization Based on Reflection Coefficient and Log Area Ratio | p. 232 |
8.3 Line Spectral Frequency | p. 239 |
8.4 Quantization Based on Line Spectral Frequency | p. 252 |
8.5 Interpolation of LPC | p. 256 |
8.6 Summary and References | p. 258 |
Exercises | p. 260 |
9 Linear Prediction Coding | p. 263 |
9.1 Speech Production Model | p. 264 |
9.2 Structure of the Algorithm | p. 268 |
9.3 Voicing Detector | p. 271 |
9.4 The FS1015 LPC Coder | p. 275 |
9.5 Limitations of the LPC Model | p. 277 |
9.6 Summary and References | p. 280 |
Exercises | p. 281 |
10 Regular-Pulse Excitation Coders | p. 285 |
10.1 Multipulse Excitation Model | p. 286 |
10.2 Regular-Pulse-Excited-Long-Term Prediction | p. 289 |
10.3 Summary and References | p. 295 |
Exercises | p. 296 |
11 Code-Excited Linear Prediction | p. 299 |
11.1 The CELP Speech Production Model | p. 300 |
11.2 The Principle of Analysis-by-Synthesis | p. 301 |
11.3 Encoding and Decoding | p. 302 |
11.4 Excitation Codebook Search | p. 308 |
11.5 Postfilter | p. 317 |
11.6 Summary and References | p. 325 |
Exercises | p. 326 |
12 The Federal Standard Version of Celp | p. 330 |
12.1 Improving the Long-Term Predictor | p. 331 |
12.2 The Concept of the Adaptive Codebook | p. 333 |
12.3 Incorporation of the Adaptive Codebook to the CELP Framework | p. 336 |
12.4 Stochastic Codebook Structure | p. 338 |
12.5 Adaptive Codebook Search | p. 341 |
12.6 Stochastic Codebook Search | p. 344 |
12.7 Encoder and Decoder | p. 346 |
12.8 Summary and References | p. 349 |
Exercises | p. 350 |
13 Vector Sum Excited Linear Prediction | p. 353 |
13.1 The Core Encoding Structure | p. 354 |
13.2 Search Strategies for Excitation Codebooks | p. 356 |
13.3 Excitation Codebook Searches | p. 357 |
13.4 Gain Related Procedures | p. 362 |
13.5 Encoder and Decoder | p. 366 |
13.6 Summary and References | p. 368 |
Exercises | p. 369 |
14 Low-Delay Celp | p. 372 |
14.1 Strategies to Achieve Low Delay | p. 373 |
14.2 Basic Operational Principles | p. 375 |
14.3 Linear Prediction Analysis | p. 377 |
14.4 Excitation Codebook Search | p. 380 |
14.5 Backward Gain Adaptation | p. 385 |
14.6 Encoder and Decoder | p. 389 |
14.7 Codebook Training | p. 391 |
14.8 Summary and References | p. 393 |
Exercises | p. 394 |
15 Vector Quantization of Linear Prediction Coefficient | p. 396 |
15.1 Correlation Among the LSFs | p. 396 |
15.2 Split VQ | p. 399 |
15.3 Multistage VQ | p. 403 |
15.4 Predictive VQ | p. 407 |
15.5 Summary and References | p. 418 |
Exercises | p. 419 |
16 Algebraic Celp | p. 423 |
16.1 Algebraic Codebook Structure | p. 424 |
16.2 Adaptive Codebook | p. 425 |
16.3 Encoding and Decoding | p. 433 |
16.4 Algebraic Codebook Search | p. 437 |
16.5 Gain Quantization Using Conjugate VQ | p. 443 |
16.6 Other ACELP Standards | p. 446 |
16.7 Summary and References | p. 451 |
Exercises | p. 451 |
17 Mixed Excitation Linear Prediction | p. 454 |
17.1 The MELP Speech Production Model | p. 455 |
17.2 Fourier Magnitudes | p. 456 |
17.3 Shaping Filters | p. 464 |
17.4 Pitch Period and Voicing Strength Estimation | p. 466 |
17.5 Encoder Operations | p. 474 |
17.6 Decoder Operations | p. 477 |
17.7 Summary and References | p. 481 |
Exercises | p. 482 |
18 Source-Controlled Variable Bit-Rate Celp | p. 486 |
18.1 Adaptive Rate Decision | p. 487 |
18.2 LP Analysis and LSF-Related Operations | p. 494 |
18.3 Decoding and Encoding | p. 496 |
18.4 Summary and References | p. 498 |
Exercises | p. 499 |
19 Speech Quality Assessment | p. 501 |
19.1 The Scope of Quality and Measuring Conditions | p. 501 |
19.2 Objective Quality Measurements for Waveform Coders | p. 502 |
19.3 Subjective Quality Measures | p. 504 |
19.4 Improvements on Objective Quality Measures | p. 505 |
Appendix A Minimum-Phase Property of the Forward Prediction-Error Filter | p. 507 |
Appendix B Some Properties of Line Spectral Frequency | p. 514 |
Appendix C Research Directions in Speech Coding | p. 518 |
Appendix D Linear Combiner for Pattern Classification | p. 522 |
Appendix E Celp: Optimal Long-Term Predictor to Minimize the Weighted Difference | p. 531 |
Appendix F Review of Linear Algebra: Orthogonality, Basis, Linear Independence, and the Gram-Schmidt Algorithm | p. 537 |
Bibliography | p. 542 |
Index | p. 553 |