Skip to:Content
|
Bottom
Cover image for Voice compression and communications : principles and applications for fixed and wireless channels
Title:
Voice compression and communications : principles and applications for fixed and wireless channels
Personal Author:
Series:
IEEE Press digital and mobile communications series
Publication Information:
New York : Wiley-Interscience, 2001
ISBN:
9780471150398

Available:*

Library
Item Barcode
Call Number
Material Type
Item Category 1
Status
Searching...
30000010047089 TK7882.S65 H36 2001 Open Access Book Book
Searching...

On Order

Summary

Summary

Up-to-date, expert coverage of topics in wireless voice communications
Voice communication is the most important facet of mobile radio service. Even when the predicted surge of wireless data and Internet services becomes a reality, voice will remain the most natural means of human communication.
Voice Compression and Communications details issues in wireless voice communications and treats compression, channel coding, and wireless transmission as a joint subject. Part I covers background material, whereas Part II provides detailed information on both proprietary and standardized analysis-by-synthesis codecs, including the speech codecs of virtually all existing wireline-based and wireless systems. Parts III and IV discuss mainly research-based wideband, audio, as well as very low-rate schemes likely to find their way into future standards.
Voice Compression and Communications describes fundamental concepts in a non-mathematical way early in the book for those with only a background knowledge of signal processing and communications. More advanced readers will find detailed discussions of theoretical principles, future concepts, and solutions to various specific wireless voice communications problems.


Author Notes

LAJOS HANZO has coauthored five books on mobile radio communications and published more than 300 research papers on a variety of topics in wireless multimedia communications. He holds a chair in telecommunications at the Department of Electronics and Computer Science, University of Southampton, UK, and he is an IEEE Distinguished Lecturer.
F. CLARE A. SOMERVILLE is with the Global Wireless Systems Research Department, Bell Laboratories, Swindon, UK. His current research involves real-time techniques for transmission of voice over GPRS and the resultant speech quality attained.
JASON P. WOODARD is with UbiNetics Ltd., where he is responsible for the development and implementation of various algorithms for third-generation mobile communications products.


Reviews 1

Choice Review

Hanzo (Univ. of Southampton, UK), Sommerville (Bell Laboratories, UK), and Woodard offer a treatise on voice compression theory and practice that comprehensively treats this field's evolution and current state of the art. The book features four major sections; the first two, "Speech Signals and Waveform Coding" and "Analysis by Synthesis Coding," discuss fundamental concepts, codec implementations (codecs are algorithms used to encode or decode, compress or decompress, various types of data to save disk space, such as sound or video files), and many related design topics. The final two sections, "Wideband Coding and Transmission" and "Very Low-Rate Coding and Transmission," focus on current research initiatives. The book is intended as a resource and design guide for practitioners of voice communications systems, e.g., wireless telephony. Other than for the initial chapters, readers will require a relatively advanced grasp of mathematics and related engineering skills. More than 300 references to other papers, books, and standards; topic and author indexes. The authors are experienced and knowledgeable and have produced a significant new addition to this field's literature. Researchers; faculty; professionals. E. M. Aupperle University of Michigan


Table of Contents

Prefacep. xxiii
Acknowledgmentsp. xxix
Part I Speech Signals and Waveform Codingp. 1
Chapter 1 Speech Signals and Introduction to Speech Codingp. 3
1.1 Motivation of Speech Compressionp. 3
1.2 Basic Characterization of Speech Signalsp. 4
1.3 Classification of Speech Codecsp. 7
1.4 Waveform Codingp. 11
1.5 Chapter Summaryp. 26
Chapter 2 Predictive Codingp. 27
2.1 Forward Predictive Codingp. 27
2.2 DPCM Codec Schematicp. 28
2.3 Predictor Designp. 29
2.4 Adaptive One-Word-Memory Quantizationp. 36
2.5 DPCM Performancep. 37
2.6 Backward-Adaptive Predictionp. 39
2.7 The 32 kbps G.721 ADPCM Codecp. 43
2.8 Subjective and Objective Speech Qualityp. 49
2.9 Variable-Rate G.726 and Embedded G.727 ADPCMp. 50
2.10 Rate-Distortion in Predictive Codingp. 58
2.11 Chapter Summaryp. 62
Part II Analysis by Synthesis Codingp. 63
Chapter 3 Analysis-by-Synthesis Principlesp. 65
3.1 Motivationp. 65
3.2 Analysis-by-Synthesis Codec Structurep. 66
3.3 The Short-Term Synthesis Filterp. 67
3.4 Long-Term Predictionp. 70
3.5 Excitation Modelsp. 78
3.6 Adaptive Short-Term and Long-Term Post-Filteringp. 81
3.7 Lattice-Based Linear Predictionp. 83
3.8 Chapter Summaryp. 89
Chapter 4 Speech Spectral Quantizationp. 90
4.1 Log-Area Ratiosp. 90
4.2 Line Spectral Frequenciesp. 95
4.3 Vector Quantization of Spectral Parametersp. 105
4.4 Spectral Quantizers for Wideband Speech Codingp. 113
4.5 Chapter Summaryp. 126
Chapter 5 Regular Pulse Excited Codingp. 127
5.1 Theoretical Backgroundp. 127
5.2 The 13 kbps RPE-LTP GSM Speech Encoderp. 133
5.3 The 13 kbps RPE-LTP GSM Speech Decoderp. 137
5.4 Bit Sensitivity of the 13 kbps GSM RPE-LTP Codecp. 140
5.5 Application Example: A Toolbox-Based Speech Transceiverp. 142
5.6 Chapter Summaryp. 144
Chapter 6 Forward-Adaptive Code Excited Linear Predictionp. 145
6.1 Backgroundp. 145
6.2 The Original CELP Approachp. 146
6.3 Fixed Codebook Searchp. 149
6.4 CELP Excitation Modelsp. 151
6.5 Optimization of the CELP Codec Parametersp. 160
6.6 The Error-Sensitivity of CELP Codecsp. 175
6.7 Application Example: A Dual-Mode 3.1 kBd Speech Transceiverp. 187
6.8 Multi-Slot PRMA Transceiverp. 200
6.9 Chapter Summaryp. 206
Chapter 7 Standard Forward-Adaptive CELP Codecsp. 207
7.1 Backgroundp. 207
7.2 The U.S. DoD FS-1016 4.8 kbits/s CELP Codecp. 207
7.3 The IS-54 DAMPS kbps Pan American Speech Codecp. 213
7.4 The 6.7 kbps Japanese Digital Cellular System's Speech Codecp. 216
7.5 The Qualcomm Variable-Rate CELP Codecp. 218
7.6 Japanese Half-Rate Speech Codecp. 225
7.7 The Half-Rate GSM Codecp. 233
7.8 The 8 kbits/s G.729 Codecp. 237
7.9 The Reduced Complexity G.729 Annex A Codecp. 256
7.10 The 12.2 kbps Enhanced Full-Rate GSM Speech Codecp. 259
7.11 The Enhanced Full-Rate 7.4 kbps IS-136 Speech Codecp. 264
7.12 The ITU G.723.1 Dual-Rate Codecp. 268
7.13 Chapter Summaryp. 277
Chapter 8 Backward-Adaptive Code Excited Linear Predictionp. 279
8.1 Introductionp. 279
8.2 Motivation and Backgroundp. 279
8.3 Backward-Adaptive G.728 Codec Schematicp. 282
8.4 Backward-Adaptive G.728 Coding Algorithmp. 284
8.5 Reduced-Rate G.728-Like Codec: Variable-Length Excitation Vectorp. 298
8.6 The Effects of Long-Term Predictionp. 300
8.7 Closed-Loop Codebook Trainingp. 305
8.8 Reduced-Rate G.728-Like Codec II: Constant-Length Excitation Vectorp. 309
8.9 Programmable-Rate 8-4 kbps Low-Delay CELP Codecsp. 310
8.10 Backward-Adaptive Error Sensitivity Issuesp. 327
8.11 A Low-Delay Multimode Speech Transceiverp. 333
8.12 Chapter Summaryp. 338
Part III Wideband Coding and Transmissionp. 339
Chapter 9 Wideband Speech Codingp. 341
9.1 Sub-band-ADPCM Wideband Coding at 64 kbpsp. 341
9.2 Wideband Transform Coding at 32 kbpsp. 357
9.3 Sub-Band-Split Wideband CELP Codecsp. 360
9.4 Fullband Wideband ACELP Codingp. 363
9.5 A Turbo-Coded Burst-by-Burst Adaptive Wideband Speech Transceiverp. 368
9.6 Chapter Summaryp. 384
Part IV Very Low-Rate Coding and Transmissionp. 385
Chapter 10 Overview of Low-Rate Speech Codingp. 387
10.1 Low-Bitrate Speech Codingp. 387
10.2 Linear Predictive Coding Modelp. 400
10.3 Speech Quality Measurementsp. 403
10.4 Speech Databasep. 406
10.5 Chapter Summaryp. 409
Chapter 11 Linear Predictive Vocoderp. 411
11.1 Overview of a Linear Predictive Vocoderp. 411
11.2 Line Spectrum Frequencies Quantizationp. 412
11.3 Pitch Detectionp. 417
11.4 Unvoiced Framesp. 428
11.5 Voiced Framesp. 429
11.6 Adaptive Post-Filterp. 430
11.7 Pulse Dispersion Filterp. 432
11.8 Results for Linear Predictive Vocoderp. 437
11.9 Chapter Summaryp. 440
Chapter 12 Wavelets and Pitch Detectionp. 441
12.1 Conceptual Introduction to Waveletsp. 441
12.2 Introduction to Wavelet Mathematicsp. 444
12.3 Pre-Processing the Wavelet Transform Signalp. 449
12.4 Voiced-Unvoiced Decisionp. 452
12.5 Wavelet-Based Pitch Detectorp. 453
12.6 Summary and Conclusionsp. 460
Chapter 13 Zinc Function Excitationp. 461
13.1 Introductionp. 461
13.2 Overview of Prototype Waveform Interpolation Zinc Function Excitationp. 462
13.3 Zinc Function Modelingp. 466
13.4 Pitch Detectionp. 470
13.5 Voiced Speechp. 473
13.6 Excitation Interpolation Between Prototype Segmentsp. 477
13.7 Unvoiced Speechp. 483
13.8 Adaptive Post-Filterp. 483
13.9 Results for Single Zinc Function Excitationp. 483
13.10 Error Sensitivity of the 1.9 kbps PWI-ZFE Coderp. 486
13.11 Multiple Zinc Function Excitationp. 490
13.12 A Sixth-Rate, 3.8 kbps GSM-Like Speech Transceiverp. 496
13.13 Chapter Summaryp. 500
Chapter 14 Mixed-Multiband Excitationp. 501
14.1 Introductionp. 501
14.2 Overview of Mixed-Multiband Excitationp. 502
14.3 Finite Impulse Response Filterp. 504
14.4 Mixed-Multiband Excitation Encoderp. 507
14.5 Mixed-Multiband Excitation Decoderp. 510
14.6 Performance of the Mixed-Multiband Excitation Coderp. 513
14.7 A Higher Rate 3.85 kbps Mixed-Multiband Excitation Schemep. 520
14.8 A 2.35 kbit/s Joint-Detection-Based CDMA Speech Transceiverp. 523
14.9 Chapter Summaryp. 530
Chapter 15 Sinusoidal Transform Coding Below 4 kbpsp. 531
15.1 Introductionp. 531
15.2 Sinusoidal Analysis of Speech Signalsp. 532
15.3 Sinusoidal Synthesis of Speech Signalsp. 534
15.4 Low-Bitrate Sinusoidal Codersp. 536
15.5 Incorporating Prototype Waveform Interpolationp. 539
15.6 Encoding the Sinusoidal Frequency Componentp. 541
15.7 Determining the Excitation Componentsp. 543
15.8 Quantizing the Excitation Parametersp. 548
15.9 Sinusoidal Transform Decoderp. 556
15.10 Speech Coder Performancep. 558
15.11 Chapter Summaryp. 563
Chapter 16 Conclusions on Low-Rate Codingp. 565
16.1 Overviewp. 565
16.2 Listening Testsp. 565
16.3 Summary of Very Low-Rate Codingp. 567
16.4 Further Researchp. 568
Chapter 17 Comparison of Speech Codecs and Transceiversp. 569
17.1 Background to Speech Quality Evaluationp. 569
17.2 Objective Speech Quality Measuresp. 570
17.3 Subjective Measuresp. 577
17.4 Comparison of Subjective and Objective Measuresp. 578
17.5 Subjective Speech Quality of Various Codecsp. 580
17.6 Error Sensitivity Comparison of Various Codecsp. 582
17.7 Objective Speech Performance of Various Transceiversp. 583
Appendix A Constructing the Quadratic Spline Waveletsp. 589
Appendix B Zinc Function Excitationp. 593
Appendix C Probability Density Function for Amplitudesp. 597
Bibliographyp. 601
Indexp. 623
Author Indexp. 631
Go to:Top of Page