Skip to:Content
|
Bottom
Cover image for Speech and audio processing in adverse environments
Title:
Speech and audio processing in adverse environments
Series:
Springer series on signals and communication technology
Publication Information:
Berlin : Springer, 2008
Physical Description:
xx, 735 p. : ill. ; 24 cm.
ISBN:
9783540706014

Available:*

Library
Item Barcode
Call Number
Material Type
Item Category 1
Status
Searching...
30000010194200 TK5102.9 S634 2008 Open Access Book Book
Searching...

On Order

Summary

Summary

Users of signal processing systems are never satis?ed with the system they currently use. They are constantly asking for higher quality, faster perf- mance, more comfort and lower prices. Researchers and developers should be appreciative for this attitude. It justi?es their constant e?ort for improved systems. Better knowledge about biological and physical interrelations c- ing along with more powerful technologies are their engines on the endless road to perfect systems. This book is an impressive image of this process. After "Acoustic Echo 1 and Noise Control" published in 2004 many new results lead to "Topics in 2 Acoustic Echo and Noise Control" edited in 2006 . Today - in 2008 - even morenew?ndingsandsystemscouldbecollectedinthisbook.Comparingthe contributions in both edited volumes progress in knowledge and technology becomesclearlyvisible:Blindmethodsandmultiinputsystemsreplace"h- ble" low complexity systems. The functionality of new systems is less and less limited by the processing power available under economic constraints. The editors have to thank all the authors for their contributions. They cooperated readily in our e?ort to unify the layout of the chapters, the ter- nology, and the symbols used. It was a pleasure to work with all of them. Furthermore, it is the editors concern to thank Christoph Baumann and the Springer Publishing Company for the encouragement and help in publi- ing this book.


Table of Contents

Abbreviations and Acronymsp. 1
1 IntroductionE. Hänsler and G. Schmidt
1.1 Overview about the Bookp. 8
Part I Speech Enhancement
2 Low Delay Filter-Banks for Speech and Audio ProcessingH. W. Löllmann and P. Vary
2.1 Introductionp. 13
2.2 Analysis-Synthesis Filter-Banksp. 15
2.2.1 General Structurep. 15
2.2.2 Tree-Structured Filter-Banksp. 16
2.2.3 Modulated Filter-Banksp. 17
2.2.4 Frequency Warped Filter-Banksp. 20
2.2.5 Low Delay Filter-Banksp. 26
2.3 The Filter-Bank Equalizerp. 29
2.3.1 Conceptp. 29
2.3.2 Prototype Filter Designp. 31
2.3.3 Relation between GDFT and GDCTp. 33
2.3.4 Realization for Different Filter Structuresp. 35
2.3.5 Polyphase Network Implementationp. 37
2.3.6 The Non-Uniform Filter-Bank Equalizerp. 41
2.3.7 Comparison between FBE and AS FBp. 43
2.3.8 Algorithmic Complexityp. 43
2.4 Further Measures for Signal Delay Reductionp. 44
2.4.1 Conceptp. 45
2.4.2 Approximation by a Moving-Average Filterp. 45
2.4.3 Approximation by an Auto-Regressive Filterp. 46
2.4.4 Algorithmic Complexityp. 47
2.4.5 Warped Filter Approximationp. 48
2.5 Application to Noise Reductionp. 49
2.5.1 System Configurationsp. 49
2.5.2 Instrumental Quality Measuresp. 50
2.5.3 Simulation Results for the Uniform Filter-Banksp. 51
2.5.4 Simulation Results for the Warped Filter-Banksp. 53
2.6 Conclusionsp. 55
Referencesp. 56
3 A Pre-Filter for Hands-Free Car Phone Noise Reduction: Suppression of Harmonic Engine Noise ComponentsH. Puder
3.1 Introductionp. 63
3.2 Analysis of the Different Car Noise Componentsp. 64
3.2.1 Wind Noisep. 65
3.2.2 Tire Noisep. 65
3.2.3 Engine Noisep. 66
3.3 Engine Noise Removal Based on Notch Filtersp. 68
3.4 Compensation of Engine Harmonics with Adaptive Filtersp. 73
3.4.1 Step-Size Controlp. 75
3.4.2 Calculating the Optimal Step-Sizep. 78
3.4.3 Results of the Compensation Approachp. 80
3.5 Evaluation and Comparison of the Results Obtained by the Notch Filter and the Compensation Approachp. 84
3.6 Conclusions and Summaryp. 85
3.6.1 Conclusionp. 85
3.6.2 Summaryp. 86
Referencesp. 87
4 Model-Based Speech EnhancementM. Krini and G. Schmidt
4.1 Introductionp. 89
4.2 Conventional Speech Enhancement Schemesp. 91
4.3 Speech Enhancement Schemes Based on Nonlinearitiesp. 93
4.4 Speech Enhancement Schemes Based on Speech Reconstructionp. 97
4.4.1 Feature Extraction and Controlp. 99
4.4.2 Reconstruction of Speech Signalsp. 110
4.5 Combining the Reconstructed and the Noise Suppressed Signalp. 124
4.5.1 Adding the Fully Reconstructed Signalp. 125
4.5.2 Adding only the Voiced Part of the Reconstructed Signalp. 129
4.6 Summary and Outlookp. 133
Referencesp. 133
5 Bandwidth Extension of Telephony SpeechB.Iser and G.Schmidt
5.1 Introductionp. 135
5.2 Organization of the Chapter 137
5.3 Basicsp. 138
5.3.1 Human Speech Generationp. 139
5.3.2 Source-Filter Modelp. 141
5.3.3 Parametric Representations of the Spectral Envelopep. 143
5.3.4 Distance Measuresp. 147
5.4 Non-Model-Based Algorithms for Bandwidth Extensionp. 149
5.4.1 Oversampling with Imagingp. 149
5.4.2 Spectral Shiftingp. 151
5.4.3 Application of Non-Linear Characteristicsp. 153
5.5 Model-Based Algorithms for Bandwidth Extensionp. 153
5.5.1 Generation of the Excitation Signalp. 155
5.5.2 Vocal Tract Transfer Function Estimationp. 159
5.6 Evaluation of Bandwidth Extension Algorithmsp. 176
5.6.1 Objective Distance Measuresp. 177
5.6.2 Subjective Measuresp. 180
5.7 Conclusionsp. 181
Referencesp. 182
6 Dereverberation and Residual Echo Suppression in Noisy EnvironmentsE. A. P. Habets and S. Gannot and I. Cohen
6.1 Introductionp. 186
6.2 Problem Formulationp. 188
6.3 OM-LSA Estimator for Multiple Interferencesp. 191
6.3.1 OM-LSA Estimatorp. 191
6.3.2 A priori SIR Estimatorp. 193
6.4 Dereverberation of Noisy Speech Signalsp. 195
6.4.1 Short Introduction to Speech Dereverberationp. 195
6.4.2 Problem Formulationp. 197
6.4.3 Statistical Reverberation Modelp. 199
6.4.4 Late Reverberant Spectral Variance Estimatorp. 200
6.4.5 Summary and Discussionp. 203
6.5 Residual Echo Suppressionp. 203
6.5.1 Problem Formulationp. 204
6.5.2 Late Residual Echo Spectral Variance Estimatorp. 206
6.5.3 Parameter Estimationp. 208
6.5.4 Summaryp. 210
6.6 Joint Suppression of Reverberation, Residual Echo, and Noisep. 210
6.7 Experimental Resultsp. 212
6.7.1 Experimental Setupp. 214
6.7.2 Joint Suppression of Reverberation and Noisep. 214
6.7.3 Suppression of Residual Echop. 216
6.7.4 Joint Suppression of Reverberation, Residual Echo, and Noisep. 221
6.8 Summary and Outlookp. 223
Referencesp. 224
7 Low Distortion Noise Cancellers -- Revival of a Classical TechniqueA. Sugiyama
7.1 Introductionp. 229
7.2 Distortions in Widrow's Adaptive Noise Cancellerp. 230
7.2.1 Distortion by Interferencep. 230
7.2.2 Distortion by Crosstalkp. 232
7.3 Paired Filter (PF) Structurep. 233
7.3.1 Algorithmp. 233
7.3.2 Evaluationsp. 235
7.4 Crosstalk Resistant ANC and Cross-Coupled Structurep. 239
7.4.1 Crosstalk Resistant ANCp. 240
7.4.2 Cross-Coupled Structurep. 241
7.5 Cross-Coupled Paired Filter (CCPF) Structurep. 242
7.5.1 Algorithmp. 242
7.5.2 Evaluationsp. 245
7.6 Generalized Cross-Coupled Paired Filter (GCCPF) Structurep. 247
7.6.1 Algorithmp. 250
7.6.2 Evaluation by Recorded Signalsp. 251
7.7 Demonstration in a Personal Robotp. 261
7.8 Conclusionsp. 261
Referencesp. 263
Part II Echo Cancellation
8 Nonlinear Echo Cancellation Based on Spectral ShapingO. Hoshuyama and A. Sugiyama
8.1 Introductionp. 267
8.2 Frequency-Domain Model of Highly Nonlinear Residual Echop. 268
8.2.1 Spectral Correlation Between Residual Echo and Echo Replicap. 269
8.2.2 Model of Residual Echo Based on Spectral Correlationp. 273
8.3 Echo Canceller Based on the New Residual Echo Modelp. 274
8.3.1 Overall Structurep. 274
8.3.2 Estimation of Near-End Speechp. 275
8.3.3 Spectral Gain Controlp. 276
8.4 Evaluationsp. 277
8.4.1 Objective Evaluationsp. 277
8.4.2 Subjective Evaluationp. 279
8.5 DSP Implementation and Real-Time Evaluationp. 280
8.6 Conclusionsp. 280
Referencesp. 281
Part III Signal and System Quality Evaluation
9 Telephone-Speech QualityU. Heute
9.1 Telephone-Speech Signalsp. 287
9.1.1 Telephone Scenariop. 287
9.1.2 Telephone-Scenario Modelp. 287
9.2 Speech-Signal Qualityp. 289
9.2.1 Intelligibilityp. 289
9.2.2 Speech-Sound Qualityp. 290
9.3 Speech-Quality Assessmentp. 292
9.3.1 Auditory Quality Assessmentp. 292
9.3.2 Aimsp. 292
9.3.3 Instrumental Quality Assessmentp. 293
9.4 Compound-System Quality Predictionp. 293
9.4.1 The System-Planning Taskp. 293
9.4.2 ETSI Network-Planning Model (E-Model)p. 293
9.5 Auditory Total-Quality Assessmentp. 294
9.5.1 Conversation Testsp. 294
9.5.2 Listening Testsp. 296
9.5.3 LOTs with Pair Comparisonsp. 296
9.5.4 Absolute-Category Rating (ACR) LOTsp. 297
9.6 Auditory Quality-Attribute Analysisp. 298
9.6.1 Quality Attributesp. 298
9.6.2 Attribute-Oriented LOTsp. 298
9.6.3 Search for Suitable Attributesp. 302
9.6.4 Integral-Quality Estimation from Attributesp. 305
9.7 Instrumental Total-Quality Measurementp. 306
9.7.1 Signal Comparisonsp. 306
9.7.2 Evaluation Approachesp. 306
9.7.3 Psychoacoustically Motivated Measuresp. 312
9.8 Instrumental Attribute-Based Quality Measurementsp. 320
9.8.1 Basic Ideasp. 320
9.8.2 Loudnessp. 322
9.8.3 Sharpnessp. 323
9.8.4 Roughnessp. 323
9.8.5 Directness/Frequency Content (DFC)p. 324
9.8.6 Continuityp. 326
9.8.7 Noisinessp. 329
9.8.8 Combined Direct and Attribute-Based Total Quality Determinationp. 331
9.9 Conclusions, Outlook, and Final Remarksp. 331
Referencesp. 332
10 Evaluation of Hands-free TerminalsF. Kettler and H.-W. Gierlich
10.1 Introductionp. 339
10.2 Quality Assessment of Hands-free Terminalsp. 340
10.3 Subjective Methods for Determining the Communicational Qualityp. 342
10.3.1 General Setup and Opinion Scales Used for Subjective Performance Evaluationp. 343
10.3.2 Conversation Testsp. 345
10.3.3 Double Talk Testsp. 346
10.3.4 Talking and Listening Testsp. 347
10.3.5 Listening-only Tests (LOT) and Third Party Listening Testsp. 348
10.3.6 Experts Tests for Assessing Real Life Situationsp. 349
10.4 Test Environmentp. 350
10.4.1 The Acoustical Environmentp. 351
10.4.2 Background Noise Simulation Techniquesp. 351
10.4.3 Positioning of the Hands-Free Terminalp. 352
10.4.4 Positioning of the Artificial Headp. 352
10.4.5 Influence of the Transmission Systemp. 354
10.5 Test Signals and Analysis Methodsp. 354
10.5.1 Speech and Perceptual Speech Quality Measuresp. 356
10.5.2 Speech-like Test Signalsp. 356
10.5.3 Background Noisep. 360
10.5.4 Applicationsp. 363
10.6 Result Representationp. 365
10.6.1 Interpretation of HFT "Quality Pies"p. 366
10.6.2 Examplesp. 368
10.7 Related Aspectsp. 368
10.7.1 The Lombard Effectp. 368
10.7.2 Intelligibility Outside Vehiclesp. 372
Referencesp. 375
Part IV Multi-Channel Processing
11 Correlation-Based TDOA-Estimation for Multiple Sources in Reverberant EnvironmentsJ. Scheuing and B. Yang
11.1 Introductionp. 381
11.2 Analysis of TDOA Ambiguitiesp. 383
11.2.1 Signal Modelp. 383
11.2.2 Multipath Ambiguityp. 384
11.2.3 Multiple Source Ambiguityp. 384
11.2.4 Ambiguity due to Periodic Signalsp. 386
11.2.5 Principles of TDOA Disambiguationp. 386
11.3 Estimation of Direct Path TDOAsp. 390
11.3.1 Correlation and Extremum Positionsp. 390
11.3.2 Raster Matchingp. 392
11.4 Consistent TDOA Graphsp. 397
11.4.1 TDOA Graphp. 397
11.4.2 Strategies of Consistency Checkp. 398
11.4.3 Properties of TDOA Graphsp. 399
11.4.4 Efficient Synthesis Algorithmp. 402
11.4.5 Initialization and Terminationp. 404
11.4.6 Estimating the Number of Active Sourcesp. 405
11.5 Experimental Resultsp. 406
11.5.1 Localization Systemp. 406
11.5.2 TDOA Estimation of a Single Signal Blockp. 408
11.5.3 Source Position Estimationp. 412
11.5.4 Evaluation of Continuous Measurementsp. 412
11.6 Summaryp. 414
Referencesp. 415
12 Microphone Calibration for Multi-Channel Signal ProcessingM. Buck and T. Haulick and H.-J. Pfleiderer
12.1 Introductionp. 417
12.2 Beamforming with Ideal Microphonesp. 418
12.2.1 Principle of Beamformingp. 418
12.2.2 Evaluation of Beamformersp. 421
12.2.3 Statistically Optimum Beamformersp. 424
12.3 Microphone Mismatch and its Effect on Beamformingp. 427
12.3.1 Model for Non-Ideal Microphone Characteristicsp. 428
12.3.2 Effect of Microphone Mismatch on Fixed Beamformersp. 429
12.3.3 Effect of Microphone Mismatch on Adaptive Beamformersp. 430
12.3.4 Comparison of Fixed and Adaptive Beamformersp. 432
12.4 Calibration Techniques and their Limits for Real-World Applicationsp. 432
Go to:Top of Page