Cover image for Phase-based speech processing
Title:
Phase-based speech processing
Publication Information:
Singapore : World Scientific Publishing Company, 2006
ISBN:
9789812566126
Added Author:

Available:*

Library
Item Barcode
Call Number
Material Type
Item Category 1
Status
Searching...
30000010148672 TK7882.S65 P42 2006 Open Access Book Book
Searching...

On Order

Summary

Summary

This is the first book that takes a detailed look at the importance of phase in the design of speech processing systems. Phase, in comparison with amplitude, is often ignored for speech recognition applications. Thus, this book highlights some of the important ways in which the phase of speech signals can be utilized for sound localization, enhancement, and recognition.This book also discusses the state-of-the-art research in phase-based speech processing, starting from the basics of signal processing and recording, to single microphone speech recognition, the recognition of speech and the processing of speech by humans, as well as the importance of phase in human speech recognition and multi-microphone phase-based speech processing.


Table of Contents

1 Introductionp. 1
1.1 Motivationp. 1
1.2 The Meaning of Phasep. 2
1.3 Dual Microphone Speech Processing, or Why Two Ears Are Better Than Onep. 3
1.4 The Microphone From the 22nd Century - The Human Earp. 5
1.5 Why Smart Computers Are Hard To Findp. 6
1.6 The Bigger Picturep. 7
1.7 Book Overviewp. 7
2 Signal Processing Basicsp. 9
2.1 Continuous and Discrete Time Signalsp. 9
2.2 Continuous Time Fourier Transformp. 12
2.2.1 Useful Mathematical Identitiesp. 13
2.3 Samplingp. 17
2.4 Spectral Analysis of Discrete Time Signalsp. 20
2.4.1 The Effect of Sampling on the Fourier Transform of a Signalp. 20
2.4.2 The Reconstruction Theoremp. 23
2.4.3 The Discrete Time Fourier Transform (DTFT)p. 26
2.4.4 Sampling in the Frequency Domainp. 28
2.4.5 The Discrete Fourier Transform (DFT)p. 33
2.5 Windowingp. 48
2.6 Delaying Discrete Time Signals by Non-Integer Amountsp. 58
3 Single-Microphone Speech Processingp. 63
3.1 Introductionp. 63
3.2 Backgroundp. 63
3.3 The Role of Phase in Speech Enhancementp. 64
3.4 The Role of Phase in Speech Recognitionp. 66
3.4.1 The Fundamentals of HMM Based ASRp. 66
3.4.2 Performance Overview of ASR Systemsp. 71
3.5 Phase Estimation from Magnitudep. 76
3.5.1 Signal Estimation from the Modified STFTp. 76
3.5.2 Signal Estimation from the STFT Magnitudep. 77
3.6 Recent Developments in Phase Utilizationp. 78
3.7 Summaryp. 81
4 Human Hearingp. 83
4.1 Anatomy of the Earp. 83
4.1.1 External Earp. 83
4.1.2 Middle Earp. 84
4.1.3 Inner Earp. 84
4.2 Physiology of the Earp. 84
4.2.1 Transmission of Sound Through the Middle Earp. 84
4.2.2 Physiology of the Cochleap. 85
4.2.3 Inner Ear Performs Super Fast Fourier Transformp. 86
4.2.4 The "Place" Principlep. 86
4.2.5 Action Potential and Determination of Loudnessp. 86
4.2.6 Detection of the Change in the Loudness and the Power Lawp. 88
4.2.7 Threshold for Hearing and Frequency Range of Hearingp. 88
4.3 Hearing in the Central Nervous Systemp. 89
4.3.1 Parallel Processing of Sound in the Cerebral Cortexp. 89
4.3.2 Importance of the Cerebral Cortex in Hearingp. 89
4.4 The Importance of Phase in Human Speech Processingp. 90
4.5 Experimental Setupp. 91
4.6 Experimental Resultsp. 92
4.7 Modeling the Effect of Phasep. 94
4.8 Speech Recognition Experiments Incorporating Phase Restorationp. 97
4.8.1 Experimental Setupp. 97
4.8.2 Experimental Resultsp. 97
4.9 Conclusionsp. 98
5 Multi-Microphone Phase-Based Speech Processingp. 101
5.1 Introductionp. 101
5.1.1 Dual Microphone Sound Modelp. 105
5.1.2 Frequency Dependent Nature of Phase Wrappingp. 106
5.2 Delay-and-sum Beamformingp. 109
5.2.1 Two-microphone Sum Beamformingp. 109
5.2.2 Multi-element Sum Beamformingp. 112
5.2.3 Steering the Arrayp. 114
5.3 Sound Localization Using a Delay-and-sum Beamformerp. 116
5.4 TDOA Based Sound Localizationp. 116
5.4.1 TDOA Estimationp. 119
5.5 A Detailed Look at the Phase Errorp. 122
5.6 The Relationship Between Phase-Error and SNRp. 124
5.7 Probabilistic constraints on the SNRsp. 127
5.8 Phase-Based Time-Varying Filtersp. 131
5.9 Beamforming as a Phase-Error Filterp. 134
6 Concluding Remarksp. 137
6.1 Summary (i.e. things you would have known if you had read the book)p. 137
6.2 Directions for Future Researchp. 138
6.3 Where Does it End?p. 139
6.4 Disclaimerp. 140
Bibliographyp. 141