Cover image for Genomic signal processing
Title:
Genomic signal processing
Personal Author:
Series:
Princeton series in applied mathematics
Publication Information:
Princeton, NJ : Princeton University Press, 2007
Physical Description:
xiii, 298 p. : ill. ; 35 cm.
ISBN:
9780691117621
Added Author:

Available:*

Library
Item Barcode
Call Number
Material Type
Item Category 1
Status
Searching...
30000010192057 QP517.C45 S45 2007 Open Access Book Book
Searching...

On Order

Summary

Summary

Genomic signal processing (GSP) can be defined as the analysis, processing, and use of genomic signals to gain biological knowledge, and the translation of that knowledge into systems-based applications that can be used to diagnose and treat genetic diseases. Situated at the crossroads of engineering, biology, mathematics, statistics, and computer science, GSP requires the development of both nonlinear dynamical models that adequately represent genomic regulation, and diagnostic and therapeutic tools based on these models. This book facilitates these developments by providing rigorous mathematical definitions and propositions for the main elements of GSP and by paying attention to the validity of models relative to the data. Ilya Shmulevich and Edward Dougherty cover real-world situations and explain their mathematical modeling in relation to systems biology and systems medicine.



Genomic Signal Processing makes a major contribution to computational biology, systems biology, and translational genomics by providing a self-contained explanation of the fundamental mathematical issues facing researchers in four areas: classification, clustering, network modeling, and network intervention.


Author Notes

Ilya Shmulevich , an associate professor at the Institute for Systems Biology, is the coauthor of Microarray Quality Control and the coeditor of Computational and Statistical Approaches to Genomics . Edward R. Dougherty is professor of electrical and computer engineering and director of the Genomic Signal Processing Laboratory at Texas A&M University, and director of the Computational Biology Division at the Translational Genomics Research Institute. His thirteen previous books include Random Processes for Image and Signal Processing .


Table of Contents

Prefacep. ix
1 Biological Foundations
1.1 Geneticsp. 1
1.1.1 Nucleic Acid Structurep. 2
1.1.2 Genesp. 5
1.1.3 RNAp. 6
1.1.4 Transcriptionp. 6
1.1.5 Proteinsp. 9
1.1.6 Translationp. 10
1.1.7 Transcriptional Regulationp. 12
1.2 Genomicsp. 16
1.2.1 Microarray Technologyp. 17
1.3 Proteomicsp. 20
Bibliographyp. 22
2 Deterministic Models of Gene Networks
2.1 Graph Modelsp. 23
2.2 Boolean Networksp. 30
2.2.1 Cell Differentiation and Cellular Functional Statesp. 33
2.2.2 Network Properties and Dynamicsp. 35
2.2.3 Network Inferencep. 49
2.3 Generalizations of Boolean Networksp. 53
2.3.1 Asynchronyp. 53
2.3.2 Multivalued Networksp. 56
2.4 Differential Equation Modelsp. 59
2.4.1 A Differential Equation Model Incorporating Transcription and Translationp. 62
2.4.2 Discretization of the Continuous Differential Equation Modelp. 65
Bibliographyp. 70
3 Stochastic Models of Gene Networks
3.1 Bayesian Networksp. 77
3.2 Probabilistic Boolean Networksp. 83
3.2.1 Definitionsp. 86
3.2.2 Inferencep. 97
3.2.3 Dynamics of PBNsp. 99
3.2.4 Steady-State Analysis of Instantaneously Random PBNsp. 113
3.2.5 Relationships of PBNs to Bayesian Networksp. 119
3.2.6 Growing Subnetworks from Seed Genesp. 125
3.3 Interventionp. 129
3.3.1 Gene Interventionp. 130
3.3.2 Structural Interventionp. 140
3.3.3 External Controlp. 145
Bibliographyp. 151
4 Classification
4.1 Bayes Classifierp. 160
4.2 Classification Rulesp. 162
4.2.1 Consistent Classifier Designp. 162
4.2.2 Examples of Classification Rulesp. 166
4.3 Constrained Classifiersp. 168
4.3.1 Shatter Coefficientp. 171
4.3.2 VC Dimensionp. 173
4.4 Linear Classificationp. 176
4.4.1 Rosenblatt Perceptronp. 177
4.4.2 Linear and Quadratic Discriminant Analysisp. 178
4.4.3 Linear Discriminants Based on Least-Squares Errorp. 180
4.4.4 Support Vector Machinesp. 183
4.4.5 Representation of Design Error for Linear Discriminant Analysisp. 186
4.4.6 Distribution of the QDA Sample-Based Discriminantp. 187
4.5 Neural Networks Classifiersp. 189
4.6 Classification Treesp. 192
4.6.1 Classification and Regression Treesp. 193
4.6.2 Strongly Consistent Rules for Data-Dependent Partitioningp. 194
4.7 Error Estimationp. 196
4.7.1 Resubstitutionp. 196
4.7.2 Cross-validationp. 198
4.7.3 Bootstrapp. 199
4.7.4 Bolsteringp. 201
4.7.5 Error Estimator Performancep. 204
4.7.6 Feature Set Rankingp. 207
4.8 Error Correctionp. 209
4.9 Robust Classifiersp. 213
4.9.1 Optimal Robust Classifiersp. 214
4.9.2 Performance Comparison for Robust Classifiersp. 216
Bibliographyp. 221
5 Regularization
5.1 Data Regularizationp. 225
5.1.1 Regularized Discriminant Analysisp. 225
5.1.2 Noise Injectionp. 228
5.2 Complexity Regularizationp. 231
5.2.1 Regularization of the Errorp. 231
5.2.2 Structural Risk Minimizationp. 233
5.2.3 Empirical Complexityp. 236
5.3 Feature Selectionp. 237
5.3.1 Peaking Phenomenonp. 237
5.3.2 Feature Selection Algorithmsp. 243
5.3.3 Impact of Error Estimation on Feature Selectionp. 244
5.3.4 Redundancyp. 245
5.3.5 Parallel Incremental Feature Selectionp. 249
5.3.6 Bayesian Variable Selectionp. 251
5.4 Feature Extractionp. 254
Bibliographyp. 259
6 Clustering
6.1 Examples of Clustering Algorithmsp. 263
6.1.1 Euclidean Distance Clusteringp. 264
6.1.2 Self-Organizing Mapsp. 265
6.1.3 Hierarchical Clusteringp. 266
6.1.4 Model-Based Cluster Operatorsp. 268
6.2 Cluster Operatorsp. 269
6.2.1 Algorithm Structurep. 269
6.2.2 Label Operatorsp. 271
6.2.3 Bayes Clustererp. 273
6.2.4 Distributional Testing of Cluster Operatorsp. 274
6.3 Cluster Validationp. 276
6.3.1 External Validationp. 276
6.3.2 Internal Validationp. 277
6.3.3 Instability Indexp. 278
6.3.4 Bayes Factorp. 280
6.4 Learning Cluster Operatorsp. 281
6.4.1 Empirical-Error Cluster Operatorp. 281
6.4.2 Nearest-Neighbor Clustering Rulep. 283
Bibliographyp. 292
Indexp. 295