Skip to:Content
|
Bottom
Cover image for Data analysis and visualization in genomics and proteomics
Title:
Data analysis and visualization in genomics and proteomics
Publication Information:
Hoboken, NJ : John Wiley, 2005
ISBN:
9780470094396

Available:*

Library
Item Barcode
Call Number
Material Type
Item Category 1
Status
Searching...
30000004993592 QH452.7 D37 2005 Open Access Book Book
Searching...

On Order

Summary

Summary

Data Analysis and Visualization in Genomics and Proteomics is the first book addressing integrative data analysis and visualization in this field. It addresses important techniques for the interpretation of data originating from multiple sources, encoded in different formats or protocols, and processed by multiple systems.

One of the first systematic overviews of the problem of biological data integration using computational approaches This book provides scientists and students with the basis for the development and application of integrative computational methods to analyse biological data on a systemic scale Places emphasis on the processing of multiple data and knowledge resources, and the combination of different models and systems


Author Notes

Dr Francisco Azuaje, Faculty of Informatics, University of Ulster, Jordanstown, Northern Ireland.

Dr.'Joaquin Dopazo, Head of Bioinformatics, Spanish National Cancer Centre, Madrid, Spain.


Table of Contents

Francisco Azuaje and Joaquin DopazoAllyson L. Williams and Paul J. Kersey and Manuela Pruess and Rolf ApweilerFrancisco Azuaje and Joaquin Dopazo and Haiying WangMartin Krallinger and Alfonso ValenciaLong J. Lu and Yu Xia and Haiyuan Yu and Alexander Rives and Haoxin Lu and Falk Schubert and Mark GersteinAmanda ClareFatima Al-Shahrour and Joaquin DopazoAlban Chesnau and Claude SardetSteve R. Pettifer and James R. Sinnott and Teresa K. AttwoodQizheng Sheng and Yves Moreau and Frank De Smet and Kathleen Marchal and Bart De MoorOlga G. TroyanskayaRamon Diaz-UriartePedro Larranaga and Inaki Inza and Jose L. FloresInge Jonassen
Prefacep. xi
List of Contributorsp. xiii
Section I Introduction - Data Diversity and Integrationp. 1
1 Integrative Data Analysis and Visualization: Introduction to Critical Problems, Goals and Challengesp. 3
1.1 Data Analysis and Visualization: An Integrative Approachp. 3
1.2 Critical Design and Implementation Factorsp. 5
1.3 Overview of Contributionsp. 8
Referencesp. 9
2 Biological Databases: Infrastructure, Content and Integrationp. 11
2.1 Introductionp. 11
2.2 Data Integrationp. 12
2.3 Review of Molecular Biology Databasesp. 17
2.4 Conclusionp. 23
Referencesp. 26
3 Data and Predictive Model Integration: an Overview of Key Concepts, Problems and Solutionsp. 29
3.1 Integrative Data Analysis and Visualization: Motivation and Approachesp. 29
3.2 Integrating Informational Views and Complexity for Understanding Functionp. 31
3.3 Integrating Data Analysis Techniques for Supporting Functional Analysisp. 34
3.4 Final Remarksp. 36
Referencesp. 38
Section II Integrative Data Mining and Visualization - Emphasis on Combination of Multiple Data Typesp. 41
4 Applications of Text Mining in Molecular Biology, from Name Recognition to Protein Interaction Mapsp. 43
4.1 Introductionp. 44
4.2 Introduction to Text Mining and NLPp. 45
4.3 Databases and Resources for Biomedical Text Miningp. 47
4.4 Text Mining and Protein-Protein Interactionsp. 50
4.5 Other Text-Mining Applications in Genomicsp. 55
4.6 The Future of NLP in Biomedicinep. 56
Acknowledgementsp. 56
Referencesp. 56
5 Protein Interaction Prediction by Integrating Genomic Features and Protein Interaction Network Analysisp. 61
5.1 Introductionp. 62
5.2 Genomic Features in Protein Interaction Predictionsp. 63
5.3 Machine Learning on Protein-Protein Interactionsp. 67
5.4 The Missing Value Problemp. 73
5.5 Network Analysis of Protein Interactionsp. 75
5.6 Discussionp. 79
Referencesp. 80
6 Integration of Genomic and Phenotypic Datap. 83
6.1 Phenotypep. 83
6.2 Forward Genetics and QTL Analysisp. 85
6.3 Reverse Geneticsp. 87
6.4 Prediction of Phenotype from Other Sources of Datap. 88
6.5 Integrating Phenotype Data with Systems Biologyp. 90
6.6 Integration of Phenotype Data in Databasesp. 93
6.7 Conclusionsp. 95
Referencesp. 95
7 Ontologies and Functional Genomicsp. 99
7.1 Information Mining in Genome-Wide Functional Analysisp. 99
7.2 Sources of Information: Free Text Versus Curated Repositoriesp. 100
7.3 Bio-Ontologies and the Gene Ontology in Functional Genomicsp. 101
7.4 Using GO to Translate the Results of Functional Genomic Experiments into Biological Knowledgep. 103
7.5 Statistical Approaches to Test Significant Biological Differencesp. 104
7.6 Using FatiGO to Find Significant Functional Associations in Clusters of Genesp. 106
7.7 Other Toolsp. 107
7.8 Examples of Functional Analysis of Clusters of Genesp. 108
7.9 Future Prospectsp. 110
Referencesp. 110
8 The C. elegans Interactome: its Generation and Visualizationp. 113
8.1 Introductionp. 113
8.2 The ORFeome: the first step toward the interactome of C. elegansp. 116
8.3 Large-Scale High-Throughput Yeast Two-Hybrid Screens to Map the C. elegans Protein-Protein Interaction (Interactome) Network: Technical Aspectsp. 118
8.4 Visualization and Topology of Protein-Protein Interaction Networksp. 121
8.5 Cross-Talk Between the C. elegans Interactome and other Large-Scale Genomics and Post-Genomics Data Setsp. 123
8.6 Conclusion: From Interactions to Therapiesp. 129
Referencesp. 130
Section III Integrative Data Mining and Visualization - Emphasis on Combination of Multiple Prediction Models and Methodsp. 135
9 Integrated Approaches for Bioinformatic Data Analysis and Visualization - Challenges, Opportunities and New Solutionsp. 137
9.1 Introductionp. 137
9.2 Sequence Analysis Methods and Databasesp. 139
9.3 A View Through a Portalp. 141
9.4 Problems with Monolithic Approaches: One Size Does Not Fit Allp. 142
9.5 A Toolkit Viewp. 143
9.6 Challenges and Opportunitiesp. 145
9.7 Extending the Desktop Metaphorp. 147
9.8 Conclusionsp. 151
Acknowledgementsp. 151
Referencesp. 152
10 Advances in Cluster Analysis of Microarray Datap. 153
10.1 Introductionp. 153
10.2 Some Preliminariesp. 155
10.3 Hierarchical Clusteringp. 157
10.4 k-Means Clusteringp. 159
10.5 Self-Organizing Mapsp. 159
10.6 A Wish List for Clustering Algorithmsp. 160
10.7 The Self-Organizing Tree Algorithmp. 161
10.8 Quality-Based Clustering Algorithmsp. 162
10.9 Mixture Modelsp. 163
10.10 Biclustering Algorithmsp. 166
10.11 Assessing Cluster Qualityp. 168
10.12 Open Horizonsp. 170
Referencesp. 171
11 Unsupervised Machine Learning to Support Functional Characterization of Genes: Emphasis on Cluster Description and Class Discoveryp. 175
11.1 Functional Genomics: Goals and Data Sourcesp. 175
11.2 Functional Annotation by Unsupervised Analysis of Gene Expression Microarray Datap. 177
11.3 Integration of Diverse Functional Data For Accurate Gene Function Predictionp. 179
11.4 MAGIC - General Probabilistic Integration of Diverse Genomic Datap. 180
11.5 Conclusionp. 188
Referencesp. 189
12 Supervised Methods with Genomic Data: a Review and Cautionary Viewp. 193
12.1 Chapter Objectivesp. 193
12.2 Class Prediction and Class Comparisonp. 194
12.3 Class Comparison: Finding/Ranking Differentially Expressed Genesp. 194
12.4 Class Prediction and Prognostic Predictionp. 198
12.5 ROC Curves for Evaluating Predictors and Differential Expressionp. 201
12.6 Caveats and Admonitionsp. 203
12.7 Final Note: Source Code Should be Availablep. 209
Acknowledgementsp. 210
Referencesp. 210
13 A Guide to the Literature on Inferring Genetic Networks by Probabilistic Graphical Modelsp. 215
13.1 Introductionp. 215
13.2 Genetic Networksp. 216
13.3 Probabilistic Graphical Modelsp. 218
13.4 Inferring Genetic Networks by Means of Probabilistic Graphical Modelsp. 229
13.5 Conclusionsp. 234
Acknowledgementsp. 235
Referencesp. 235
14 Integrative Models for the Prediction and Understanding of Protein Structure Patternsp. 239
14.1 Introductionp. 239
14.2 Structure Predictionp. 241
14.3 Classifications of Structuresp. 244
14.4 Comparing Protein Structuresp. 246
14.5 Methods for the Discovery of Structure Motifsp. 249
14.6 Discussion and Conclusionsp. 252
Referencesp. 254
Indexp. 257
Go to:Top of Page