Cover image for Algorithmic aspects of bioinformatics
Title:
Algorithmic aspects of bioinformatics
Personal Author:
Series:
Natural computing series
Publication Information:
Berlin : Springer, 2007
ISBN:
9783540719120
General Note:
Available online version
Added Author:
Electronic Access:
Fulltext

Available:*

Library
Item Barcode
Call Number
Material Type
Item Category 1
Status
Searching...
30000010150736 QH324.2 B62 2007 Open Access Book Book
Searching...

On Order

Summary

Summary

Advances in bioinformatics and systems biology require improved computational methods for analyzing data, while progress in molecular biology is in turn influencing the development of computer science methods. This book introduces some key problems in bioinformatics, discusses the models used to formally describe these problems, and analyzes the algorithmic approaches used to solve them.

After introducing the basics of molecular biology and algorithmics, Part I explains string algorithms and alignments; Part II details the field of physical mapping and DNA sequencing; and Part III examines the application of algorithmics to the analysis of biological data. Exciting application examples include predicting the spatial structure of proteins, and computing haplotypes from genotype data.

This book describes topics in detail and presents formal models in a mathematically precise, yet intuitive manner, with many figures and chapter summaries, detailed derivations, and examples. It is well suited as an introduction into the field of bioinformatics, and will benefit students and lecturers in bioinformatics and algorithmics, while also offering practitioners an update on current research topics.


Table of Contents

1 Introductionp. 1
Part I Introduction and Basic Algorithms
2 Basics of Molecular Biologyp. 7
2.1 Proteinsp. 7
2.2 Nucleic Acidsp. 9
2.3 Hereditary Information and Protein Biosynthesisp. 12
2.4 Experimental Techniquesp. 15
2.4.1 Basic Terms and Methodsp. 15
2.4.2 Duplication of DNAp. 15
2.4.3 Gel Electrophoresis and Direct Sequencingp. 16
2.4.4 DNA Chipsp. 19
2.5 Bibliographic Notesp. 20
3 Basic Concepts: Strings, Graphs, and Algorithmsp. 23
3.1 Stringsp. 23
3.2 Graphsp. 25
3.3 Algorithms and Complexityp. 28
3.4 Bibliographic Notesp. 35
4 String Algorithmsp. 37
4.1 The String Matching Problemp. 37
4.2 String Matching Automatap. 39
4.3 The Boyer-Moore Algorithmp. 44
4.4 Suffix Treesp. 50
4.5 Further Applications of Suffix Treesp. 58
4.5.1 Generalized Suffix Trees and the Substring Problemp. 58
4.5.2 Longest Common Substringsp. 61
4.5.3 Efficient Computation of Overlapsp. 63
4.5.4 Repeats in Stringsp. 66
4.6 Suffix Arraysp. 68
4.7 Summaryp. 77
4.8 Bibliographic Notesp. 78
5 Alignment Methodsp. 81
5.1 Alignment of Two Stringsp. 82
5.1.1 Basic Definitionsp. 82
5.1.2 Global Alignmentp. 84
5.1.3 Local and Semiglobal Alignmentp. 89
5.1.4 Generalized Scoring Functionsp. 94
5.2 Heuristic Methods for Database Searchp. 97
5.2.1 The FASTA Heuristicp. 98
5.2.2 The BLAST Heuristicp. 99
5.3 Multiple Alignmentsp. 101
5.3.1 Definition and Scoring of Multiple Alignmentsp. 101
5.3.2 Exact Computation of Multiple Alignmentsp. 104
5.3.3 Combining Pairwise Alignmentsp. 109
5.4 Summaryp. 114
5.5 Bibliographic Notesp. 114
Part II DNA Sequencing
6 Introduction and Overviewp. 119
7 Physical Mappingp. 123
7.1 Restriction Site Mappingp. 123
7.1.1 The Double Digest Approachp. 124
7.1.2 The Partial Digest Approachp. 131
7.1.3 Comparison of Methods for Restriction Site Mappingp. 141
7.2 Hybridization Mappingp. 143
7.2.1 Mapping with Unique Probesp. 146
7.2.2 Mapping with Unique Probes and Errorsp. 157
7.2.3 Mapping with Non-unique Probesp. 165
7.3 Summaryp. 166
7.4 Bibliographic Notesp. 168
8 DNA Sequencingp. 171
8.1 Shotgun Sequencingp. 171
8.1.1 Crucial Points to Be Considered in a Suitable Modelp. 174
8.1.2 The Shortest Common Superstring Problemp. 176
8.1.3 Refined Models for Fragment Assemblyp. 196
8.2 Sequencing by Hybridizationp. 201
8.3 Summaryp. 207
8.4 Bibliographic Notesp. 208
Part III Analyzing Biological Data
9 Finding Signals in DNA Sequencesp. 213
9.1 Identical and Similar Substringsp. 213
9.2 Tandem Repeatsp. 217
9.3 Frequent and Infrequent Substringsp. 223
9.4 Hidden Markov Modelsp. 228
9.5 Summaryp. 235
9.6 Bibliographic Notesp. 235
10 Genome Rearrangementsp. 237
10.1 Modelingp. 237
10.2 Sorting Undirected Permutationsp. 239
10.3 Sorting Directed Permutationsp. 247
10.4 Computing the Syntenic Distancep. 249
10.5 Summaryp. 255
10.6 Bibliographic Notesp. 255
11 Phylogenetic Treesp. 257
11.1 Ultrametric Distancesp. 258
11.2 Additive Treesp. 265
11.3 Characters with Binary Statesp. 268
11.4 The Parsimony Principle and the Quartet Methodp. 275
11.5 Summaryp. 283
11.6 Bibliographic Notesp. 285
12 Haplotypingp. 287
12.1 Inferring Haplotypes from a Populationp. 288
12.2 Haplotyping a Single Individualp. 305
12.3 Summaryp. 316
12.4 Bibliographic Notesp. 316
13 Molecular Structuresp. 319
13.1 RNA Secondary Structure Predictionp. 320
13.1.1 Minimizing the Free Energyp. 322
13.1.2 Stochastic Context-Free Grammarsp. 329
13.2 Structure-Based Comparison of Biomoleculesp. 337
13.3 Protein Structure Predictionp. 349
13.3.1 De Novo Structure Prediction - The HP Modelp. 352
13.3.2 Protein Threadingp. 363
13.4 Summaryp. 369
13.4.1 RNA Secondary Structure Predictionp. 369
13.4.2 Structure-Based Comparison of Biomoleculesp. 371
13.4.3 Protein Structure Predictionp. 371
13.5 Bibliographic Notesp. 372
13.5.1 RNA Secondary Structure Predictionp. 372
13.5.2 Structure-Based Comparison of Biomoleculesp. 373
13.5.3 Protein Structure Predictionp. 374
Referencesp. 377
Indexp. 389