Skip to:Content
|
Bottom
Cover image for Bioinformatics, biocomputing and Perl : an introduction to bioinformatics computing skills and practice
Title:
Bioinformatics, biocomputing and Perl : an introduction to bioinformatics computing skills and practice
Personal Author:
Publication Information:
Chichester : Wiley, 2004
ISBN:
9780470853313
Added Author:

Available:*

Library
Item Barcode
Call Number
Material Type
Item Category 1
Status
Searching...
30000004890848 QH324.2 M66 2004 Open Access Book Book
Searching...

On Order

Summary

Summary

Bioinformatics, Biocomputing and Perl presents a modern introduction to bioinformatics computing skills and practice. Structuring its presentation around four main areas of study, this book covers the skills vital to the day-to-day activities of today's bioinformatician. Each chapter contains a series of maxims designed to highlight key points and there are exercises to supplement and cement the introduced material.

Working with Perl presents an extended tutorial introduction to programming through Perl, the premier programming technology of the bioinformatics community. Even though no previous programming experience is assumed, completing the tutorial equips the reader with the ability to produce powerful custom programs with ease.

Working with Data applies the programming skills acquired to processing a variety of bioinformatics data. In addition to advice on working with important data stores such as the Protein DataBank, SWISS-PROT, EMBL and the GenBank, considerable discussion is devoted to using bioinformatics data to populate relational database systems. The popular MySQL database is used in all examples.

Working with the Web presents a discussion of the Web-based technologies that allow the bioinformatics researcher to publish both data and applications on the Internet.

Working with Applications shifts gear from creating custom programs to using them. The tools described include Clustal-W, EMBOSS, STRIDE, BLAST and Xmgrace. An introduction to the important Bioperl Project concludes this chapter and rounds off the book.


Author Notes

Paul Barry works as a Lecturer in Computing Science at The Institute of Technology, Carlow in Ireland.


Table of Contents

Prefacep. xv
1 Setting the Biological Scenep. 1
1.1 Introducing Biological Sequence Analysisp. 1
1.2 Protein and Polypeptidesp. 4
1.3 Generalised Models and their Usep. 5
1.4 The Central Dogma of Molecular Biologyp. 6
1.5 Genome Sequencingp. 10
1.6 The Example DNA-gene-protein system we will usep. 12
Where to from Herep. 13
2 Setting the Technological Scenep. 15
2.1 The Layers of Technologyp. 15
2.2 Finding perp. 17
Where to from Herep. 18
I Working with Perlp. 19
3 The Basicsp. 21
3.1 Let's Get Started!p. 21
3.2 Iterationp. 26
3.3 More Iterationsp. 30
3.4 Selectionp. 34
3.5 There Really is MTOWTDIp. 36
3.6 Processing Data Filesp. 41
3.7 Introducing Patternsp. 44
Where to from Herep. 46
The Maxims Repeatedp. 46
4 Places to Put Thingsp. 49
4.1 Beyond Scalarsp. 49
4.2 Arrays: Associating Data with Numbersp. 49
4.3 Hashes: Associating Data with Wordsp. 60
Where to from Herep. 68
The Maxims Repeatedp. 68
5 Getting Organisedp. 71
5.1 Named Blocksp. 71
5.2 Introducing Subroutinesp. 73
5.3 Creating Subroutinesp. 74
5.4 Visibility and Scopep. 85
5.5 In-built Subroutinesp. 90
5.6 Grouping and Reusing Subroutinesp. 92
5.7 The Standard Modulesp. 96
5.8 CPAN: The Module Repositoryp. 96
Where to from Herep. 100
The Maxims Repeatedp. 100
6 About Filesp. 103
6.1 I/O: Input and Outputp. 103
6.2 Reading Filesp. 105
6.3 Writing Filesp. 116
6.4 Chopping and Chompingp. 118
Where to from Herep. 119
The Maxims Repeatedp. 119
7 Patterns, Patterns and More Patternsp. 121
7.1 Pattern Basicsp. 121
7.2 Introducing the Pattern Metacharactersp. 124
7.3 Anchorsp. 132
7.4 The Binding Operatorsp. 134
7.5 Remembering What Was Matchedp. 135
7.6 Greedy by Defaultp. 137
7.7 Alternative Pattern Delimitersp. 138
7.8 Another Useful Utilityp. 139
7.9 Substitutions: Search and Replacep. 140
7.10 Finding a Sequencep. 142
Where to from Herep. 146
The Maxims Repeatedp. 146
8 Perl Grabbagp. 147
8.1 Introductionp. 147
8.2 Strictnessp. 147
8.3 Perl One-linersp. 149
8.4 Running Other Programs from perp. 152
8.5 Recovering from Errorsp. 153
8.6 Sortingp. 155
8.7 HERE Documentsp. 159
Where to from Herep. 160
The Maxims Repeatedp. 161
II Working with Datap. 163
9 Downloading Datasetsp. 165
9.1 Let's Get Datap. 165
9.2 Downloading from the Webp. 165
Where to from Herep. 171
The Maxims Repeatedp. 171
10 The Protein Databankp. 173
10.1 Introductionp. 173
10.2 Determining Biomolecule Structuresp. 174
10.3 The Protein Databankp. 177
10.4 The PDB Data-file Formatsp. 179
10.5 Accessing Data in PDB Entriesp. 182
10.6 Accessing PDB Annotation Datap. 183
10.7 Contact Mapsp. 192
10.8 STRIDE: Secondary Structure Assignmentp. 196
10.9 Assigning Secondary Structuresp. 197
10.10 Introducing the mmCIF Protein Formatp. 205
Where to from Herep. 210
The Maxims Repeatedp. 210
11 Non-redundant Datasetsp. 211
11.1 Introducing Non-redundant Datasetsp. 211
11.2 Non-redundant Protein Structuresp. 213
Where to from Herep. 217
The Maxims Repeatedp. 217
12 Databasesp. 219
12.1 Introducing Databasesp. 219
12.2 Available Database Systemsp. 224
12.3 SQL: the Language of Databasesp. 226
12.4 A Database Case Study: MERp. 227
Where to from Herep. 269
The Maxims Repeatedp. 269
13 Databases and Perlp. 273
13.1 Why Program Databases?p. 273
13.2 Perl Database Technologiesp. 274
13.3 Preparing Perlp. 275
13.4 Programming Databases with DBIp. 276
13.5 Customising Outputp. 282
13.6 Customising Inputp. 285
13.7 Extending SQLp. 289
Where to from Herep. 292
The Maxims Repeatedp. 292
III Working with the Webp. 295
14 The Sequence Retrieval Systemp. 297
14.1 An Example of What's Possiblep. 297
14.2 Why SRS?p. 298
14.3 Using SRSp. 298
Where to from Herep. 300
The Maxims Repeatedp. 300
15 Web Technologiesp. 303
15.1 The Web Development Infrastructurep. 303
15.2 Creating Content for the WWWp. 305
15.3 Preparing Apache for Perlp. 310
15.4 Sending Data to a Web Serverp. 315
15.5 Web Databasesp. 320
Where to from Herep. 327
The Maxims Repeatedp. 327
16 Web Automationp. 329
16.1 Why Automate Surfing?p. 329
16.2 Automated Surfing with Perlp. 330
Where to from Herep. 335
The Maxims Repeatedp. 336
IV Working with Applicationsp. 337
17 Tools and Datasetsp. 339
17.1 Introductionp. 339
17.2 Sequence Databasesp. 340
17.3 General Concepts and Methodsp. 347
17.4 Introducing Bioinformatics Toolsp. 357
17.5 BLASTp. 362
Where to from Herep. 371
The Maxims Repeatedp. 371
18 Applicationsp. 373
18.1 Introductionp. 373
18.2 Scientific Background to Mer Operonp. 374
18.3 Downloading the Raw DNA Sequencep. 377
18.4 Initial BLAST Sequence Similarity Searchp. 378
18.5 GeneMarkp. 380
18.6 Structural Prediction with SWISS-MODELp. 388
18.7 DeepView as a Structural Alignment Toolp. 396
18.8 PROSITE and Sequence Motifsp. 401
18.9 Phylogeneticsp. 407
Where to from Here?p. 410
The Maxims Repeatedp. 411
19 Data Visualisationp. 413
19.1 Introducing Visualisationp. 413
19.2 Displaying Tabular Data Using HTMLp. 415
19.3 Creating High-quality Graphics with GDp. 422
19.4 Plotting Graphsp. 431
Where to from Herep. 439
The Maxims Repeatedp. 439
20 Introducing Bioperlp. 441
20.1 What is Bioperl?p. 441
20.2 Bioperl's Relationship to Project Ensemblp. 442
20.3 Installing Bioperlp. 442
20.4 Using Bioperl: Fetching Sequencesp. 444
20.5 Remote BLAST Searchesp. 448
Where to from Herep. 451
The Maxims Repeatedp. 452
A Appendix Ap. 453
B Appendix Bp. 457
C Appendix Cp. 459
D Appendix Dp. 461
E Appendix Ep. 467
F Appendix Fp. 471
Indexp. 475
Go to:Top of Page