Cover image for Applied parallel computing
Title:
Applied parallel computing
Personal Author:
Publication Information:
Singapore ; Hackensack, NJ : World Scientific, 2013
Physical Description:
x, 207 p. : ill. ; 24 cm.
ISBN:
9789814307604

Available:*

Library
Item Barcode
Call Number
Material Type
Item Category 1
Status
Searching...
30000010306300 QA76.58 D46 2013 Open Access Book Book
Searching...

On Order

Summary

Summary

The book provides a practical guide to computational scientists and engineers to help advance their research by exploiting the superpower of supercomputers with many processors and complex networks. This book focuses on the design and analysis of basic parallel algorithms, the key components for composing larger packages for a wide range of applications.


Table of Contents

Prefacep. v
Chapter 1 Introductionp. 1
1.1 Definition of Parallel Computingp. 1
1.2 Evolution of Computersp. 4
1.3 An Enabling Technologyp. 8
1.4 Cost Effectivenessp. 9
Chapter 2 Performance Metrics and Modelsp. 13
2.1 Parallel Activity Tracep. 13
2.2 Speedupp. 14
2.3 Parallel Efficiencyp. 15
2.4 Load Imbalancep. 15
2.5 Granularityp. 16
2.6 Overheadp. 17
2.7 Scalabilityp. 18
2.8 Amdahl's Lawp. 18
Chapter 3 Hardware Systemsp. 19
3.1 Node Architecturesp. 19
3.2 Network Interconnectionsp. 21
3.3 Instruction and Data Streamsp. 28
3.4 Processor-Memory Connectivityp. 29
3.5 IO Subsystemsp. 29
3.6 System Convergencep. 31
3.7 Design Considerationsp. 31
Chapter 4 Software Systemsp. 35
4.1 Node Softwarep. 35
4.2 Programming Modelsp. 37
4.3 Parallel Debuggersp. 43
4.4 Parallel Profilersp. 43
Chapter 5 Design of Algorithmsp. 45
5.1 Algorithm Modelsp. 46
5.2 Examples of Collective Operationsp. 54
5.3 Mapping Tasks to Processorsp. 56
Chapter 6 Linear Algebrap. 65
6.1 Problem Decompositionp. 65
6.2 Matrix Operationsp. 68
6.3 Solution of Linear Systemsp. 81
Chapter 7 Differential Equationsp. 89
7.1 Integration and Differentiationp. 89
7.2 Partial Differential Equationsp. 92
Chapter 8 Fourier Transformsp. 105
8.1 Fourier Transformsp. 105
8.2 Discrete Fourier Transformsp. 106
8.3 Fast Fourier Transformsp. 107
8.4 Simple Parallelizationp. 111
8.5 The Transpose Methodp. 112
8.6 Complexity Analysis for FFTp. 113
Chapter 9 Optimizationp. 115
9.1 Monte Carlo Methodsp. 116
9.2 Parallelizationp. 119
Chapter 10 Applicationsp. 123
10.1 Newton's Equation and Molecular Dynamicsp. 124
10.2 Schrodinger's Equations and Quantum Mechanicsp. 133
10.3 Partition Function, DFT and Material Sciencep. 134
10.4 Maxwell's Equations and Electrical Engineeringp. 135
10.5 Diffusion Equation and Mechanical Engineeringp. 135
10.6 Navier-Stokes Equation and CFDp. 136
10.7 Other Applicationsp. 136
Appendix A MPIp. 139
A.1 An MPI Primerp. 139
A.2 Examples of Using MPIp. 159
A.3 MPI Toolsp. 161
A.4 Complete List of MPI Functionsp. 167
Appendix B OpenMPp. 171
B.1 Introduction to OpenMPp. 171
B.2 Memory Model of OpenMPp. 172
B.3 OpenMP Directivesp. 172
B.4 Synchronizationp. 174
B.5 Runtime Library Routinesp. 175
B.6 Examples of Using OpenMPp. 178
B.7 The Futurep. 180
Appendix C Projectsp. 181
Project C.1 Watts and Flops of Supercomputersp. 181
Project C.2 Review of Supercomputersp. 181
Project C.3 Top500 and BlueGene Supercomputersp. 181
Project C.4 Say Hello in Orderp. 182
Project C.5 Broadcast on Torusp. 183
Project C.6 Competing with MPI on Broadcast, Scatter, etcp. 183
Project C.7 Simple Matrix Multiplicationp. 183
Project C.8 Matrix Multiplication on 4D Torusp. 183
Project C.9 Matrix Multiplication and PATp. 184
Project C.10 Matrix Inversionp. 184
Project C.11 Simple Analysis of an iBT Networkp. 185
Project C.12 Compute Eigenvalues of Adjacency Matrices of Networksp. 185
Project C.13 Mapping Wave Equation to Torusp. 185
Project C.14 Load Balance in 3D Meshp. 186
Project C.15 Wave Equation and PATp. 186
Project C.16 Computing Coulomb's Forcesp. 187
Project C.17 Timing Model for MDp. 187
Project C.18 Minimizing Lennard-Jones Potentialp. 188
Project C.19 Install and Profile CP2Kp. 188
Project C.20 Install and Profile CPMDp. 189
Project C.21 Install and Profile NAMDp. 190
Project C.22 FFT on Beowulfp. 190
Project C.23 FFT on BlueGene/Qp. 191
Project C.24 Word Analysisp. 191
Project C.25 Cost Estimate of a 0.1 Pflops Systemp. 191
Project C.26 Design of a Pflops Systemp. 191
Appendix D Program Examplesp. 193
D.1 Matrix-Vector Multiplicationp. 193
D.2 Long Range N-body Forcep. 195
D.3 Integrationp. 201
Referencesp. 203
Indexp. 205