Cover image for Scaling up machine learning : parallel and distributed approaches
Title:
Scaling up machine learning : parallel and distributed approaches
Publication Information:
Cambridge ; New York : Cambridge University Press, 2012
Physical Description:
xvi, 475 pages : illustrations ; 26 cm.
ISBN:
9780521192248
Abstract:
"This book presents an integrated collection of representative approaches for scaling up machine learning and data mining methods on parallel and distributed computing platforms. Demand for parallelizing learning algorithms is highly task-specific: in some settings it is driven by the enormous dataset sizes, in others by model complexity or by real-time performance requirements. Making task-appropriate algorithm and platform choices for large-scale machine learning requires understanding the benefits, trade-offs, and constraints of the available options"--provided by publisher

Available:*

Library
Item Barcode
Call Number
Material Type
Item Category 1
Status
Searching...
30000010328960 Q325.5 S33 2012 Open Access Book Book
Searching...

On Order

Summary

Summary

This book presents an integrated collection of representative approaches for scaling up machine learning and data mining methods on parallel and distributed computing platforms. Demand for parallelizing learning algorithms is highly task-specific: in some settings it is driven by the enormous dataset sizes, in others by model complexity or by real-time performance requirements. Making task-appropriate algorithm and platform choices for large-scale machine learning requires understanding the benefits, trade-offs and constraints of the available options. Solutions presented in the book cover a range of parallelization platforms from FPGAs and GPUs to multi-core systems and commodity clusters, concurrent programming frameworks including CUDA, MPI, MapReduce and DryadLINQ, and learning settings (supervised, unsupervised, semi-supervised and online learning). Extensive coverage of parallelization of boosted trees, SVMs, spectral clustering, belief propagation and other popular learning algorithms, and deep dives into several applications, make the book equally useful for researchers, students and practitioners.


Table of Contents

1 Scaling up machine learning: introductionRon Bekkerman and Mikhail Bilenko and John Langford
Part I Frameworks for Scaling Up Machine Learning:
2 Mapreduce and its application to massively parallel learning of decision tree ensemblesBiswanath Panda and Joshua S. Herbach and Sugato Basu and Roberto J. Bayardo
3 Large-scale machine learning using DryadLINQMihai Budiu and Dennis Fetterly and Michael Isard and Frank McSherry and Yuan Yu
4 IBM parallel machine learning toolboxEdwin Pednault and Elad Yom-Tov and Amol Ghoting
5 Uniformly fine-grained data parallel computing for machine learning algorithmsMeichun Hsu and Ren Wu and Bin Zhang
Part II Supervised and Unsupervised Learning Algorithms:
6 PSVM: parallel support vector machines with incomplete Cholesky FactorizationEdward Chang and Hongjie Bai and Kaihua Zhu and Hao Wang and Jian Li and Zhihuan Qiu
7 Massive SVM parallelization using hardware acceleratorsIgor Durdanovic and Eric Cosatto and Hans Peter Graf and Srihari Cadambi and Venkata Jakkula and Srimat Chakradhar and Abhinandan Majumdar
8 Large-scale learning to rank using boosted decision trees KrystaM. Svore and Christopher J. C. Burges
9 The transform regression algorithmRamesh Natarajan and Edwin Pednault
10 Parallel belief propagation in factor graphsJoseph Gonzalez and Yucheng Low and Carlos Guestrin
11 Distributed Gibbs sampling for latent variable models Arthur AsuncionPadhraic Smyth and Max Welling and David Newman and Ian Porteous and Scott Triglia
12 Large-scale spectral clustering with Mapreduce and MPIWen-Yen Chen and Yangqiu Song and Hongjie Bai and Chih-Jen Lin and Edward Y. Chang
13 Parallelizing information-theoretic clustering methodsRon Bekkerman and Martin Scholz
Part III Alternative Learning Settings:
14 Parallel online learningDaniel Hsu and Nikos Karampatziakis and John Langford and Alex J. Smola
15 Parallel graph-based semi-supervised learningJeff Bilmes and Amarnag Subramanya
16 Distributed transfer learning via cooperative matrix factorizationEvan Xiang and Nathan Liu and Qiang Yang
17 Parallel large-scale feature selectionJeremy Kubica and Sameer Singh and Daria Sorokina
Part IV Applications:
18 Large-scale learning for vision with GPUSAdam Coates and Rajat Raina and Andrew Y. Ng
19 Large-scale FPGA-based convolutional networks Clement FarabetYann LeCun and Koray Kavukcuoglu and Berin Martini and Polina Akselrod and Selcuk Talay and Eugenio Culurciello
20 Mining tree structured data on multicore systemsShirish Tatikonda and Srinivasan Parthasarathy
21 Scalable parallelization of automatic speech recognitionJike Chong and Ekaterina Gonina and Kisun You and Kurt Keutzer