Scaling up machine learning : parallel and distributed approaches

Select an Action

Place Hold(s)
Add to My Lists
Email
Print

Title:

Publication Information:

Cambridge ; New York : Cambridge University Press, 2012

Physical Description:

xvi, 475 pages : illustrations ; 26 cm.

ISBN:

9780521192248

Abstract:

"This book presents an integrated collection of representative approaches for scaling up machine learning and data mining methods on parallel and distributed computing platforms. Demand for parallelizing learning algorithms is highly task-specific: in some settings it is driven by the enormous dataset sizes, in others by model complexity or by real-time performance requirements. Making task-appropriate algorithm and platform choices for large-scale machine learning requires understanding the benefits, trade-offs, and constraints of the available options"--provided by publisher

Subject Term:

Machine learning

Data mining

Parallel algorithms

Parallel programs (Computer programs)

Added Author:

Bekkerman, Ron,

Bilenko, Mikhail, 1978-

Langford, John, 1975-,

Available:*

Library	Item Barcode	Call Number	Material Type	Item Category 1	Status
Searching... PSZ JB	30000010328960	Q325.5 S33 2012	Open Access Book	Book	Searching... Unknown

On Order

Summary

This book presents an integrated collection of representative approaches for scaling up machine learning and data mining methods on parallel and distributed computing platforms. Demand for parallelizing learning algorithms is highly task-specific: in some settings it is driven by the enormous dataset sizes, in others by model complexity or by real-time performance requirements. Making task-appropriate algorithm and platform choices for large-scale machine learning requires understanding the benefits, trade-offs and constraints of the available options. Solutions presented in the book cover a range of parallelization platforms from FPGAs and GPUs to multi-core systems and commodity clusters, concurrent programming frameworks including CUDA, MPI, MapReduce and DryadLINQ, and learning settings (supervised, unsupervised, semi-supervised and online learning). Extensive coverage of parallelization of boosted trees, SVMs, spectral clustering, belief propagation and other popular learning algorithms, and deep dives into several applications, make the book equally useful for researchers, students and practitioners.

1 Scaling up machine learning: introductionRon Bekkerman and Mikhail Bilenko and John Langford

Part I Frameworks for Scaling Up Machine Learning:

2 Mapreduce and its application to massively parallel learning of decision tree ensemblesBiswanath Panda and Joshua S. Herbach and Sugato Basu and Roberto J. Bayardo

3 Large-scale machine learning using DryadLINQMihai Budiu and Dennis Fetterly and Michael Isard and Frank McSherry and Yuan Yu

4 IBM parallel machine learning toolboxEdwin Pednault and Elad Yom-Tov and Amol Ghoting

5 Uniformly fine-grained data parallel computing for machine learning algorithmsMeichun Hsu and Ren Wu and Bin Zhang

Part II Supervised and Unsupervised Learning Algorithms:

6 PSVM: parallel support vector machines with incomplete Cholesky FactorizationEdward Chang and Hongjie Bai and Kaihua Zhu and Hao Wang and Jian Li and Zhihuan Qiu

7 Massive SVM parallelization using hardware acceleratorsIgor Durdanovic and Eric Cosatto and Hans Peter Graf and Srihari Cadambi and Venkata Jakkula and Srimat Chakradhar and Abhinandan Majumdar

8 Large-scale learning to rank using boosted decision trees KrystaM. Svore and Christopher J. C. Burges

9 The transform regression algorithmRamesh Natarajan and Edwin Pednault

10 Parallel belief propagation in factor graphsJoseph Gonzalez and Yucheng Low and Carlos Guestrin

11 Distributed Gibbs sampling for latent variable models Arthur AsuncionPadhraic Smyth and Max Welling and David Newman and Ian Porteous and Scott Triglia

12 Large-scale spectral clustering with Mapreduce and MPIWen-Yen Chen and Yangqiu Song and Hongjie Bai and Chih-Jen Lin and Edward Y. Chang

13 Parallelizing information-theoretic clustering methodsRon Bekkerman and Martin Scholz

Part III Alternative Learning Settings:

14 Parallel online learningDaniel Hsu and Nikos Karampatziakis and John Langford and Alex J. Smola

15 Parallel graph-based semi-supervised learningJeff Bilmes and Amarnag Subramanya

16 Distributed transfer learning via cooperative matrix factorizationEvan Xiang and Nathan Liu and Qiang Yang

17 Parallel large-scale feature selectionJeremy Kubica and Sameer Singh and Daria Sorokina

Part IV Applications:

18 Large-scale learning for vision with GPUSAdam Coates and Rajat Raina and Andrew Y. Ng

19 Large-scale FPGA-based convolutional networks Clement FarabetYann LeCun and Koray Kavukcuoglu and Berin Martini and Polina Akselrod and Selcuk Talay and Eugenio Culurciello

20 Mining tree structured data on multicore systemsShirish Tatikonda and Srinivasan Parthasarathy

21 Scalable parallelization of automatic speech recognitionJike Chong and Ekaterina Gonina and Kisun You and Kurt Keutzer

Available:*

On Order

Summary

Summary

Table of Contents