Cover image for Machine learning for multimedia content analysis
Title:
Machine learning for multimedia content analysis
Personal Author:
Series:
Multimedia systems and applications series ; 30
Publication Information:
New York : Springer, 2007
Physical Description:
xv, 277 p. : ill. ; 23 cm.
ISBN:
9780387699387

9780387699424
General Note:
Also available in online version
Added Author:
Electronic Access:
Fulltext
DSP_RESTRICTION_NOTE:
Remote access restricted to users with a valid UTM ID via VPN

Available:*

Library
Item Barcode
Call Number
Material Type
Item Category 1
Status
Searching...
30000010177439 QA76.575 G66 2007 Open Access Book Book
Searching...
Searching...
30000010184072 QA76.575 G66 2007 Open Access Book Book
Searching...

On Order

Summary

Summary

Challenges in complexity and variability of multimedia data have led to revolutions in machine learning techniques. Multimedia data, such as digital images, audio streams and motion video programs, exhibit richer structures than simple, isolated data items. A number of pixels in a digital image collectively conveys certain visual content to viewers. A TV video program consists of both audio and image streams that unfold the underlying story. To recognize the visual content of a digital image, or to understand the underlying story of a video program, we may need to label sets of pixels or groups of image and audio frames jointly.

Machine Learning for Multimedia Content Analysis introduces machine learning techniques that are particularly powerful and effective for modeling spatial, temporal structures of multimedia data and for accomplishing common tasks of multimedia content analysis. This book systematically covers these techniques in an intuitive fashion and demonstrates their applications through case studies. This volume uses a large number of figures to illustrate and visualize complex concepts, and provides insights into the characteristics of many algorithms through examinations of their loss functions and straightforward comparisons.

Machine Learning for Multimedia Content Analysis is designed for an academic and professional audience. Researchers will find this book an invaluable tool for applying machine learning techniques to multimedia content analysis. This volume is also suitable for practitioners in industry.


Table of Contents

1 Introductionp. 1
1.1 Basic Statistical Learning Problemsp. 2
1.2 Categorizations of Machine Learning Techniquesp. 4
1.2.1 Unsupervised vs. Supervisedp. 4
1.2.2 Generative Models vs. Discriminative Modelsp. 4
1.2.3 Models for Simple Data vs. Models for Complex Datap. 6
1.2.4 Model Identification vs. Model Predictionp. 7
1.3 Multimedia Content Analysisp. 8
Part I Unsupervised Learning
2 Dimension Reductionp. 15
2.1 Objectivesp. 15
2.2 Singular Value Decompositionp. 16
2.3 Independent Component Analysisp. 20
2.3.1 Preprocessingp. 23
2.3.2 Why Gaussian is Forbiddenp. 24
2.4 Dimension Reduction by Locally Linear Embeddingp. 26
2.5 Case Studyp. 30
Problemsp. 34
3 Data Clustering Techniquesp. 37
3.1 Introductionp. 37
3.2 Spectral Clusteringp. 39
3.2.1 Problem Formulation and Criterion Functionsp. 39
3.2.2 Solution Computationp. 42
3.2.3 Examplep. 46
3.2.4 Discussionsp. 50
3.3 Data Clustering by Non-Negative Matrix Factorizationp. 51
3.3.1 Single Linear NMF Modelp. 52
3.3.2 Bilinear NMF Modelp. 55
3.4 Spectral vs. NMFp. 59
3.5 Case Study: Document Clustering Using Spectral and NMF Clustering Techniquesp. 61
3.5.1 Document Clustering Basicsp. 62
3.5.2 Document Corporap. 64
3.5.3 Evaluation Metricsp. 64
3.5.4 Performance Evaluations and Comparisonsp. 65
Part II Generative Graphical Models
4 Introduction of Graphical Modelsp. 73
4.1 Directed Graphical Modelp. 74
4.2 Undirected Graphical Modelp. 77
4.3 Generative vs. Discriminativep. 79
4.4 Content of Part IIp. 80
5 Markov Chains and Monte Carlo Simulationp. 81
5.1 Discrete-Time Markov Chainp. 81
5.2 Canonical Representationp. 84
5.3 Definitions and Terminologiesp. 88
5.4 Stationary Distributionp. 91
5.5 Long Run Behavior and Convergence Ratep. 94
5.6 Markov Chain Monte Carlo Simulationp. 100
5.6.1 Objectives and Applicationsp. 100
5.6.2 Rejection Samplingp. 101
5.6.3 Markov Chain Monte Carlop. 104
5.6.4 Rejection Sampling vs. MCMCp. 110
Problemsp. 112
6 Markov Random Fields and Gibbs Samplingp. 115
6.1 Markov Random Fieldsp. 115
6.2 Gibbs Distributionsp. 117
6.3 Gibbs - Markov Equivalencep. 120
6.4 Gibbs Samplingp. 123
6.5 Simulated Annealingp. 126
6.6 Case Study: Video Foreground Object Segmentation by MRFp. 133
6.6.1 Objectivep. 134
6.6.2 Related Worksp. 134
6.6.3 Method Outlinep. 135
6.6.4 Overview of Sparse Motion Layer Computationp. 136
6.6.5 Dense Motion Layer Computation Using MRFp. 138
6.6.6 Bayesian Inferencep. 140
6.6.7 Solution Computation by Gibbs Samplingp. 141
6.6.8 Experimental Resultsp. 143
Problemsp. 146
7 Hidden Markov Modelsp. 149
7.1 Markov Chains vs. Hidden Markov Modelsp. 149
7.2 Three Basic Problems for HMMsp. 153
7.3 Solution to Likelihood Computationp. 154
7.4 Solution to Finding Likeliest State Sequencep. 158
7.5 Solution to HMM Trainingp. 160
7.6 Expectation-Maximization Algorithm and its Variancesp. 162
7.6.1 Expectation-Maximization Algorithmp. 162
7.6.2 Baum-Welch Algorithmp. 164
7.7 Case Study: Baseball Highlight Detection Using HMMsp. 167
7.7.1 Objectivep. 167
7.7.2 Overviewp. 167
7.7.3 Camera Shot Classificationp. 169
7.7.4 Feature Extractionp. 172
7.7.5 Highlight Detectionp. 173
7.7.6 Experimental Evaluationp. 174
Problemsp. 175
8 Inference and Learning for General Graphical Modelsp. 179
8.1 Introductionp. 179
8.2 Sum-product algorithmp. 182
8.3 Max-product algorithmp. 188
8.4 Approximate inferencep. 189
8.5 Learningp. 191
Problemsp. 196
Part III Discriminative Graphical Models
9 Maximum Entropy Model and Conditional Random Fieldp. 201
9.1 Overview of Maximum Entropy Modelp. 202
9.2 Maximum Entropy Frameworkp. 204
9.2.1 Feature Functionp. 204
9.2.2 Maximum Entropy Model Constructionp. 205
9.2.3 Parameter Computationp. 208
9.3 Comparison to Generative Modelsp. 210
9.4 Relation to Conditional Random Fieldp. 213
9.5 Feature Selectionp. 215
9.6 Case Study: Baseball Highlight Detection Using Maximum Entropy Modelp. 217
9.6.1 System Overviewp. 218
9.6.2 Highlight Detection Based on Maximum Entropy Modelp. 220
9.6.3 Multimedia Feature Extractionp. 222
9.6.4 Multimedia Feature Vector Constructionp. 226
9.6.5 Experimentsp. 227
Problemsp. 232
10 Max-Margin Classificationsp. 235
10.1 Support Vector Machines (SVMs)p. 236
10.1.1 Loss Function and Riskp. 237
10.1.2 Structural Risk Minimizationp. 237
10.1.3 Support Vector Machinesp. 239
10.1.4 Theoretical Justificationp. 243
10.1.5 SVM Dualp. 244
10.1.6 Kernel Trickp. 245
10.1.7 SVM Trainingp. 248
10.1.8 Further Discussionsp. 255
10.2 Maximum Margin Markov Networksp. 257
10.2.1 Primal and Dual Problemsp. 257
10.2.2 Factorizing Dual Problemp. 259
10.2.3 General Graphs and Learning Algorithmp. 262
10.2.4 Max-Margin Networks vs. Other Graphical Modelsp. 262
Problemsp. 264
A Appendixp. 267
Referencesp. 269
Indexp. 275