Cover image for Multimedia semantics : metadata, analysis and interaction
Title:
Multimedia semantics : metadata, analysis and interaction
Personal Author:
Publication Information:
Chichester, West Sussex, U.K. : Wiley, 2011
Physical Description:
xxii, 305 p. : ill., maps ; 25 cm.
ISBN:
9780470747001

Available:*

Library
Item Barcode
Call Number
Material Type
Item Category 1
Status
Searching...
30000010280866 QA76.575 T76 2011 Open Access Book Book
Searching...

On Order

Summary

Summary

In this book, the authors present the latest research results in the multimedia and semantic web communities, bridging the "Semantic Gap"

This book explains, collects and reports on the latest research results that aim at narrowing the so-called multimedia "Semantic Gap": the large disparity between descriptions of multimedia content that can be computed automatically, and the richness and subjectivity of semantics in user queries and human interpretations of audiovisual media. Addressing the grand challenge posed by the "Semantic Gap" requires a multi-disciplinary approach (computer science, computer vision and signal processing, cognitive science, web science, etc.) and this is reflected in recent research in this area. In addition, the book targets an interdisciplinary community, and in particular the Multimedia and the Semantic Web communities. Finally, the authors provide both the fundamental knowledge and the latest state-of-the-art results from both communities with the goal of making the knowledge of one community available to the other.

Key Features:

Presents state-of-the art research results in multimedia semantics: multimedia analysis, metadata standards and multimedia knowledge representation, semantic interaction with multimedia Contains real industrial problems exemplified by user case scenarios Offers an insight into various standardisation bodies including W3C, IPTC and ISO MPEG Contains contributions from academic and industrial communities from Europe, USA and Asia Includes an accompanying website containing user cases, datasets, and software mentioned in the book, as well as links to the K-Space NoE and the SMaRT society web sites ( http://www.multimediasemantics.com/ )

This book will be a valuable reference for academic and industry researchers /practitioners in multimedia, computational intelligence and computer science fields. Graduate students, project leaders, and consultants will also find this book of interest.


Author Notes

Dr. Raphaël Troncy, Centre for Mathematics and Computer Science, Netherlands
Raphaël Troncy obtained his Master's thesis with honours in computer science at the University Joseph Fourier of Grenoble, France. He received his PhD with honours in 2004. His research interests include Semantic Web and Multimedia Technologies, Knowledge Representation, Ontology Modeling and Alignment. Raphaël Troncy is an expert in audio visual metadata and in combining existing metadata standards (such as MPEG-7) with current Semantic Web technologies.

Dr. Benoit Huet, Institut EURECOM, France
Benoit Huet received his BSc degree in computer science and engineering from the Ecole Superieure de Technologie Electrique (Groupe ESIEE, France) in 1992. In 1993, he was awarded the MSc degree in Artificial Intelligence from the University of Westminster (UK) with distinction. He received his PhD degree in Computer Science from the University of York (UK). His research interests include computer vision, content-based retrieval, multimedia data mining and indexing (still and/or moving images) and pattern recognition.

Simon Schenk, University of Koblenz-Landau, Germany
Simon Schenk is a research and teaching assistant at the Information Systems and Semantic Web Group of University of Koblenz-Landau.Simon is working towards his PhD degree under the supervision of Professor Dr. Steffen Staab. Previously, he has worked as a consultant for Capgemini. Schenk studied at NORDAKADEMIE University of Applied Sciences, Germany and Karlstads Universitet, Sweden and received his diploma in Computer Science and Business Management from NORDAKADEMIE in 2004.


Table of Contents

Raphaël Troncy and Benoit Huet and Simon SchenkWerner Bailer and Susanne Boll and Oscar Celma and Michael Hausenblas and Yves RaimondLynda Hardman and Zeljko Obrenovic and Frank NackRachid Benmokhtar and Benoit Huet and Gaël Richard and Slim EssidSlim Essid and Marine Campedel and Gaël Richard and Tomas Piatrik and Rachid Benmokhtar and Benoit HuetEyal Oren and Simon SchenkAntoine Isaac and Simon Schenk and Ansgar ScherpPeter Schallauer and Werner Bailer and Raphael Troncy and Florian KaiserThomas Franz and Raphaël Troncy and Miroslav VacuraThanos Athanasiadis and Phivos Mylonas and Georgios Th. Papadopoulos and Vasileios Mezaris and Yannis Avrithis and Ioannis Kompatsiaris and Michael G. StrintzisNikolaos Simon and Giorgos Stoilos and Carsten Saathoff and Jan Nemrava and Vojtech Svdtek and Petr Berka and Vassilis TzouvarasNoel E. O'Connor and David A. Sadlier and Bart Lehane and Andrew Salway and Jan Nemrava and Paul BuitelaarCarsten Saathoff and Krishna Chandramouli and Werner Bailer and Peter Schallauer and Raphaël TroncyFrank Hopfgartner and Reede Ren and Thierry Urruty and Joemon M. JoseMichiel Hildebrand and Jacco van Ossenbruggen and Lynda HardmanRaphaël Troncy and Benoit Huet and Simon Schenk
Forewordp. xi
List of Figuresp. xiii
List of Tablesp. xvii
List of Contributorsp. xix
1 Introductionp. 1
2 Use Case Scenariosp. 7
2.1 Photo Use Casep. 8
2.1.1 Motivating Examplesp. 8
2.1.2 Semantic Description of Photos Todayp. 9
2.1.3 Services We Need for Photo Collectionsp. 10
2.2 Music Use Casep. 10
2.2.1 Semantic Description of Music Assetsp. 11
2.2.2 Music Recommendation and Discoveryp. 12
2.2.3 Management of Personal Music Collectionsp. 13
2.3 Annotation in Professional Media Production and Archivingp. 14
2.3.1 Motivating Examplesp. 15
2.3.2 Requirements for Content Annotationp. 17
2.4 Discussionp. 18
Acknowledgementsp. 19
3 Canonical Processes of Semantically Annotated Media Productionp. 21
3.1 Canonical Processesp. 22
3.1.1 Premeditatep. 23
3.1.2 Create Media Assetp. 23
3.1.3 Annotatep. 23
3.1.4 Packagep. 24
3.1.5 Queryp. 24
3.1.6 Construct Messagep. 25
3.1.7 Organizep. 25
3.1.8 Publishp. 26
3.1.9 Distributep. 26
3.2 Example Systemsp. 27
3.2.1 CeWe Color Photo Bookp. 27
3.2.2 SenseCamp. 29
3.3 Conclusion and Future Workp. 33
4 Feature Extraction for Multimedia Analysisp. 35
4.1 Low-Level Feature Extractionp. 36
4.1.1 What Are Relevant Low-Level Features?p. 36
4.1.2 Visual Descriptorsp. 36
4.1.3 Audio Descriptorsp. 45
4.2 Feature Fusion and Multi-modalityp. 54
4.2.1 Feature Normalizationp. 54
4.2.2 Homogeneous Fusionp. 55
4.2.3 Cross-modal Fusionp. 56
4.3 Conclusionp. 58
5 Machine Learning Techniques for Multimedia Analysisp. 59
5.1 Feature Selectionp. 61
5.1.1 Selection Criteriap. 61
5.1.2 Subset Searchp. 62
5.1.3 Feature Rankingp. 63
5.1.4 A Supervised Algorithm Examplep. 63
5.2 Classificationp. 65
5.2.1 Historical Classification Algorithmsp. 65
5.2.2 Kernel Methodsp. 67
5.2.3 Classifying Sequencesp. 71
5.2.4 Biologically Inspired Machine Learning Techniquesp. 73
5.3 Classifier Fusionp. 75
5.3.1 Introductionp. 75
5.3.2 Non-trainable Combinersp. 75
5.3.3 Trainable Combinersp. 76
5.3.4 Combination of Weak Classifiersp. 77
5.3.5 Evidence Theoryp. 78
5.3.6 Consensual Clusteringp. 78
5.3.7 Classifier Fusion Propertiesp. 80
5.4 Conclusionp. 80
6 Semantic Web Basicsp. 81
6.1 The Semantic Webp. 82
6.2 RDFp. 83
6.2.1 RDF Graphsp. 86
6.2.2 Named Graphsp. 87
6.2.3 RDF Semanticsp. 88
6.3 RDF Schemap. 90
6.4 Data Modelsp. 93
6.5 Linked Data Principlesp. 94
6.5.1 Dereferencing Using Basic Web Look-upp. 95
6.5.2 Dereferencing Using HTTP 303 Redirectsp. 95
6.6 Development Practicalitiesp. 96
6.6.1 Data Storesp. 97
6.6.2 Toolkitsp. 97
7 Semantic Web Languagesp. 99
7.1 The Need for Ontologies on the Semantic Webp. 100
7.2 Representing Ontological Knowledge Using OWLp. 100
7.2.1 OWL Constructs and OWL Syntaxp. 100
7.2.2 The Formal Semantics of OWL and its Different Layersp. 102
7.2.3 Reasoning Tasksp. 106
7.2.4 OWL Flavorsp. 107
7.2.5 Beyond OWLp. 107
7.3 A Language to Represent Simple Conceptual Vocabularies: SKOSp. 108
7.3.1 Ontologies versus Knowledge Organization Systemsp. 108
7.3.2 Representing Concept Schemes Using SKOSp. 109
7.3.3 Characterizing Concepts beyond SKOSp. 111
7.3.4 Using SKOS Concept Schemes on the Semantic Webp. 112
7.4 Querying on the Semantic Webp. 113
7.4.1 Syntaxp. 113
7.4.2 Semanticsp. 118
7.4.3 Default Negation in SPARQLp. 123
7.4.4 Well-Formed Queriesp. 124
7.4.5 Querying for Multimedia Metadatap. 124
7.4.6 Partitioning Datasetsp. 126
7.4.7 Related Workp. 127
8 Multimedia Metadata Standardsp. 129
8.1 Selected Standardsp. 130
8.1.1 MPEG-7p. 130
8.1.2 EBU P_Metap. 132
8.1.3 SMPTE Metadata Standardsp. 133
8.1.4 Dublin Corep. 133
8.1.5 TV-Anytimep. 134
8.1.6 METS and VRAp. 134
8.1.7 MPEG-21p. 135
8.1.8 XMP, IPTC in XMPp. 135
8.1.9 EXIFp. 136
8.1.10 DIG35p. 137
8.1.11 ID3/MP3p. 137
8.1.12 NewsML G2 and rNewsp. 138
8.1.13 W3C Ontology for Media Resourcesp. 138
8.1.14 EBUCorep. 139
8.2 Comparisonp. 140
8.3 Conclusionp. 143
9 The Core Ontology for Multimediap. 145
9.1 Introductionp. 145
9.2 A Multimedia Presentation for Granddadp. 146
9.3 Related Workp. 149
9.4 Requirements for Designing a Multimedia Ontologyp. 150
9.5 A Formal Representation for MPEG-7p. 150
9.5.1 DOLCE as Modeling Basisp. 151
9.5.2 Multimedia Patternsp. 151
9.5.3 Basic Patternsp. 155
9.5.4 Comparison with Requirementsp. 157
9.6 Granddad's Presentation Explained by COMMp. 157
9.7 Lessons Learnedp. 159
9.8 Conclusionp. 160
10 Knowledge-Driven Segmentation and Classificationp. 163
10.1 Related Workp. 164
10.2 Semantic Image Segmentationp. 165
10.2.1 Graph Representation of an Imagep. 165
10.2.2 Image Graph Initializationp. 165
10.2.3 Semantic Region Growingp. 167
10.3 Using Contextual Knowledge to Aid Visual Analysisp. 170
10.3.1 Contextual Knowledge Formulationp. 170
10.3.2 Contextual Relevancep. 173
10.4 Spatial Context and Optimizationp. 177
10.4.1 Introductionp. 177
10.4.2 Low-Level Visual Information Processingp. 177
10.4.3 Initial Region-Concept Associationp. 178
10.4.4 Final Region-Concept Associationp. 179
10.5 Conclusionsp. 181
11 Reasoning for Multimedia Analysisp. 183
11.1 Fuzzy DL Reasoningp. 184
11.1.1 The Fuzzy DLf-SKLMp. 184
11.1.2 The Tableaux Algorithmp. 185
11.1.3 The FiRE Fuzzy Reasoning Enginep. 187
11.2 Spatial Features for Image Region Labelingp. 192
11.2.1 Fuzzy Constraint Satisfaction Problemsp. 192
11.2.2 Exploiting Spatial Features Using Fuzzy Constraint Reasoningp. 193
11.3 Fuzzy Rule Based Reasoning Enginep. 196
11.4 Reasoning over Resources Complementary to Audiovisual Streamsp. 201
12 Multi-Modal Analysis for Content Structuring and Event Detectionp. 205
12.1 Moving Beyond Shots for Extracting Semanticsp. 206
12.2 A Multi-Modal Approachp. 207
12.3 Case Studiesp. 207
12.4 Case Study 1: Field Sportsp. 208
12.4.1 Content Structuringp. 208
12.4.2 Concept Detection Leveraging Complementary Text Sourcesp. 213
12.5 Case Study 2: Fictional Contentp. 214
12.5.1 Content Structuringp. 215
12.5.2 Concept Detection Leveraging Audio Descriptionp. 219
12.6 Conclusions and Future Workp. 221
13 Multimedia Annotation Toolsp. 223
13.1 State of the Artp. 224
13.2 SVAT: Professional Video Annotationp. 225
13.2.1 User Interfacep. 225
13.2.2 Semantic Annotationp. 228
13.3 KAT: Semi-automatic, Semantic Annotation of Multimedia Contentp. 229
13.3.1 Historyp. 231
13.3.2 Architecturep. 232
13.3.3 Default Pluginsp. 234
13.3.4 Using COMM as an Underlying Model: Issues and Solutionsp. 234
13.3.5 Semi-automatic Annotation: An Examplep. 237
13.4 Conclusionsp. 239
14 Information Organization Issues in Multimedia Retrieval Using Low-Level Featuresp. 241
14.1 Efficient Multimedia Indexing Structuresp. 242
14.1.1 An Efficient Access Structure for Multimedia Datap. 243
14.1.2 Experimental Resultsp. 245
14.1.3 Conclusionp. 249
14.2 Feature Term Based Indexp. 249
14.2.1 Feature Termsp. 250
14.2.2 Feature Term Distributionp. 251
14.2.3 Feature Term Extractionp. 252
14.2.4 Feature Dimension Selectionp. 253
14.2.5 Collection Representation and Retrieval Systemp. 254
14.2.6 Experimentp. 256
14.2.7 Conclusionp. 258
14.3 Conclusion and Future Trendsp. 259
Acknowledgementp. 259
15 The Role of Explicit Semantics in Search and Browsingp. 261
15.1 Basic Search Terminologyp. 261
15.2 Analysis of Semantic Searchp. 262
15.2.1 Query Constructionp. 263
15.2.2 Search Algorithmp. 265
15.2.3 Presentation of Resultsp. 267
15.2.4 Survey Summaryp. 269
15.3 Use Case A: Keyword Search in ClioPatriap. 270
15.3.1 Query Constructionp. 270
15.3.2 Search Algorithmp. 270
15.3.3 Result Visualization and Organizationp. 273
15.4 Use Case B: Faceted Browsing in ClioPatriap. 274
15.4.1 Query Constructionp. 274
15.4.2 Search Algorithmp. 276
15.4.3 Result Visualization and Organizationp. 276
15.5 Conclusionsp. 277
16 Conclusionp. 279
Referencesp. 281
Author Indexp. 301
Subject Indexp. 303