Skip to:Content
|
Bottom
Cover image for Advanced data warehouse design : from conventional to spatial and temporal applications
Title:
Advanced data warehouse design : from conventional to spatial and temporal applications
Personal Author:
Series:
Data-centric systems and applications
Publication Information:
Berlin : Springer-Verlag, 2007
ISBN:
9783540744047
Added Author:

Available:*

Library
Item Barcode
Call Number
Material Type
Item Category 1
Status
Searching...
30000010163964 QA76.9.D37 M34 2008 Open Access Book Book
Searching...

On Order

Summary

Summary

A data warehouse stores large volumes of historical data required for analytical purposes. This data is extracted from operational databases; transformed into a coherent whole using a multidimensional model that includes measures, dimensions, and hierarchies; and loaded into a data warehouse during the extraction-transformation-loading (ETL) process.

Malinowski and Zimányi explain in detail conventional data warehouse design, covering in particular complex hierarchy modeling. Additionally, they address two innovative domains recently introduced to extend the capabilities of data warehouse systems, namely the management of spatial and temporal information. Their presentation covers different phases of the design process, such as requirements specification, conceptual, logical, and physical design. They include three different approaches for requirements specification depending on whether users, operational data sources, or both are the driving force in the requirements gathering process, and they show how each approach leads to the creation of a conceptual multidimensional model. Throughout the book the concepts are illustrated using many real-world examples and completed by sample implementations for Microsoft's Analysis Services 2005 and Oracle 10g with the OLAP and the Spatial extensions.

For researchers this book serves as an introduction to the state of the art on data warehouse design, with many references to more detailed sources. Providing a clear and a concise presentation of the major concepts and results of data warehouse design, it can also be used as the basis of a graduate or advanced undergraduate course. The book may help experienced data warehouse designers to enlarge their analysis possibilities by incorporating spatial and temporal information. Finally, experts in spatial databases or in geographical information systems could benefit from the data warehouse vision for building innovative spatial analytical applications.


Author Notes

Elzbieta Malinowski is a professor at the department of Computer and Information
Science at the Universidad de Costa Rica and a professional consultant in
Costa Rica in the area of the Data Warehousing. She received her master degrees
from Saint Petersburg Electrotechnical University, Russia (1982) and
University of Florida, USA (1996), and her Ph.D. degree from
Université Libre de Bruxelles, Belgium (2006). Her research interests
include data warehouses, OLAP systems, geographic information systems,
and temporal databases.

Esteban Zimányi is a professor of computer science at the Engineering Department of the Université Libre de Bruxelles (ULB), Belgium. He received the BSc degree (1988) and the doctorate degree (1992) in computer science from the Sciences Department at the ULB. His current research interests include conceptual modeling, geographic information systems, spatio-temporal databases, and semantic web.


Table of Contents

1 Introductionp. 1
1.1 Overviewp. 2
1.1.1 Conventional Data Warehousesp. 2
1.1.2 Spatial Databases and Spatial Data Warehousesp. 4
1.1.3 Temporal Databases and Temporal Data Warehousesp. 5
1.1.4 Conceptual Modeling for Databases and Data Warehousesp. 6
1.1.5 A Method for Data Warehouse Designp. 7
1.2 Motivation for the Bookp. 8
1.3 Objective of the Book and its Contributions to Researchp. 11
1.3.1 Conventional Data Warehousesp. 12
1.3.2 Spatial Data Warehousesp. 13
1.3.3 Temporal Data Warehousesp. 13
1.4 Organization of the Bookp. 14
2 Introduction to Databases and Data Warehousesp. 17
2.1 Database Conceptsp. 18
2.2 The Entity-Relationship Modelp. 19
2.3 Logical Database Designp. 23
2.3.1 The Relational Modelp. 23
2.3.2 The Object-Relational Modelp. 32
2.4 Physical Database Designp. 38
2.5 Data Warehousesp. 41
2.6 The Multidimensional Modelp. 43
2.6.1 Hierarchiesp. 44
2.6.2 Measure Aggregationp. 45
2.6.3 OLAP Operationsp. 47
2.7 Logical Data Warehouse Designp. 49
2.8 Physical Data Warehouse Designp. 51
2.9 Data Warehouse Architecturep. 55
2.9.1 Back-End Tierp. 56
2.9.2 Data Warehouse Tierp. 57
2.9.3 OLAP Tierp. 58
2.9.4 Front-End Tierp. 58
2.9.5 Variations of the Architecturep. 59
2.10 Analysis Services 2005p. 59
2.10.1 Defining an Analysis Services Databasep. 60
2.10.2 Data Sourcesp. 61
2.10.3 Data Source Viewsp. 61
2.10.4 Dimensionsp. 62
2.10.5 Cubesp. 64
2.11 Oracle 10g with the OLAP Optionp. 66
2.11.1 Multidimensional Modelp. 67
2.11.2 Multidimensional Database Designp. 68
2.11.3 Data Source Managementp. 69
2.11.4 Dimensionsp. 70
2.11.5 Cubesp. 71
2.12 Conclusionp. 73
3 Conventional Data Warehousesp. 75
3.1 MultiDim: A Conceptual Multidimensional Modelp. 76
3.2 Data Warehouse Hierarchiesp. 79
3.2.1 Simple Hierarchiesp. 81
3.2.2 Nonstrict Hierarchiesp. 88
3.2.3 Alternative Hierarchiesp. 93
3.2.4 Parallel Hierarchiesp. 94
3.3 Advanced Modeling Aspectsp. 97
3.3.1 Modeling of Complex Hierarchiesp. 97
3.3.2 Role-Playing Dimensionsp. 100
3.3.3 Fact Dimensionsp. 101
3.3.4 Multivalued Dimensionsp. 101
3.4 Metamodel of the MultiDim Modelp. 106
3.5 Mapping to the Relational and Object-Relational Modelsp. 107
3.5.1 Rationalep. 107
3.5.2 Mapping Rulesp. 108
3.6 Logical Representation of Hierarchiesp. 112
3.6.1 Simple Hierarchiesp. 112
3.6.2 Nonstrict Hierarchiesp. 120
3.6.3 Alternative Hierarchiesp. 123
3.6.4 Parallel Hierarchiesp. 123
3.7 Implementing Hierarchiesp. 124
3.7.1 Hierarchies in Analysis Services 2005p. 124
3.7.2 Hierarchies in Oracle OLAP 10gp. 126
3.8 Related Workp. 128
3.9 Summaryp. 130
4 Spatial Data Warehousesp. 133
4.1 Spatial Databases: General Conceptsp. 134
4.1.1 Spatial Objectsp. 134
4.1.2 Spatial Data Typesp. 134
4.1.3 Reference Systemsp. 136
4.1.4 Topological Relationshipsp. 136
4.1.5 Conceptual Models for Spatial Datap. 138
4.1.6 Implementation Models for Spatial Datap. 138
4.1.7 Models for Storing Collections of Spatial Objectsp. 139
4.1.8 Architecture of Spatial Systemsp. 140
4.2 Spatial Extension of the MultiDim Modelp. 141
4.3 Spatial Levelsp. 143
4.4 Spatial Hierarchiesp. 143
4.4.1 Hierarchy Classificationp. 143
4.4.2 Topological Relationships Between Spatial Levelsp. 149
4.5 Spatial Fact Relationshipsp. 152
4.6 Spatiality and Measuresp. 153
4.6.1 Spatial Measuresp. 153
4.6.2 Conventional Measures Resulting from Spatial Operationsp. 156
4.7 Metamodel of the Spatially Extended MultiDim Modelp. 157
4.8 Rationale of the Logical-Level Representationp. 159
4.8.1 Using the Object-Relational Modelp. 159
4.8.2 Using Spatial Extensions of DBMSsp. 160
4.8.3 Preserving Semanticsp. 161
4.9 Object-Relational Representation of Spatial Data Warehousesp. 162
4.9.1 Spatial Levelsp. 162
4.9.2 Spatial Attributesp. 164
4.9.3 Spatial Hierarchiesp. 165
4.9.4 Spatial Fact Relationshipsp. 170
4.9.5 Measuresp. 172
4.10 Summary of the Mapping Rulesp. 174
4.11 Related Workp. 175
4.12 Summaryp. 178
5 Temporal Data Warehousesp. 181
5.1 Slowly Changing Dimensionsp. 182
5.2 Temporal Databases: General Conceptsp. 185
5.2.1 Temporality Typesp. 185
5.2.2 Temporal Data Typesp. 186
5.2.3 Synchronization Relationshipsp. 187
5.2.4 Conceptual and Logical Models for Temporal Databasesp. 189
5.3 Temporal Extension of the MultiDim Modelp. 190
5.3.1 Temporality Typesp. 190
5.3.2 Overview of the Modelp. 192
5.4 Temporal Support for Levelsp. 195
5.5 Temporal Hierarchiesp. 196
5.5.1 Nontemporal Relationships Between Temporal Levelsp. 196
5.5.2 Temporal Relationships Between Nontemporal Levelsp. 198
5.5.3 Temporal Relationships Between Temporal Levelsp. 198
5.5.4 Instant and Lifespan Cardinalitiesp. 199
5.6 Temporal Fact Relationshipsp. 201
5.7 Temporal Measuresp. 202
5.7.1 Temporal Support for Measuresp. 202
5.7.2 Measure Aggregation for Temporal Relationshipsp. 207
5.8 Managing Different Temporal Granularitiesp. 207
5.8.1 Conversion Between Granularitiesp. 208
5.8.2 Different Granularities in Measures and Dimensionsp. 208
5.8.3 Different Granularities in the Source Systems and in the Data Warehousep. 210
5.9 Metamodel of the Temporally Extended MultiDim Modelp. 211
5.10 Rationale of the Logical-Level Representationp. 213
5.11 Logical Representation of Temporal Data Warehousesp. 214
5.11.1 Temporality Typesp. 214
5.11.2 Levels with Temporal Supportp. 216
5.11.3 Parent-Child Relationshipsp. 220
5.11.4 Fact Relationships and Temporal Measuresp. 226
5.12 Summary of the Mapping Rulesp. 228
5.13 Implementation Considerationsp. 229
5.13.1 Integrity Constraintsp. 229
5.13.2 Measure Aggregationp. 234
5.14 Related Workp. 237
5.14.1 Types of Temporal Supportp. 237
5.14.2 Conceptual Models for Temporal Data Warehousesp. 238
5.14.3 Logical Representationp. 240
5.14.4 Temporal Granularityp. 241
5.15 Summaryp. 242
6 Designing Conventional Data Warehousesp. 245
6.1 Current Approaches to Data Warehouse Designp. 246
6.1.1 Data Mart and Data Warehouse Designp. 246
6.1.2 Design Phasesp. 248
6.1.3 Requirements Specification for Data Warehouse Designp. 248
6.2 A Method for Data Warehouse Designp. 250
6.3 A University Case Studyp. 251
6.4 Requirements Specificationp. 253
6.4.1 Analysis-Driven Approachp. 253
6.4.2 Source-Driven Approachp. 261
6.4.3 Analysis/Source-Driven Approachp. 265
6.5 Conceptual Designp. 265
6.5.1 Analysis-Driven Approachp. 266
6.5.2 Source-Driven Approachp. 275
6.5.3 Analysis/Source-Driven Approachp. 278
6.6 Characterization of the Various Approachesp. 280
6.6.1 Analysis-Driven Approachp. 280
6.6.2 Source-Driven Approachp. 282
6.6.3 Analysis/Source-Driven Approachp. 283
6.7 Logical Designp. 283
6.7.1 Logical Representation of Data Warehouse Schemasp. 283
6.7.2 Defining ETL Processesp. 287
6.8 Physical Designp. 288
6.8.1 Data Warehouse Schema Implementationp. 288
6.8.2 Implementation of ETL Processesp. 294
6.9 Method Summaryp. 295
6.9.1 Analysis-Driven Approachp. 296
6.9.2 Source-Driven Approachp. 296
6.9.3 Analysis/Source-Driven Approachp. 297
6.10 Related Workp. 298
6.10.1 Overall Methodsp. 300
6.10.2 Requirements Specificationp. 301
6.11 Summaryp. 305
7 Designing Spatial and Temporal Data Warehousesp. 307
7.1 Current Approaches to the Design of Spatial and Temporal Databasesp. 308
7.2 A Risk Management Case Studyp. 308
7.3 A Method for Spatial-Data-Warehouse Designp. 310
7.3.1 Requirements Specification and Conceptual Designp. 310
7.3.2 Logical and Physical Designp. 321
7.4 Revisiting the University Case Studyp. 324
7.5 A Method for Temporal-Data-Warehouse Designp. 325
7.5.1 Requirements Specification and Conceptual Designp. 326
7.5.2 Logical and Physical Designp. 333
7.6 Method Summaryp. 337
7.6.1 Analysis-Driven Approachp. 337
7.6.2 Source-Driven Approachp. 338
7.6.3 Analysis/Source-Driven Approachp. 339
7.7 Related Workp. 340
7.8 Summaryp. 342
8 Conclusions and Future Workp. 345
8.1 Conclusionsp. 345
8.2 Future Workp. 348
8.2.1 Conventional Data Warehousesp. 348
8.2.2 Spatial Data Warehousesp. 349
8.2.3 Temporal Data Warehousesp. 351
8.2.4 Spatiotemporal Data Warehousesp. 352
8.2.5 Design Methodsp. 353
A Formalization of the MultiDim Modelp. 355
A.1 Notationp. 355
A.2 Predefined Data Typesp. 355
A.3 Metavariablesp. 356
A.4 Abstract Syntaxp. 357
A.5 Examples Using the Abstract Syntaxp. 359
A.5.1 Conventional Data Warehousep. 359
A.5.2 Spatial Data Warehousep. 361
A.5.3 Temporal Data Warehousep. 364
A.6 Semanticsp. 366
A.6.1 Semantics of the Predefined Data Typesp. 367
A.6.2 The Space Modelp. 367
A.6.3 The Time Modelp. 371
A.6.4 Semantic Domainsp. 372
A.6.5 Auxiliary Functionsp. 372
A.6.6 Semantic Functionsp. 375
B Graphical Notationp. 383
B.1 Entity-Relationship Modelp. 383
B.2 Relational and Object-Relational Modelsp. 385
B.3 Conventional Data Warehousesp. 386
B.4 Spatial Data Warehousesp. 388
B.5 Temporal Data Warehousesp. 389
Referencesp. 391
Glossaryp. 411
Indexp. 425
Go to:Top of Page