|
|
|
Lecture 7 |
|
Professor Burns |
|
|
|
|
A planned, integrated, managed store of relevant
corporate data optimized for analysis, query and reporting functions. |
|
Optimal design is a series of conformed
dimension tables and transaction-grained |
|
Reasonable Staging, if necessary |
|
Clean (correct), consistent ( 1 version),
relevant(data organized to answer business questions) |
|
|
|
|
3NF EDW – large, mainframe oriented, years |
|
3NF EDW w/ dependent data marts – requires a 3NF
EDW to exist |
|
Independent, unintegrated data marts -
redundancy |
|
Integrated, dimensional enterprise data
warehouse – cross process analyses without redundancy, deliver value early |
|
|
|
|
3NF data warehouse precedes any and all
dependent data marts (each addressing a single subject area). |
|
Building data marts in advance would only lead
to redundancy |
|
|
|
|
|
Developed by Greg Jones and advocated by authors |
|
Consider two immutable laws of DW |
|
The most complex part of building a data
warehouse is the loading of data, and |
|
It is impossible to evaluate any design until
legacy data is loaded and shown to the users. |
|
|
|
|
|
Jones proposed 2 changes to traditional approach |
|
Build an EDW a subject area at a time, loading
data into the first subject area design as soon as technically feasible,
and iterate collection of feedback from users thereafter (weeks, not months). |
|
Abandon the mandate of building a large,
comprehensive, data warehouse prior to building star schemas for individual
subject areas. |
|
Prove value earlier, reconcile onerous issues
early, leverage advances in tools and software. |
|
Dimensional EDW, done as prescribed, actually
leads to less redundancy and improved consistency without a 3NF data warehouse. |
|
|
|
|
|
|
|
5 Business Elements |
|
Concept of Operations – organizational structure
(products, divisions, geographies, ect) |
|
Business Model (how are $$$ generated and costs
allocated) |
|
Corporate Strategies (forward looking) |
|
Competitive Environment (relative strengths and
weaknesses of a business) |
|
Operational Environment (key operational
processes in service of the business model) |
|
See Table 14.2 |
|
|
|
|
|
|
Identify and conform primary dimensions for the
entire business (Figure 14.4). Core
Operational Concepts. |
|
Identify key transaction grain fact tables for
the entire business (Figure 14.5) |
|
Create a high-level dimensional EDW design
expressing all relationships (Figure 14.6) |
|
Prioritize subject areas for development
(business need) |
|
Identify source system changes (risk management) |
|
Select data storage, movement, and analysis
sofotware |
|
Document objectives, assumptions and business
requirements |
|
See also “A Practical Guide to Building Data
Marts” on the text CD. |
|
|
|
|
Plan an EDW, build a data mart (Figures 14.7 and
14.8) |
|