MIS 385/MBA 664 Systems Implementation With DBMS/ Database Management
MIS 385/MBA 664 Systems Implementation With DBMS/ Database Management
MIS 385/MBA 664 Systems Implementation With DBMS/ Database Management
Objectives
Definition of terms
Reasons for information gap between
information needs and availability
Reasons for need of data warehousing
Describe three levels of data warehouse
architectures
Describe two components of star schema
Estimate fact table size
Design a data mart
Develop requirements for a data mart
Definition
Data Warehouse:
Data Mart:
L
T
One,
companywide
warehouse
E
Periodic extraction data is not completely current in warehouse
T
E
Separate ETL for each
independent data mart
Data access
complexity due to
multiple data marts
T
E
Single ETL for
enterprise data warehouse
(EDW)
T
E
Near real-time ETL for
Data Warehouse
Data Characteristics
Status vs. Event Data
Status
Event = a database
action
(create/update/delete)
that results from a
transaction
Status
Data Characteristics
Transient vs. Periodic Data
With
transient
data,
changes
to
existing
records
are
written
over
previous
records,
thus
destroyin
g the
previous
data
Data Characteristics
Transient vs. Periodic Data
Periodic
data are
never
physicall
y
altered
or
deleted
once
they
have
been
added
to the
store
Derived Data
Objectives
Characteristics
Star schema
Modeling dates
Tracking events
Inventory coverage
Multivalued Dimensions
Hierarchies
OLAP Operations
Example of drill-down
Starting with summary
data, users can obtain
details for particular cells
Summary
report
Drill-down
with color
added
Techniques
Statistical regression
Decision tree induction
Clustering and signal processing
Affinity
Sequence association
Case-based reasoning
Rule discovery
Neural nets
Fractals