CS614 : Data Warehousing

Course Overview

Course Synopsis

The focal area of this course is to provide awareness of data warehouse basic components, importance of data warehouse in business, important steps and techniques to be considered during data warehouse development, and future trends and usage of data warehouse. This course will provide the knowledge and skills to design and implement a Data Warehouse. Presented in the regular lectures and 5 lab lectures, participants will experience all phases of a Data Warehouse implementation from Extract, Transform and Load (ETL) of the data to running queries on the final database. The course will have a look at OLAP, MOLAP, ROLAP and other systems and how they differ from one another. It will provide an understanding of how to use data warehousing techniques in implementing it on web.

Course Learning Outcomes

After completing this course you should be able to:

  • Design and implement a Data Warehouse
  • Define the basic concepts and importance of data warehouse
  • Identify the business areas where data warehouse is required
  • Use data warehouse for data mining projects


Course Calendar

1 Introduction & Overview
2 Introduction & Overview
3 Introduction & Overview
4 Introduction & Overview
5 Introduction & Overview
6 Normalization
7 De-Normalization
8 De-Normalization Techniques
Assignment No1
9 Issues of De-Normalization
10 Online Analytical Processing (OLAP)
11 Multidimensional OLAP (MOLAP)
12 Relational OLAP (ROLAP)
13 Dimensional Modeling (DM)
14 Process of Dimensional Modeling
15 Issues of Dimensional Modeling
Quiz No. 1
16 Extract Transform Load (ETL)
17 Issues of ETL
18 ETL Detail: Data Extraction & Transformation
Graded Discussion Board
19 ETL Detail: Data Cleansing
20 Data Duplication Elimination & BSN Method
21 Introduction to Data Quality Management (DQM
Quiz No. 2
22 DQM: Quantifying Data Quality
Mid Term
23 Total DQM
24 Need for Speed: Parallelism
25 Need for Speed: Hardware Techniques
26 Conventional Indexing Techniques
27 Need for Speed: Special Indexing Techniques
28 Join Techniques
29 A Brief Introduction to Data mining (DM)
30 What Can Data Mining Do?
Assignment No. 2
31 Supervised Vs. Unsupervised Learning
32 DWH Lifecycle: Methodologies
33 DWH Implementation: Goal Driven Approach
34 DWH Implementation: Goal Driven Approach
Quiz No. 3
35 DWH Life Cycle: Pitfalls, Mistakes, Tips
36 Course Project
37 Case Study: Agri-Data Warehouse
38 Case Study: Agri-Data Warehouse
Quiz No. 4
39 Web Warehousing: An introduction
40 Web Warehousing: Issues
41 Data Transformation Services
42 Lab Data Set
43 Extracting Data Using Wizard
44 Data Profiling
45 Data Transformation and Standardization
46 Pre-Assessment
Final Term