CS614 : Data Warehousing

Course Overview

Course Synopsis

The focal area of this course is to provide awareness of data warehouse basic components, importance of data warehouse in business, important steps and techniques to be considered during data warehouse development, and future trends and usage of data warehouse. This course will provide the knowledge and skills to design and implement a Data Warehouse. Presented in the regular lectures and 5 lab lectures, participants will experience all phases of a Data Warehouse implementation from Extract, Transform and Load (ETL) of the data to running queries on the final database. The course will have a look at OLAP, MOLAP, ROLAP and other systems and how they differ from one another. It will provide an understanding of how to use data warehousing techniques in implementing it on web.

Course Learning Outcomes

After completing this course you should be able to:

  • Design and implement a Data Warehouse
  • Define the basic concepts and importance of data warehouse
  • Identify the business areas where data warehouse is required
  • Use data warehouse for data mining projects


Course Calendar

1 Introduction & Overview
2 Introduction & Overview
3 Introduction & Overview
4 Introduction & Overview
5 Introduction & Overview
6 Normalization
7 De-Normalization
8 De-Normalization Techniques
9 Issues of De-Normalization
Assignment No. 1
10 Online Analytical Processing (OLAP)
Quiz No. 1
11 Multidimensional OLAP (MOLAP)
12 Relational OLAP (ROLAP)
13 Dimensional Modeling (DM)
14 Process of Dimensional Modeling
15 Issues of Dimensional Modeling
GDB
16 Extract Transform Load (ETL)
17 Issues of ETL
Quiz No. 2
18 ETL Detail: Data Extraction & Transformation
Mid-Term Exam
19 ETL Detail: Data Cleansing
20 Data Duplication Elimination & BSN Method
21 Introduction to Data Quality Management (DQM
22 DQM: Quantifying Data Quality
23 Total DQM
24 Need for Speed: Parallelism
25 Need for Speed: Hardware Techniques
26 Conventional Indexing Techniques
27 Need for Speed: Special Indexing Techniques
Assignment No. 2
28 Join Techniques
29 A Brief Introduction to Data mining (DM)
30 What Can Data Mining Do?
31 Supervised Vs. Unsupervised Learning
32 DWH Lifecycle: Methodologies
33 DWH Implementation: Goal Driven Approach
34 DWH Implementation: Goal Driven Approach
35 DWH Life Cycle: Pitfalls, Mistakes, Tips
Quiz No. 3
36 Course Project
37 Case Study: Agri-Data Warehouse
38 Case Study: Agri-Data Warehouse
39 Web Warehousing: An introduction
40 Web Warehousing: Issues
41 Data Transformation Services
Quiz No. 4
42 Lab Data Set
43 Extracting Data Using Wizard
44 Data Profiling
45 Data Transformation and Standardization
Final Term Exam
46 Pre-Assessment