Close

DWDM-P05
(PGP 2005-07 : Term-VI


DATA WAREHOUSING & DATA MINING
Course Outline on
Instructor: Pradip Kumar Bala
Objective: To impart knowledge on the emerging trends in Data Warehousing & Data Miming, enabling students understand and appreciate the importance of making meaningful use of large volume of data in decision-making process

Session Details:
Session
Topic
Session Details
1
Introduction-Introduction

-Data Warehousing & Data Mining as a subject

-Motivation behind data mining

2
Data Warehouse-What is a data warehouse?

-Definition

-Multidimensional data model

3
Data Warehouse vs Database-Difference between Operational Database Systems and Data Warehouses

-Basic elements of the data warehouse

-Commercial Importance of data warehouse

4
Data Warehousing Architecture -DW Architecture

-Enterprise Warehouse

-Data Marts

-Virtual Data Warehouse

-Metadata

5
Multidimensional data modelMultidimensional Representation of data

-Dimension Modeling & Hierarchy

-Lattice of Cubouds

-Summary Measures

6
OLAP Operations-Slicing & Dicing

-Drill-up & Drill-down

-Drill within & Drill Across

-Pivot

7
Warehouse Schema-Normalization vs Dimensional Modeling

-Star Schema, Snowflake Schema and Fact Constellation

8
OLAP Engine-Specialized SQL Server

-ROLAP, MOLAP and HOLAP

9
Data Warehouse Implementation-Efficient Computation of Data Cubes

-Indexing OLAP Data

-Backend Processes

10
DW Case Study 1 : Retail Sales

DW Case Study 2:

Inventory

-Retail Schema in action

-Retail Schema Extensibility

-Promotion Dimension

-Degenerate Transaction Number Dimension

-Market Basket analysis

-Inventory Periodic snapshot

-Inventory Transactions

-Inventory Accumulating Snapshot

11
DW Case Study 3:

Procurement

-Slowly Changing Dimensions (SCD)

-How to handle with SCD

12
DW Case Study 4:

CRM

DW Case Study 5:

Banking

DW Study 6: Insurance

-Large Changing Customer Dimensions

-Analyzing Customer Data from Multiple Business Processes

-Banking case study

-Insurance case study

13
Data Mining-What is data mining

-KDD vs Data Mining

-DBMS vs DM

-DM Techniques

-DM Application Areas

-DM applications: Case Studies

14
Association Rules-Introduction

-What is an association rule?

-Support, Confidence, and Lift paradigm

-Generalized Association Rule (Numeric,categoric,temporal,spatial etc)

-Case study on use of association rules in Market Basket Analysis and Inventory Management

15
Methods to discover association rules-Discussion of DM algorithms (A priori, DIC, FP Tree Growth)
16
Clustering & Classification-Definitions of clustering & classification

-Algorithms for clustering

-Case Study on clustering

17
Classification-Decision Trees based on information entropy

-Case Study on building Decision Tree

18
Classification-Neural Network

-Case study: Data Mining using Neural Network

19
Data Mining in soft computing paradigm-Genetic Algorithms

-Fuzzy and Neuro Fuzzy Approaches

-Rough Set

-Support Vector Machine

20
Web Mining-Web Content Mining

-Text Mining

Suggested Readings:
(i) J. Hahn and Micheline Kamber - Data Mining: Concepts and Techniques
(Morgan Kaufmann)
(ii) R.Kimball - DataWarehouse Toolkit (J.Wiley)
(iii) O.P.Rud – Data Mining: Modeling Data for Marketing, Risk and CRM (Wiley)
(iv)A.K.Pujari -Data mining (University Press)
Evaluation Methodology: Mid Term: 30, Quizzes (surprise or announced): 10, Class Participation: 10, Presentation: 10, End-Term: 40
Created By: Lingaraj Pattanaik on 08/31/2006 at 12:21 PM
Category: PGP-II Doctype: Document

...........................