You don’t have to be great to start, but you have to start to be great.
--Your friends at LectureNotes

Note for Data Mining And Data Warehousing - DMDW by Ashutosh Jaiswal

  • Data Mining And Data Warehousing - DMDW
  • Note
  • 299 Offline Downloads
  • Uploaded 5 months ago
Ashutosh Jaiswal
Ashutosh Jaiswal
0 User(s)
Download PDFOrder Printed Copy

Share it with your friends

Leave your Comments

Text from page-1

Data Mining: Introduction Lecture Notes for Chapter 1 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach, Kumar Introduction to Data Mining 4/18/2004 1

Text from page-2

Why Mine Data? Commercial Viewpoint  Lots of data is being collected and warehoused – Web data, e-commerce – purchases at department/ grocery stores – Bank/Credit Card transactions  Computers have become cheaper and more powerful  Competitive Pressure is Strong – Provide better, customized services for an edge (e.g. in Customer Relationship Management) © Tan,Steinbach, Kumar Introduction to Data Mining 4/18/2004 2

Text from page-3

Why Mine Data? Scientific Viewpoint  Data collected and stored at enormous speeds (GB/hour) – remote sensors on a satellite – telescopes scanning the skies – microarrays generating gene expression data – scientific simulations generating terabytes of data   Traditional techniques infeasible for raw data Data mining may help scientists – in classifying and segmenting data – in Hypothesis Formation

Text from page-4

Mining Large Data Sets - Motivation    There is often information “hidden” in the data that is not readily evident Human analysts may take weeks to discover useful information Much of the data is never analyzed at all 4,000,000 3,500,000 The Data Gap 3,000,000 2,500,000 2,000,000 1,500,000 Total new disk (TB) since 1995 1,000,000 Number of analysts 500,000 0 1995 1996 1997 1998 1999 ©From: Tan,Steinbach, R. Grossman, Kumar C. Kamath, V. Kumar, Introduction “Data Mining to Data for Mining Scientific and Engineering Applications” 4/18/2004 4

Lecture Notes