Data Mining- Concepts and Techniques Slides for .ppt
《Data Mining- Concepts and Techniques Slides for .ppt》由会员分享,可在线阅读,更多相关《Data Mining- Concepts and Techniques Slides for .ppt(34页珍藏版)》请在麦多课文档分享上搜索。
1、October 4, 2018,Data Mining: Concepts and Techniques,1,Data Mining: Concepts and Techniques Slides for Textbook Chapter 1 ,Jiawei Han and Micheline Kamber Department of Computer Science University of Illinois at Urbana-Champaign www.cs.uiuc.edu/hanj,October 4, 2018,Data Mining: Concepts and Techniqu
2、es,2,Data Mining: Concepts and Techniques,October 4, 2018,Data Mining: Concepts and Techniques,3,Acknowledgements,This set of slides started with Hans tutorial for UCLA Extension course in February 1998 Other subsequent contributors: Dr. Hongjun Lu (Hong Kong Univ. of Science and Technology) Graduat
3、e students from Simon Fraser Univ., Canada, notably Eugene Belchev, Jian Pei, and Osmar R. Zaiane Graduate students from Univ. of Illinois at Urbana-Champaign,October 4, 2018,Data Mining: Concepts and Techniques,4,CS497JH Schedule (Fall 2002),Chapter 1. Introduction W1:L1 Chapter 2. Data pre-process
4、ing W4: L1-2 Homework # 1 distribution (SQLServer2000) Chapter 3. Data warehousing and OLAP technology for data mining W2:L1-2, W3:L1-2 Homework # 2 distribution Chapter 4. Data mining primitives, languages, and system architectures W5: L1 Chapter 5. Concept description: Characterization and compari
5、son W5: L2, W6: L1 Chapter 6. Mining association rules in large databases W6:L2, W7:L1-L21, W8: L1 Homework #3 distribution Chapter 7. Classification and prediction W8:L2, W9: L2, W10:L1 Midterm W9: L1 Chapter 8. Clustering analysis W10:L2, W11: L1-2 Homework #4 distribution Chapter 9. Mining comple
6、x types of data W12: L1-2, W13:L1-2 Chapter 10. Data mining applications and trends in data mining W14: L1 Research/Development project presentation (W14-W15 + final exam period) Final Project Due,October 4, 2018,Data Mining: Concepts and Techniques,5,Where to Find the Set of Slides?,Book page: (MS
7、PowerPoint files): www.cs.uiuc.edu/hanj/dmbook Updated course presentation slides (.ppt): www-courses.cs.uiuc.edu/cs497jh/ Research papers, DBMiner system, and other related information: www.cs.uiuc.edu/hanj or ,October 4, 2018,Data Mining: Concepts and Techniques,6,Chapter 1. Introduction,Motivatio
8、n: Why data mining? What is data mining? Data Mining: On what kind of data? Data mining functionality Are all the patterns interesting? Classification of data mining systems Major issues in data mining,October 4, 2018,Data Mining: Concepts and Techniques,7,Necessity Is the Mother of Invention,Data e
9、xplosion problem Automated data collection tools and mature database technology lead to tremendous amounts of data accumulated and/or to be analyzed in databases, data warehouses, and other information repositories We are drowning in data, but starving for knowledge! Solution: Data warehousing and d
10、ata mining Data warehousing and on-line analytical processing Miing interesting knowledge (rules, regularities, patterns, constraints) from data in large databases,October 4, 2018,Data Mining: Concepts and Techniques,8,Evolution of Database Technology,1960s: Data collection, database creation, IMS a
11、nd network DBMS 1970s: Relational data model, relational DBMS implementation 1980s: RDBMS, advanced data models (extended-relational, OO, deductive, etc.) Application-oriented DBMS (spatial, scientific, engineering, etc.) 1990s: Data mining, data warehousing, multimedia databases, and Web databases
12、2000s Stream data management and mining Data mining with a variety of applications Web technology and global information systems,October 4, 2018,Data Mining: Concepts and Techniques,9,What Is Data Mining?,Data mining (knowledge discovery from data) Extraction of interesting (non-trivial, implicit, p
13、reviously unknown and potentially useful) patterns or knowledge from huge amount of data Data mining: a misnomer? Alternative names Knowledge discovery (mining) in databases (KDD), knowledge extraction, data/pattern analysis, data archeology, data dredging, information harvesting, business intellige
14、nce, etc. Watch out: Is everything “data mining”? (Deductive) query processing. Expert systems or small ML/statistical programs,October 4, 2018,Data Mining: Concepts and Techniques,10,Why Data Mining?Potential Applications,Data analysis and decision support Market analysis and management Target mark
15、eting, customer relationship management (CRM), market basket analysis, cross selling, market segmentation Risk analysis and management Forecasting, customer retention, improved underwriting, quality control, competitive analysis Fraud detection and detection of unusual patterns (outliers) Other Appl
16、ications Text mining (news group, email, documents) and Web mining Stream data mining DNA and bio-data analysis,October 4, 2018,Data Mining: Concepts and Techniques,11,Market Analysis and Management,Where does the data come from? Credit card transactions, loyalty cards, discount coupons, customer co
17、mplaint calls, plus (public) lifestyle studies Target marketing Find clusters of “model” customers who share the same characteristics: interest, income level, spending habits, etc. Determine customer purchasing patterns over time Cross-market analysis Associations/co-relations between product sales,
18、 & prediction based on such association Customer profiling What types of customers buy what products (clustering or classification) Customer requirement analysis identifying the best products for different customers predict what factors will attract new customers Provision of summary information mul
19、tidimensional summary reports statistical summary information (data central tendency and variation),October 4, 2018,Data Mining: Concepts and Techniques,12,Corporate Analysis & Risk Management,Finance planning and asset evaluation cash flow analysis and prediction contingent claim analysis to evalua
20、te assets cross-sectional and time series analysis (financial-ratio, trend analysis, etc.) Resource planning summarize and compare the resources and spending Competition monitor competitors and market directions group customers into classes and a class-based pricing procedure set pricing strategy in
21、 a highly competitive market,October 4, 2018,Data Mining: Concepts and Techniques,13,Fraud Detection & Mining Unusual Patterns,Approaches: Clustering & model construction for frauds, outlier analysis Applications: Health care, retail, credit card service, telecomm. Auto insurance: ring of collisions
22、 Money laundering: suspicious monetary transactions Medical insurance Professional patients, ring of doctors, and ring of references Unnecessary or correlated screening tests Telecommunications: phone-call fraud Phone call model: destination of the call, duration, time of day or week. Analyze patter
23、ns that deviate from an expected norm Retail industry Analysts estimate that 38% of retail shrink is due to dishonest employees Anti-terrorism,October 4, 2018,Data Mining: Concepts and Techniques,14,Other Applications,Sports IBM Advanced Scout analyzed NBA game statistics (shots blocked, assists, an
24、d fouls) to gain competitive advantage for New York Knicks and Miami Heat Astronomy JPL and the Palomar Observatory discovered 22 quasars with the help of data mining Internet Web Surf-Aid IBM Surf-Aid applies data mining algorithms to Web access logs for market-related pages to discover customer pr
- 1.请仔细阅读文档,确保文档完整性,对于不预览、不比对内容而直接下载带来的问题本站不予受理。
- 2.下载的文档,不会出现我们的网址水印。
- 3、该文档所得收入(下载+内容+预览)归上传者、原创作者;如果您是本文档原作者,请点此认领!既往收益都归您。
下载文档到电脑,查找使用更方便
2000 积分 0人已下载
下载 | 加入VIP,交流精品资源 |
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- DATAMININGCONCEPTSANDTECHNIQUESSLIDESFORPPT

链接地址:http://www.mydoc123.com/p-372902.html