A Formal Approach to Finding Explanations for Database .ppt
《A Formal Approach to Finding Explanations for Database .ppt》由会员分享,可在线阅读,更多相关《A Formal Approach to Finding Explanations for Database .ppt(22页珍藏版)》请在麦多课文档分享上搜索。
1、A Formal Approach to Finding Explanations for Database Queries,Sudeepa Roy Dan Suciu University of Washington, Seattle,1,We need to understand “Big Data”,ref. Big data whitepaper, Jagadish et al., 2011-12,D1,D2,D3,Data Analysis System,1. Acquire Data,2. Prepare Data,Clean,Extract Feature,Integrate,5
2、. Plot Graphs,6. Ask Questions!,4. Run Queries,Do you have an explanation?,2,Why is there a peak for #sigmod papers from industry during 2000-06,while #academia papers kept increasing?,Why is #SIGMOD papers #PODS papers in UK?,Sample Questions,Dataset: Pre-processed DBLP + Affiliation data Disclaime
3、r: Not all authors have affiliation info,Explanations by our approach at the end,3,“What was the cause of the observation?” Not simple association or correlation e.g. People having headache drink coffeeDoes coffee cause headache?Does headache lead to drinking coffee?,Ideal goal: Why Causality,4,Has
4、been studied for many years (Hume1748) Extensive study in AI over the last decade by Judea Pearl using the notion of intervention:X is a cause of Y, if removal of X also removes Y keeping other conditions unchangedNeeds controlled experiments Not always possible with a database ,But, causality is ha
5、rd,5,Realistic Database-y goal: Why Explanation,Causality,Explanation,Controlled,Experiment,Input database and observed query outputs,Causal Paths,PK-FK constraints and their,generalization,Intervention,Remove input,tuples,query output should,change,Top Causes,Top explanations will change the output
6、,in the expected direction to a greater extent,6,Previous/Related Work,Causality in databases Meliou et al.10, Meliou et al.11Explanations in databases Explaining outliers in aggregate queries: Wu-Madden13 Specific applications (Map-Reduce, Access log, User Rating,):e.g. Khoussainova et al.12, Fabbr
7、i et al.12, Das et al.11Other related topics Provenance, deletion propagation: e.g. Green et al.07, Buneman et al.01 Missing answer/Why-Not: e.g. Herschel et al.09, Huang et al.10, Chapman-Jagadish09 Finding causal structure/data mining: e.g. Silverstein et al.00 OLAP: e.g. Sarawagi-Sathe01,Informal
8、ly use interventionExplanation = predicateMostly single table, no join,Pearls notion of causality and interventionCausal structure from input to output by lineageCause = Individual input tuples, not predicatesNo inherent causal structure in input data,Upcoming VLDB 2014 Tutorial “Causality and Expla
9、nations in Databases” Alexandra Meliou, Sudeepa Roy, Dan Suciu,This work: Formal framework of explanations (= predicates)and theoretical analysis causal structure within input data independent of queries or user questions allow multiple tables and joinsOptimizations and Evaluation find top explanati
10、ons using data cube,7,Outline,FrameworkCausal Paths and InterventionComputing InterventionOptimization: Ranking Explanations by Data CubeEvaluationFuture Work,8,Input and Output,Run Group-By Queries and Plot,Toy DBLP database,Output Plot,User question Numerical expression EDirection: high/low,E = (q
11、1/q3) / (q2/q4) Direction = high,Why is q1/q3 q2/q4,e.g. q1select count(distinct x.pubid) from Author x, Authored y, Publication z where x.id = y.id and y.pubid = z.pubidand z.venue = SIGMODand 2000 = z.yearand z.year = 2004and x.domain = com,These values will vary for q2, q3, q4,Input,Explanation(s
12、) : Predicate on attributes e.g. name = JG name = JG inst = C.edu name = JG year = 2007 Note: attr from multiple tables,Output,E should change when database is “intervened “with ,9,Causal Paths by Foreign Key Constraints,Causal path X Y: removing X removes Y Analogy in DB: Foreign key constraints an
- 1.请仔细阅读文档,确保文档完整性,对于不预览、不比对内容而直接下载带来的问题本站不予受理。
- 2.下载的文档,不会出现我们的网址水印。
- 3、该文档所得收入(下载+内容+预览)归上传者、原创作者;如果您是本文档原作者,请点此认领!既往收益都归您。
下载文档到电脑,查找使用更方便
2000 积分 0人已下载
下载 | 加入VIP,交流精品资源 |
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- AFORMALAPPROACHTOFINDINGEXPLANATIONSFORDATABASEPPT

链接地址:http://www.mydoc123.com/p-373149.html