Directed Acyclic Graphs.ppt
《Directed Acyclic Graphs.ppt》由会员分享,可在线阅读,更多相关《Directed Acyclic Graphs.ppt(100页珍藏版)》请在麦多课文档分享上搜索。
1、1,Directed Acyclic Graphs,David A. Bessler Texas A&M UniversityNovember 20, 2002Universidad Internacional del Ecuador Quito, Ecuador,2,Outline,Introduction Causal Forks Inverted Causal Forks D-separation Markov Property The Adjustment Problem Policy Modeling PC Algorithm,3,Outline Continued,Example:
2、 Traffic Fatalities Correlation and Partial Correlation Forecasting Traffic Fatalities More Examples: US Money, Prices and Income World Stock Markets Conclusion,4,Motivation,Oftentimes we are uncertain about which variables are causal in a modeling effort. Theory may tell us what our fundamental cau
3、sal variables are in a controlled system; however, it is common that our data may not be collected in a controlled environment. In fact we are rarely involved with the collection of our data.,5,Observational Data,In the case where no experimental control is present in the generation of our data, suc
4、h data are said to be observational (non-experimental) and usually secondary, not collected explicitly for our purpose but rather for some other primary purpose.,6,Use of Theory,Theory is a good potential source of information about direction of causal flow. However, theory usually invokes the ceter
5、is paribus condition to achieve results. Data are usually observational (non-experimental) and thus the ceteris paribus condition may not hold. We may not ever know if it holds because of unknown variables operating on our system (see Malinvauds econometric text).,7,Experimental Methods,If we do not
6、 know the “true“ system, but have an approximate idea that one or more variables operate on that system, then experimental methods can yield appropriate results. Experimental methods work because they use randomization, random assignment of subjects to alternative treatments, to account for any addi
7、tional variation associated with the unknown variables on the system.,8,Directed Graphs Can Be Used To Represent Causation,Directed graphs help us assign causal flows to a set of observational data.The problem under study and theory suggests certain variables ought to be related, even if we do not k
8、now exactly how; i.e. we dont know the “true“ system.,9,Causal Models Are Well Represented By Directed Graphs,One reason for studying causal models, represented here as X Y, is to predict the consequences of changing the effect variable (Y) by changing the cause variable (X). The possibility of mani
9、pulating Y by way of manipulating X is at the heart of causation. Hausman (1998, page 7) writes: “Causation seems connected to intervention and manipulation: One can use causes to wiggle their effects.”,10,We Need More Than Algebra To Represent Cause,Linear algebra is symmetric with respect to the e
10、qual sign. We can re-write y = a + bx as x = -a/b +(1/b)y. Either form is legitimate for representing the information conveyed by the equation. A preferred representation of causation would be the sentence x y, or the words: “if you change x by one unit you will change y by b units, ceteris paribus.
11、” The algebraic statement suggests a symmetry that does not hold for causal statements.,11,Arrows Carry the Information,An arrow placed with its base at X and head at Y indicates X causes Y: X Y. By the words “X causes Y” we mean that one can change the values of Y by changing the values of X.Arrows
12、 indicate a productive or genetic relationship between X and Y. Causal Statements are asymmetric: x y is not consistent with y x.,12,Problems with Predictive Definitions of Cause,Definition of the word “cause” that focus on prediction alone, without distinguishing between intervention (first) and su
13、bsequent realization, may mistakenly label as causal variables that are associated only through an omitted variable. Prediction is one attribute of the word “cause.” We must be careful not to make it the only attribute (more or less a summary of Bunge 1959).,13,Granger-type Causality,For example, Gr
14、anger-type causality (Granger 1980) focuses solely on prediction, without considering intervention. If we can predict Y better by using past values of X than by not using past values of X , then X Granger-causes Y.The consequences of such focus is to open oneself up to the frustration of unrealized
15、expectations by attempting policy on the wrong set of variables.,14,Graph,A graph is an ordered triple .V is a non-empty set of vertices (variables). M is a non-empty set of marks (symbols attached to the end of undirected edges).E is a set of ordered pairs. Each member of E is called an edge.,15,Ve
16、rtices are variables; Edges are lines,Vertices connected by an edge are said to be adjacent. If we have a set of vertices A,B,C,D the undirected graph contains only undirected edges (e.g., A B). A directed graph contains only directed edges:C D.,16,Directed Acyclic Graphs (DAGs),A directed acyclic g
17、raph is a directed graph that contains no directed cyclic paths. An acyclic graph has no path that leads away from a variable only to return to that same variable. The path A B C A is labeled “cyclic” as here we move from A to B, but then return to A by way of C.,17,Graphs and Probabilities of Varia
18、bles,Directed acyclic graphs are pictures (illustrations) for representing conditional independence as given by the recursive decomposition:n Pr(v1,v2 vn-1,vn ) = Pr( vi | pai )i=1 where Pr is the probability of vertices (variables) v1, v2, v3, . vn and pai the realization of some subset of the vari
19、ables that precede (come before in a causal sense) vi in order (v1, v2, v3, . vn), and the symbol represents the product operation, with index of operation denoted below (start) and above (finish) the symbol. Think of pai as the parent of variable i.,18,D-Separation,Let X, Y and Z be three disjoint
20、subsets of variables in a directed acylic graph G, and let p be any path between a vertex variable in X and a vertex variable in Y, where by path we mean any succession of edges, regardless of their directions. Z is said to block p if there is a vertex w on p satisfying one of the following: (i) w h
21、as converging arrows along p, and neither w nor any of its descendants are on Z or (ii) w does not have converging arrows along p, and w is in Z. Furthermore, Z is said to d-separate X from Y on graph G, written (X Y | Z)G , if and only if Z blocks every path from a vertex variable in X to a vertex
22、variable in Y.,19,Graphs and D-Separation,Geiger, Verma and Pearl (1990) show that there is a one-to-one correspondence between the set of conditional independencies, X Y | Z, implied by the above factorization and the set of triples, X, Y, Z, that satisfy the d-separation criterion in graph G. If G
23、 is a directed acyclic graph with vertex set V, if A and B are in V and if H is also in V, then G linearly implies the correlation between A and B conditional on H is zero if and only if A and B are d-separated given H.,20,Colliders (Inverted Fork),Consider three variables (vertices): A, B and C. A
- 1.请仔细阅读文档,确保文档完整性,对于不预览、不比对内容而直接下载带来的问题本站不予受理。
- 2.下载的文档,不会出现我们的网址水印。
- 3、该文档所得收入(下载+内容+预览)归上传者、原创作者;如果您是本文档原作者,请点此认领!既往收益都归您。
下载文档到电脑,查找使用更方便
2000 积分 0人已下载
下载 | 加入VIP,交流精品资源 |
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- DIRECTEDACYCLICGRAPHSPPT
