Biological networksConstruction andAnalysis.ppt
《Biological networksConstruction andAnalysis.ppt》由会员分享,可在线阅读,更多相关《Biological networksConstruction andAnalysis.ppt(47页珍藏版)》请在麦多课文档分享上搜索。
1、Biological networks Construction and Analysis,Recap,Gene regulatory networks Transcription Factors: special proteins that function as “keys” to the “switches” that determine whether a protein is to be produced Gene regulatory networks try to show this “key-product” relationship and understand the re
2、gulatory mechanisms that govern the cell.We went over a simple algorithm for detecting significant patterns in these networks,Other networks?,Apart from regulation there are other events in a cell that require interaction of biological molecules Other types of molecular interactions that can be obse
3、rved in a cell enzyme ligand enzyme: a protein that catalyzes, or speeds up, a chemical reaction ligand: extracellular substance that binds to receptors metabolic pathways protein protein cell signaling pathways proteins interact physically and form large complexes for cell processes,Pathways are in
4、ter-linked,Signalling pathway,Genetic network,Metabolic pathway,STIMULUS,Interactions Pathways Network,A collection of interactions defines a network Pathways are subsets of networks All pathways are networks of interactions, however not all networks are pathways! Difference in the level of annotati
5、on or understanding We can define a pathway as a biological network that relates to a known physiological process or complete function,The “interactome”,The complete wiring of a proteome. Each vertex represents a protein. Each edge represents an “interaction” between two proteins.,An edge between tw
6、o proteins if.,The proteins interact physically and form large complexes The proteins are enzymes that catalyze two successive chemical reactions in a pathway One of the proteins regulates the expression of the other,Sources for interaction data,Literature: research labs have been conducting small-s
7、cale experiments for many years! Interaction dabases: MIPS (Munich Information center for Protein Sequences) BIND (Biomolecular Network Interaction Database) GRID (General Repository for Interaction Datasets) DIP (Database of Interacting Proteins) Experiments: Y2H (yeast two-hybrid method) APMS (aff
8、inity purification coupled with mass spectrometry),These methods provide the ability to perform genome/proteome-scale experiments. For yeast: 50,000 unique interactions involving 75% of known open reading frames (ORFs) of yeast genome However, for C. elegans they provide relatively small coverage of
9、 the genome with 5600 interactions. Problems with high-throughput experiments: Low quality, false positives, false negatives Fraction of biologically relevant interactions: 30%-50% (Deane et al. 2002),Solution:,User other indirect data sources to create a probabilistic protein network. Other sources
10、 include: Genome data: Existence of genes in multiple organisms Locations of the genes Bio-image data Gene Ontology annotations Microarray experiments Sub-cellular localization data,Probabilistic network approach,Each “interaction” link between two proteins has a posterior probability of existence,
11、based on the quality of supporting evidence.,Bayesian Network approach,Jansen et al. (2003) Science. Lee et al. (2004) Science. Combine individual probabilities of likelihood computed for each data source into a single likelihood (or probability) Naive Bayes: Assume independence of data sources Comb
12、ine likelihoods using simple multiplication,Bayesian Approach,A scalar score for a pair of genes is computed separately for each information source. Using gold positives (known interacting pairs) and gold negatives (known non-interacting pairs) interaction likelihoods for each information source is
13、computed. The product of likelihoods can be used to combine multiple information sources Assumption: A score from a source is independent from a score from another source.,Computing the likelihoods,Partition the pair scores of an information source into bins and provide likelihoods for score-ranges
14、E.g. Using the microarray information source and using Pearson correlation for scoring protein pairs you may get scores between -1 and 1. You want to know what is the likelihood of interaction for a protein pair that gets a Pearson correlation of 0.6.,Partitioning the scores,Computing the likelihood
15、,P(Interaction | Score) / P (Interaction)L = -P(Interaction | Score) / P (Interaction)Example,Protein interaction networks,Large scale (genome wide networks):,ProNet (Asthana et al.)Yeast3,112 nodes12,594 edges,Analyzing Protein Networks,Predict members of a partially known protein complex/pathway.
16、Infer individual genes functions on the basis of linked neighbors. Find strongly connected components, clusters to reveal unknown complexes. Find the best interaction path between a source and a target gene.,Simple analysis,The network can be thresholded to reveal clusters of interacting proteins,Co
17、mplex/Pathway membership problem,E.g., C. elegans cell death (apoptosis) pathway Identified 50 genes involved in the pathway. Are there other genes involved in the pathway? Biologists would like to know: Which genes (out of 15K genes) should be tested in the RNAi screens next?,Complex/pathway member
- 1.请仔细阅读文档,确保文档完整性,对于不预览、不比对内容而直接下载带来的问题本站不予受理。
- 2.下载的文档,不会出现我们的网址水印。
- 3、该文档所得收入(下载+内容+预览)归上传者、原创作者;如果您是本文档原作者,请点此认领!既往收益都归您。
下载文档到电脑,查找使用更方便
2000 积分 0人已下载
下载 | 加入VIP,交流精品资源 |
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- BIOLOGICALNETWORKSCONSTRUCTIONANDANALYSISPPT
