BCB 444-544.ppt
《BCB 444-544.ppt》由会员分享,可在线阅读,更多相关《BCB 444-544.ppt(68页珍藏版)》请在麦多课文档分享上搜索。
1、BCB 444/544 F07 ISU Dobbs #28- Promoter Prediction,BCB 444/544,Lecture 28Gene Prediction - finish itPromoter Prediction #28_Oct29,BCB 444/544 F07 ISU Dobbs #28- Promoter Prediction,Mon Oct 29 - Lecture 28Promoter & Regulatory Element Prediction Chp 9 - pp 113 - 126Wed Oct 30 - Lecture 29Phylogenetic
2、s Basics Chp 10 - pp 127 - 141Thurs Oct 31 - Lab 9 Gene & Regulatory Element PredictionFri Oct 30 - Lecture 29Phylogenetic Tree Construction Methods & Programs Chp 11 - pp 142 - 169,Required Reading (before lecture),BCB 444/544 F07 ISU Dobbs #28- Promoter Prediction,Assignments & Announcements,Mon O
3、ct 29 - HW#5 - will be posted todayHW#5 = Hands-on exercises with phylogenetics and tree-building softwareDue: Mon Nov 5 (not Fri Nov 1 as previously posted),BCB 444/544 F07 ISU Dobbs #28- Promoter Prediction,BCB 544 “Team“ Projects,Last week of classes will be devoted to ProjectsWritten reports due
4、: Mon Dec 3 (no class that day)Oral presentations (20-30) will be: Wed-Fri Dec 5,6,7 1 or 2 teams will present during each class periodSee Guidelines for Projects posted online,BCB 444/544 F07 ISU Dobbs #28- Promoter Prediction,BCB 544 Only: New Homework Assignment,544 Extra#2 Due: PART 1 - ASAPPART
5、 2 - meeting prior to 5 PM Fri Nov 2Part 1 - Brief outline of Project, email to Drena & Michaelafter response/approval, then: Part 2 - More detailed outline of projectRead a few papers and summarize status of problemSchedule meeting with Drena & Michael to discuss ideas,BCB 444/544 F07 ISU Dobbs #28
6、- Promoter Prediction,Seminars this Week,BCB List of URLs for Seminars related to Bioinformatics:http:/www.bcb.iastate.edu/seminars/index.htmlNov 1 Thurs - BBMB Seminar 4:10 in 1414 MBB Todd Yeates UCLA TBA -something cool about structure and evolution?Nov 2 Fri - BCB Faculty Seminar 2:10 in 102 ScI
7、 Bob Jernigan BBMB, ISU Control of Protein Motions by Structure,BCB 444/544 F07 ISU Dobbs #28- Promoter Prediction,Chp 8 - Gene Prediction,SECTION III GENE AND PROMOTER PREDICTIONXiong: Chp 8 Gene PredictionCategories of Gene Prediction Programs Gene Prediction in Prokaryotes Gene Prediction in Euka
8、ryotes,BCB 444/544 F07 ISU Dobbs #28- Promoter Prediction,Computational Gene Prediction: Approaches,Ab initio methods Search by signal: find DNA sequences involved in gene expression Search by content: Test statistical properties distinguishing coding from non-coding DNA Similarity-based methods Dat
9、abase search: exploit similarity to proteins, ESTs, cDNAs Comparative genomics: exploit aligned genomes Do other organisms have similar sequence? Hybrid methods - best,BCB 444/544 F07 ISU Dobbs #28- Promoter Prediction,Computational Gene Prediction: Algorithms,Neural Networks (NNs) (more on these la
10、ter)e.g., GRAILLinear discriminant analysis (LDA) (see text)e.g., FGENES, MZEFMarkov Models (MMs) & Hidden Markov Models (HMMs) e.g., GeneSeqer - uses MMs GENSCAN - uses 5th order HMMs - (see text)HMMgene - uses conditional maximum likelihood (see text),This is a new slide,BCB 444/544 F07 ISU Dobbs
11、#28- Promoter Prediction,Signals Search,Approach: Build models (PSSMs, profiles, HMMs, ) and search against DNA. Detected instances provide evidence for genes,This is a new slide,BCB 444/544 F07 ISU Dobbs #28- Promoter Prediction,Content Search,Observation: Encoding a protein affects statistical pro
12、perties of DNA sequence: Nucleotide.amino acid distribution GC content (CpG islands, exon/intron) Uneven usage of synonymous codons (codon bias) Hexamer frequency - most discriminative of these for identifying coding potentialMethod: Evaluate these differences (coding statistics) to differentiate be
13、tween coding and non-coding regions,This is a new slide,BCB 444/544 F07 ISU Dobbs #28- Promoter Prediction,Human Codon Usage,This is a new slide,BCB 444/544 F07 ISU Dobbs #28- Promoter Prediction,Predicting Genes based on Codon Usage Differences,Algorithm: Process sliding window Use codon frequencie
14、s to compute probability of coding versus non-coding Plot log-likelihood ratio:,This is a new slide,BCB 444/544 F07 ISU Dobbs #28- Promoter Prediction,In different genomes: Translate DNA into all 6 reading frames and search against proteins (TBLASTX,BLASTX, etc.)Within same genome: Search with EST/c
15、DNA database (EST2genome, BLAT, etc.).Problems: Will not find “new” or RNA genes (non-coding genes). Limits of similarity are hard to define Small exons might be overlooked,Similarity-Based Methods: Database Search,This is a new slide,BCB 444/544 F07 ISU Dobbs #28- Promoter Prediction,Similarity-Bas
16、ed Methods: Comparative Genomics,Idea: Functional regions are more conserved than non-functional ones; high similarity in alignment indicates geneAdvantages: May find uncharacterized or RNA genes Problems: Finding suitable evolutionary distance Finding limits of high similarity (functional regions),
17、This is a new slide,BCB 444/544 F07 ISU Dobbs #28- Promoter Prediction,Human-Mouse Homology,Comparison of 1196 orthologous genes Sequence identity between genes in human vs mouse Exons: 84.6% Protein: 85.4% Introns: 35% 5 UTRs: 67% 3 UTRs: 69%,This is a new slide,BCB 444/544 F07 ISU Dobbs #28- Promo
18、ter Prediction,Thanks to Volker Brendel, ISU for the following Figs & Slides,Slightly modified from:BSSI Genome Informatics Module http:/www.bioinformatics.iastate.edu/BBSI/course_desc_2005.html#moduleBV Brendel vbrendeliastate.edu,Brendel et al (2004) Bioinformatics 20: 1157,BCB 444/544 F07 ISU Dob
19、bs #28- Promoter Prediction,Perform pairwise alignment with large gaps in one sequence (due to introns) Align genomic DNA with cDNA, ESTs, protein sequences Score semi-conserved sequences at splice junctions Using Bayesian probability model & 1st order MMScore coding constraints in translated exons
20、Using Bayesian model,Spliced Alignment Algorithm,GeneSeqer - Brendel et al.- ISU,http:/deepc2.psi.iastate.edu/cgi-bin/gs.cgi,Brendel et al (2004) Bioinformatics 20: 1157 http:/bioinformatics.oxfordjournals.org/cgi/content/abstract/20/7/1157,Brendel 2005,BCB 444/544 F07 ISU Dobbs #28- Promoter Predic
21、tion,i: ith position in sequence : avg information content over all positions 20 nt from splice site : avg sample standard deviation of ,Splice Site Detection,Do DNA sequences surrounding splice “consensus“ sequences contribute to splicing signal?,YES,Brendel 2005,BCB 444/544 F07 ISU Dobbs #28- Prom
22、oter Prediction,Information Content vs Position,Which sequences are exons & which are introns?How can you tell?,Brendel 2005,BCB 444/544 F07 ISU Dobbs #28- Promoter Prediction,Markov Model for Spliced Alignment,Brendel 2005,BCB 444/544 F07 ISU Dobbs #28- Promoter Prediction,Evaluation of Splice Site
23、 Prediction,Fig 5.11 Baxevanis & Ouellette 2005,This is a new slide,TP = positive instance correctly predicted as positive FP = negative instance incorrectly predicted as positive TN = negative instance correctly predicted as negative FN = positive instance incorrectly predicted as negative,Right!,B
24、CB 444/544 F07 ISU Dobbs #28- Promoter Prediction,Evaluation of Predictions,Normalized specificity:,Specificity:,Misclassification rates:,Coverage,Sensitivity:,Predicted Positives,True Positives,False Positives,Recall,Do not memorize this!,BCB 444/544 F07 ISU Dobbs #28- Promoter Prediction,Evaluatio
- 1.请仔细阅读文档,确保文档完整性,对于不预览、不比对内容而直接下载带来的问题本站不予受理。
- 2.下载的文档,不会出现我们的网址水印。
- 3、该文档所得收入(下载+内容+预览)归上传者、原创作者;如果您是本文档原作者,请点此认领!既往收益都归您。
下载文档到电脑,查找使用更方便
2000 积分 0人已下载
下载 | 加入VIP,交流精品资源 |
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- BCB444544PPT
