Alignment methods.ppt
《Alignment methods.ppt》由会员分享,可在线阅读,更多相关《Alignment methods.ppt(32页珍藏版)》请在麦多课文档分享上搜索。
1、Alignment methods,Introduction to global and local sequence alignment methods Global : Needleman-Wunch Local : Smith-Waterman Database Search BLAST FASTA,Why search sequence databases?,I have just sequenced something. What is known about the thing I sequenced? I have a unique sequence. Is there simi
2、larity to another gene that has a known function? I found a new protein in a lower organism. Is it similar to a protein from another species?,Perfect Searches,First “hit” should be an exact match. Next “hits” should contain all of the genes that are related to your gene (homologs) Next “hits” should
3、 be similar but are not homologs,How does one achieve the “perfect search”?,Comparison Matrices (PAM vs. BLOSUM) Database Search Algorithms Databases Search Parameters Expect Value-change threshold for score reporting Translation-of DNA sequence into protein Filtering-remove repeat sequences,Alignme
4、nt Algorithms,Global : Needleman-Wunch Local : Smith-Watermann These two dynamic programming alignment algorithm are guaranteed to give OPTIMAL alignments But O(m*n) quadraticSkip to Scoring Matrixes,Alignment Methods,Learning objectives-Understand the principles behind the Needleman-Wunsch method o
5、f alignment. Understand how software operates to optimally align two sequences,Needleman-Wunsch Method (1970),Output: An alignment of two sequences is represented by three lines The first line shows the first sequence The third line shows the second sequence. The second line has a row of symbols. Th
6、e symbol is a vertical bar wherever characters in the two sequences match, and a space where ever they do not. Dots may be inserted in either sequence to represent gaps.,Needleman-Wunsch Method (cont. 1),For example, the two hypothetical sequencesabcdefghajklmabbdhijkcould be aligned like this abcde
7、fghajklm| | | | abbd.hijk As shown, there are 6 matches, 2 mismatches, and one gap of length 3.,Needleman-Wunsch Method (cont. 2),The alignment is scored according to a payoff matrix $payoff = match = $match,mismatch = $mismatch,gap_open = $gap_open,gap_extend = $gap_extend ;For correct operation, m
8、atch must be positive, and the other entries must be negative.,Needleman-Wunsch Method (cont. 3),Example Given the payoff matrix $payoff = match = 4,mismatch = -3,gap_open = -2,gap_extend = -1 ;,Needleman-Wunsch Method (cont. 4),The sequences abcdefghajklmabbdhijk are aligned and scored like this a
9、b c d e f g h a j k l m| | | | | | a b b d . . . h i j kmatch 4 4 4 4 4 4 mismatch -3 -3gap_open -2gap_extend -1-1-1 for a total score of 24-6-2-3 = 13.,Needleman-Wunsch Method (cont. 5),The algorithm guarantees that no other alignment of these two sequences has a higher score under this payoff matr
10、ix.,Needleman-Wunsch Method (cont. 6) Dynamic Programming,Potential difficulty. How does one come up with the optimal alignment in the first place? We now introduce the concept of dynamic programming (DP).DP can be applied to a large search space that can be structured into a succession of stages su
11、ch that:1) the initial stage contains trivial solutions to sub-problems2) each partial solution in a later stage can be calculated by recurring on only a fixed number of partial solutions in an earlier stage.3) the final stage contains the overall solution.,Three steps in Dynamic Programming,1. Init
12、ialization2 Matrix fill or scoring3. Traceback and alignment,Two sequences will be aligned.GAATTCAGTTA (sequence #1) GGATCGA (sequence #2)A simple scoring scheme will be usedSi,j = 1 if the residue at position I of sequence #1 is the same as the residue at position j of the sequence #2 (called match
13、 score)Si,j = 0 for mismatch scorew = gap penalty,Initialization step: Create Matrix with M + 1 columns and N + 1 rows. First row and column filled with 0.,Matrix fill step: Each position Mi,j is defined to be the MAXIMUM score at position i,j Mi,j = MAXIMUM Mi-1, j-1 + si,j (match or mismatch in th
- 1.请仔细阅读文档,确保文档完整性,对于不预览、不比对内容而直接下载带来的问题本站不予受理。
- 2.下载的文档,不会出现我们的网址水印。
- 3、该文档所得收入(下载+内容+预览)归上传者、原创作者;如果您是本文档原作者,请点此认领!既往收益都归您。
下载文档到电脑,查找使用更方便
2000 积分 0人已下载
下载 | 加入VIP,交流精品资源 |
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- ALIGNMENTMETHODSPPT
