Cancer Classification with Data-dependent Kernels.ppt
《Cancer Classification with Data-dependent Kernels.ppt》由会员分享,可在线阅读,更多相关《Cancer Classification with Data-dependent Kernels.ppt(35页珍藏版)》请在麦多课文档分享上搜索。
1、2018/10/10,DIMACS Workshop on Machine Learning Techniques in Bioinformatics,1,Cancer Classification with Data-dependent Kernels,Anne Ya Zhang (with Xue-wen Chen & Huilin Xiong) EECS & ITTC University of Kansas,2018/10/10,DIMACS Workshop on Machine Learning Techniques in Bioinformatics,2,Outline,Intr
2、oduction Data-dependent Kernel Results Conclusion,2018/10/10,DIMACS Workshop on Machine Learning Techniques in Bioinformatics,3,Cancer facts,Cancer is a group of many related diseases Cells continue to grow and divide and do not die when they should. Changes in the genes that control normal cell gro
3、wth and death. Cancer is the second leading cause of death in the United States Cancer causes 1 of every 4 deaths NIH estimate overall costs for cancer in 2004 at $189.8 billion ($64.9 billion for direct medical cost) Cancer types Breast cancer, Lung cancer, Colon cancer, Death rates vary greatly by
4、 cancer type and stage at diagnosis,2018/10/10,DIMACS Workshop on Machine Learning Techniques in Bioinformatics,4,Motivation,Why do we need to classify cancers? The general way of treating cancer is to: Categorize the cancers in different classes Use specific treatment for each of the classes Tradit
5、ional way to classify cancers Morphological appearanceNot accurate! Enzyme-based histochemical analyses. Immunophenotyping. Cytogenetic analysis.Complicated & needs highly specialized laboratories,2018/10/10,DIMACS Workshop on Machine Learning Techniques in Bioinformatics,5,Motivation,Why traditiona
6、l ways are not enough ? There exists some tumors in the same class with completely different clinical courses May be more accurate classification is needed Assigning new tumors to known cancer classes is not easy e.g. assigning an acute leukemia tumor to one of the AML (acute myeloid leukemia) ALL (
7、acute lymphoblastic leukemia),2018/10/10,DIMACS Workshop on Machine Learning Techniques in Bioinformatics,6,DNA Microarray-based Cancer Diagnosis,Cancer is caused by changes in the genes that control normal cell growth and death. Molecular diagnostics offer the promise of precise, objective, and sys
8、tematic cancer classification These tests are not widely applied because characteristic molecular markers for most solid tumors have to be identified. Recently, microarray tumor gene expression profiles have been used for cancer diagnosis.,2018/10/10,DIMACS Workshop on Machine Learning Techniques in
9、 Bioinformatics,7,Microarray,A microarray experiment monitors the expression levels for thousands of genes simultaneously. Microarray techniques will lead to a more complete understanding of the molecular variations among tumors, hence to a more reliable classification.,2018/10/10,DIMACS Workshop on
10、 Machine Learning Techniques in Bioinformatics,8,Microarray,Microarray analysis allows the monitoring of the activities of thousands of genes over many different conditions. From a machine learning point of view,The large volume of the data requires the computational aid in analyzing the expression
11、data.,2018/10/10,DIMACS Workshop on Machine Learning Techniques in Bioinformatics,9,Machine learning tasks in cancer classification,There are three main types of machine learning problems associated with cancer classification: The identification of new cancer classes using gene expression profiles T
12、he classification of cancer into known classes The identifications of “marker” genes that characterize the different cancer classes In this presentation, we focus on the second type of problems.,2018/10/10,DIMACS Workshop on Machine Learning Techniques in Bioinformatics,10,Project Goals,To develop a
13、 more systematic machine learning approach to cancer classification using microarray gene expression profiles.Use an initial collection of samples belonging to the known classes of cancer to create a “class predictor” for new, unknown, samples.,2018/10/10,DIMACS Workshop on Machine Learning Techniqu
14、es in Bioinformatics,11,Challenges in cancer classification,Gene expression data are typically characterized by high dimensionality (i.e. a large number of genes) small sample sizeCurse of dimensionality!,Methods Kernel techniques Data resampling Gene selection,AML,2018/10/10,DIMACS Workshop on Mach
15、ine Learning Techniques in Bioinformatics,12,Outline,Introduction Data-dependent Kernel Results Conclusion,2018/10/10,DIMACS Workshop on Machine Learning Techniques in Bioinformatics,13,Data-dependent kernel model,Optimizing the data-dependent kernel is to choose the coefficient vector,Data dependen
16、t,2018/10/10,DIMACS Workshop on Machine Learning Techniques in Bioinformatics,14,Optimizing the kernel,Criterion for kernel optimizationMaximum class separability of the training data in the kernel-induced feature space,2018/10/10,DIMACS Workshop on Machine Learning Techniques in Bioinformatics,15,T
- 1.请仔细阅读文档,确保文档完整性,对于不预览、不比对内容而直接下载带来的问题本站不予受理。
- 2.下载的文档,不会出现我们的网址水印。
- 3、该文档所得收入(下载+内容+预览)归上传者、原创作者;如果您是本文档原作者,请点此认领!既往收益都归您。
下载文档到电脑,查找使用更方便
2000 积分 0人已下载
下载 | 加入VIP,交流精品资源 |
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- CANCERCLASSIFICATIONWITHDATADEPENDENTKERNELSPPT

链接地址:http://www.mydoc123.com/p-379287.html