End-to-end Data-flow Parallelism for Throughput Optimization .ppt
《End-to-end Data-flow Parallelism for Throughput Optimization .ppt》由会员分享,可在线阅读,更多相关《End-to-end Data-flow Parallelism for Throughput Optimization .ppt(24页珍藏版)》请在麦多课文档分享上搜索。
1、End-to-end Data-flow Parallelism for Throughput Optimization in High-speed Networks,Esma Yildirim Data Intensive Distributed Computing Laboratory University at Buffalo (SUNY) Condor Week 2011,Motivation,Data grows larger hence the need for speed to transfer it Technology develops with the introducti
2、on of high-speed networks and complex computer architectures which are not fully utilized yet Still many questions are out in the uncertainty,I can not receive the speed I am supposed to get from the network,I have a 10G high-speed network and supercomputers connecting. Why do I still get under 1G t
3、hroughput?,I cant wait for a new protocol to replace the current ones, why cant I get high throughput with what I have at hand?,OK, may be I am asking too much but I want to get optimal settings to achieve maximal throughput,I want to get high throughput without congesting the traffic too much. How
4、can I do it in the application level?,2,Introduction,Users of data-intensive applications need intelligent services and schedulers that will provide models and strategies to optimize their data transfer jobs Goals: Maximize throughput Minimize model overhead Do not cause contention among users Use m
5、inimum number of end-system resources,3,Introduction,Current optical technology supports 100 G transport hence, the utilization of network brings a challenge to the middleware to provide faster data transfer speeds Achieving multiple Gbps throughput have become a burden over TCP-based networks Paral
6、lel streams can solve the problem of network utilization inefficiency of TCP Finding the optimal number of streams is a challenging task With faster networks end-systems have become the major source of bottleneck CPU, NIC and Disk Bottleneck We provide models to decide on the optimal number of paral
7、lelism and CPU/disk stripes,4,Outline,Stork Overview End-system Bottlenecks End-to-end Data-flow Parallelism Optimization Algorithm Conclusions and Future Work,5,Stork Data Scheduler,Implements state-of-the art models and algorithms for data scheduling and optimization Started as part of the Condor
8、project as PhD thesis of Dr. Tevfik Kosar Currently developed at University at Buffalo and funded by NSF Heavily uses some Condor libraries such as ClassAds and DaemonCore,6,Stork Data Scheduler (cont.),Stork v.2.0 is available with enhanced features http:/www.storkproject.org Supports more than 20
9、platforms (mostly Linux flavors) Windows and Azure Cloud support planned soon The most recent enhancement: Throughput Estimation and Optimization Service,7,End-to-end Data Transfer,Method to improve the end-to-end data transfer throughput Application-level Data Flow Parallelism Network level paralle
10、lism (parallel streams) Disk/CPU level parallelism (stripes),8,Network Bottleneck,Step1: Effect of Parallel Streams on Disk-to-disk Transfers Parallel streams can improve the data throughput but only to a certain extent Disk speed presents a major limitation. Parallel streams may have an adverse eff
11、ect if the disk speed upper limit is already reached,9,Disk Bottleneck,Step2: Effect of Parallel Streams on Memory-to-memory Transfers and CPU Utilization Once disk bottleneck is eliminated, parallel streams improve the throughput dramatically Throughput either becomes stable or falls down after rea
12、ching its peak due to network or end-system limitations. Ex:The network interface card limit(10G) could not be reached (e.g.7.5Gbps-internode),10,CPU Bottleneck,Step3: Effect of Striping and Removal of CPU Bottleneck Striped transfers improves the throughput dramatically Network card limit is reache
13、d for inter-node transfers(9Gbps),11,Prediction of Optimal Parallel Stream Number,Throughput formulation : Newtons Iteration Model a , b and c are three unknowns to be solved hence 3 throughput measurements of different parallelism level (n) are needed Sampling strategy: Exponentially increasing par
14、allelism levels Choose points not close to each other Select points that are power of 2: 1, 2, 4, 8, , 2k Stop when the throughput starts to decrease or increase very slowly comparing to the previous level Selection of 3 data points From the available sampling points For every 3-point combination, c
- 1.请仔细阅读文档,确保文档完整性,对于不预览、不比对内容而直接下载带来的问题本站不予受理。
- 2.下载的文档,不会出现我们的网址水印。
- 3、该文档所得收入(下载+内容+预览)归上传者、原创作者;如果您是本文档原作者,请点此认领!既往收益都归您。
下载文档到电脑,查找使用更方便
2000 积分 0人已下载
下载 | 加入VIP,交流精品资源 |
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- ENDTOENDDATAFLOWPARALLELISMFORTHROUGHPUTOPTIMIZATIONPPT

链接地址:http://www.mydoc123.com/p-374411.html