Toward Loosely Coupled Programming on Petascale Systems.ppt
《Toward Loosely Coupled Programming on Petascale Systems.ppt》由会员分享,可在线阅读,更多相关《Toward Loosely Coupled Programming on Petascale Systems.ppt(35页珍藏版)》请在麦多课文档分享上搜索。
1、Toward Loosely Coupled Programming on Petascale Systems,Presenter: Sora Choe,Introduction37 Requirements812 Implementation1317 Microbenchmarks Performance1823 Loosely Coupled Applications.2431 DOCKS and MARS Conclusion and Future Work3234,Index,Emerging petascale computing systems incorporate high-s
2、peed, low-latency interconnects Designed to support tightly coupled parallel computations Most of applications running on these have a SPMD structure Implemented by using MPI for interprocess communication Goal: enable the use of petascale computing systems for task-parallel applications,Introductio
3、n,Problem Space,Many tasks that can be individually scheduled on many different computing resources across multiple administrative boundaries to achieve some larger application goal Emphasis on using much large numbers of computing resources over short periods of time to accomplish many computationa
4、l tasks Primary metrics are in seconds e.g. FLOPS, tasks/sec, MB/sec I/O rates,Many-Task Computing(MTC),MTC applications can be executed efficiently on todays supercomputers A set of problems that must be overcome to make loosely coupled programming practical on emerging petascale architecture Local
5、 resource manager scalability and granularity Efficient utilization of the raw hardware Shared file system contention Application scalability IBM Blue Gene/P supercomputer(also known as Intrepid) Processors = cores = CPUs,Hypothesis,The I/O subsystem of peta. systems offers unique capabilities neede
6、d by MTC applications The cost to manage and run on peta. systems like the BG/P is less than that of conventional clusters or Grids Large-scale systems inevitably have utilization issues Some apps are so demanding that only peta. systems have enough compute power to get results in a reasonable timef
7、rame, or to leverage new opportunities,Why Peta. Sys. For MTC Apps? ( 4 motivating factors),For large-scale and loosely coupled apps to efficiently execute on petascale systems, which are traditionally HPC systems Required mechanisms Multi-level scheduling Efficient task dispatch Extensive use of ca
8、ching to minimize shared infrastructure such as file systems and interconnects,Requirements,Essential because LRM(Cobalt) on BG/P works at a granularity of pset Pset: a group of 64 quad-core compute nodes and one I/O node Allocate compute resources from Cobalt at the pset granularity, and then make
9、these resources available to apps at a single processor core granularity Made possible through Falkon and its resource provisioning mechanism,Multi-Level Scheduling,Overhead of scheduling and starting resources Compute nodes are powered off when not in use and must be booted when allocated to a job
10、Since compute nodes dont have local disks, the boot-up process involves reading the lightweight IBM compute node kernel(Linux-based ZeptoOS kernel image, specifically) from a shared file system Multi-level scheduling reduces it to insignificant overhead over many jobs,Multi-Level Scheduling(cont.),S
11、treamlined task submission framework Falkons specialization leading higher performance LRMs for reservation, policy-based scheduling, accounting, etc. Client frameworks(workflow sys. or distributed scripting systems) for recovery, data staging, job dependency management, etc.2534 tasks/sec in a Linu
12、x cluster 3186 tasks/sec on the SiCortex3071 tasks/sec on the BG/PVS 0.522 jobs/sec on traditional LRMs like Condor or PBS,Efficient Task Dispatch,Compute nodes on BG/P have a shared file system(GPFS) and local file system implemented in RAM(ramdisk) For better app. scalability, Extensive caching of
13、 app. data using ramdisk LFS Minimizing the use of shared file systems Simple caching scheme is employed for Static data : app. Binaries, libraries, common input cached at all compute nodes Dynamic data : input data specific for a single data cached on one compute node,Extensive Use of Caching,Swift
14、 and Falkon Swift enables scientific workflows through a data-flow-based functional parallel programming model Falkon light-weight task execution dispatcher for optimized task throughput and efficiency Extensions to get Falkon to work on BG/P Static Resource Provisioning Alternative Implementations
- 1.请仔细阅读文档,确保文档完整性,对于不预览、不比对内容而直接下载带来的问题本站不予受理。
- 2.下载的文档,不会出现我们的网址水印。
- 3、该文档所得收入(下载+内容+预览)归上传者、原创作者;如果您是本文档原作者,请点此认领!既往收益都归您。
下载文档到电脑,查找使用更方便
2000 积分 0人已下载
下载 | 加入VIP,交流精品资源 |
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- TOWARDLOOSELYCOUPLEDPROGRAMMINGONPETASCALESYSTEMSPPT

链接地址:http://www.mydoc123.com/p-373426.html