The TAU Performance Technology for Complex Parallel .ppt
《The TAU Performance Technology for Complex Parallel .ppt》由会员分享,可在线阅读,更多相关《The TAU Performance Technology for Complex Parallel .ppt(35页珍藏版)》请在麦多课文档分享上搜索。
1、The TAU Performance Technology for Complex Parallel Systems (Performance Analysis Bring Your Own Code Workshop, NRL Washington D.C.) Sameer Shende, Allen D. Malony, Robert Bell University of Oregon sameer, malony, bertiecs.uoregon.edu,Outline,Motivation Part I: Instrumentation Part II: Measurement P
2、art III: Analysis Tools Conclusion,TAU Performance System Framework,Tuning and Analysis Utilities Performance system framework for scalable parallel and distributed high-performance computing Targets a general complex system computation model nodes / contexts / threads Multi-level: system / software
3、 / parallelism Measurement and analysis abstraction Integrated toolkit for performance instrumentation, measurement, analysis, and visualization Portable, configurable performance profiling/tracing facility Open software approach University of Oregon, LANL, FZJ Germany http:/www.cs.uoregon.edu/resea
4、rch/paracomp/tau,TAU Performance System Architecture,paraprof,TAU Analysis,Parallel profile analysis pprof parallel profiler with text-based display paraprof Graphical, scalable, parallel profile analysis and display Trace analysis and visualization Trace merging and clock adjustment (if necessary)
5、Trace format conversion (ALOG, SDDF, VTF, Paraver) Trace visualization using Vampir (Pallas/Intel),Pprof Output (ESMF CoupledFlowSolver),IBM AIX F95, C+, C, MPI Profile - Node - Context - Thread Events - code - MPI,Terminology Example,For routine “int main( )”: Exclusive time 100-20-50-20=10 secs In
6、clusive time 100 secs Calls 1 call Subrs (no. of child routines called) 3 Inclusive time/call 100secs,int main( ) /* takes 100 secs */f1(); /* takes 20 secs */f2(); /* takes 50 secs */f1(); /* takes 20 secs */* other work */ /* Time can be replaced by counts */,Performance Analysis and Visualization
7、,Analysis of parallel profile and trace measurement Parallel profile analysis ParaProf Cube Profile Browser (UTK, FZJ) Profile generation from trace data Performance data management framework (PerfDMF) Parallel trace analysis Translation to VTF 3.0 and EPILOG Integration with VNG (Technical Universi
8、ty of Dresden) Online parallel analysis and visualization,TAUs ParaProf Framework Architecture,Portable, extensible, and scalable tool for profile analysis Try to offer “best of breed” capabilities to analysts Build as profile analysis framework for extensibility,Profile Manager Window,Structured AM
9、R toolkit (SAMRAI+), LLNL,Paraprof: CoupledFlowApp (ESMF) on 4 Nodes,Paraprof Mean Profile (4 nodes),Individual Node (0) Profile in Paraprof,MPI Routines,Text Profile Window,k-Level Callpath Implementation in TAU,TAU maintains a performance event (routine) callstack Profiled routine (child) looks in
- 1.请仔细阅读文档,确保文档完整性,对于不预览、不比对内容而直接下载带来的问题本站不予受理。
- 2.下载的文档,不会出现我们的网址水印。
- 3、该文档所得收入(下载+内容+预览)归上传者、原创作者;如果您是本文档原作者,请点此认领!既往收益都归您。
下载文档到电脑,查找使用更方便
2000 积分 0人已下载
下载 | 加入VIP,交流精品资源 |
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- THETAUPERFORMANCETECHNOLOGYFORCOMPLEXPARALLELPPT

链接地址:http://www.mydoc123.com/p-373357.html