Transient Fault Tolerance via Dynamic Process-Level .ppt
《Transient Fault Tolerance via Dynamic Process-Level .ppt》由会员分享,可在线阅读,更多相关《Transient Fault Tolerance via Dynamic Process-Level .ppt(12页珍藏版)》请在麦多课文档分享上搜索。
1、Transient Fault Tolerance via Dynamic Process-Level Redundancy,Alex Shye, Vijay Janapa Reddi, Tipp Moseley and Daniel A. ConnorsUniversity of Colorado at Boulder Department of Electrical and Computer Engineering DRACO Architecture Research GroupWorkshop on Binary Instrumentation and Applications San
2、 Jose, CA 10.22.2006,Outline,IntroductionBackground/TerminologySoftware-centric Fault DetectionProcess-Level RedundancyExperimental ResultsConclusion,Introduction,Process technology trends Single transistor error rate expected to stay close to constant Number of transistors is increasing exponential
3、ly with each generationTransient faults will be a problem for microprocessors!Hardware Approaches Specialized redundant hardware, redundant multi-threading Software Approaches Compiler solutions: instruction duplication, control flow checking Low-cost, flexible alternative but higher overheadGoal: L
4、everage available hardware parallelism in SMT and CMP machines to improve the performance of software transient fault tolerance,Background/Terminology,Types of transient faults (based upon outcome) Benign Faults Silent Data Corruption (SDC) Detected Unrecoverable Error (DUE) True DUE False DUESphere
5、 of Replication (SoR) Indicates the scope of fault detection and containment Input Replication Output Comparison,Software-centric Fault Detection,Most previous approaches are hardware-centric Even compiler approaches (e.g. EDDI, SWIFT) Software-centric able to leverage strengths of a software approa
6、ch Correctness is defined by software output Ability to see larger scope effect of a fault Ignore benign faults,Processor,Cache,Memory,Devices,Application,Libraries,Operating System,Hardware-centric Fault Detection,Software-centric Fault Detection,Software SoR,Hardware SoR,Process-Level Redundancy (
7、PLR),System Call Emulation Unit Creates redundant processes Barrier synchronize at all system calls Enforces SoR with input replication and output comparison Emulates system calls to guarantee determinism among all processes Detects and recovers from transient faults,App,Libs,App,Libs,App,Libs,SysCa
8、ll Emulation Unit,Operating System,Watchdog Alarm,Master Processonly processallowed to perform system I/O,Redundant Processesidentical address space,file descriptors, etc.not allowed to performsystem I/O,Watchdog Alarmoccasionally a processwill hangset at beginning of barriersynchronization to ensur
- 1.请仔细阅读文档,确保文档完整性,对于不预览、不比对内容而直接下载带来的问题本站不予受理。
- 2.下载的文档,不会出现我们的网址水印。
- 3、该文档所得收入(下载+内容+预览)归上传者、原创作者;如果您是本文档原作者,请点此认领!既往收益都归您。
下载文档到电脑,查找使用更方便
2000 积分 0人已下载
下载 | 加入VIP,交流精品资源 |
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- TRANSIENTFAULTTOLERANCEVIADYNAMICPROCESSLEVELPPT

链接地址:http://www.mydoc123.com/p-373452.html