Celia Russell, Stephen Pickles and Mike JonesCombining .ppt
《Celia Russell, Stephen Pickles and Mike JonesCombining .ppt》由会员分享,可在线阅读,更多相关《Celia Russell, Stephen Pickles and Mike JonesCombining .ppt(27页珍藏版)》请在麦多课文档分享上搜索。
1、Celia Russell, Stephen Pickles and Mike JonesCombining Data Workshop ESRC Research Methods Programme Manchester, December 18, 2002,SAMD,Seamless Access to Multiple DatasetsA ESRC/DTI e-Science demonstrator project http:/www.sve.man.ac.uk/Research/AtoZ/SAMD,Supercomputing, Visualization & eScience,2,
2、SAMD,Seamless Access to Multiple DatasetsA project to demonstrate the benefits of applying e-Science grid technologies to an ordinary social science query We solve a genuine problem from the UK academic social science community - a multivariate analysis using a complex mathematical algorithm Based o
3、n a major social science databank, the Office for National Statistics Time Series Data, hosted at MIMAS,Supercomputing, Visualization & eScience,3,The problem,Published as Sensier, M., Osborn D.R. and cal N. (2002) Asymmetric Interest Rate Effects for the UK Real Economy , Oxford Bulletin of Economi
4、cs and Statistics, Volume 64, September 2002, n4The research query looks at the effect interest rate changes had on Gross Domestic Product in the UK over the period 1960 2000,Supercomputing, Visualization & eScience,4,Interest Rates in the UK,Supercomputing, Visualization & eScience,5,UK GDP quarter
5、ly changes,Supercomputing, Visualization & eScience,6,The Model,Where y is the quarterly change in GDP and z is the quarterly change in interest rates,Supercomputing, Visualization & eScience,7,Before SAMD,Supercomputing, Visualization & eScience,8,e-Science Grid,Supercomputing, Visualization & eSci
6、ence,9,SAMD Methodology,We built a mini demonstrator grid for SAMD by: Grid-enabling the NS Time Series Databank Parallelising the code to represent the HPC facilities Using Grid protocols for data transfer Creating a graphical user interface that included a single sign-on It all worked, and cut the
7、 data collection and analysis time down to around 8 minutes.,Supercomputing, Visualization & eScience,10,Extending SAMD,The approach and methods of SAMD are applicable to more general social science applications involving data collection and analysis More efficient handling of datasets data is moved
8、 to where its needed, not just to web browser The single sign-on for all databanks means users can cross search datasets and perform cross analyses of multiple datasets from different providers Grants access to high performance computing facilities on the grid without the user having to learn how to
9、 use them Can automate routine enquiries Cuts the time taken to run computing intensive problems by a factor of around 100,Supercomputing, Visualization & eScience,11,Scaling up with the Grid,E-Science Grids allow the social scientist to scale up their quantitative research by: Including many more d
10、ata points in their analysis Developing more complex models incorporating more variables Dropping assumptions Visualising data Creating new communities and collaborations Exploring new types of analyses,SAMD Architecture,Supercomputing, Visualization & eScience,13,Motivation,Web-based access to soci
11、o-economic datasets such as Office of National Statistics Time series data has lead to greatly increased use, but:- No standard authentication or authorisation too many usernames and passwords to remember To automate search and retrieval, can only emulate navigation through “screen scraping“ breaks
- 1.请仔细阅读文档,确保文档完整性,对于不预览、不比对内容而直接下载带来的问题本站不予受理。
- 2.下载的文档,不会出现我们的网址水印。
- 3、该文档所得收入(下载+内容+预览)归上传者、原创作者;如果您是本文档原作者,请点此认领!既往收益都归您。
下载文档到电脑,查找使用更方便
2000 积分 0人已下载
下载 | 加入VIP,交流精品资源 |
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- CELIARUSSELL STEPHENPICKLESANDMIKEJONESCOMBININGPPT

链接地址:http://www.mydoc123.com/p-379406.html