ISO TR 14873-2013 Information and documentation - Statistics and quality issues for web archiving《信息和文献工作 网络信息存档的统计和质量问题》.pdf
《ISO TR 14873-2013 Information and documentation - Statistics and quality issues for web archiving《信息和文献工作 网络信息存档的统计和质量问题》.pdf》由会员分享,可在线阅读,更多相关《ISO TR 14873-2013 Information and documentation - Statistics and quality issues for web archiving《信息和文献工作 网络信息存档的统计和质量问题》.pdf(62页珍藏版)》请在麦多课文档分享上搜索。
1、 ISO 2013 Information and documentation Statistics and quality issues for web archiving Information et documentation Statistiques et indicateurs de qualit pour larchivage du web TECHNICAL REPORT ISO/TR 14873 First edition 2013-12-01 Reference number ISO/TR 14873:2013(E) ISO/TR 14873:2013(E)ii ISO 20
2、13 All rights reserved COPYRIGHT PROTECTED DOCUMENT ISO 2013 All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting on the internet or an intranet,
3、without prior written permission. Permission can be requested from either ISO at the address below or ISOs member body in the country of the requester. ISO copyright office Case postale 56 CH-1211 Geneva 20 Tel. + 41 22 749 01 11 Fax + 41 22 749 09 47 E-mail copyrightiso.org Web www.iso.org Publishe
4、d in Switzerland ISO/TR 14873:2013(E) ISO 2013 All rights reserved iii Contents Page Foreword iv Introduction v 1 Scope . 1 2 T erms and definitions . 1 3 Methods and purposes of Web archiving 7 3.1 Collecting methods 8 3.2 Access and description methods 10 3.3 Preservation methods 12 3.4 Legal basi
5、s for Web archiving 14 3.5 Additional reasons for Web archiving 15 4 Statistics .16 4.1 General 16 4.2 Statistics for collection development .16 4.3 Collection characterization .22 4.4 Collection usage 28 4.5 Web archive preservation .31 4.6 Measuring the costs of Web archiving .35 5 Quality indicat
6、ors .37 5.1 General 37 5.2 Limitations .37 5.3 Description 38 6 Usage and benefits .47 6.1 General 47 6.2 Intended usage and readers .47 6.3 Benefits for user groups .48 6.4 Use of proposed statistics by user groups 48 6.5 Web archiving process with related performance indicators .50 Bibliography .5
7、2 ISO/TR 14873:2013(E) Foreword ISO (the International Organization for Standardization) is a worldwide federation of national standards bodies (ISO member bodies). The work of preparing International Standards is normally carried out through ISO technical committees. Each member body interested in
8、a subject for which a technical committee has been established has the right to be represented on that committee. International organizations, governmental and non-governmental, in liaison with ISO, also take part in the work. ISO collaborates closely with the International Electrotechnical Commissi
9、on (IEC) on all matters of electrotechnical standardization. The procedures used to develop this document and those intended for its further maintenance are described in the ISO/IEC Directives, Part 1. In particular the different approval criteria needed for the different types of ISO documents shou
10、ld be noted. This document was drafted in accordance with the editorial rules of the ISO/IEC Directives, Part 2 (see www.iso.org/directives). Attention is drawn to the possibility that some of the elements of this document may be the subject of patent rights. ISO shall not be held responsible for id
11、entifying any or all such patent rights. Details of any patent rights identified during the development of the document will be in the Introduction and/or on the ISO list of patent declarations received (see www.iso.org/patents). Any trade name used in this document is information given for the conv
12、enience of users and does not constitute an endorsement. For an explanation on the meaning of ISO specific terms and expressions related to conformity assessment, as well as information about ISOs adherence to the WTO principles in the Technical Barriers to Trade (TBT) see the following URL: Forewor
13、d - Supplementary information The committee responsible for this document is ISO/TC 46, Information and documentation, Subcommittee SC 8, Quality - Statistics and performance evalutation.iv ISO 2013 All rights reserved ISO/TR 14873:2013(E) Introduction This Technical Report was developed in response
14、 to a worldwide demand for guidelines on the management and evaluation of Web archiving activities and products. Web archiving refers to the activities of selecting, capturing, storing, preserving and managing access to snapshots of Internet resources over time. It started at the end of the 1990s, b
15、ased on the vision that an archive of Internet resources would become a vital record for research, commerce and government in the future. Internet resources are regarded as part of the cultural heritage and therefore preserved like printed heritage publications. Many institutions involved in Web arc
16、hiving see this as an extension of their long standing mission of preserving their national heritage, and this is endorsed and enabled in many countries by legislative frameworks such as legal deposit. There is a wide range of resources available on the Internet, including text, image, film, sound a
17、nd other multimedia formats. In addition to interlinked Web pages, there are newsgroups, newsletters, blogs and interactive services such as games, made available using various transfer and communication protocols. Web archives bring together copies of Internet resources, collected automatically by
18、harvesting software, usually at regular intervals. The intention is to replay the resources including the inherent relations, for example by means of hypertext links, as much as possible as they were in their original environment. The primary goal of Web archiving is to preserve a record of the Web
19、in perpetuity, as closely as possible to its original form, for various academic, professional and private purposes. Web archiving is a recent but expanding activity which continuously requires new approaches and tools in order to stay in sync with rapidly evolving Web technology. Determined by the
20、strategic importance perceived by the archiving institution, means available and sometimes legal requirements, diverse approaches have been taken to archive Internet resources, ranging from capturing individual Web pages to entire top-level domains. From an organisational perspective, Web archiving
21、is also at different levels of maturity. While it has become a business as usual activity in some organisations, others have just initiated experimental programmes to explore the challenge. Depending on the scale and purpose of collection, a distinction can be made between two broad categories of We
22、b archiving strategy: bulk harvesting and selective harvesting. Large scale bulk harvesting, such as national domain harvesting, is intended to capture a snapshot of an entire domain (or a subset of it). Selective harvesting is performed on a much smaller scale, is more focused and undertaken more f
23、requently, often based on criteria such as theme, event, format (e.g. audio or video files) or agreement with content owners. A key difference between the two strategies lies in the level of quality control, the evaluation of harvested Websites to determine whether pre-defined quality standards are
24、being attained. The scale of domain harvesting makes it impossible to carry out any manual visual comparison between the harvested and the live version of the resource, which is a common quality assurance method in selective harvesting. This Technical Report aims to demonstrate how Web archives, as
- 1.请仔细阅读文档,确保文档完整性,对于不预览、不比对内容而直接下载带来的问题本站不予受理。
- 2.下载的文档,不会出现我们的网址水印。
- 3、该文档所得收入(下载+内容+预览)归上传者、原创作者;如果您是本文档原作者,请点此认领!既往收益都归您。
下载文档到电脑,查找使用更方便
10000 积分 0人已下载
下载 | 加入VIP,交流精品资源 |
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- ISOTR148732013INFORMATIONANDDOCUMENTATIONSTATISTICSANDQUALITYISSUESFORWEBARCHIVING 信息 文献 工作 网络 存档 统计

链接地址:http://www.mydoc123.com/p-1257676.html