BS ISO 28500-2017 Information and documentation WARC file format《信息和文献工作 WARC文件格式》.pdf
《BS ISO 28500-2017 Information and documentation WARC file format《信息和文献工作 WARC文件格式》.pdf》由会员分享,可在线阅读,更多相关《BS ISO 28500-2017 Information and documentation WARC file format《信息和文献工作 WARC文件格式》.pdf(36页珍藏版)》请在麦多课文档分享上搜索。
1、Information and documentation WARC file formatBS ISO 28500:2017BSI Standards PublicationWB11885_BSI_StandardCovs_2013_AW.indd 1 15/05/2013 15:06 ISO 2017Information and documentation WARC file formatInformation et documentation Format de fichier WARCINTERNATIONAL STANDARDISO28500Second edition2017-0
2、8Reference numberISO 28500:2017(E)National forewordThis British Standard is the UK implementation of ISO 28500:2017. It supersedes BS ISO 28500:2009, which is withdrawn.The UK participation in its preparation was entrusted to Technical Committee IDT/2/7, Computer applications in Information and Docu
3、mentation.A list of organizations represented on this committee can be obtained on request to its secretary.This publication does not purport to include all the necessary provisions of a contract. Users are responsible for its correct application. The British Standards Institution 2017 Published by
4、BSI Standards Limited 2017ISBN 978 0 580 95168 8ICS 35.240.30Compliance with a British Standard cannot confer immunity from legal obligations.This British Standard was published under the authority of the Standards Policy and Strategy Committee on 30 September 2017.Amendments/corrigenda issued since
5、 publicationDate Text affectedBRITISH STANDARDBS ISO 28500:2017 ISO 2017Information and documentation WARC file formatInformation et documentation Format de fichier WARCINTERNATIONAL STANDARDISO28500Second edition2017-08Reference numberISO 28500:2017(E)BS ISO 28500:2017ISO 28500:2017(E)ii ISO 2017 A
6、ll rights reservedCOPYRIGHT PROTECTED DOCUMENT ISO 2017, Published in SwitzerlandAll rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting on the inter
7、net or an intranet, without prior written permission. Permission can be requested from either ISO at the address below or ISOs member body in the country of the requester.ISO copyright officeCh. de Blandonnet 8 CP 401CH-1214 Vernier, Geneva, SwitzerlandTel. +41 22 749 01 11Fax +41 22 749 09 47copyri
8、ghtiso.orgwww.iso.orgBS ISO 28500:2017ISO 28500:2017(E)ii ISO 2017 All rights reservedCOPYRIGHT PROTECTED DOCUMENT ISO 2017, Published in SwitzerlandAll rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized otherwise in any form or by any means, elect
9、ronic or mechanical, including photocopying, or posting on the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address below or ISOs member body in the country of the requester.ISO copyright officeCh. de Blandonnet 8 CP 401CH-1214 Vernier
10、, Geneva, SwitzerlandTel. +41 22 749 01 11Fax +41 22 749 09 47copyrightiso.orgwww.iso.orgISO 28500:2017(E)Foreword vIntroduction vi1 Scope . 12 Normative references 13 Terms, definitions and abbreviated terms 24 File and record model . 35 Named fields . 55.1 General . 55.2 WARC-Record-ID (mandatory)
11、 . 55.3 Content-Length (mandatory) . 55.4 WARC-Date (mandatory) . 65.5 WARC-Type (mandatory) . 65.6 Content-Type . 65.7 WARC-Concurrent-To . 75.8 WARC-Block-Digest . 75.9 WARC-Payload-Digest . 75.10 WARC-IP-Address . 85.11 WARC-Refers-To . 85.12 WARC-Refers-To-Target-URI 85.13 WARC-Refers-To-Date 85
12、.14 WARC-Target-URI 95.15 WARC-Truncated 95.16 WARC-Warcinfo-ID 95.17 WARC-Filename 95.18 WARC-Profile . 105.19 WARC-Identified-Payload-Type . 105.20 WARC-Segment-Number . 105.21 WARC-Segment-Origin-ID 105.22 WARC-Segment-Total-Length 106 WARC record types 116.1 General 116.2 warcinfo . 116.3 respon
13、se 116.3.1 General. 116.3.2 http and https schemes 126.3.3 Other URI schemes 126.4 resource . 126.4.1 General. 126.4.2 http and https schemes 126.4.3 ftp scheme 126.4.4 dns scheme 136.4.5 Other URI schemes 136.5 request 136.5.1 General. 136.5.2 http and https schemes 136.5.3 Other URI schemes 136.6
14、metadata . 136.7 revisit . 146.7.1 General. 146.7.2 Profile: Identical Payload Digest . 146.7.3 Profile: Server Not Modified . 156.7.4 Other profiles .15 ISO 2017 All rights reserved iiiContents PageBS ISO 28500:2017ISO 28500:2017(E)6.8 conversion . 156.9 continuation . 167 Record segmentation 168 W
15、ARC file name, size and compression 16Annex A (informative) Use cases for writing WARC records .18Annex B (informative) Examples of WARC records .21Annex C (informative) WARC file size and name recommendations 24Annex D (informative) Compression recommendations 25Bibliography .26iv ISO 2017 All righ
16、ts reservedBS ISO 28500:2017ISO 28500:2017(E)6.8 conversion . 156.9 continuation . 167 Record segmentation 168 WARC file name, size and compression 16Annex A (informative) Use cases for writing WARC records .18Annex B (informative) Examples of WARC records .21Annex C (informative) WARC file size and
17、 name recommendations 24Annex D (informative) Compression recommendations 25Bibliography .26iv ISO 2017 All rights reserved ISO 28500:2017(E)ForewordISO (the International Organization for Standardization) is a worldwide federation of national standards bodies (ISO member bodies). The work of prepar
18、ing International Standards is normally carried out through ISO technical committees. Each member body interested in a subject for which a technical committee has been established has the right to be represented on that committee. International organizations, governmental and non-governmental, in li
19、aison with ISO, also take part in the work. ISO collaborates closely with the International Electrotechnical Commission (IEC) on all matters of electrotechnical standardization.The procedures used to develop this document and those intended for its further maintenance are described in the ISO/IEC Di
20、rectives, Part 1. In particular the different approval criteria needed for the different types of ISO documents should be noted. This document was drafted in accordance with the editorial rules of the ISO/IEC Directives, Part 2 (see www .iso .org/ directives).Attention is drawn to the possibility th
21、at some of the elements of this document may be the subject of patent rights. ISO shall not be held responsible for identifying any or all such patent rights. Details of any patent rights identified during the development of the document will be in the Introduction and/or on the ISO list of patent d
22、eclarations received (see www .iso .org/ patents).Any trade name used in this document is information given for the convenience of users and does not constitute an endorsement.For an explanation on the voluntary nature of standards, the meaning of ISO specific terms and expressions related to confor
23、mity assessment, as well as information about ISOs adherence to the World Trade Organization (WTO) principles in the Technical Barriers to Trade (TBT) see the following URL: www .iso .org/ iso/ foreword .html.This document was prepared by Technical Committee ISO/TC 46, Information and documentation,
24、 Subcommittee 4, Technical interoperability.This second edition cancels and replaces the first edition (ISO 28500:2009), which has been technically revised. ISO 2017 All rights reserved vBS ISO 28500:2017ISO 28500:2017(E)IntroductionWebsites and web pages emerge and disappear from the World Wide Web
- 1.请仔细阅读文档,确保文档完整性,对于不预览、不比对内容而直接下载带来的问题本站不予受理。
- 2.下载的文档,不会出现我们的网址水印。
- 3、该文档所得收入(下载+内容+预览)归上传者、原创作者;如果您是本文档原作者,请点此认领!既往收益都归您。
下载文档到电脑,查找使用更方便
10000 积分 0人已下载
下载 | 加入VIP,交流精品资源 |
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- BSISO285002017INFORMATIONANDDOCUMENTATIONWARCFILEFORMAT 信息 文献 工作 WARC 文件格式 PDF

链接地址:http://www.mydoc123.com/p-586892.html