1、ISO/IEC 15445:2000(E) ISO-HTML Page 1 of 23 file:/J:MiscellaneousISO%2FIEC+15445%3B2000+%28E%29%2DHTML+document.htm 8/1/00 ISO/IEC 15445:2000(E) First edition 2000-05-15Information technology Document description and processing languages HyperText Markup Language (HTML) Contents l Foreword l Introdu
2、ction l 1 Scope l 2 Conformance 2.1 Conforming documents 2.2 Validating systems 2.3 Conforming systems 2.4 Character set conformance l 3 Normative references l 4 Definitions l 5 Symbols and abbreviations l 6 Requirements l 7 Use of the referenced text l 8 General provisions 8.1 Byte order l 9 Invoca
3、tion 9.1 Document type declaration 9.2 Architectural support declaration 9.3 Comments in the DTD l 10 Availability of the DTD l 11 Style l 12 Comments in HTML l Annex A SGML declaration l Annex B Entities, element types and attributes l Bibliography ISO/IEC 2000 All rights reserved. Unless otherwise
4、 specified, no part of this publication may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm, without permission in writing from either ISO at the address below or ISOs member body in the country of the requester. ISO copyright off
5、ice ISO/IEC 15445:2000(E) ISO-HTML Page 2 of 23 file:/J:MiscellaneousISO%2FIEC+15445%3B2000+%28E%29%2DHTML+document.htm 8/1/00Case postale 56 - CH-1211 Geneva 20 Tel. + 41 22 749 01 11 Fax + 41 22 734 10 79 E-mail copyrightiso.ch Web www.iso.ch Printed in Switzerland Foreword ISO (the International
6、Organization for Standardization) and IEC (the International Electrotechnical Commission) form the specialized system for worldwide standardization. National bodies that are members of ISO or IEC participate in the development of International Standards through technical committees established by th
7、e respective organization to deal with particular fields of technical activity. ISO and IEC technical committees collaborate in fields of mutual interest. Other international organizations, governmental and non-governmental, in liaison with ISO and IEC, also take part in the work. International Stan
8、dards are drafted in accordance with the rules given in the ISO/IEC Directives, Part 3. In the field of information technology, ISO and IEC have established a joint technical committee, ISO/IEC JTC 1. Draft International Standards adopted by the joint technical committee are circulated to national b
9、odies for voting. Publication as an International Standard requires approval by at least 75 % of the national bodies casting a vote. Attention is drawn to the possibility that some of the elements of this International Standard may be the subject of patent rights. ISO and IEC shall not be held respo
10、nsible for identifying any or all such patent rights. International Standard ISO/IEC 15445 was prepared by Joint Technical Committee ISO/IEC JTC 1, Information technology, Subcommittee SC 34, Document description languages. JTC 1/SC 34 has worked on this project in close cooperation with the World W
11、ide Web Consortium. This International Standard makes normative reference to the W3C Recommendation for HTML 4.0 amended by the H TML 4.0 Specification Errata. Annexes A and B form a normative part of this International Standard. Introduction The HyperText Markup Language (HTML) is an application of
12、 the International Standard ISO 8879 - Standard Generalized Markup Language (SGML). It provides a simple way of structuring hypertext documents and of placing references in one document which point to another. This International Standard is a refinement of the World Wide Web Consortiums (W3Cs) Recom
13、mendation for HTML 4.0: it provides further rules to condition and refine the use of the W3C Recommendation in a way which emphasizes the use of stable and mature features, and represents accepted SGML practice. Documents which conform to this ISO/IEC 15445:2000(E) ISO-HTML Page 3 of 23 file:/J:Misc
14、ellaneousISO%2FIEC+15445%3B2000+%28E%29%2DHTML+document.htm 8/1/00 International Standard also conform to the strict DTD provided by the W3C Recommendation for HTML 4.0. This International Standard makes a clear and important distinction between conforming systems and validating systems. A conformin
15、g system operates correctly when handling documents which conform to this International Standard, but is not required to operate correctly when the documents do not conform. A validating system is more powerful: it detects all SGML and HTML errors in a document. Frequently browsers are conforming sy
16、stems whereas authoring tools check for validity. This International Standard does not define error handling procedures. NOTE: International Standard ISO/IEC 15445 assumes that the reader is familiar with International Standard ISO 8879 - Standard Generalized Markup Language (SGML). Readers who are
17、not experts in SGML may wish to consult the Users Guide to ISO/IEC 15445. The Guide provides more accessible descriptions of the SGML techniques that are used, for example, in the document preparation process. 1 Scope The scope of this International Standard is a conforming application of ISO 8879,
18、SGML. This International Standard describes the way in which the HTML language specified by the following clauses in the W3C Recommendation for HTML 4.0 shall be used, and does so by identifying all the differences between the HTML language specified by the W3C Recommendation for HTML 4.0 and the HT
19、ML language defined by this International Standard: l Clause 2: Introduction to HTML 4.0. l Clause 5: HTML Document Representation. l Clause 6: Basic HTML data types, excluding subclauses 6.5, 6.14 and 6.16. l Clause 7: The global structure of an HTML document. l Clause 8: Language information and t
20、ext direction. l Clause 9: Text. l Clause 10: Lists, excluding subclause 10.4. l Clause 11: Tables. l Clause 12: Links. l Clause 13: Objects, Images and Applets, excluding subclauses 13.4 and 13.7. l Clause 14: Style Sheets, excluding subclause 14.2.2. l Subclause 15.2.1: Font style element types, e
21、xcluding the ; W3C 15.2.1, W3C 15.2.1, W3C 15.2.1, W3C 15.2.1 and W3C 15.2.1 element types. l Subclause 15.3: Rules: the HR element type. l Clause 17: Forms. l Clause 24: Character entity references in HTML 4.0. l Clause “References“: Normative references and informative references. The scope exclud
22、es any material in the W3C Recommendation for HTML 4.0 not listed in this clause. ISO/IEC 15445:2000(E) ISO-HTML Page 4 of 23 file:/J:MiscellaneousISO%2FIEC+15445%3B2000+%28E%29%2DHTML+document.htm 8/1/00 2 Conformance This International Standard distinguishes between conforming documents, validatin
23、g systems, conforming systems and character set conformance. 2.1 Conforming documents A document which conforms to this International Standard shall a. Be a conforming HTML document consisting of a required document type declaration, followed by a single document instance, contained within an W3C 7.
24、3 document element. The document type declaration may be surrounded by white space consisting of RS, RE, SPACE, TAB and HTML comments. The document instance may also be followed by such white space. b. Meet the requirements of this International Standard. 2.2 Validating systems An HTML system is a v
25、alidating HTML system if a. It is a validating SGML parser as defined by ISO 8879 subclause 15.4; and b. It is able to process any conforming HTML document; and c. It finds and reports an HTML error if one exits; and d. It does not report an HTML error where none exists. 2.2.1 Documentation of valid
26、ating systems Validating systems conforming to this International Standard shall display the following identification text prominently and in the national language of the documentation: a. In a prominent location in the front matter of publications (normally the title and cover pages), b. On identif
27、ying displays of programs, c. In promotional and training material. The HTML validating system identification text is: An HTML validating system conforming to International Standard ISO/IEC 15445HyperText Markup Language, and International Standard ISO 8879 Standard Generalized Markup Language (SGML
28、). NOTE: The validating system identification text is copyrighted by the ISO/IEC, but may be used without further permission or further reference to the ISO/IEC. 2.3 Conforming systems A conforming HTML system is an HTML system which is able to process all documents ISO/IEC 15445:2000(E) ISO-HTML Pa
29、ge 5 of 23 file:/J:MiscellaneousISO%2FIEC+15445%3B2000+%28E%29%2DHTML+document.htm 8/1/00 conforming to this standard. 2.3.1 Documentation of conforming HTML systems Conforming systems shall display the following identification text prominently and in the national language of the documentation: a. I
30、n a prominent location in the front matter of publications (normally the title and cover pages), b. On identifying displays of programs, c. In promotional and training material. The HTML conforming system identification text is: An HTML system conforming to International Standard ISO/IEC 15445 Hyper
31、Text Markup Language. The documentation shall not claim or imply that the system may be used to validate HTML documents. 2.4 Character set conformance The SGML declaration provided with this International Standard calls for the use of ISO/IEC 10646 Universal Multiple-Octet Coded Character Set (UCS).
32、 ISO/IEC 10646 specifies a large number of facilities from which different selections may be made to suit individual applications. As a result it is only practicable to envisage limited conformance to ISO/IEC 10646 as defined in this subclause. Under limited conformance, the following is required: a
33、. When the characters described by ISO/IEC 10646 are used, they shall be implemented with the meanings and coded representation specified in ISO/IEC 10646. b. If a server is unable to express a document using the character set supported by the user agent, it should instead deliver a document in a li
34、mited character set and explain the problem to the user agent. c. Code positions that are either reserved for registration or reserved for future standardization shall not be used. d. No registered escape sequence shall be used with a meaning other than that defined by ISO/IEC 10646. The UTF-1 trans
35、formation format of ISO/IEC 10646, registered by IANA as ISO-10646- UTF-1, has been removed from ISO/IEC 10646 and should not be used. 3 Normative references The following normative documents contain provisions which, through reference in this text, constitute provisions of this International Standa
36、rd. For dated references, subsequent amendments to, or revisions of, any of these publications do not apply. However, parties to ISO/IEC 15445:2000(E) ISO-HTML Page 6 of 23 file:/J:MiscellaneousISO%2FIEC+15445%3B2000+%28E%29%2DHTML+document.htm 8/1/00 agreements based on this International Standard
37、are encouraged to investigate the possibility of applying the most recent editions of the normative documents indicated below. For undated references, the latest edition of the normative document referred to applies. Members of ISO and IEC maintain registers of currently valid International Standard
38、s. This International Standard refers normatively to: l ISO 8879, Information processing Text and office systems Standard Generalized Markup Language (SGML) l ISO/IEC 10744, Hypermedia/Time-based Structuring Language (HyTime) NOTE 1: ISO/IEC 10744 HyTime provides the techniques required to allow HTM
39、L to be used as a base architecture for other SGML applications.l Those clauses and subclauses of the W3C Recommendation for HTML 4.0 after amendment by the H TML 4.0 Specification Errata which are listed in the scope clause of this International Standard. NOTE 2: The W3C Recommendation consists of
40、the current version along with any amendments required to correct the errata (known errors in the specification). Although some of the errors may have been corrected in the current version, the Recommendation to which this International Standard refers corrects all known errors.NOTE 3: The W3C Recom
41、mendation contains further normative references and defines their application. 4 Definitions For the purposes of this International Standard, the definitions given in ISO 8879:1986 and the following definitions apply: 4.1 Browser A user agent whose main function is to present documents to a user. 4.
42、2 Character (Source: RFC1866) An atom of information, for example a letter or a digit. Graphic characters have associated glyphs, whereas control characters have associated processing semantics. 4.3 Character encoding scheme (Source: RFC1866) A function whose domain is the set of sequences of octets
43、, and whose range is the set of sequences of characters from a character repertoire; that is, a sequence of octets and a character encoding scheme determining a sequence of characters. 4.4 Character repertoire (Source: RFC1866) A finite set of characters; eg. the range of a coded character set. 4.5
44、Code position (Source: RFC1866) An integer in the domain of a coded character set. The coded character set maps the code position to a character. 4.6 Coded character set ISO/IEC 15445:2000(E) ISO-HTML Page 7 of 23 file:/J:MiscellaneousISO%2FIEC+15445%3B2000+%28E%29%2DHTML+document.htm 8/1/00 (Source
45、: RFC1866) A function whose domain is a subset of the integers and whose range is a character repertoire; that is, for some set of integers (usually of the form 0, 1, 2, ., N-1), a coded character set and an integer in that set determine a character. Conversely, a character and a coded character set
46、 determine the characters code position (or, in rare cases, a few code positions). 4.7 CRLF (Source: RFC2046) The sequence of the two ISO/IEC 646:1991 characters CR (13) and LF (10) which, taken together, in this order, denote a line break. 4.8 Form data set (Source: RFC1866) A sequence of name/valu
47、e pairs; the names given by an HTML document and the values given by the user. 4.9 Fragment identifier (Source: RFC1866) The portion of an HREF attribute value following the # character. 4.10 HTML browser A browser which presents HTML documents. 4.11 HTML document A document structured in accordance
48、 with this International Standard. 4.12 Hyperlink A relationship between two anchors, called the source and the target. The link goes from the source to the target. The source is also known as the tail, and the target is also known as the destination or head. 4.13 User Agent (in the World Wide Web)
49、A software or hardware device which accepts user input and presents to the user the World Wide Webs interpretation of that input. 5 Symbols and abbreviated terms The following symbols and abbreviated terms are used in this International Standard: 5.1 HTML HyperText Markup Language. Pertaining to this standard. 5.2 HTML 4.0 A Recommendation for the HyperText Markup Language developed by the World Wide Web Consortium. 5.3 HTTP IETF RFC2068 Hypertext Transfer Protocol. 5.4 IANA Internet Assigned Numbers Authority. 5.