1、 KSKSKSKSKSKSKSK KSKSKS KSKSK KSKS KSK KS KS X 6030 (XML)KS X 6030:2010 2010 12 28 http:/www.kats.go.krKS X 6030:2010 : e- ( ) ( ) () () ( ) : () () ( ) () DRM () KS X 6030:2010 : (http:/www.standard.go.kr) : :2003 12 06 :2010 12 28 2010-0643 : e- :e- () ( 02-509-7262) (http:/www.kats.go.kr). 10 5 ,
2、 . KS X 6030:2010 i . iii iv 1 1 1.1 1 1.2 .2 2 3 3 3 3.1 (well-formed) XML .4 3.2 .4 3.3 .5 3.4 6 3.5 .7 3.6 7 3.7 CDATA .7 3.8 (prolog) .8 3.9 .10 3.10 11 3.11 .12 3.12 12 4 .13 4.1 , , .14 4.2 15 4.3 .17 5 .23 5.1 23 5.2 25 5.3 .26 5.4 XML 28 5.5 30 5.6 .31 5.7 32 5.8 32 6 32 6.1 .32 6.2 XML .33
3、7 33 A( ) 36 B( ) XML SGML .39 C( ) .40 KS X 6030:2010 ii D( ) .42 E( ) .43 E.1 .43 E.2 44 F( ) W3C XML .46 G( ) W3C XML .47 H( ) 48 I( ) XML Names .49 50 KS X 6030:2010 .51 KS X 6030:2010 iii World Wide Web Consortium(W3C) 2006 Extensible Markup Language(XML) 1.1 . , , , . , , . KS X 6030: 2010 . A
4、( ) B( ) XML SGML C( ) D( ) E( ) F( ) W3C XML G( ) W3C WML H( ) I( ) XML Names KS X 6030:2010 iv (XML) SGML , . XML HTML WWW , , SGML WWW , . XML , SGML HTML . . . W3C W3C (http:/www.w3.org/TR/) . Standard generalized markup language, ISO 8879: 1986(E) . WWW . XML XML . . http:/www.w3c.org/2003/03/T
5、ranslations/byTechnology? technology=xml . W3C . 5 XML . , 4 (Fourth Edition of XML 1.0, 2006 8 16 ) (http:/www.w3c.org/XML/xml-V10-4e-errata ) . , E09 , XML 1.1 XML 1.0 . , (well-formed) , (ID ) . . . 5 http:/www.w3.org/XML/xml-V10-5e-errata . http:/www.w3.org/XML/2008/01/xml10-5e-implementation.ht
6、ml , . W3C , , W3C , W3C . . W3C . . W3C . . W3C 6 . KS X 6030:2010 (XML) Extensible Markup Language(XML) 1.1 1 XML(Extensible Markup Language) XML , XML . XML SGML (Standard Generalized Markup Language)(KS X ISO 8879) . XML SGML . XML (entity) . , , . . XML . : XML XML , .: XML , . XML . XML , . 1.
7、1 1996 World Wide Web Consortium (W3) XML ( SGML .) XML . Jon Bosak . W3C , SGML XML SIG(Special Interest Group) XML . XML G . Dan Connolly W3C . XML . a) XML . b) XML . c) XML SGML . d) XML . e) XML , . f) XML , . g) XML . h) XML . i) XML . j) XML . (Unicode Unicode ISO/IEC 10646 ISO/IEC 10646, KS
8、X 6030:2010 2 BCP 47IETF BCP 47 IANA LANGCODES) XML 1.0 . XML , . 1.2 XML . MUST, MUST NOT, REQUIRED, SHALL, SHALL NOT, SHOULD, SHOULD NOT, RECOMMENDED, MAY, OPTIONAL IETF RFC 2119 . XML . XML (processor) . 1.2.1 (error) : , . , MUST, REQUIRED, MUST NOT, SHALL SHALL NOT . , . 1.2.2 (fatal error) : X
9、ML , . , , . ( ) . , ( , ). 1.2.3 (at user option) , . , . 1.2.4 (validity constraint) : XML . . XML . 1.2.5 (well-formedness constraint) : (well-formed) XML . . 1.2.6 (match) : ( ) 2 . ISO/IEC KS X 6030:2010 3 10646 ( ) . , ( ) . ( ). , ( ). “ ” , (element) VC: Element Valid. 1.2.7 (for compatibili
10、ty) : XML SGML XML 1.2.8 (for interoperability) : SGML (KS X ISO 8879 SGML ) XML 2 . . ( ) . IANA CHARSETS, Internet Assigned Numbers Authority Official Names for Character Sets, ed. Keld Simonsen et al(ftp:/ftp.isi.edu/in-notes/iana/assignments/character-sets ) IETF RFC 2119, IETF(Internet Engineer
11、ing Task Force). RFC 2119: Key words for use in RFCs to Indicate Requirement Levels. Scott Bradner, 1997(http:/www.ietf.org/rfc/rfc2119.txt ) IETF BCP 47, IETF(Internet Engineering Task Force). BCP 47, consisting of RFC 4646: Tags for Identifying Languages, and RFC 4647: Matching of Language Tags, A
12、. Phillips, M. Davis. 2006 (ftp:/ftp.isi.edu/in notes/bcp/bcp47.txt. ) IETF RFC 3986, IETF(Internet Engineering Task Force). RFC 3986: Uniform Resource Identifier(URI):Generic Syntax. T. Berners Lee, R. Fielding, L. Masinter. 2005(http:/www.ietf.org/rfc/rfc3986.txt. ) ISO/IEC 10646, ISO (Internation
13、al Organization for Standardization). ISO/IEC 10646 1 : 2000. Information technology Universal Multiple Octet Coded Character Set(UCS) Part 1: Architecture and Basic Multilingual Plane and ISO/IEC 10646 2: 2001. Information technology Universal Multiple Octet Coded Character Set (UCS) Part 2: Supple
14、mentary Planes, as, from time to time, amended, replaced by a new edition or expanded by the addition of new parts. Geneva: International Organization for Standardization( http:/www.iso.org/iso/home.htm ) ISO/IEC 10646: 2000, ISO (International Organization for Standardization). ISO/IEC 10646 1: 200
15、0. Information technology Universal Multiple Octet Coded Character Set(UCS) Part 1: Architecture and Basic Multilingual Plane. Geneva: International Organization for Standardization, 2000 Unicode, The Unicode Consortium. The Unicode Standard, Version 5.0.0, defined by: The Unicode Standard, Version
16、5.0 (Boston, MA, Addison Wesley, 2007. ISBN 0 321 48091 0) UnicodeNormal, The Unicode Consortium. Unicode normalization forms. Mark Davis and Martin Durst. 2008. (http:/unicode.org/reports/tr15/. ) 3 KS X 6030:2010 4 : , XML . XML . XML . , . , . . , , , , , . 5.3.2 “ ” . 3.1 XML : XML . a) . b) . c
17、) . 1 document : = prolog element Misc* a) . b) : , . , . , . : , C , C P , P P . P C , C P . 3.2 : , . : ISO/IEC 10646: 2000 ISO/IEC 10646 . , , , Unicode ISO/IEC 10646 . . . , XML Char . 2 Char : = #x9 | #xA | #xD | #x20 #xD7FF | /* #xE000 #xFFFD | #x10000 #x10FFFF (surrogate , FFFE, FFFF ) */ (bi
18、t) . XML Unicode Unicode UTF 8 UTF 16 . , 5.3.3 “ ” . KS X 6030:2010 5 Unicode 2.3 “ ” . . . #x7F #x84, #x86 #x9F, #xFDD0 #xFDEF, #x1FFFE #x1FFFF, #x2FFFE #x2FFFF, #x3FFFE #x3FFFF, #x4FFFE #x4FFFF, #x5FFFE #x5FFFF, #x6FFFE #x6FFFF, #x7FFFE #x7FFFF, #x8FFFE #x8FFFF, #x9FFFE #x9FFFF, #xAFFFE #xAFFFF,
19、#xBFFFE #xBFFFF, #xCFFFE #xCFFFF, #xDFFFE #xDFFFF, #xEFFFE #xEFFFF, #xFFFFE #xFFFFF, #x10FFFE #x10FFFF. 3.3 . S( ) 1 (space: #x20), , , . 3 S : = (#x20 | #x9 | #xD | #xA)+ #xD . 3.11 “ ” XML #xD #xA . #xD . Nmtoken( ) . : Name Nmtoken . , , Name . “xml” Name (X|x) (M|m) (L|l) . XML XML Names . XML N
20、ame . XML . XML XML Names . XML Name . XML . , . COLON, HYPHEN MINUS, FULL STOP(period), LOW LINE(underscore), MIDDLE DOT . ASCII XML XML XML . #x037E, GREEK QUESTION MARK (;) KS X 6030:2010 6 . 4 NameStartChar : = “:“ | A-Z | “_“ | a-z | #xC0-#xD6 | #xD8-#xF6 | #xF8-#x2FF | #x370-#x37D | #x37F-#x1F
21、FF | #x200C-#x200D | #x2070-#x218F | #x2C00-#x2FEF | #x3001-#xD7FF | #xF900-#xFDCF | #xFDF0-#xFFFD | #x10000-#xEFFFF 4a NameChar : = NameStartChar | “-“ | “.“ | 0-9 | #xB7 | #x0300-#x036F | #x203F-#x2040 5 Name : = NameStartChar (NameChar)* 6 Names : = Name (#x20 Name)* 7 Nmtoken : = (NameChar)+ 8 Nmtokens : = Nmtoken(#x20 Nmtoken)* Names Nmtokens (4.3.1 ). (literal) , . (EntityValue), (AttValue) (SystemLiteral) . SystemLiteral . 9 EntityValue : = “ (%“ , (“) “AB“ A #x20 B #x20 #x20 A #x20 #x20 #x20 B #x20 #x20 a= “AB“ #xD #xD A #xA #xA B #xD #xA #xD #xD A #xA #xA B #xD #xA NTOKENS