ECMA 114-2000 8-Bit Single-Byte Coded Graphic Character Sets Latin Arabic Alphabet《8位元单一位元组码化图形字元集拉丁字母 阿拉伯字母 第2版》.pdf
《ECMA 114-2000 8-Bit Single-Byte Coded Graphic Character Sets Latin Arabic Alphabet《8位元单一位元组码化图形字元集拉丁字母 阿拉伯字母 第2版》.pdf》由会员分享,可在线阅读,更多相关《ECMA 114-2000 8-Bit Single-Byte Coded Graphic Character Sets Latin Arabic Alphabet《8位元单一位元组码化图形字元集拉丁字母 阿拉伯字母 第2版》.pdf(28页珍藏版)》请在麦多课文档分享上搜索。
1、Standard ECMA-1142ndEdition - December 2000Standardizing Information and Communication SystemsPhone: +41 22 849.60.00 - Fax: +41 22 849.60.01 - URL: http:/www.ecma.ch - Internet: helpdeskecma.ch8-Bit Single-Byte CodedGraphic Character SetsLatin/Arabic Alphabet.Standard ECMA-1142ndEdition - December
2、2000Standardizing Information and Communication SystemsPhone: +41 22 849.60.00 - Fax: +41 22 849.60.01 - URL: http:/www.ecma.ch - Internet: helpdeskecma.chMB ECMA-114.DOC 20-12-00 14,388-Bit Single-Byte CodedGraphic Character SetsLatin/Arabic Alphabet.Brief HistoryThe adoption of Standard ECMA-6 (IS
3、O 646) in 1965 as the agreed international 7-bit code for informationinterchange has led to the development of many national, international and application-oriented versions of this codewhich have been in wide use for quite some time.These versions had a number of limitations generally inherent to t
4、he size of the code: they did not provide all graphic characters which may be needed, for some characters, specially for accented letters, it was necessary to resort to BACKSPACE sequences, whichcreated problems when processing data containing such composite characters, interchange among different v
5、ersions was practically limited to the 82 common graphic characters.With the advent of 8-bit coding it was possible to increase the number of graphic characters. ISO 6937/2, forexample, provided a character set covering the requirements of most languages based on the Latin alphabet. Thischaracter se
6、t, although well suited for text communication, was difficult to use for processing as some graphiccharacters were represented by one and others by two bit combinations. Thus, the need was recognized for codedgraphic character sets, each of which: is the same for all users of a given area, provides
7、single-byte coding of all graphic characters thus permitting easy processing, takes into account character sets used in the industry.Since 1982 the urgency of the need for an 8-bit single-byte coded character set was recognized in ECMA as well as inANSI/X3L2 and numerous working papers were exchange
8、d between the two groups. In February 1984 ECMA TC1submitted to ISO/TC97/SC2 (which has become ISO/IEC JTC 1/SC2 in 1987) a proposal for such a coded characterset. At its meeting of April 1984 SC2 decided to propose a new item of work for this topic. Technical discussionsduring and after this meetin
9、g led TC1 to adopt the coding scheme proposed by X3L2. International Standard ISO/IEC8859-1 is based on this joint ANSI/ECMA proposal. ECMA published its corresponding Standard ECMA-94 inMarch 1985.After this first publication, the work of ECMA TC1 on further coded graphic character sets has led to
10、the followingresults:i. The present Standard ECMA-114 for a Latin/Arabic coded graphic character set. In developing this ECMAStandard TC1 closely co-operated with the relevant groups and committees of ASMO, the Arab Organization forStandardization and Metrology, of ATU, the Arab Telecommunication Un
11、ion, and of different Arabic countries.This 2ndEdition has been developed to keep it fully aligned with the new edition of ISO/IEC 8859-6.ii. The second edition of Standard ECMA-94 comprising four coded graphic character sets for the Latin script,identified as Latin Alphabets No. 1 to No. 4. These a
12、lphabets have a number of characters in common, inparticular those allocated to columns 02 to 07. These four Latin Alphabets have been submitted to ISO/IEC andJTC 1 and have become Parts 1 to 4 of ISO/IEC 8859.iii. A series of ECMA Standards for coded graphic character sets comprising those characte
13、rs of the Latin Alphabetsallocated to columns 02 to 07 and characters of another script for multiple-language applications. These ECMAStandards cover the Cyrillic, Greek and Hebrew scripts. These ECMA Standards ECMA-113, ECMA-118 andECMA-121, resp., have become Parts 5, 7 and 8, resp., of ISO/IEC 88
14、59.iv. Latin Alphabets No. 5 and No. 6 have been published as ECMA-128 and ECMA-144, resp. They have becomeParts 9 and 10, resp., of ISO/IEC 8859.This ECMA Standard has been adopted as 2ndEdition of Standard ECMA-114 by the ECMA General Assembly ofDecember 2000.- i -Table of contents1Scope 12 Confor
15、mance 12.1 Conformance of information interchange 12.2 Conformance of devices 12.2.1 Device description 12.2.2 Originating devices 12.2.3 Receiving devices 13 References 14 Definitions 24.1 bit combination 24.2 byte 24.3 character 24.4 code table 24.5 coded character set; code 24.6 coded-character-d
16、ata-element (CC-data-element) 24.7 graphic character 24.8 graphic symbol 24.9 position 25 Notation, code table and names 25.1 Notation 25.2 Layout of the code table 35.3 Names and meanings. 35.3.1 SPACE (SP) 35.3.2 NO-BREAK SPACE (NBSP) 35.3.3 SOFT HYPHEN (SHY) 36 Specification of the coded characte
17、r set 36.1 Characters of the set and their coded representation 46.2 Code table 87 Identification of the character set 97.1 Identification according to ECMA-35 and ECMA-43 97.2 Identification using the ISO International register of coded character sets to be used with escapesequences 10Annex A - Cov
18、erage of languages 11Annex B - Main differences between the first edition and this second edition of ECMA-114 13Annex C - Bibliography 15Annex D - Identification according to ISO/IEC 8824-1 (ASN.1) 17- ii -.1ScopeThis ECMA Standard specifies a set of 146 coded graphic characters identified as the La
19、tin/Arabic alphabet.This set of coded graphic characters is intended for use in data and text processing applications and also forinformation interchange. The set contains graphic characters used for general purpose applications in typicaloffice environments in at least the following languages:Arabi
20、c, English and Latin.Some of the characters in this set are combining characters (see clause 6).This set of coded graphic characters may be regarded as a version of an 8-bit code according to StandardECMA-35 or Standard ECMA-43 at level 1.This ECMA Standard may not be used with any other ECMA Standa
21、rds for 8-bit single-byte coded graphiccharacter sets. If coded characters from more than one ECMA Standard are to be used together, by means ofcode extension techniques, the equivalent coded character sets from ISO/IEC 10367 should be used insteadwithin a version of Standard ECMA-43 at level 2 or l
22、evel 3.The coded characters in this set may be used in conjunction with coded control functions selected fromECMA-48. However, control functions are not used to create composite graphic symbols from two or moregraphic characters (see clause 6).NOTEThis ECMA Standard is not intended for use with Tele
23、matic services defined by ITU-T. If information codedaccording to this ECMA Standard is to be transferred to such services, it will have to conform to therequirements of those services at the access-point.2 Conformance2.1 Conformance of information interchangeA coded-character-data-element (CC-data-
24、element) within coded information for interchange is inconformance with this ECMA Standard if all the coded representations of graphic characters within thatCC-data-element conform to the requirements of clause 6.2.2 Conformance of devicesA device is in conformance with this ECMA Standard if it conf
- 1.请仔细阅读文档,确保文档完整性,对于不预览、不比对内容而直接下载带来的问题本站不予受理。
- 2.下载的文档,不会出现我们的网址水印。
- 3、该文档所得收入(下载+内容+预览)归上传者、原创作者;如果您是本文档原作者,请点此认领!既往收益都归您。
下载文档到电脑,查找使用更方便
10000 积分 0人已下载
下载 | 加入VIP,交流精品资源 |
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- ECMA11420008BITSINGLEBYTECODEDGRAPHICCHARACTERSETSLATINARABICALPHABET8 位元 单一 组码化 图形 字元 拉丁字母 阿拉伯 字母 PDF

链接地址:http://www.mydoc123.com/p-704605.html