ECMA 151-1991 Data Compression for Information Interchange - Adaptive Coding with Embedded《信息交换用数据压缩 带有嵌入式字典的自适应编码》.pdf
《ECMA 151-1991 Data Compression for Information Interchange - Adaptive Coding with Embedded《信息交换用数据压缩 带有嵌入式字典的自适应编码》.pdf》由会员分享,可在线阅读,更多相关《ECMA 151-1991 Data Compression for Information Interchange - Adaptive Coding with Embedded《信息交换用数据压缩 带有嵌入式字典的自适应编码》.pdf(20页珍藏版)》请在麦多课文档分享上搜索。
1、Standard ECMA-151June 1991Published in electronic form in September 1999Standardizing Information and Communication SystemsPhone: +41 22 849.60.00 - Fax: +41 22 849.60.01 - URL: http:/www.ecma.ch - Internet: helpdeskecma.chData Compression forInformation Interchange -Adaptive Coding with EmbeddedDic
2、tionary - DCLZ Algorithm.Standard ECMA-151June 1991Standardizing Information and Communication SystemsPhone: +41 22 849.60.00 - Fax: +41 22 849.60.01 - URL: http:/www.ecma.ch - Internet: helpdeskecma.chMB ECMA-151.DOC 06-09-99 14,04Data Compression forInformation Interchange -Adaptive Coding with Em
3、beddedDictionary - DCLZ Algorithm.Brief HistoryIn the past decades ECMA have published numerous ECMA Standards for magnetic tapes, magnetic tape cassettes andcartridges, as well as for optical disk cartridges. Those media developed recently have a very high physical recording density.In order to mak
4、e an optimal use of the resulting data capacity, compression algorithms have been designed which allow areduction of the number of bits required for the representation of user data in coded form.In future, these compression algorithms will be registered by an International Registration Authority to
5、be set up by ISO/IEC.The registration will consist in allocating to each registered algorithm a numerical identifier which will be recorded on themedium and, thus, indicate which compression algorithm(s) has been used.The present ECMA Standard is the first of a forthcoming series of ECMA Standards f
6、or compression algorithms. It has beencontributed to ISO/IEC for adoption as an International Standard under the fast-track procedure.This ECMA Standard has been adopted by the ECMA General Assembly of June 1991.- i -Table of contents1Scope 12 Conformance 13 Reference 14 Conventions 15 Algorithm Ide
7、ntifier 16 DCLZ Compression Algorithm 16.1 Overview 16.2 Principle of operation 16.2.1 Compilation of the dictionary 16.2.2 Frozen dictionary 26.2.3 Resetting the dictionary to the empty state 26.2.4 Boundaries 26.2.5 Re-creation of the dictionary 26.3 Code Values 26.3.1 Control Codes 36.3.2 Encoded
8、 Bytes 36.3.3 Dictionary Codes 36.4 Codewords 37 Bibliography 4Appendix A - Example of a Generic DCLZ Algorithm 5Appendix B - Example of Code Values output for a given Input Stream 9- ii -.1ScopeThis ECMA Standard specifies a lossless compression algorithm to reduce the number of bits required torep
9、resent information coded by means of 8-bit bytes. This algorithm is known as DCLZ (Data Compressionaccording to Lempel and Ziv).This ECMA Standard specifies neither the strategy for resetting the dictionary nor that for freezing it, asthese are implementation-dependent.This algorithm is particularly
10、 useful when information has to be recorded on an interchangeable medium. Itsuse is not limited to this application.2 ConformanceA compression algorithm shall be in conformance with this Standard if its output data stream satisfies therequirements of clause 6.3 ReferenceInternational Register of Pro
11、cessing Algorithms (to be established).4 ConventionsNumbers in this Standard are expressed in decimal notation.5 Algorithm IdentifierThe numeric identifier of this algorithm in the International Register is 32.6 DCLZ Compression Algorithm6.1 OverviewThe DCLZ compression algorithm shall accept inform
12、ation input, in the form of a stream of 8-bit databytes, and shall output Codewords, in the form of a stream of bits which are organised into 8-bit bytes.The algorithm shall identify repetition of byte strings in the input stream and shall exclude suchredundancy from the output stream.With many type
13、s of information generated, transmitted or recorded by electronic information processingsystems and equipment, the degree of repetition in data is sufficiently high to permit the output stream tocontain significantly fewer bits than the input stream. Under degenerate circumstances, however, theoutpu
14、t stream may contain more bits than the input stream. The actual ratio of the numbers of bits isdependent on the characteristics of the actual input data stream.Compression by this algorithm is lossless, i.e. it is possible to restore exactly the original representation ofdata by means of a compleme
15、ntary decompression algorithm.The algorithm contains features which aid its implementation in data storage and retrieval equipmentwhich handles, in a sequential manner, data records of varying length.6.2 Principle of operationThe fundamental principle of operation is the compilation of a dictionary
16、of strings of bytes which occurin the input stream, the use of that dictionary to detect repetition, and the generation of a Codeword foreach repeated string. The Codeword expresses a Code Value which is the reference to the dictionary entryfor the repeated string.6.2.1 Compilation of the dictionary
17、The algorithm shall examine the input stream and shall search for the first occurrence of a unique pairor a unique string. A unique pair is a 2-byte string which has not yet been allocated a dictionary entry.A unique string of n bytes (n 2) is one which has not yet been allocated a dictionary entry;
18、 however,- 2 -the first n-1 bytes shall have been already allocated a dictionary entry. The maximum length of a stringfor which a dictionary entry can be allocated shall be 128 bytes.Upon encountering a unique pair, the algorithm shall output a Codeword which expresses the CodeValue for the first by
19、te of the pair. Upon encountering a unique string of n bytes, the algorithm shalloutput a Codeword which expresses the Code Value for the first n-1 bytes of the string.It shall then enter the unique pair or unique string into the dictionary and assign the next unused CodeValue to the entry, provided
20、 that the dictionary is not frozen (see 6.2.2) and that n does not exceed 128.Starting with the 2nd byte of the current unique pair or the last byte of the current unique string, thealgorithm shall then continue to examine the input stream and search for the next unique pair or uniquestring.6.2.2 Fr
21、ozen dictionaryThe dictionary shall be considered frozen in the following cases:- all available Code Values have been assigned,- the implementation of the algorithm has decided not to enter a unique pair or a unique string into thedictionary, for example because the search for free space in the dict
22、ionary takes too much time.In the frozen state no further dictionary entries shall be made. The only means by which the dictionarymay be removed from the frozen state is by being reset to the empty state (see also 6.3.1.1).6.2.3 Resetting the dictionary to the empty statePrior to the commencement of
23、 operation of the algorithm, the dictionary shall be reset to the empty state(see also 6.3.1.2).The algorithm is also permitted to reset the dictionary to this empty state at any time, provided that allbytes which have been input to the algorithm have been expressed by Codewords.The algorithm may, f
24、or example, choose to reset the dictionary if the current degree of compression isnot adequate because the current dictionary entries do not reflect the current repetition characteristics ofthe input stream to a sufficient extent.6.2.4 BoundariesWithin the input stream, natural boundaries may exist
- 1.请仔细阅读文档,确保文档完整性,对于不预览、不比对内容而直接下载带来的问题本站不予受理。
- 2.下载的文档,不会出现我们的网址水印。
- 3、该文档所得收入(下载+内容+预览)归上传者、原创作者;如果您是本文档原作者,请点此认领!既往收益都归您。
下载文档到电脑,查找使用更方便
10000 积分 0人已下载
下载 | 加入VIP,交流精品资源 |
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- ECMA1511991DATACOMPRESSIONFORINFORMATIONINTERCHANGEADAPTIVECODINGWITHEMBEDDED 信息 交换 数据压缩 带有 嵌入式 字典 自适应

链接地址:http://www.mydoc123.com/p-704626.html