1、 Copyright 2011 by THE SOCIETY OF MOTION PICTURE AND TELEVISION ENGINEERS 3 Barker Avenue, White Plains, NY 10601 (914) 761-1100 Approved October 3, 2011 Table of Contents Page Foreword . 3 Intellectual Property 3 Introduction 3 1 Scope . 4 2 Conformance Notation . 4 3 Normative References . 4 4 Ove
2、rview (Informative) 5 5 Notation 5 5.1 Abbreviations . 5 5.2 Definition of Terminology . 6 5.3 Mnemonics . 7 6 MPEG-2 Transport Stream Encoding 7 6.1 Signaling of VC-4 Elementary Streams . 7 6.2 Encapsulation of VC-4 Elementary Streams 136.3 T-STD Buffer Model for VC-4 Streams 187 MPEG-2 Program Str
3、eam Encoding 237.1 Signaling of VC-4 Elementary Streams . 237.2 Encapsulation of VC-4 Elementary Streams 247.3 P-STD Buffer Model for VC-4 Streams 24Annex A Graphics of Syntax for this SMPTE Recommeded Practice (Informative) 29 A.1 Introduction 29 A.2 PES Packet 29 A.3 TS Program Map Section . 30A.4
4、 Program Stream Map . 31Annex B Bibliography (Informative) . 32Page 1 of 32 pages SMPTE RP 2058-3:2011 SMPTE RECOMMENDED PRACTICE VC-4 Bitstream Transport Encodings SMPTE RP 2058-3:2011 Page 2 of 32 pages Table of Figures Figure 1 Transport Stream System Target Decoder for VC-4 streams . 19Figure 2
5、Program Stream System Target Decoder for VC-4 streams 25Figure A.1 PES packet syntax diagram for SMPTE elementary stream . 29Figure A.2 TS Program Map Section diagram for VC-4 elementary stream. 30 Figure A.3 Program Stream Map diagram for VC-4 elementary stream . 31Table of Tables Table 1 Syntax fo
6、r registration descriptor 8Table 2 Lists of values applicable to the subdescriptor_tag 8Table 3 Syntax for the profile and level sub-descriptor . 9Table 4 Syntax for the alignment sub-descriptor . 9Table 5 Alignment type for VC-4 elementary stream 10Table 6 Syntax for the buffer size sub-descriptor
7、10Table 7 Syntax for the scalable extension sub-descriptor . 11Table 8 Lists of values applicable to the frame_field_coding_mode/base_layer_frame_field_coding_mode . 12Table 9 Elements and notations for T-STD model 19Table 10 Elements and notations for P-STD model 26SMPTE RP 2058-3:2011 Page 3 of 32
8、 pages Foreword SMPTE (the Society of Motion Picture and Television Engineers) is an internationally-recognized standards developing organization. Headquartered and incorporated in the United States of America, SMPTE has members in over 80 countries on six continents. SMPTEs Engineering Documents, i
9、ncluding Standards, Recommended Practices, and Engineering Guidelines, are prepared by SMPTEs Technology Committees. Participation in these Committees is open to all with a bona fide interest in their work. SMPTE cooperates closely with other standards-developing organizations, including ISO, IEC an
10、d ITU. SMPTE Engineering Documents are drafted in accordance with the rules given in Part XIII of its Administrative Practices. SMPTE RP 2058-3 was prepared by Technology Committee 10E on Essence. Intellectual Property SMPTE draws attention to the fact that it is claimed that compliance with this Re
11、commended Practice may involve the use of one or more patents or other intellectual property rights (collectively, “IPR“). The Society takes no position concerning the evidence, validity, or scope of this IPR. Each holder of claimed IPR has assured the Society that it is willing to License all IPR i
12、t owns, and any third party IPR it has the right to sublicense, that is essential to the implementation of this Recommended Practice to those (Members and non-Members alike) desiring to implement this Recommended Practice under reasonable terms and conditions, demonstrably free of discrimination. Ea
13、ch holder of claimed IPR has filed a statement to such effect with SMPTE. Information may be obtained from the Director, Standards or that a certain course of action is preferred but not necessarily required; or that (in the negative form) a certain possibility or course of action is deprecated but
14、not prohibited. The keywords “may“ and “need not“ indicate courses of action permissible within the limits of the document. The keyword “reserved” indicates a provision that is not defined at this time, shall not be used, and may be defined in the future. The keyword “forbidden” indicates “reserved”
15、 and in addition indicates that the provision will never be defined in the future. Unless otherwise specified, the order of precedence of the types of normative information in this document shall be as follows: Normative prose shall be the authoritative definition; Tables shall be next; followed by
16、formal languages; then figures; and then any other language forms. 3 Normative References Note: All references in this document to other SMPTE documents use the current numbering style (e.g. SMPTE ST 421:2006) although, during a transitional phase, the document as published (printed or PDF) may bear
17、 an older designation (such as SMPTE 421M-2006). Documents with the same root number (e.g. 421) and publication year (e.g. 2006) are functionally identical. SMPTE RP 2058-3:2011 Page 5 of 32 pages The following standards contain provisions which, through reference in this text, constitute provisions
18、 of this recommended practice. At the time of publication, the editions indicated were valid. All standards are subject to revision, and parties to agreements based on this recommended practice are encouraged to investigate the possibility of applying the most recent edition of the standards indicat
19、ed below. SMPTE ST 421:2006, Television VC-1 Compressed Video Bitstream Format and Decoding Process Amendment 1:2007 to SMPTE ST 421:2006 Amendment 2:2011 to SMPTE ST 421:2006 SMPTE ST 2058-1:2011, VC- 4 Layered Video Extension Bitstream Format and Decoding Process SMPTE RP 227-2010, VC-1 Bitstream
20、Transport Encodings ISO/IEC 13818-1:2007, Information Technology Generic Coding of Moving Pictures and Associated Audio Information: Systems (3rdEdition) ISO/IEC 13818-1:2007/Amd3:2009, Transport of Scalable Video over Recommendation ITU-T H.222.0|ISO/IEC 13818-1 4 Overview (Informative) This docume
21、nt depends on ISO/IEC 13818-1:2007 and ISO/IEC 13818-1:2007/Amd3:2009 and defines extensions that enable the encapsulation and signaling of SMPTE Layered Video Extension (VC-4) elementary streams. This results in implementations compatible with ISO/IEC 13818-1:2007 and ISO/IEC 13818-1:2007/Amd3:2009
22、. The bitstream specified by the SMPTE Layered Video Extension (VC-4) carries residual signals that when integrated with the video content from the base layer can provide enhanced quality as well as enhanced fidelity in bit depth, resolution and color difference. Syntax elements specifying the contr
23、ol parameters for the picture reconstruction or integration process are also carried in the same bitstreams. The 8-bit 4:2:0 pictures can be pictures decoded, for example, from SMPTE ST 421 compliant bitstreams as well as content decoded from bitstreams specified by other video coding standards, suc
24、h as ITU-T H.262 (MPEG-2) or ITU-T H.264 (MPEG-4 AVC), etc. The 8-bit 4:2:0 pictures are referred to as base layer pictures in this document. The residual bitstreams can be decoded independently from the base layer picture decoding process. This document specifies how to map SMPTE Layered Video Exte
25、nsion (VC-4) elementary streams to MPEG-2 Transport Streams and how to map SMPTE Layered Video Extension (VC-4) elementary streams to MPEG-2 Program Streams. In addition, this document describes how to encapsulate SMPTE Layered Video Extension (VC-4) elementary streams to packetized elementary strea
26、ms (PESs). The definition and operation of the hypothetical buffer model for SMPTE Layered Video Extension (VC-4) elementary streams are specified as ISO/IEC 13818-1:2007 and ISO/IEC 13818-1:2007/Amd3:2009 for other video coding standards. The document employs the similar method to SMPTE RP 227 (VC-
27、1 Bitstream Transport Encodings) to map the SMPTE Layered Video Extension (VC-4) bitstream to MPEG-2 System protocols. The document also pursues the consistency with the previous SMPTE document. 5 Notation 5.1 Abbreviations This section provides a list of acronyms used in this document. Those are de
28、fined in ISO/IEC 13818-1. CRC Cyclic Redundancy Check DTS Decoding Time Stamp SMPTE RP 2058-3:2011 Page 6 of 32 pages ES Elementary Stream ESCR Elementary Stream Clock Reference PAT Program Association Table PCR Program Clock Reference PES Packetized Elementary Stream PID Packet Identifier PMT Progr
29、am Map Table PS Program Stream PSI Program Specific Information PSM Program Stream Map P-STD Program Stream System Target Decoder PTS Presentation Time Stamp PU Presentation Unit SCR System Clock Reference VC-4 Shortened Name of SMPTE Layered Video Extension TS Transport Stream T-STD Transport Strea
30、m System Target Decoder 5.2 Definition of Terminology 5.2.1 Data Access Unit Definition The following definition for an access unit shall be applicable for both MPEG-2 Transport Stream and MPEG-2 Program Stream. An access unit is defined to be a coded representation of a single picture in a VC-4 ele
31、mentary stream. An access unit shall include all coded data for a picture and any flushing bits that follow it, up to not including the start code of the next access unit. The start code of the next access unit shall be the byte of a sequence start code, frame start code or field start code. If a pi
32、cture is not preceded by a sequence header, the access unit shall start with the picture start code. Otherwise the access unit shall start with the first byte of sequence start code. 5.2.2 Base Layer A picture that is decoded from the base layer stream, specified by other video coding standards, suc
33、h as ITU-T H.262 (MPEG-2) or ITU-T H.264 (MPEG-4 AVC), etc, and is used in reconstructing a new picture when integrating with the residual picture decoded from the residual layer stream. The synchronization of the base layer picture and other residual layer streams is expected to be signaled in the
34、system layer. 5.2.3 Base Layer Stream A sequence of bits that forms the representation of coded picture at the base layer consists of one or more video sequences which are specified by other video coding standards, such as ITU-T H.262 (MPEG-2) or ITU-T H.264 (MPEG-4 AVC), etc. 5.2.4 Enhancement Laye
35、r A residual layer has at least one lower layer. The lowest layer is a base layer. The number of enhancement layer should be greater than or equal to one and each enhancement layer is a residual layer. 5.2.5 Enhancement Layer Stream A VC-4 residual enhancement layer stream consists of one or more VC
36、-4 residual sequences. A VC-4 residual enhancement layer stream may also result from re-assembling enhancement residual sub-bitstreams. SMPTE RP 2058-3:2011 Page 7 of 32 pages 5.2.6 VC-4 Elementary Stream The VC-4 coding may generate multiple enhancement layer streams. Each enhancement layer stream
37、shall be an elementary stream and conform to the profile and the level defined in Annex D of SMPTE ST 2058-1. A scalable video stream set or a scalable video service consists of the video sub-stream from base layer and these multiple enhancement layer streams. The VC-4 elementary stream is defined a
38、s one enhancement layer stream among multiple enhancement layer streams coded by SMPTE ST 2058-1. A VC-4 elementary stream is also called a VC-4 video sub-stream. This elementary stream consists of the data which are associated with the same LAYER_ID defined in SMPTE ST 2058-1. 5.2.7 VC-4 Video Sub-
39、Stream A VC-4 video sub-stream is one stream among multiple enhancement layer streams coded by SMPTE ST 2058-1. This sub-stream is treated as an elementary stream, and has a unique PID or elementary_PID. 5.2.8 Video Sub-Stream A video sub-stream is defined as one elementary stream among multiple ele
40、mentary streams which make a scalable video stream set. The video sub-stream from base layer is specified by other video coding standards, such as ITU-T H.262 (MPEG-2) or ITU-T H.264 (MPEG-4 AVC), etc. Rest enhancement video sub-streams are from SMPTE-LVE coding. Each video sub-stream is assigned by
41、 hierarchy_layer_index of hierarchy_descriptor(). 5.3 Mnemonics The following mnemonics are defined to describe the different data types used in the coded bitstream: uimsbf Unsigned integer, most-significant bit first. 6 MPEG-2 Transport Stream Encoding The bitstream syntax is defined in separate ta
42、bles that use a C-like pseudo-code notation to expose the sequential order of data fields in the stream. 6.1 Signaling of VC-4 Elementary Streams MPEG-2 elementary streams carrying VC-4 residual elementary stream data shall be signaled in a PMT as defined in ISO/IEC 13818-1. The syntax and semantics
43、 of these signaling parameters are specified in this section. 6.1.1 Stream Type The stream_type value in the PMT describing a VC-4 elementary stream should be set to 0xEB. This value indicates that this elementary stream is considered to be a privately-handled stream. The scope of this private value
44、 shall be captured by means of the registration_descriptor() of ISO/IEC 13818-1. 6.1.2 Registration Descriptor for VC-4 At least one MPEG-2 registration_descriptor() shall be present in the inner descriptor loop of the MPEG-2 program element listed in the TS_program_map_section() corresponding to th
45、e VC-4 elementary stream. The syntax and semantics for this descriptor appears in Table 1 and in the subsequent text. SMPTE RP 2058-3:2011 Page 8 of 32 pages Table 1 Syntax for registration descriptor Syntax Bits Format registration_descriptor() descriptor_tag descriptor_length format_identifier for
46、(i = 0; i 0x80) base_layer_format_identifier base_layer_frame_field_coding_mode reserved reserved 8 4 4 4 4 8 14 4 14 8 8 4 4 16 0x05 uimsbf uimsbf uimsbf uimsbf 0xFF uimsbf uimsbf uimsbf uimsbf uimsbf uimsbf 0xF 0xFFFF subdescriptor_tag This 8-bit field shall be set to 0x05 to identify this data st
47、ructure as the scalability extension sub-descriptor. layer_id This 4-bit field shall identify the layer of the VC-4 stream associated with this descriptor among all enhancement layer streams. This value shall be LAYER_ID_SEQ in the sequence header, LAYER_ID in_ the picture header or LAYER_ID_SLICE i
48、n the slice header of this VC-4 elementary stream. frame_field_coding_mode This 4-bit field shall specify this video elementary stream is coded with which coding mode among the interlaced field mode, the interlaced frame mode, the interlaced frame/field mixture mode or the progressive frame mode. Ta
49、ble 8 defines values for the frame_field_coding_mode. sub_layer_id_min This 4-bit field shall indicate the minimum value of the QUALITY_LAYER_ID of this VC-4 elementary stream. sub_layer_id_max This 4-bit field shall indicate the maximum value of the QUALITY_LAYER_ID of this VC-4 elementary stream. SMPTE RP 2058-3:2011 Page 12 of 32 pages Table 8 Lists of values applicable to the frame_field_coding_mode/base_layer_frame_field_coding_