SMPTE RDD 42-2016 MDA Program Specification.pdf
《SMPTE RDD 42-2016 MDA Program Specification.pdf》由会员分享,可在线阅读,更多相关《SMPTE RDD 42-2016 MDA Program Specification.pdf(31页珍藏版)》请在麦多课文档分享上搜索。
1、 Copyright 2016 by THE SOCIETY OF MOTION PICTURE AND TELEVISION ENGINEERS 3 Barker Avenue, White Plains, NY 10601 (914) 761-1100 Approved August 23, 2016 The attached document is a Registered Disclosure Document prepared by the proponent identified below. It has been examined by the appropriate SMPT
2、E Technology Committee and is believed to contain adequate information to satisfy the objectives defined in the Scope, and to be technically consistent. This document is NOT a Standard, Recommended Practice or Engineering Guideline, and does NOT imply a finding or representation of the Society. Ever
3、y attempt has been made to ensure that the information contained in this document is accurate. Errors in this document should be reported to the proponent identified below, with a copy to engsmpte.org. All other inquiries in respect of this document, including inquiries as to intellectual property r
4、equirements that may be attached to use of the disclosed technology, should be addressed to the proponent identified below. Proponent contact information: Scott Smyers DTS, Inc. 130 Knowles Drive, Suite B Los Gatos, CA 95032 Email: Page 1 of 31 pages SMPTE REGISTERED DISCLOSURE DOCUMENT MDA Program
5、 Specification SMPTE RDD 42:2016 SMPTE RDD 42:2016 Page 2 of 31 pages Table of Contents Page Conventions . 3 Introduction (Informative) 3 1 Scope . 7 2 Normative References 7 3 Timeline . 7 4 Audio Object 7 5 Coordinate System 7 6 Object Model . 8 7 URI Constants 20 8 Basic Data Types 20 9 Reference
6、 Renderer 21 Bibliography (Informative) 31 SMPTE RDD 42:2016 Page 3 of 31 pages Conventions All sections are normative, unless otherwise indicated. Pseudo-code and property names use font style courier new. The expressions MAY, NEED NOT, SHALL, SHALL NOT, SHOULD, and SHOULD NOT indicate normative be
7、havior ISO/EIC Directives, Part 2. VERBAL FORM SEMANTICS MAY It is allowed NEED NOT It is not required that SHALL Is required that SHALL NOT Is required to be not SHOULD It is recommended that SHOULD NOT It is not recommended that Introduction (Informative) The MDA Program, or simply Program hereaft
8、er, is a self-contained object-based audio program. As such it consists of a collection of audio objects, each combining an audio waveform with metadata. The metadata indicates, for instance, when the object occurs on the Program timeline or where it is positioned within the soundfield. It is used t
9、o control the mapping of the audio object waveform to output loudspeakers at playback. SMPTE RDD 42:2016 Page 4 of 31 pages Figure 1 Sample Program. Only a subset of all Fragment properties are shown Figure 1 depicts a Program that consists of 4 audio objects: Dialog, FX1, FX2 and LFE. While the LFE
10、 object exists from t = 0 to t = T3, the dialog object exists only from t = 0 to t = T2, and the FX2 and FX2 objects from t = T1 to t = T3. The Program object model allows for any number of audio objects to overlap at any point in time, and an audio object can be as short as a sample or as long as t
11、he program. Each audio object is assigned an identifier that is unique within the scope of the Program. The metadata associated with each object is divided into Fragments, each corresponding to a period of time during which the metadata is static. To simplify the Program structure, Fragment boundari
12、es are aligned. Two kinds of Fragments, and hence audio objects, are defined: F X 2 o bjectF X 1 o bjectGr o up Id=1Dial og objectOb jec tF rag mentID = 1re n der ingException = CassetLo cator = AAAOb jec tF rag mentID = 1re n der ingException = CassetLo cator = AAAOb jec tF rag mentID = 2po sit ion
13、 = (1,0 .1 ,0 )assetLo cator = BBBOb jec tF rag mentID = 2po si tion = ( 1, 0 ,0)assetLo cator = BBBLF E ob je ctLF EF r agmentID = 4assetLo cator = DDDLF EF r agmentID = 4assetLo cator = DDDLF EF r agmentID = 4assetLo cator = DD DOb jec tF rag mentID = 3po sit ion = (1, - 0. 1,0 )assetLo cator = CC
14、COb jec tF rag mentID = 3po si tion = ( 1, 0 ,0)assetLo cator = CCCGr o up Id=1Asse t (URI = AA A) Asse t ( URI = BBB ) Asse t (URI = C CC) Asse t (URI = DDD ) t = 0 t = T1t = T2t = T3P rogram ti me li neSMPTE RDD 42:2016 Page 5 of 31 pages An ObjectFragment corresponds to an object associated with
15、a spatial locus. For instance, the position of the FX1 and FX2 objects in Figure 1 changes from t = T1 to t = T3. This spatial locus is used to determine the loudspeakers that will output the waveform associated with the object. It is also possible to instruct that an object waveform be routed throu
16、gh a specific loudspeaker, if present. An LFEFragment corresponds to an object whose waveform is intended for routing to a Low Frequency Effect (LFE) channel, and is therefore not associated with a spatial location. Each Fragment references a sequence of audio samples, i.e. the object waveform, with
17、in an underlying asset identified using a Uniform Resource Identifier (URI). Depending on applications, the asset can be carried alongside the Fragment metadata or be remote. Multiple Fragments can reference the same audio samples within a single asset. As illustrated by the FX1 and FX2 objects, Fra
18、gments can be combined into a Group, which logically groups the two ObjectFragments and contains metadata common to them . The object model also allows Fragments to be combined into a Switch, which indicates that only one of the Fragments is rendered at any given time. Groups and Switches are recurs
19、ive entities that can themselves contain Groups and Switches. From the perspective of the object model, Fragments, Groups and Switches are all subclasses of the Entity class, which represents arbitrary entities of the Program timeline. In order to specify unambiguously how object metadata is used to
20、 map object waveforms to loudspeaker outputs, i.e. rendered, a Reference Renderer is fully specified in Section 9. The Reference Renderer uses the Vector Base Amplitude Panning formalism (VBAP), which was introduced by Pulkki et al. and has since been extensively studied. VBAP is an extension of the
21、 familiar tangent law for pair-wise panning to three-dimensional speaker configurations. Specifically, given a loudspeaker triplet on the unit sphere and a point source object located within the spherical triangle defined by the loudspeakers, the contribution of the object waveform to each of the lo
22、udspeaker is determined by the coordinates of the object within the linear basis formed by the three loudspeakers (see Figure 2). Objects with a finite extent can be rendered as a collection of point sources. More complex loudspeaker configurations can be decomposed into multiple speaker triplets. F
23、igure 2 Rendering audio objects using VBAP. The shaded areas show the relative output power at each of the speakers and are determined by expressing the object vector in the basis formed by the three loudspeakers. S1S2S3OSMPTE RDD 42:2016 Page 6 of 31 pages To support a range of applications within
24、its stated scope, the Program object model is designed to be flexible, e.g. the number of simultaneous Fragments is not limited, and offers multiple extension points. Applications are therefore expected to constrain or extend the object model to suit their specific requirements. Similarly, this spec
- 1.请仔细阅读文档,确保文档完整性,对于不预览、不比对内容而直接下载带来的问题本站不予受理。
- 2.下载的文档,不会出现我们的网址水印。
- 3、该文档所得收入(下载+内容+预览)归上传者、原创作者;如果您是本文档原作者,请点此认领!既往收益都归您。
下载文档到电脑,查找使用更方便
10000 积分 0人已下载
下载 | 加入VIP,交流精品资源 |
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- SMPTERDD422016MDAPROGRAMSPECIFICATIONPDF
