1、 Table of Contents Page 1 Scope. 3 2 Normative references 3 3 Definitions, symbols and abbreviations . 4 4 Data structure of Content stored on an SSM Card 5 4.1 Overview 5 4.2 Content Structure. 5 4.3 File Directory 6 4.4 Files 7 5 Structure of MXF file for an SSM Card 9 5.1 Partition Packs . 9 5.2
2、Header Metadata . 9 6 Video Essence file .13 6.1 Video data format13 6.2 DV-DIF Compound data .14 7 Audio Essence file .17 7.1 Audio data format17 7.2 Sound Essence data.18 8 Clip Metadata file (XML) 20 8.1 XML version and document encoding information23 8.2 Namespace.23 8.3 Clip Structural metadata
3、 .23 8.4 Clip Descriptive metadata .37 9 MXF Descriptive metadata implementation (informative)50 9.1 DM Framework50 9.2 DM Tracks, DM Sequence, DM Segments.50 9.3 Universal Label for Descriptive Metadata Scheme51 9.4 Clip Metadata mapping to DMS-1.51 Annex A Header Metadata Set for MXF file (Informa
4、tive) 56 Annex B Structures of application specific files (Normative) 64 Annex C Document flow (Informative) 80 Annex D Specification of Bitmap file (Normative) .81 Annex E Specification of WAVE file (Normative)84 Annex F Bibliography (Informative)86 Page 1 of 86 pages RP 2002-2006 SMPTE RECOMMENDED
5、 PRACTICE Content Specification on Solid State Media Card for DV/DV-Based Essence Copyright 2006 by THE SOCIETY OF MOTION PICTURE AND TELEVISION ENGINEERS 3 Barker Avenue, White Plains, NY 10601 (914) 761-1100 Approved September 15, 2006 RP 2002-2006 Page 2 of 86 pages Foreword SMPTE (the Society of
6、 Motion Picture and Television Engineers) is an internationally recognized standard developing organization. Headquartered and incorporated in the United States of America, SMPTE has members in over 80 countries on six continents. SMPTEs Engineering Documents, including Standards, Recommended Practi
7、ces and Engineering Guidelines, are prepared by SMPTEs Technology Committees. Participation in these Committees is open to all with a bona fide interest in their work. SMPTE cooperates closely with other standards-developing organizations, including ISO, IEC and ITU. SMPTE Engineering Documents are
8、drafted in accordance with the rules given in Part XIII of its Administrative Practices. Technology Committee W25 prepared SMPTE Recommended Practice RP 2002. Introduction The evolution of compression technology in the field of television has enabled new television signal processing devices. This in
9、cludes new methods and applications for storing captured television signals. Standardization efforts by the Society of Motion Picture and Television Engineers (SMPTE) has led to the development of a new file format for audio-visual applications, known as MXF (Material Exchange Format), which may be
10、used in all stages of audio-visual signal processing. The combination of these two technologies, signal compression and file format, are enabling the use of IT technology in network-based facilities, providing a new environment for flexible tools for Content creation, processing, storage and transfe
11、r. The purpose of this document is to provide implementers and users with the specification of Content on a Solid State Media (SSM) Card allowing for interoperability and file interchange with other devices that usually edit or process the Content after it has been captured. This document defines th
12、e specification of Content on an SSM Card. The Content consists of Essence and Metadata, and the structure of the content as well as detailed specifications of the Essence and the Metadata are described in this document. The Essence data consists of a video MXF file and multiple audio MXF files. The
13、 MXF file format is intended to contain not only essence (picture and sound) but associated Metadata as well. The video and audio essence data on an SSM Card are stored in separate MXF files using the OP-Atom operational pattern as specified in SMPTE 390M. Metadata stored in the Header of the MXF fi
14、les contains the MXF Structural Metadata which describes the structure or parameters of the video and audio essence and includes optional MXF Descriptive Metadata which provides additional information about the essences. In addition, Metadata is also described using XML (eXtensible Markup Language)
15、that is stored in a separate Clip Metadata file. Descriptions using XML are human readable, computer compatible and easily extensible. The Clip Metadata file contains the Clip Structural Metadata which is the same information as required in the MXF Structural Metadata mentioned above, describing the
16、 structure or parameters of the video and audio essence. The Clip Metadata file also contains the Clip Descriptive Metadata which describes additional information about the essences. There is a complementary SMPTE document (SMPTE RP 2006) which describes the physical and electrical interfaces of the
17、 storage media (SSM Card) to ensure interoperability at the storage level. Users who are interested in additional information on MXF are encouraged to begin with SMPTE Engineering Guideline EG 41 which provides the basic explanation of the MXF file format. RP 2002-2006 Page 3 of 86 pages 1 Scope Thi
18、s document defines the specification of Content on an SSM (Solid State Media) Card defined in SMPTE RP 2006 for interoperability and interchange of this Content with other devices. The Content consists of Essence and Metadata and this document specifies the structure of Content as well as detailed s
19、pecifications of Essence and Metadata. The Essence data consists of a video MXF file and audio MXF files each of which uses operational pattern OP Atom as specified in SMPTE 390M. The video MXF file contains DV/DV-based compressed video conforming to SMPTE 383M, and multiple audio MXF files contain
20、one individual channel of audio data conforming to SMPTE 382M. The Metadata is stored in the header metadata of the MXF files and additionally stored in a separate metadata file that is defined using XML (eXtensible Markup Language). 2 Normative references The following standards contain provisions,
21、 which through reference in this text constitute provisions of this standard. At the time of publication, the editions indicated were valid. All standards are subject to revision, and parties to agreements based on this standard are encouraged to investigate the possibility of applying the most rece
22、nt edition of the standards indicated below. SMPTE 12M-1999, Television, Audio and Film Time and Control Code SMPTE 314M-2005, Television Data Structure for DV-Based Audio, Data and Compressed Video 25 and 50 Mb/s SMPTE 330M-2004, Television Unique Material Identifier (UMID) SMPTE 336M-2001, Televis
23、ion Data Encoding Protocol using Key-Length-Value SMPTE 377M-2004, Television Material Exchange Format (MXF) File Format Specification SMPTE 379M-2004, Television Material Exchange Format (MXF) Generic Container Format SMPTE 382M, Television Material Exchange Format (MXF) Mapping AES3 and Broadcast
24、Wave Audio into the MXF Generic Container SMPTE 383M-2004, Television Material Exchange Format (MXF) Mapping DV-DIF Data to the MXF Generic Container SMPTE 390M-2004, Television Material Exchange Format (MXF) Specialized Operational Pattern “Atom” (Simplified Representation of a Single Item) SMPTE R
25、P 2006-2006, Solid State Media (SSM) Card Specification AES3-2003, Serial Transmission Format for two Channel Linearly Represented Digital Audio Data IEC 61834-2 (1998-08), Recording Helical-Scan Digital Video Cassette Recording System using 6,35 mm Magnetic Tape for Consumer Use (525-60, 625-50, 11
26、25-60 and 1250-50 Systems) Part 2: SD Format for 525-60 and 625-50 Systems ISO/IEC 646:1991, Information Technology ISO 7-Bit Coded Character Set for Information Interchange ISO/IEC 14496-2:2004, Information Technology Coding of Audio-Visual Objects Part 2: Visual ISO/IEC 14496-3:2005, Information T
27、echnology Coding of Audio-Visual Objects Part 3: Audio RP 2002-2006 Page 4 of 86 pages ISO/IEC 14496-12:2005, Information Technology Coding of Audio-Visual Objects Part 12: ISO Base Media File Format Extensible Markup Language (XML) 1.0 (Third Edition), W3C Recommendation, 04 February 2004 Namespace
28、s in XML, W3C Recommendation, 14 January 1999 XML Schema Part 1: Structures Second Edition, W3C Recommendation, 28 October 2004 XML Schema Part 2: Datatypes, W3C Recommendation, 28 October 2004 3 Definitions, symbols and abbreviations For symbol definition and abbreviations about MXF, the reader is
29、encouraged to study the SMPTE 377M and SMPTE EG 41. This document uses same terms defined in the mentioned standards and in other referenced documents. Content Content comprises video or audio essence and any Metadata. Clip Content Clip Content indicates the Content which contains the video or audio
30、 essence and related Metadata in one Clip. Clip Metadata Metadata which is stored in a Clip Metadata file using XML Clip Descriptive Metadata Metadata which is stored in a Clip Metadata file using XML and describes additional information about the video and audio essence Clip Structural Metadata Met
31、adata which is stored in a Clip Metadata file using XML and describes the structure or parameters of the video and audio essence. MXF Descriptive Metadata Metadata which is stored in a MXF file using the KLV coding and describes additional information about the video and audio essence. MXF Structura
32、l Metadata Metadata which is stored in a MXF file using the KLV coding and describes the structure or parameters of the video and audio essence. WORD unsigned short integer DWORD unsigned long integer LONG signed long integer RP 2002-2006 Page 5 of 86 pages 4 Data structure of Content stored on an S
33、SM Card 4.1 Overview This section describes the data structure of Content on an SSM Card. The SSM Card format is defined in SMPTE RP 2006. This document defines the components of the Content, the concept of the Metadata structure, the directory structure, the files within each directory, and the nam
34、ing rules of the files. 4.2 Content Structure The content on an SSM Card shall consist of separate files of Essence data and Metadata, structured as Clip Contents. The Essence data shall consist of one video MXF file and up to 16 individual audio MXF files each containing a single audio channel. The
35、 Metadata shall be stored in the Header Metadata of the MXF files, and shall be additionally stored in a Clip metadata file. The metadata stored in the Header Metadata part of the MXF files comprises MXF Structural Metadata which describes the structure and parameters of the video and audio essence,
36、 and optional MXF Descriptive Metadata which provides additional information about the Content. Keeping Metadata in the header of a MXF file provides great benefits in file exchange. The relationship between video/audio essence and Metadata can be always maintained by wrapping the essence data and a
37、ssociated Metadata together in one file. In addition, the KLV coded Metadata structure, based on the SMPTE Metadata dictionary, guarantees the interoperability of Metadata items between different systems. However, for direct editing capability using a fast random access media like a SSM Card, having
38、 Clip Metadata file outside a MXF file gives benefits for users. MXF metadata editing after capture, requires addition or modification of embedded MXF metadata items. This editing may sometimes result in the re-creation of MXF essence files. The provision of an external Clip Metadata structure provi
39、des direct editing capability of Metadata items without changes to the essence files. For ease of editing, an external Clip Metadata file is defined as part of the Content Data Structure on a SSM Card. A portion of the Clip Metadata file contains Clip Structural Metadata which corresponds to the MXF
40、 Structural Metadata in the header portion of the MXF files. Another portion contains the Clip Descriptive Metadata which provides additional information, in human readable form, regarding the stored Clip Content. Figure 4.1 illustrates the general structure of a Clip Content as stored on the SSM Ca
41、rd. RP 2002-2006 Page 6 of 86 pages Figure 4.1 General Structure of Content on an SSM Card 4.3 File Directory All SSM Content files shall be organized on the SSM Card in a Directory structure that is shown in Figure 4.2. The Directory hierarchy of the Clip Content shall be “/” and this shall represe
42、nt the root of all stored Content on the SSM Card. 4.3.1 Contents directory The “CONTENTS” directory shall be located under the root directory “/” and shall contain the “CLIP” metadata directory, “VIDEO” essence directory, and “AUDIO” essence directory. The “CONTENTS” directory may optionally contai
43、n “ICON”, “VOICE” and “PROXY” directories. 4.3.2 Clip directory The “CLIP” directory shall contain all Clip Metadata files that store the Clip Structural and Descriptive Metadata and is located under the “CONTENTS” directory. 4.3.3 Video directory The “VIDEO” directory shall contain all video essenc
44、e files including the MXF Header Metadata. 4.3.4 Audio directory The “AUDIO” directory shall contain all audio essence files including the MXF Header Metadata. 4.3.5 File directory for optional files The “ICON” directory shall contain all the Thumbnail files, the “VOICE” directory shall contain all
45、the Voice Memo files and the “PROXY” directory shall contain all the Proxy files. Definitions of the Thumbnail, Voice memo and Proxy files are included in Annex B. Clip ContentVideo Essence MXF Structural MetadataMXF Descriptive MetadataClip Structural MetadataClip Descriptive MetadataAudio Essence
46、MXF Structural MetadataMXF Descriptive MetadataVideo MXF fileAudio MXF fileClip Metadata file (XML)Descriptive MetadataSourceRP 2002-2006 Page 7 of 86 pages Figure 4.2 An example of Directory Tree for Clip (0001AB) and (0002CD) with two Audio channels 4.4 Files All Essence files in the same Clip Con
47、tent shall have the same duration. The structure of MXF files is defined in section 5. 4.4.1 Video Essence file A Video Essence file shall comply with the MXF file structure using the DV-DIF mapping defined in SMPTE 383M. The Video Essence file is defined in section 6. /CONTENTSVIDEO0001AB.MXF0002CD
48、.MXF0001AB00.MXFAUDIO0001AB01.MXF0002CD00.MXF0002CD01.MXFCLIP0001AB.XML0002CD.XMLICON (Thumbnail)0001AB.BMP0002CD.BMPVOICE0001AB00.WAV0001AB01.WAVPROXY0001AB.MP4Optional Normative Annex B0001AB.BINRP 2002-2006 Page 8 of 86 pages 4.4.2 Audio Essence file Each Audio Essence file shall comply with the
49、MXF file structure using the AES 3 audio mapping defined in SMPTE 382. Each Audio Essence file shall map a single channel of AES3 audio. The maximum number of the Audio Essence files per Clip shall be 16 channels. The Audio Essence file is defined in section 7. 4.4.3 Clip Metadata file A Clip Metadata file shall consist of Clip Structural Metadata and Clip Descriptive Metadata. Both Clip Metadata types shall be described with an XML language and contained in a single file using XML Schema syntax and