ECMA TR 105-2012 A Shaped Noise File Representative of Speech (1st Edition).pdf
《ECMA TR 105-2012 A Shaped Noise File Representative of Speech (1st Edition).pdf》由会员分享,可在线阅读,更多相关《ECMA TR 105-2012 A Shaped Noise File Representative of Speech (1st Edition).pdf(20页珍藏版)》请在麦多课文档分享上搜索。
1、 Reference number ECMA TR/12:2009 Ecma International 2009 ECMA TR/105 1st Edition / December 2012 A Shaped Noise File Representative of Speech COPYRIGHT PROTECTED DOCUMENT Ecma International 2012 Ecma International 2012 i Contents Page 1 Scope 1 2 References . 1 3 Terms and definitions . 1 4 Abbrevi
2、ations . 1 5 Spectrum 2 6 Crest Factor . 3 Annex A (informative) Basis of Target Spectrum 5 Annex B (informative) Basis of Target Crest Factor 9 ii Ecma International 2012 Introduction Determination of headphone acoustic output for compliance with product safety regulations is described in EN 50332,
3、 which in turn references IEC 60268. Together, these documents describe three major characteristics of a recorded file that is to be used when measuring this output. These three characteristics are the spectrum, the crest factor, and the recording level. The spectrum is specified relative to pink no
4、ise, which has a flat spectrum when using constant percentage bandwidth filters, specifically third-octave filters out to 20 kHz. Use of a shaped noise file is attractive because it can be described mathematically and has characteristics that are essentially the same considering any portion of the f
5、ile beyond some reasonably short time scale. This means that long averaging times are not necessary, and a stable measurement can be made quickly. A purely mathematical description also means that the file can be generated by anyone, rather than relying on specific “golden” recordings. Because EN 50
6、332 is concerned with hearing safety, the crest factor is quite aggressive to encompass the behavior of certain types of music. However, in other cases, such as for power consumption testing, a noise file more representative of the typical behavior rather than the upper limit is desired. In addition
7、, different content types, such as speech, are also of interest, for example listening to an audiobook or a podcast. This Ecma Technical Report has been adopted by the General Assembly of December 2012. Ecma International 2012 iii “COPYRIGHT NOTICE 2012 Ecma International This document may be copied
8、, published and distributed to others, and certain derivative works of it may be prepared, copied, published, and distributed, in whole or in part, provided that the above copyright notice and this Copyright License and Disclaimer are included on all such copies and derivative works. The only deriva
9、tive works that are permissible under this Copyright License and Disclaimer are: (i) works which incorporate all or portion of this document for the purpose of providing commentary or explanation (such as an annotated version of the document), (ii) works which incorporate all or portion of this docu
10、ment for the purpose of incorporating features that provide accessibility, (iii) translations of this document into languages other than English and into different formats and (iv) works by making use of this specification in standard conformant products by implementing (e.g. by copy and paste wholl
11、y or partly) the functionality therein. However, the content of this document itself may not be modified in any way, including by removing the copyright notice or references to Ecma International, except as required to translate it into languages other than English or into a different format. The of
12、ficial version of an Ecma International document is the English language version on the Ecma International website. In the event of discrepancies between a translated version and the official version, the official version shall govern. The limited permissions granted above are perpetual and will not
13、 be revoked by Ecma International or its successors or assigns. This document and the information contained herein is provided on an “AS IS“ basis and ECMA INTERNATIONAL DISCLAIMS ALL WARRANTIES, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF THE INFORMATION HEREIN WIL
14、L NOT INFRINGE ANY OWNERSHIP RIGHTS OR ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.“ iv Ecma International 2012 Ecma International 2009 1 A Shaped Noise File Representative of Speech 1 Scope This Technical Report describes a digital shaped pink noise file representa
15、tive of speech in two main characteristics, namely the spectrum and the crest factor. The spectrum is defined in third-octave band levels relative to pink noise up to the 8 kHz band, which provides a sufficient bandwidth for speech. The crest factor is defined at a 30 second time scale. The recordin
16、g level of the file is not specified, and should be adjusted to match the amplitude, at the output of the headphone, of a typical audiobook or podcast when played on the device under test. This file is not meant to replace the existing file defined in EN 50332-1 for hearing safety. 2 References For
17、dated references, only the edition cited applies. For undated references, the latest edition of the referenced document (including any amendments) applies. EN 50332-1, Sound system equipment Headphones and earphones associated with portable audio equipment Maximum sound pressure level measurement me
18、thodology and limit considerations IEC 60268-1, Sound system equipment 3 Terms and definitions For the purposes of this document, the following terms and definitions apply. 3.1 crest factor (CF) Crest factor is the ratio of the largest absolute value in a time-varying signal to the root-mean-square
19、(RMS) value of the signal. In this Technical Report, it will be expressed in decibels, and calculated by the following equation, where x is the time-varying amplitude. CF = 20 l o g 10 Max RM S ( 1 ) 4 Abbreviations CF crest factor RMS root-mean-square SPL sound pressure level 2 Ecma International 2
20、012 5 Spectrum The spectrum is derived by taking the average spectrum of the TIMIT speech corpus files 2, which contain several hours of speech by 630 speakers of both genders, recorded at a sample rate of 16 kHz. A spline fit is then used to smooth the result. See Annex A for details on the derivat
21、ion. The target spectrum, relative to pink noise, is listed in Table 1 by third-octave band, and is based on a 16 kHz sample rate. The tolerance on the spectrum is the same as that given in IEC 60268-1. NOTE 1 Some analysis programs may not report a level for the 8 kHz band if the signal does not fi
22、ll entire band, as is the case with a sample rate of 16 kHz. NOTE 2 If the file is synthesized using a wider bandwidth, care must be taken when downsampling to 16 kHz, because the low-pass filter typically used to avoid aliasing will greatly alter the level of the resulting 8 kHz band. If a wider ba
23、ndwidth is used during synthesis, the spectrum defined below may be extended at a slope of -2 dB per band. However, the final file must be limited to only the bands specified in Table 1 and a sample rate of 16 kHz. Table 1 Spectrum Relative to Pink Noise Frequency (Hz) Relative SPL (dB) 20 -49,8 25
- 1.请仔细阅读文档,确保文档完整性,对于不预览、不比对内容而直接下载带来的问题本站不予受理。
- 2.下载的文档,不会出现我们的网址水印。
- 3、该文档所得收入(下载+内容+预览)归上传者、原创作者;如果您是本文档原作者,请点此认领!既往收益都归您。
下载文档到电脑,查找使用更方便
10000 积分 0人已下载
下载 | 加入VIP,交流精品资源 |
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- ECMATR1052012ASHAPEDNOISEFILEREPRESENTATIVEOFSPEECH1STEDITIONPDF

链接地址:http://www.mydoc123.com/p-704858.html