ASA S3 50-2013 American National Standard Method for Evaluation of the Intelligibility of Text-to-Speech Synthesis Systems (Includes Access to Additional Content).pdf
《ASA S3 50-2013 American National Standard Method for Evaluation of the Intelligibility of Text-to-Speech Synthesis Systems (Includes Access to Additional Content).pdf》由会员分享,可在线阅读,更多相关《ASA S3 50-2013 American National Standard Method for Evaluation of the Intelligibility of Text-to-Speech Synthesis Systems (Includes Access to Additional Content).pdf(27页珍藏版)》请在麦多课文档分享上搜索。
1、 ANSI/ASA S3.50-2013 AMERICAN NATIONAL STANDARD Method for Evaluation of the Intelligibility of Text-to-Speech Synthesis Systems Secretariat: Acoustical Society of America Approved on May 6, 2013 by: American National Standards Institute, Inc. Abstract This Standard is to be used for testing the spe
2、ech intelligibility of text-to-speech systems, providing a measure of human listeners recovery of words that correspond to the intended phonemic content of speech created by the system. Listeners are tasked to record the words or sentences they hear. Scoring may be either at the word or segment leve
3、l. A normalized edit distance of the response from the intended message is the measure of the systems speech intelligibility. This Standard specifies methods for selecting test material, which may depend on the purpose and constraints of the test. The Standard also specifies methods for selecting an
4、d training the listeners; for designing, controlling, and reporting the test conditions; and for analyzing and reporting the test results. The Standard also provides background material, important for designing the test. Informative software is provided to assist the user in creating stimuli and sco
5、ring the test results. Use of the software is not mandatory. AMERICAN NATIONAL STANDARDS ON ACOUSTICS The Acoustical Society of America (ASA) provides the Secretariat for Accredited Standards Committees S1 on Acoustics, S2 on Mechanical Vibration and Shock, S3 on Bioacoustics, S3/SC 1 on Animal Bioa
6、coustics, and S12 on Noise. These committees have wide representation from the technical community (manufacturers, consumers, trade associations, organizations with a general interest, and government representatives). The Standards are published by the Acoustical Society of America as American Natio
7、nal Standards after approval by their respective Standards Committees and the American National Standards Institute (ANSI). These standards are developed and published as a public service to provide standards useful to the public, industry, and consumers, and to Federal, State, and local governments
8、. Each of the Accredited Standards Committees (operating in accordance with procedures approved by ANSI) is responsible for developing, voting upon, and maintaining or revising its own Standards. The ASA Standards Secretariat administers Committee organization and activity and provides liaison betwe
9、en the Accredited Standards Committees and ANSI. After the Standards have been produced and adopted by the Accredited Standards Committees, and approved as American National Standards by ANSI, the ASA Standards Secretariat arranges for their publication and distribution. An American National Standar
10、d implies a consensus of those substantially concerned with its scope and provisions. Consensus is established when, in the judgment of the ANSI Board of Standards Review, substantial agreement has been reached by directly and materially affected interests. Substantial agreement means much more than
11、 a simple majority, but not necessarily unanimity. Consensus requires that all views and objections be considered and that a concerted effort be made towards their resolution. The use of an American National Standard is completely voluntary. Their existence does not in any respect preclude anyone, w
12、hether he or she has approved the Standards or not, from manufacturing, marketing, purchasing, or using products, processes, or procedures not conforming to the Standards. NOTICE: This American National Standard may be revised or withdrawn at any time. The procedures of the American National Standar
13、ds Institute require that action be taken periodically to reaffirm, revise, or withdraw this Standard. Acoustical Society of America ASA Secretariat 35 Pinelawn Road, Suite 114E Melville, New York 11747-3177 Telephone: 1 (631) 390-0215 Fax: 1 (631) 390-0217 E-mail: asastdsaip.org 2013 by Acoustical
14、Society of America. This Standard may not be reproduced in whole or in part in any form for sale, promotion, or any commercial purpose, or any purpose not falling within the provisions of the U.S. Copyright Act of 1976, without prior written permission of the publisher. For permission, address a req
15、uest to the Standards Secretariat of the Acoustical Society of America. 2013 Acoustical Society of America All rights reserved i Contents 1 Scope 1 2 Normative references . 1 3 Terms and definitions . 1 4 Description of a text-to-speech synthesis system 2 5 General guidance for experimental design a
16、nd testing 3 6 Requirements (Methods) 4 6.1 TTS system description and specification 4 6.2 Listeners 5 6.3 Selection and design of test materials 6 6.4 Intelligibility test procedures 7 6.5 Measurements and analysis of results . 8 Annex A (informative) Rationale for the recommendations concerning in
17、telligibility test materials . 9 A.1 Introduction . 9 A.2 Acoustic cues to linguistic units vary from context to context . 9 A.3 Systems vary in the algorithms they use and the types of errors they produce . 10 A.4 Conclusion 12 Annex B (normative) Methodological considerations for stimuli and respo
18、nses: Considerations for test material containing names and nonsense words 13 B.1 Stimuli preparation 13 B.2 Response scoring . 14 Annex C (informative) Example software to create stimuli and score results in conformity with the method described in ANSI/ASA S3.50-2013 . 15 C.1 Disclaimer . 15 C.2 Ex
19、ample software . 15 Bibliography 18 Figures Figure 1 Block diagram of a typical TTS system. This Standard primarily evaluates processing below the dotted line. 3 Figure A.1 Spectrograms of (a) Miss Peak, (b) Miss Beak, and (c) misspeak 10 ii 2013 Acoustical Society of America All rights reserved Tab
20、les Table A.1 Sample responses for one listener to fake ill 11 Table A.2 Sample responses for one listener to dock, cat, dock, bird 12 Table A.3 Sample responses for one listener to Jupiter eyebrows . 12 Table C.1 An example grammar for the susgen program showing sentence frames with Part of Speech
21、(POS) tags, and the total number of syllables in the non-variable content of each frame. 16 Table C.2 Example lexicon entries. Each row specifies a word, the POS tag to which that word can be assigned within grammar frames, and a syllable count. . 16 2013 Acoustical Society of America All rights res
22、erved iii Foreword This Foreword is for information only, and is not a part of the American National Standard ANSI/ASA S3.50-2013 American National Standard Method for Evaluation of the Intelligibility of Text-to-Speech Synthesis Systems. As such, this Foreword may contain material that has not been
23、 subjected to public review or a consensus process. In addition, it does not contain requirements necessary for conformance to the Standard. This Standard comprises a part of a group of definitions, standards, and specifications for use in bioacoustics. It was developed and approved by Accredited St
24、andards Committee S3, Bioacoustics, under its approved operating procedures. Those procedures have been accredited by the American National Standards Institute (ANSI). The Scope of Accredited Standards Committee S3 is as follows: Standards, specifications, methods of measurement and test, and termin
- 1.请仔细阅读文档,确保文档完整性,对于不预览、不比对内容而直接下载带来的问题本站不予受理。
- 2.下载的文档,不会出现我们的网址水印。
- 3、该文档所得收入(下载+内容+预览)归上传者、原创作者;如果您是本文档原作者,请点此认领!既往收益都归您。
下载文档到电脑,查找使用更方便
10000 积分 0人已下载
下载 | 加入VIP,交流精品资源 |
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- ASAS3502013AMERICANNATIONALSTANDARDMETHODFOREVALUATIONOFTHEINTELLIGIBILITYOFTEXTTOSPEECHSYNTHESISSYSTEMSINCLUDESACCESSTOADDITIONALCONTENTPDF

链接地址:http://www.mydoc123.com/p-450487.html