欢迎来到麦多课文档分享! | 帮助中心 海量文档,免费浏览,给你所需,享你所想!
麦多课文档分享
全部分类
  • 标准规范>
  • 教学课件>
  • 考试资料>
  • 办公文档>
  • 学术论文>
  • 行业资料>
  • 易语言源码>
  • ImageVerifierCode 换一换
    首页 麦多课文档分享 > 资源分类 > PDF文档下载
    分享到微信 分享到微博 分享到QQ空间

    ETSI TR 126 943-2016 Digital cellular telecommunications system (Phase 2+) Universal Mobile Telecommunications System (UMTS) LTE Recognition performance evaluations of codecs for S.pdf

    • 资源ID:736877       资源大小:165.75KB        全文页数:22页
    • 资源格式: PDF        下载积分:10000积分
    快捷下载 游客一键下载
    账号登录下载
    微信登录下载
    二维码
    微信扫一扫登录
    下载资源需要10000积分(如需开发票,请勿充值!)
    邮箱/手机:
    温馨提示:
    如需开发票,请勿充值!快捷下载时,用户名和密码都是您填写的邮箱或者手机号,方便查询和重复下载(系统自动生成)。
    如需开发票,请勿充值!如填写123,账号就是123,密码也是123。
    支付方式: 支付宝扫码支付    微信扫码支付   
    验证码:   换一换

    加入VIP,交流精品资源
     
    账号:
    密码:
    验证码:   换一换
      忘记密码?
        
    友情提示
    2、PDF文件下载后,可能会被浏览器默认打开,此种情况可以点击浏览器菜单,保存网页到桌面,就可以正常下载了。
    3、本站不支持迅雷下载,请使用电脑自带的IE浏览器,或者360浏览器、谷歌浏览器下载即可。
    4、本站资源下载后的文档和图纸-无水印,预览文档经过压缩,下载后原文更清晰。
    5、试题试卷类文档,如果标题没有明确说明有答案则都视为没有答案,请知晓。

    ETSI TR 126 943-2016 Digital cellular telecommunications system (Phase 2+) Universal Mobile Telecommunications System (UMTS) LTE Recognition performance evaluations of codecs for S.pdf

    1、 ETSI TR 1Digital cellular telecoUniversal Mobile TelRecognition perffor Speech(3GPP TR 26.9floppy3TECHNICAL REPORT 126 943 V13.0.0 (2016communications system (Phaelecommunications System (LTE; rformance evaluations of codc Enabled Services (SES) .943 version 13.0.0 Release 1316-01) hase 2+); (UMTS)

    2、; o ecs 13) ETSI ETSI TR 126 943 V13.0.0 (2016-01)13GPP TR 26.943 version 13.0.0 Release 13Reference RTR/TSGS-0426943vd00 Keywords GSM,LTE,UMTS ETSI 650 Route des Lucioles F-06921 Sophia Antipolis Cedex - FRANCE Tel.: +33 4 92 94 42 00 Fax: +33 4 93 65 47 16 Siret N 348 623 562 00017 - NAF 742 C Ass

    3、ociation but non lucratif enregistre la Sous-Prfecture de Grasse (06) N 7803/88 Important notice The present document can be downloaded from: http:/www.etsi.org/standards-search The present document may be made available in electronic versions and/or in print. The content of any electronic and/or pr

    4、int versions of the present document shall not be modified without the prior written authorization of ETSI. In case of any existing or perceived difference in contents between such versions and/or in print, the only prevailing document is the print of the Portable Document Format (PDF) version kept

    5、on a specific network drive within ETSI Secretariat. Users of the present document should be aware that the document may be subject to revision or change of status. Information on the current status of this and other ETSI documents is available at http:/portal.etsi.org/tb/status/status.asp If you fi

    6、nd errors in the present document, please send your comment to one of the following services: https:/portal.etsi.org/People/CommiteeSupportStaff.aspx Copyright Notification No part may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfi

    7、lm except as authorized by written permission of ETSI. The content of the PDF version shall not be modified without the written authorization of ETSI. The copyright and the foregoing restriction extend to reproduction in all media. European Telecommunications Standards Institute 2016. All rights res

    8、erved. DECTTM, PLUGTESTSTM, UMTSTMand the ETSI logo are Trade Marks of ETSI registered for the benefit of its Members. 3GPPTM and LTE are Trade Marks of ETSI registered for the benefit of its Members and of the 3GPP Organizational Partners. GSM and the GSM logo are Trade Marks registered and owned b

    9、y the GSM Association. ETSI ETSI TR 126 943 V13.0.0 (2016-01)23GPP TR 26.943 version 13.0.0 Release 13Intellectual Property Rights IPRs essential or potentially essential to the present document may have been declared to ETSI. The information pertaining to these essential IPRs, if any, is publicly a

    10、vailable for ETSI members and non-members, and can be found in ETSI SR 000 314: “Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in respect of ETSI standards“, which is available from the ETSI Secretariat. Latest updates are available on the ETSI Web s

    11、erver (https:/ipr.etsi.org/). Pursuant to the ETSI IPR Policy, no investigation, including IPR searches, has been carried out by ETSI. No guarantee can be given as to the existence of other IPRs not referenced in ETSI SR 000 314 (or the updates on the ETSI Web server) which are, or may be, or may be

    12、come, essential to the present document. Foreword This Technical Report (TR) has been produced by ETSI 3rd Generation Partnership Project (3GPP). The present document may refer to technical specifications or reports using their 3GPP identities, UMTS identities or GSM identities. These should be inte

    13、rpreted as being references to the corresponding ETSI deliverables. The cross reference between GSM, UMTS, 3GPP and ETSI identities can be found under http:/webapp.etsi.org/key/queryform.asp. Modal verbs terminology In the present document “shall“, “shall not“, “should“, “should not“, “may“, “need n

    14、ot“, “will“, “will not“, “can“ and “cannot“ are to be interpreted as described in clause 3.2 of the ETSI Drafting Rules (Verbal forms for the expression of provisions). “must“ and “must not“ are NOT allowed in ETSI deliverables except when used in direct citation. ETSI ETSI TR 126 943 V13.0.0 (2016-

    15、01)33GPP TR 26.943 version 13.0.0 Release 13Contents Intellectual Property Rights 2g3Foreword . 2g3Modal verbs terminology 2g3Foreword . 4g3Introduction 4g31 Scope 5g32 References 5g33 Abbreviations . 5g34 General . 6g34.1 Project History 6g34.2 Overview of the speech recognition framework for autom

    16、ated voice services work item . 8g34.3 Presentation of the following sections 8g35 Recommendation criteria . 8g35.1 Overview 8g35.2 Scoring on individual databases . 8g35.3 Performance metric over all databases . 9g35.4 Comparisons between codecs . 9g35.4.1 Low data-rate codec comparison 9g35.4.2 Hi

    17、gh data-rate codec comparison 9g35.4.2.1 8 kHz sampling rate 9g35.4.2.2 16 kHz sampling rate 9g35.5 Detailed recommendation comparisons 9g36 Performance evaluation method . 10g36.1 Introduction 10g36.2 Recognition engines . 11g36.2.1 Recognizer for speech codecs based proposals . 11g36.2.2 Training

    18、and testing 11g36.2.3 Recognizer for DSR 11g36.2.4 Training and testing 11g36.3 Usage of VAD for frame dropping . 12g36.4 Codec evaluations. 12g36.4.1 Recognition experiments under error-free channel . 12g36.5 Recognition experiments under channel errors 14g37 Recognition Performance Evaluation Resu

    19、lts 15g3Annex A: Key selection phase documents 19g3Annex B: Change history 20g3History 21g3ETSI ETSI TR 126 943 V13.0.0 (2016-01)43GPP TR 26.943 version 13.0.0 Release 13Foreword This Technical Report has been produced by the 3rdGeneration Partnership Project (3GPP). The contents of the present docu

    20、ment are subject to continuing work within the TSG and may change following formal TSG approval. Should the TSG modify the contents of the present document, it will be re-released by the TSG with an identifying change of release date and an increase in version number as follows: Version x.y.z where:

    21、 x the first digit: 1 presented to TSG for information; 2 presented to TSG for approval; 3 or greater indicates TSG approved document under change control. y the second digit is incremented for all changes of substance, i.e. technical enhancements, corrections, updates, etc. z the third digit is inc

    22、remented when editorial only changes have been incorporated in the document. Introduction SA4 has been working on the selection of a codec to recommend for Speech Enabled Services since October 2002 under the WID for SES 9. The usual process of agreeing “design constrains“ 10, “test and processing p

    23、lan“ 7 and “recommendation criteria“ 8 was followed and completed before evaluating the candidates. Two candidate codecs were proposed and evaluated: 1. ETSI Standard for the DSR Extended Advanced Front-end (ES 202 212) 2. AMR and AMR-WB audio codec The performance evaluations were conducted by two

    24、leading companies in the area of speech recognition, IBM and Scansoft. Results from these evaluations were presented at SA4#30 in February 2004 and are summarised here. The “recommendation criteria“ have been applied and SA4 recommends the DSR codec for Speech Enabled Services. SES codecs are introd

    25、uced in packet switched conversational services in Technical Specifications 26.235 Stage 1“. 2 3GPP TR 22.977: “Feasibility study for speech enabled services“. 3 ETSI ES 202 050: “Distributed Speech Recognition; Advanced Front-end Feature Extraction Algorithm; Compression Algorithm“. 4 ETSI ES 202 2

    26、12: “Distributed Speech Recognition; Extended Advanced Front-end Feature Extraction Algorithm; Compression Algorithm, Back-end Speech Reconstruction Algorithm“. 5 3GPP TS 26.235: “Packet switched conversational multimedia applications; Default codecs“. 6 3GPP TS 26.236: “Packet switched conversation

    27、al multimedia applications; Transport Protocols“. 7 TD S4-030543 “Test and Processing plan for default codec evaluation for speech enabled services (SES)“, SA4 8 TD SP-030440 “Recommendation Criteria for Default Codec for Speech Enabled Services (SES)“, TSG SA. 9 TD SP-020687 WID Codec Work to Suppo

    28、rt Speech Recognition Framework for Automated Voice Services (Rel-6), TSG SA. 10 TD S4-030248 “Design Constraints for default codec for speech enabled services (SES)“, SA4. Note: Annex A lists all the key SA4 SES selection phase documents. Temporary Documents are attached to this specification in a

    29、separate .zip file. 3 Abbreviations For the purposes of the present document, the following abbreviations apply: AFE Advanced Front-end AMR Adaptive Multi-Rate AMR-NB AMR Narrowband AMR-WB AMR Wideband BLER Block Error Rate ETSI ETSI TR 126 943 V13.0.0 (2016-01)63GPP TR 26.943 version 13.0.0 Release

    30、 13DSR Distributed Speech Recognition EDGE Enhanced Data for GSM Evolution ETSI European Telecommunications Standards Institute GSM Global System for Mobile communications SES Speech Enabled Services SNR Signal To Noise Ratio VAD Voice Activity Detector X-AFE eXtended Advanced Front-end 4 General 4.

    31、1 Project History Table 1 below shows the progress and timeline of the project. In particular the creation of permanent documents; identification of candidate codecs and test organisations; running of the performance evaluations by test organisations; selection at SA4; verification; and the approval

    32、 of CRs and TS at SA. Key milestones are highlighted in bold. Table 1: SES project timeline Meeting Status of progress in activities SA4 #23 (30 Sept - 4 Oct 2002) square4 Draft WID and work plan SA4 #24 (11-15 Nov 2002) Permanent documents o Design Constraints V1.0 o Test & Processing Plan V0.8 o R

    33、ecommendation Criteria V0.1 Intermediate deadline on SA4 reflector 31.12.2002 Submission of specification of additional databases as candidate for testing as part of test and processing plan. Intermediate deadline on SA4 reflector 31.12.2002 square4 Any company which would possibly like to submit a

    34、candidate will indicate before 31.12.2002. Later indications will not be considered. SA4 #25 (20-24 Jan 2003) square4 List of testing organisations square4 Permanent documents o Design Constraints V1.1 o Test Plan & Processing Plan V1.0 o Recommendation Criteria V0.3 SA4 #25 bis (24-28 Feb 2003) squ

    35、are4 List of testing organisations (IBM & SpeechWorks) square4 List of candidate codecs (DSR X-AFE & AMR-NB/AMR-WB) square4 Permanent documents o Design Constraints V2.0 o Test Plan & Processing Plan V1.3 ETSI ETSI TR 126 943 V13.0.0 (2016-01)73GPP TR 26.943 version 13.0.0 Release 13o Recommendation

    36、 Criteria V0.3 SA4 SQ SES ad-hoc 1-2 April 2003 Basingstoke, UK square4 Permanent documents o Test & Processing Plan V1.4 o Recommendation Criteria V0.3 SA4 #26 (5-9 May 2003) square4 Permanent documents o Test & Processing Plan V2.0 o Recommendation Criteria V0.6 SA4 #27 (7-11 July 2003) Approval o

    37、f permanent docs o Test & Processing Plan V2.2 o Recommendation Criteria V2.0 ASR vendor evaluations start. Aug 2003 square4 ASR vendors start tests. Deliverables from candidates: (31 October 2003) square4 Fixed point complexity assessment square4 Drafts of new 3GPP TSs (for new codecs), or existing

    38、 specifications for information (codecs already in standards) square4 Justification document of having met the Design Constraints SA4 #29 (24-28 Nov 2003) square4 Preparation for verification square4 Agree verification plan by correspondence (19 Dec) square4 Complete any legal agreements (NDAs) that

    39、 are needed (15 Feb) square4 Verification labs to obtain any databases needed (15 Feb) Informative speech quality listening tests square4 Nokia and Ericsson to supply listening test speech files to Motorola (5thDec) square4 Motorola to process listening test speech files supplied by Nokia and Ericss

    40、on (15 Jan) square4 Nokia and Ericsson conduct listening tests Completion of ASR vendor evaluations (31 Jan 2004) square4 Results from ASR vendor evaluations to ETSI representative SA4 #30 (23-27 Feb 2004) SES Selection meeting square4 Results from evaluator tests available square4 Make recommendati

    41、on square4 Prepare TSs for approval SA#23 ETSI ETSI TR 126 943 V13.0.0 (2016-01)83GPP TR 26.943 version 13.0.0 Release 13square4 Prepare CRs for approval SA#23 SES Verification (1 March) square4 Verification of selected codec (ST-Micro). square4 Discussion of results of verification conference call

    42、March. SA #23 (15-17 March 2004) square4 TSs for information square4 CRs for information SA4 #31 (17-21 May 2004) Verification report SA #24 (7-10 June 2004) square4 TSs approval (TS 26.243) CRs approval (TS 26.235 & TS 26.236) 4.2 Overview of the speech recognition framework for automated voice ser

    43、vices work item The work item covered the evaluation of candidate codecs for use in a speech recognition framework for automated voice services. The 3GPP speech recognition framework enables the use of conventional codecs (e.g. AMR) or DSR optimised codecs to distribute in the network the speech eng

    44、ines that process speech input or generate speech output. The aim of the work item is, through objective evaluation, to recommend a single codec for speech enabled services based on a speech recognition framework. 4.3 Presentation of the following sections The following sections provide a summary of

    45、 the Selection Phase test results, including the results of the objective performance measurements, and a record of other relevant information for the selected candidate algorithm. - Section 5 describes the Recommendation Criteria defined for the Selection Phase - Section 6 defines the means used to

    46、 measure the performance of each of the candidates - Section 7 summarises the recognition evaluation results 5 Recommendation criteria 5.1 Overview The set of databases used for the evaluations are defined in the Test and Processing Plan 7. Each of these databases contains different types of speech

    47、material covering a variety of tasks, environments and languages. Recommendation was based on a score obtained from the recognition performance measured on each of these different databases. Section 5.3 describes how the scores from all the individual databases are combined using a weighting table.

    48、5.2 Scoring on individual databases For each database the reference performance is measured as the word error rate obtained from the ASR vendor“s system. This is the performance obtained from a state-of-the-art system from the ASR vendor assuming a transparent channel. The performance (word error ra

    49、te) on a given database is also measured with the ASR vendors system for a codec under test as described in the test and processing plan 7. ETSI ETSI TR 126 943 V13.0.0 (2016-01)93GPP TR 26.943 version 13.0.0 Release 13Scoring for tests performed with channel BLER were also computed in a similar way. Note that only BLER of 1% and 3% were considered as part of the recommendation criteria8. 5.3 Performance metric over all databases The overall performance was determined by averaging the absolute word error rate using the weightings


    注意事项

    本文(ETSI TR 126 943-2016 Digital cellular telecommunications system (Phase 2+) Universal Mobile Telecommunications System (UMTS) LTE Recognition performance evaluations of codecs for S.pdf)为本站会员(proposalcash356)主动上传,麦多课文档分享仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。 若此文所含内容侵犯了您的版权或隐私,请立即通知麦多课文档分享(点击联系客服),我们立即给予删除!




    关于我们 - 网站声明 - 网站地图 - 资源地图 - 友情链接 - 网站客服 - 联系我们

    copyright@ 2008-2019 麦多课文库(www.mydoc123.com)网站版权所有
    备案/许可证编号:苏ICP备17064731号-1 

    收起
    展开