欢迎来到麦多课文档分享! | 帮助中心 海量文档,免费浏览,给你所需,享你所想!
麦多课文档分享
全部分类
  • 标准规范>
  • 教学课件>
  • 考试资料>
  • 办公文档>
  • 学术论文>
  • 行业资料>
  • 易语言源码>
  • ImageVerifierCode 换一换
    首页 麦多课文档分享 > 资源分类 > PDF文档下载
    分享到微信 分享到微博 分享到QQ空间

    ETSI TS 126 094-2018 Digital cellular telecommunications system (Phase 2+) (GSM) Universal Mobile Telecommunications System (UMTS) LTE Mandatory speech codec speech processing func_1.pdf

    • 资源ID:741743       资源大小:281.29KB        全文页数:26页
    • 资源格式: PDF        下载积分:10000积分
    快捷下载 游客一键下载
    账号登录下载
    微信登录下载
    二维码
    微信扫一扫登录
    下载资源需要10000积分(如需开发票,请勿充值!)
    邮箱/手机:
    温馨提示:
    如需开发票,请勿充值!快捷下载时,用户名和密码都是您填写的邮箱或者手机号,方便查询和重复下载(系统自动生成)。
    如需开发票,请勿充值!如填写123,账号就是123,密码也是123。
    支付方式: 支付宝扫码支付    微信扫码支付   
    验证码:   换一换

    加入VIP,交流精品资源
     
    账号:
    密码:
    验证码:   换一换
      忘记密码?
        
    友情提示
    2、PDF文件下载后,可能会被浏览器默认打开,此种情况可以点击浏览器菜单,保存网页到桌面,就可以正常下载了。
    3、本站不支持迅雷下载,请使用电脑自带的IE浏览器,或者360浏览器、谷歌浏览器下载即可。
    4、本站资源下载后的文档和图纸-无水印,预览文档经过压缩,下载后原文更清晰。
    5、试题试卷类文档,如果标题没有明确说明有答案则都视为没有答案,请知晓。

    ETSI TS 126 094-2018 Digital cellular telecommunications system (Phase 2+) (GSM) Universal Mobile Telecommunications System (UMTS) LTE Mandatory speech codec speech processing func_1.pdf

    1、 ETSI TS 126 094 V15.0.0 (2018-07) Digital cellular telecommunications system (Phase 2+) (GSM); Universal Mobile Telecommunications System (UMTS); LTE; Mandatory speech codec speech processing functions; Adaptive Multi-Rate (AMR) speech codec; Voice Activity Detector (VAD) (3GPP TS 26.094 version 15

    2、.0.0 Release 15) TECHNICAL SPECIFICATION ETSI ETSI TS 126 094 V15.0.0 (2018-07)13GPP TS 26.094 version 15.0.0 Release 15Reference RTS/TSGS-0426094vf00 Keywords GSM,LTE,UMTS ETSI 650 Route des Lucioles F-06921 Sophia Antipolis Cedex - FRANCE Tel.: +33 4 92 94 42 00 Fax: +33 4 93 65 47 16 Siret N 348

    3、623 562 00017 - NAF 742 C Association but non lucratif enregistre la Sous-Prfecture de Grasse (06) N 7803/88 Important notice The present document can be downloaded from: http:/www.etsi.org/standards-search The present document may be made available in electronic versions and/or in print. The conten

    4、t of any electronic and/or print versions of the present document shall not be modified without the prior written authorization of ETSI. In case of any existing or perceived difference in contents between such versions and/or in print, the only prevailing document is the print of the Portable Docume

    5、nt Format (PDF) version kept on a specific network drive within ETSI Secretariat. Users of the present document should be aware that the document may be subject to revision or change of status. Information on the current status of this and other ETSI documents is available at https:/portal.etsi.org/

    6、TB/ETSIDeliverableStatus.aspx If you find errors in the present document, please send your comment to one of the following services: https:/portal.etsi.org/People/CommiteeSupportStaff.aspx Copyright Notification No part may be reproduced or utilized in any form or by any means, electronic or mechani

    7、cal, including photocopying and microfilm except as authorized by written permission of ETSI. The content of the PDF version shall not be modified without the written authorization of ETSI. The copyright and the foregoing restriction extend to reproduction in all media. ETSI 2018. All rights reserve

    8、d. DECTTM, PLUGTESTSTM, UMTSTMand the ETSI logo are trademarks of ETSI registered for the benefit of its Members. 3GPPTM and LTETMare trademarks of ETSI registered for the benefit of its Members and of the 3GPP Organizational Partners. oneM2M logo is protected for the benefit of its Members. GSMand

    9、the GSM logo are trademarks registered and owned by the GSM Association. ETSI ETSI TS 126 094 V15.0.0 (2018-07)23GPP TS 26.094 version 15.0.0 Release 15Intellectual Property Rights Essential patents IPRs essential or potentially essential to normative deliverables may have been declared to ETSI. The

    10、 information pertaining to these essential IPRs, if any, is publicly available for ETSI members and non-members, and can be found in ETSI SR 000 314: “Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in respect of ETSI standards“, which is available fro

    11、m the ETSI Secretariat. Latest updates are available on the ETSI Web server (https:/ipr.etsi.org/). Pursuant to the ETSI IPR Policy, no investigation, including IPR searches, has been carried out by ETSI. No guarantee can be given as to the existence of other IPRs not referenced in ETSI SR 000 314 (

    12、or the updates on the ETSI Web server) which are, or may be, or may become, essential to the present document. Trademarks The present document may include trademarks and/or tradenames which are asserted and/or registered by their owners. ETSI claims no ownership of these except for any which are ind

    13、icated as being the property of ETSI, and conveys no right to use or reproduce any trademark and/or tradename. Mention of those trademarks in the present document does not constitute an endorsement by ETSI of products, services or organizations associated with those trademarks. Foreword This Technic

    14、al Specification (TS) has been produced by ETSI 3rd Generation Partnership Project (3GPP). The present document may refer to technical specifications or reports using their 3GPP identities, UMTS identities or GSM identities. These should be interpreted as being references to the corresponding ETSI d

    15、eliverables. The cross reference between GSM, UMTS, 3GPP and ETSI identities can be found under http:/webapp.etsi.org/key/queryform.asp. Modal verbs terminology In the present document “shall“, “shall not“, “should“, “should not“, “may“, “need not“, “will“, “will not“, “can“ and “cannot“ are to be i

    16、nterpreted as described in clause 3.2 of the ETSI Drafting Rules (Verbal forms for the expression of provisions). “must“ and “must not“ are NOT allowed in ETSI deliverables except when used in direct citation. ETSI ETSI TS 126 094 V15.0.0 (2018-07)33GPP TS 26.094 version 15.0.0 Release 15Contents In

    17、tellectual Property Rights 2g3Foreword . 2g3Modal verbs terminology 2g3Foreword . 4g31 Scope 5g32 References 5g33 Technical Description of VAD Option 1 5g33.1 Definitions, symbols and abbreviations 5g33.1.1 Definitions 5g33.1.2 Symbols 5g33.1.2.1 Variables . 5g33.1.2.2 Constants. 6g33.1.2.3 Function

    18、s . 7g33.1.3 Abbreviations 7g33.2 General . 7g33.3 Functional description 7g33.3.1 Filter bank and computation of sub-band levels . 8g33.3.2 Pitch detection 10g33.3.3 Tone detection 10g33.3.4 Correlated Complex Signal Analysis (and detection) . 11g33.3.5 VAD decision . 11g33.3.5.1 Hangover addition

    19、. 12g33.3.5.2 Background noise estimation 14g34 Technical Description of VAD Option 2 16g34.1 Definitions, symbols and abbreviations 16g34.1.1 Definitions 16g34.1.2 Symbols 16g34.1.2.1 Variables . 16g34.1.2.2 Constants. 17g34.1.2.3 Functions . 17g34.1.3 Abbreviations 18g34.2 General . 18g34.3 Functi

    20、onal description 18g34.3.1 Frequency Domain Conversion 19g34.3.2 Channel Energy Estimator 19g34.3.3 Channel SNR Estimator 20g34.3.4 Voice Metric Calculation 20g34.3.5 Frame SNR and Long-Term Peak SNR Calculation . 20g34.3.6 Negative SNR Sensitivity Bias . 21g34.3.7 VAD Decision 21g34.3.8 Spectral De

    21、viation Estimator 21g34.3.9 Sinewave Detection 22g34.3.10 Background Noise Update Decision . 22g34.3.10 Background Noise Estimate Update . 23g35 Computational details . 23g3Annex A (informative) : Change history . 24g3History 25g3ETSI ETSI TS 126 094 V15.0.0 (2018-07)43GPP TS 26.094 version 15.0.0 R

    22、elease 15Foreword This Technical Specification has been produced by the 3rdGeneration Partnership Project (3GPP). The contents of the present document are subject to continuing work within the TSG and may change following formal TSG approval. Should the TSG modify the contents of the present documen

    23、t, it will be re-released by the TSG with an identifying change of release date and an increase in version number as follows: Version x.y.z where: x the first digit: 1 presented to TSG for information; 2 presented to TSG for approval; 3 or greater indicates TSG approved document under change control

    24、. y the second digit is incremented for all changes of substance, i.e. technical enhancements, corrections, updates, etc. z the third digit is incremented when editorial only changes have been incorporated in the document. ETSI ETSI TS 126 094 V15.0.0 (2018-07)53GPP TS 26.094 version 15.0.0 Release

    25、151 Scope The present document specifies two alternatives for the Voice Activity Detector (VAD) to be used in the Discontinuous Transmission (DTX) as described in 3. Implementors of mobile station and infrastructure equipment conforming to the AMR specifications can choose which of the two VAD optio

    26、ns to implement. There are no interoperability factors associated with this choice. The requirements are mandatory on any VAD to be used either in User Equipment (UE) or Base Station Systems (BSS)s that utilize the AMR speech codec. 2 References The following documents contain provisions which, thro

    27、ugh reference in this text, constitute provisions of the present document. - References are either specific (identified by date of publication, edition number, version number, etc.) or non-specific. - For a specific reference, subsequent revisions do not apply. - For a non-specific reference, the la

    28、test version applies. In the case of a reference to a 3GPP document (including a GSM document), a non-specific reference implicitly refers to the latest version of that document in the same Release as the present document. 1 3GPP TS 26.073: “Adaptive Multi-Rate (AMR); ANSI C source code“. 2 3GPP TS

    29、26.090: “Transcoding functions“. 3 3 GPP TS 26.093: “Source Controlled Rate operation“. 4 ITU, The International Telecommunications Union, Blue Book, Vol. III, Telephone Transmission Quality, IXth Plenary Assembly, Melbourne, 14-25 November, 1988, Recommendation G.711, Pulse code modulation (PCM) of

    30、 voice frequencies. 3 Technical Description of VAD Option 1 3.1 Definitions, symbols and abbreviations 3.1.1 Definitions For the purposes of the present document, the following terms and definitions apply: frame: time interval of 20 ms corresponding to the time segmentation of the speech transcoder

    31、3.1.2 Symbols For the purposes of the present document, the following symbols apply. 3.1.2.1 Variables bckr_estn background noise estimate burst_count counts length of a speech burst, used by VAD hangover addition hang_count hangover counter, used by VAD hangover addition complex_hang_count hangover

    32、 counter, used by CAD hangover addition complex_hang_timer hangover initator, used fo Complex Activity Estimation lagcount pitch detection counter leveln signal level new_speech pointer of the speech encoder, points a buffer containing last received samples of a speech frame 2 ETSI ETSI TS 126 094 V

    33、15.0.0 (2018-07)63GPP TS 26.094 version 15.0.0 Release 15noise_level average level of the background noise estimate oldlagcount lagcount of the previous frame pitch flag indicating presence of a periodic signal complex_warning flag indicating the presence of a complex signal. best_corr_hp normalized

    34、 and limited value from maximum HP filtered correlation vector corr_hp filtered best_corr_hp values pow_sum power of the input frame s(i) samples of the input framer snr_sum measure between input frame and noise estimate stat_count stationarity counter stat_rat measure indicating stationary T_opn op

    35、en-loop lags 2 t0 autocorrelation maxima calculated by the open-loop pitch analysis 2 t1 signal power related to the autocorrelation maxima t0 2 tone flag indicating the presence of a tone vad_thr VAD threshold VAD_flag boolean VAD flag vadreg intermediate VAD decision complex_low intermediate compl

    36、ex signal decisions complex_high intermediate complex signal decisions 3.1.2.2 Constants ALPHA_UP1 constant for updating noise estimate (see clause 3.3.5.2) ALPHA_DOWN1 constant for updating noise estimate (see clause 3.3.5.2) ALPHA_UP2 constant for updating noise estimate (see clause 3.3.5.2) ALPHA

    37、_DOWN2 constant for updating noise estimate (see clause 3.3.5.2) ALPHA3 constant for updating noise estimate (see clause 3.3.5.2) ALPHA4 constant for updating average signal level (see clause 3.3.5.2) ALPHA5 constant for updating average signal level (see clause 3.3.5.2) BURST_LEN_HIGH_NOISE constan

    38、t for controlling VAD hangover addition (see clause 3.3.5.1) BURST_LEN_LOW_NOISE constant for controlling VAD hangover addition (see clause 3.3.5.1) COEFF3 coefficient for the filter bank (see clause 3.3.1) COEFF5_1 coefficient for the filter bank (see clause 3.3.1) COEFF5_2 coefficient for the filt

    39、er bank (see clause 3.3.1) HANG_LEN_HIGH_NOISE constant for controlling VAD hangover addition (see clause 3.3.5.1) HANG_LEN_LOW_NOISE constant for controlling VAD hangover addition (see clause 3.3.5.2) HANG_NOISE_THR constant for controlling VAD hangover addition (see clause 3.3.5.2) L_FRAME size of

    40、 a speech frame, 160 L_NEXT length for the lookahead of the speech encoder, 40 LTHRESH threshold for pitch detection (see clause 3.3.2) NOISE_MAX maximum value for noise estimate (see clause 3.3.5.2) NOISE_MIN minimum value for noise estimate (see clause 3.3.5.2) NTHRESH threshold for pitch detectio

    41、n (see clause 3.3.2) POW_PITCH_THR threshold for pitch detection (see clause 3.3.5) POW_COMPLEX_THR threshold for complex detection (see clause 3.3.5) STAT_COUNT threshold for stationary detection (see clause 3.3.5.2) CAD_MIN_STAT_COUNT minimum threshold after complex warning STAT_THR threshold for

    42、stationary detection (see clause 3.3.5.2) STAT_THR_LEVEL threshold for stationary detection (see clause 3.3.5.2) TONE_THR threshold for tone detection (see clause 3.3.3) VAD_P1 constant of computation for VAD threshold (see clause 3.3.5.2) VAD_POW_LOW constant for controlling VAD hangover addition (

    43、see clause 3.3.5.1) VAD_SLOPE constant of computation for VAD threshold (see clause 3.3.5) VAD_THR_HIGH constant of computation for VAD threshold (see clause 3.3.5) CVAD_THRESH_ADAPT_HIGH constant for updating complex_high CVAD_THRESH_ADAPT_LOW constant for updating complex_low CVAD_THRESH_HANG cons

    44、tant for updating complex_hang_timer CVAD_HANG_LIMIT constant for initiating complex_hang_count CVAD_HANG_LENGTH constant for resetting complex_hang_count ETSI ETSI TS 126 094 V15.0.0 (2018-07)73GPP TS 26.094 version 15.0.0 Release 153.1.2.3 Functions + addition- subtraction * multiplication / divis

    45、ion | x | absolute value of x AND Boolean ANDOR Boolean ORxnnab()=MIN(x,y) = MAX(x,y) = 3.1.3 Abbreviations For the purposes of the present document, the following abbreviations apply: ANSI American National Standards Institute DTX Discontinuous Transmission VAD Voice Activity Detector CAD Complex A

    46、ctivity Detection CNG Comfort Noise Generation 3.2 General The function of the VAD algorithm is to indicate whether each 20 ms frame contains signals that should be transmitted, i.e. speech, music or information tones. The output of the VAD algorithm is a Boolean flag (VAD_flag) indicating presence

    47、of such signals. 3.3 Functional description The block diagram of the VAD algorithm is depicted in figure 1. The VAD algorithm uses parameters of the speech encoder to compute the Boolean VAD flag (VAD_flag). Samples of the Input frame (s(i) are divided into sub-bands and level of the signal in each

    48、band (leveln) is calculated. Input for the pitch detection function are open-loop lags (T_opn), which are calculated by open-loop pitch analysis of the speech encoder. The pitch detection function computes a flag (pitch) which indicates presence of pitch. Tone detection function calculates a flag (t

    49、one), which indicates presence of an information tone. Tones are detected based on pitch gain of the open-loop pitch analysis The pitch gain is estimated using autocorrelation values (t0 and t1) received from the pitch analysis. Complex Signal Detection function calculates a flag (complex_warning), which indicates presence of a correlated complex signal such as music. Correlate complex signals are detected based on analysis of the correlation vector available in the open-loop pitch analysis.The VAD decision function estimates background noise leve


    注意事项

    本文(ETSI TS 126 094-2018 Digital cellular telecommunications system (Phase 2+) (GSM) Universal Mobile Telecommunications System (UMTS) LTE Mandatory speech codec speech processing func_1.pdf)为本站会员(fatcommittee260)主动上传,麦多课文档分享仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。 若此文所含内容侵犯了您的版权或隐私,请立即通知麦多课文档分享(点击联系客服),我们立即给予删除!




    关于我们 - 网站声明 - 网站地图 - 资源地图 - 友情链接 - 网站客服 - 联系我们

    copyright@ 2008-2019 麦多课文库(www.mydoc123.com)网站版权所有
    备案/许可证编号:苏ICP备17064731号-1 

    收起
    展开