欢迎来到麦多课文档分享! | 帮助中心 海量文档,免费浏览,给你所需,享你所想!
麦多课文档分享
全部分类
  • 标准规范>
  • 教学课件>
  • 考试资料>
  • 办公文档>
  • 学术论文>
  • 行业资料>
  • 易语言源码>
  • ImageVerifierCode 换一换
    首页 麦多课文档分享 > 资源分类 > PPT文档下载
    分享到微信 分享到微博 分享到QQ空间

    AudioDB- Scalable approximate nearest-neighbor search with .ppt

    • 资源ID:378713       资源大小:1.64MB        全文页数:30页
    • 资源格式: PPT        下载积分:2000积分
    快捷下载 游客一键下载
    账号登录下载
    微信登录下载
    二维码
    微信扫一扫登录
    下载资源需要2000积分(如需开发票,请勿充值!)
    邮箱/手机:
    温馨提示:
    如需开发票,请勿充值!快捷下载时,用户名和密码都是您填写的邮箱或者手机号,方便查询和重复下载(系统自动生成)。
    如需开发票,请勿充值!如填写123,账号就是123,密码也是123。
    支付方式: 支付宝扫码支付    微信扫码支付   
    验证码:   换一换

    加入VIP,交流精品资源
     
    账号:
    密码:
    验证码:   换一换
      忘记密码?
        
    友情提示
    2、PDF文件下载后,可能会被浏览器默认打开,此种情况可以点击浏览器菜单,保存网页到桌面,就可以正常下载了。
    3、本站不支持迅雷下载,请使用电脑自带的IE浏览器,或者360浏览器、谷歌浏览器下载即可。
    4、本站资源下载后的文档和图纸-无水印,预览文档经过压缩,下载后原文更清晰。
    5、试题试卷类文档,如果标题没有明确说明有答案则都视为没有答案,请知晓。

    AudioDB- Scalable approximate nearest-neighbor search with .ppt

    1、Thursday, November 13, 2008,ASA 156: Statistical Approaches for Analysis of Music and Speech Audio Signals,AudioDB: Scalable approximate nearest-neighbor search with automatic radius-bounded indexing,Michael A. Casey Digital Musics Dartmouth College, Hanover, NH,Scalable Similarity,8M tracks in comm

    2、ercial collection PByte of multimedia data Require passage-level retrieval ( 2 bars) Require scalable nearest-neighbor methods,Specificity,Partial track retrieval Alternate versions: remix, cover, live, album Task is mid-high specificity,Example: remixing,Original Track Remix 1 Remix 2 Remix 3,Audio

    3、 Shingles, concatenate l frames of m dimensional features,A shingle is defined as:,Shingles provide contextual information about features Originally used for Internet search engines: Andrei Z. Broder, Steven C. Glassman, Mark S. Manasse, Geoffrey Zweig: “Syntactic Clustering of the Web”. Computer Ne

    4、tworks 29(8-13): 1157-1166 (1997) Related to N-grams, overlapping sequences of featuresApplied to audio domain by Casey and Slaney : Casey, M. Slaney, M. “The Importance of Sequences in Musical Similarity”, in Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, 2006. ICASSP 2006,Audio

    5、Shingle Similarity,Audio Shingle Similarity, a query shingle drawn from a query track Q, database of audio tracks indexed by (n), a database shingle from track n,Shingles are normalized to unit vectors, therefore:,For shingles with M dimensions (M=l.m); m=12, 20; l=30,40,Open source: google: “audioD

    6、B” Management of tracks, sequences, salience Automatic indexing parameters OMRAS2, Yahoo!, AWAL, CHARM, more Web-services interface (SOAP / JSON) Implementation of LSH for large N 1B 1-10 ms whole-track retrieval from 1B vectors,AudioDB: Shingle Nearest Neighbor Search,AudioDB: Shingle Nearest Neigh

    7、bor Search,Whole-track similarity,Often want to know which tracks are similar Similarity depends on specificity of task Distortion / filtering / re-encoding (high) Remix with new audio material (mid) Cover song: same song, different artist (mid),Whole-track resemblance: radius-bounded search,Compute

    8、 the number of shingle collisions between two tracks:,Whole-track resemblance: radius-bounded search,Compute the number of shingle collisions between two tracks:,Requires a threshold for considering shingles to be relatedNeed a way to estimate relatedness (threshold) for data set,Statistical approac

    9、hes to modeling distance distributions,Distribution of minimum distances,Database: 1.4 million shingles. The left bump is the minimum between 1000 randomly selected query shingles and this database. The right bump is a small sampling (1/98 000 000) of the full histogram of all distances.,Radius-boun

    10、ded retrieval performance: cover song (opus task),Performance depends critically on xthresh, the collision thresholdWant to estimate xthresh automatically from unlabelled data,Order Statistics,Minimum-value distribution is analytic Estimate the distribution parameters Substitute into minimum value d

    11、istribution Define a threshold in terms of FP rate This gives an estimate of xthresh,Estimating xthresh from unlabelled data,Use theoretical statistics Null Hypothesis: H0: shingles are drawn from unrelated tracks Assume elements i.i.d., normally distributed M dimensional shingles, d effective degre

    12、es of freedom: Squared distance distribution for H0,ML for background distribution,Likelihood for N data points (distances squared)d = effective degrees of freedomM = shingle dimensionality,Background distribution parameters,Likelihood for N data points (distances squared)d = effective degrees of fr

    13、eedomM = shingle dimensionality,Minimum value over N samples,Minimum value distribution of unrelated shingles,Estimate of xthresh, false positive rate,Unlabelled data experiment,Unlabelled data set Known to contain: cover songs (same work, different performer) Near duplicate recordings (misattributi

    14、on, encoding) Estimate background distance distribution Estimate minimum value distribution Set xthresh so FP rate is = 1% Whole-track retrieval based on shingle collisions,Cover song retrieval,Scaling,Locality sensitive hashing Trade-off approximate NN for time complexity 3 to 4 orders of magnitude

    15、 speed-up No noticeable degradation in performance For optimal radius threshold,LSH,Remix retrieval via LSH,Current deployment,Large commercial collections AWAL 100,000 tracks Yahoo! 2M+ tracks, related song classifier AudioDB: open-source, international consortium of developers Google: “audioDB”,Co

    16、nclusions,Radius-bounded retrieval model for tracks Shingles preserve temporal information, high d Implements mid-to-high specificity search Optimal radius threshold from order statistics null hypothesis: shingles are drawn from unrelated tracks LSH requires radius bound, automatic estimate Scales to 1B shingles+ using LSH,Thanks,Malcolm Slaney, Yahoo! Research Inc. Christophe Rhodes, Goldsmiths, U. of London Michela Magas, Goldsmiths, U. of London Funding: EPSRC: EP/E02274X/1,


    注意事项

    本文(AudioDB- Scalable approximate nearest-neighbor search with .ppt)为本站会员(ownview251)主动上传,麦多课文档分享仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。 若此文所含内容侵犯了您的版权或隐私,请立即通知麦多课文档分享(点击联系客服),我们立即给予删除!




    关于我们 - 网站声明 - 网站地图 - 资源地图 - 友情链接 - 网站客服 - 联系我们

    copyright@ 2008-2019 麦多课文库(www.mydoc123.com)网站版权所有
    备案/许可证编号:苏ICP备17064731号-1 

    收起
    展开