Abstract
  1. Principal Mel-Spectrum Components (Feature)
  2. Temporal Pooling Functions (Model)
  3. Single Hidden Layer Neural Network, thus Multi-layer Perceptron (Classifier)

Audio Preprocessing
    Feature: PMSC (Principal Mel-Spectrum Components)
  1. Original Data:  30s, 22.05KHz, mono, wav
  2. Process Steps:
    1. DFT (spectral domain)
      we compute DFTs over windows of 1024 samples on audio at 22.05 KHz (i.e. roughly 46ms) with a frame step of 512 samples.
    2. Mel-Compression
      we run the spectral amplitudes through a set of 256 mel-scaled triangular filters to abtain a set of spectral energy bands.
    3. Principal Component analysis whitening (PCA whitening)
      we compute the principal components of a random sub-sample of training set. In order to obtain features with unitary variance, we multiply(乘以) each component by the inverse square of its eigenvalue(特征值平方的倒数). ---- PCA whitening.
Model
    PFC (Pooled Features Classifier)
  1. Pooling Operation
    the model applies a given set of pooling functions (how many?) to the PMSC features, and sends the pooled features to a classifier(MLP, with hidden layer of 2000 units, sigmoid activation, L2 weight decay and cross-entropy cost).
  2. Classify
    each pooling window is considered as a training example for the classifier, and average the predictions of the classifier over all the windows of a given clip to obtain the final classification (what is the rule?).
Tasks
  1. Classification (train/test task)
    the MLP outputs an affinity prediction for each class (pooling functions tread each pooling window as a training example).
  2. Tagging
    1. Affinity
      the affinity scores for a song is thus directly the output of the MLP.
    2. Binary Classification
      choose the threshold that optimizes the F1-score on the validation set.
Tools
  1. Theano: Theano is a numerical computation library for Python. In Theano, computations are expressed using a NumPy-like syntax and compiled to run efficiently on either CPU or GPU architectures.
    


最新文章

  1. ASP.NET Core 1.0 开发记录
  2. Windows下SVN服务器的搭建步骤
  3. 对DIP IoC DI的理解与运用
  4. redis-cluster 单个节点不可用
  5. Java cookie和session介绍与区别
  6. 系统的 host文件的作用
  7. [转Go-简洁的并发 ]
  8. 【推介】GitHub
  9. [HIHO1176]欧拉路·一(欧拉图判定)
  10. ASP中双引号单引号和&连接符使用技巧
  11. background-position 具体的使用说明
  12. iOS 架构模式
  13. 第1回-使用ThinkPHP的3.1.3版本轻松建网站
  14. MySQL之连接查询
  15. zabbix模板化监控
  16. Python3 tkinter基础 Radiobutton 设置相同的value值,产生连锁效果
  17. 基于jQuery经典扫雷游戏源码
  18. Hyper-v群集的仲裁模式配置依据
  19. mysql中使用日期加减时无法识别年-月格式数据的问题,%Y-%m"这种格式数据
  20. C#泛型设计的一个小陷阱.

热门文章

  1. CAD参数绘制角度标注(com接口)
  2. 第3节 mapreduce高级:2、3、课程大纲&共同好友求取步骤一、二
  3. Android-ViewPagerIndicator框架使用——CirclePageIndicator
  4. jquery attr的属性
  5. PHP解惑(一)
  6. [Python3网络爬虫开发实战] 2.2-网页基础
  7. [Python3网络爬虫开发实战] 1.9.3-Scrapyd-Client的安装
  8. 洛谷 4933 洛谷10月月赛II T2 大师
  9. 04001_HTML简单介绍
  10. Dijkstra算法C++实现总结