There is some debate in the community regarding the use of the DCT, instead of directly using the log Mel fiterbank features, particularly for deep neural network based acoustic models. Some research groups, like Google, use filterbanks (fbanks) while Kaldi mostly uses MFCCs, especially in its TDNN chain models. Since filterbank energies are correlated and cannot be used directly with a Gaussian mixture with diagonal covariance, we apply a discrete cosine transform (DCT) to decorrelate them.

Here is Dan Povey’s take on this:

The reason we use MFCC is because they are more easily compressible, being decorrelated; we dump them to disk with compression to 1 byte per coefficient. But we dump all the coefficients, so it’s equivalent to filterbanks times a full-rank matrix, no information is lost.

参考:A note on MFCCs and delta features (desh2608.github.io)

最新文章

  1. 【React】启动dva脚手架
  2. 使用duplicate target database ... from active database复制数据库
  3. web程序调试方法
  4. flask学习
  5. 六种排序的C++实现
  6. 关于三目运算符与if语句的效率与洛谷P2704题解
  7. 如何在IOS开发中在自己的framework中添加.bunble文件
  8. 服务器部署_linuix下 一台nginx 多域名
  9. KeyDown,KeyPress 和KeyUp
  10. PHP字符串替换函数strtr()
  11. 考试必备神器-真题园手机客户端Android版1.1正式上线啦,欢迎大家下载使用!
  12. PHP学习笔记7-JSON数据操作
  13. 转:C++与JAVA语言区别
  14. Java提取URL某个参数的值
  15. 转 Java虚拟机5:Java垃圾回收(GC)机制详解
  16. DefWindowProc是一个会产生消息的函数
  17. 《Linux内核分析》 第五节 扒开系统调用的三层皮(下)
  18. get the code of function in matlab
  19. IBM MQ + WebSphere + Spring JMS配置方法
  20. python中while循环和for循环的定义和详细的使用方法

热门文章

  1. Spring MVC 笔记2
  2. vite 运行或打包出现内存溢出的解决方案
  3. PHP 网页 apache24+php8 yii basic
  4. linux查看java堆栈信息_linux进程堆栈大小
  5. IsNotEmpty和isNotBlank的区别
  6. vscode plugin - jenkins jack使用方法
  7. 【DM论文阅读杂记】复杂社区网络
  8. Mysql之迂回连接术
  9. 莫凡Python 3
  10. Python列表等长度分割