论文笔记：语音情感识别（三）手工特征+CRNN

时间 2020-12-30

原文原文链接

论文笔记：语音情感识别（三）手工特征+CRNN 一：Emotion Recognition from Human Speech Using Temporal Information and Deep Learning（2018 InterSpeech）（1）分帧加窗，每一帧采用的特征向量为eGeMAPS特征集中的20个特征，每个utterance使用裁剪和padding的做法使得定长512帧，所