Abstract
This study proposes a method for recognizing emotions from speech using Mel spectrograms and MFCC features which capture the spectral features of speech signals. To predict emotions from the extracted features from the dataset, Convolutional Neural Networks (CNNs) and finetune pre-trained models are used. Pre-trained models are fine-tuned with some optimizations and one-dimensional convolutional neural network is constructed. The results demonstrate that the proposed method achieved an accuracy rate of over 80% in predicting emotions from speech and show the effectiveness of the approach in a comparative manner.
Translated title of the contribution | Speech-Based Emotion Analysis Using Log-Mel Spectrograms and MFCC Features |
---|---|
Original language | Turkish |
Title of host publication | 31st IEEE Conference on Signal Processing and Communications Applications, SIU 2023 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
ISBN (Electronic) | 9798350343557 |
DOIs | |
Publication status | Published - 2023 |
Event | 31st IEEE Conference on Signal Processing and Communications Applications, SIU 2023 - Istanbul, Turkey Duration: 5 Jul 2023 → 8 Jul 2023 |
Publication series
Name | 31st IEEE Conference on Signal Processing and Communications Applications, SIU 2023 |
---|
Conference
Conference | 31st IEEE Conference on Signal Processing and Communications Applications, SIU 2023 |
---|---|
Country/Territory | Turkey |
City | Istanbul |
Period | 5/07/23 → 8/07/23 |
Bibliographical note
Publisher Copyright:© 2023 IEEE.