Özet
This study proposes a method for recognizing emotions from speech using Mel spectrograms and MFCC features which capture the spectral features of speech signals. To predict emotions from the extracted features from the dataset, Convolutional Neural Networks (CNNs) and finetune pre-trained models are used. Pre-trained models are fine-tuned with some optimizations and one-dimensional convolutional neural network is constructed. The results demonstrate that the proposed method achieved an accuracy rate of over 80% in predicting emotions from speech and show the effectiveness of the approach in a comparative manner.
| Tercüme edilen katkı başlığı | Speech-Based Emotion Analysis Using Log-Mel Spectrograms and MFCC Features |
|---|---|
| Orijinal dil | Türkçe |
| Ana bilgisayar yayını başlığı | 31st IEEE Conference on Signal Processing and Communications Applications, SIU 2023 |
| Yayınlayan | Institute of Electrical and Electronics Engineers Inc. |
| ISBN (Elektronik) | 9798350343557 |
| DOI'lar | |
| Yayın durumu | Yayınlandı - 2023 |
| Etkinlik | 31st IEEE Conference on Signal Processing and Communications Applications, SIU 2023 - Istanbul, Türkiye Süre: 5 Tem 2023 → 8 Tem 2023 |
Yayın serisi
| Adı | 31st IEEE Conference on Signal Processing and Communications Applications, SIU 2023 |
|---|
???event.eventtypes.event.conference???
| ???event.eventtypes.event.conference??? | 31st IEEE Conference on Signal Processing and Communications Applications, SIU 2023 |
|---|---|
| Ülke/Bölge | Türkiye |
| Şehir | Istanbul |
| Periyot | 5/07/23 → 8/07/23 |
Bibliyografik not
Publisher Copyright:© 2023 IEEE.
Keywords
- MFCC
- log-Mel spectrogram
- machine learning
- neural networks
- speech emotion recognition
Parmak izi
Log-Mel Spektrogramlari ve MFCC Özellikleri Kullanilarak Konuşma Tabanli Duygu Analizi' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.Alıntı Yap
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver