Log-Mel Spektrogramlari ve MFCC Özellikleri Kullanilarak Konuşma Tabanli Duygu Analizi

Translated title of the contribution: Speech-Based Emotion Analysis Using Log-Mel Spectrograms and MFCC Features

Ahmet Kemal Yetkin, Hatice Köse

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This study proposes a method for recognizing emotions from speech using Mel spectrograms and MFCC features which capture the spectral features of speech signals. To predict emotions from the extracted features from the dataset, Convolutional Neural Networks (CNNs) and finetune pre-trained models are used. Pre-trained models are fine-tuned with some optimizations and one-dimensional convolutional neural network is constructed. The results demonstrate that the proposed method achieved an accuracy rate of over 80% in predicting emotions from speech and show the effectiveness of the approach in a comparative manner.

Translated title of the contributionSpeech-Based Emotion Analysis Using Log-Mel Spectrograms and MFCC Features
Original languageTurkish
Title of host publication31st IEEE Conference on Signal Processing and Communications Applications, SIU 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350343557
DOIs
Publication statusPublished - 2023
Event31st IEEE Conference on Signal Processing and Communications Applications, SIU 2023 - Istanbul, Turkey
Duration: 5 Jul 20238 Jul 2023

Publication series

Name31st IEEE Conference on Signal Processing and Communications Applications, SIU 2023

Conference

Conference31st IEEE Conference on Signal Processing and Communications Applications, SIU 2023
Country/TerritoryTurkey
CityIstanbul
Period5/07/238/07/23

Bibliographical note

Publisher Copyright:
© 2023 IEEE.

Fingerprint

Dive into the research topics of 'Speech-Based Emotion Analysis Using Log-Mel Spectrograms and MFCC Features'. Together they form a unique fingerprint.

Cite this