Abstract
Speech Emotion Recognition (SER) refers to accurately predicting human emotions from their speech. The ability to predict emotions through speech signals is a motivating factor in achieving Human-Computer Interaction (HCI). This paper contains a comparative study of the existing research on speech emotion models. It makes use of the RAVDESS and SAVEE dataset containing audio input. The study of speech emotion recognition is made on SVM, CNN, KNN, MLP, Decision Tree, XGBoost, and Random Forest models. This paper presents a comparative analysis of the models highlighting the accuracy, F1 Score, bar plots, and loss graphs of the same. The paper also highlights the significant future areas for study in speech emotion recognition.
Original language | English |
---|---|
Title of host publication | 2023 International Conference on Computational Intelligence, Communication Technology and Networking, CICTN 2023 |
Publisher | IEEE |
ISBN (Electronic) | 9798350338027 |
DOIs | |
Publication status | Published - 7 Jun 2023 |
Externally published | Yes |
Event | 2023 International Conference on Computational Intelligence, Communication Technology and Networking - Ghaziabad, India Duration: 20 Apr 2023 → 21 Apr 2023 http://www.cictn.abes.ac.in/ |
Conference
Conference | 2023 International Conference on Computational Intelligence, Communication Technology and Networking |
---|---|
Abbreviated title | CICTN |
Country/Territory | India |
City | Ghaziabad |
Period | 20/04/23 → 21/04/23 |
Internet address |