Speech Emotion Recognition: A Survey

AUTHORS

Swarna. kuchibhotla,Associate Professor, Department of CSE, Koneru Lakshmaiah Education Foundation, Guntur

ABSTRACT

Speech Recognition is a tremendous application from the history that is identification and conversion of spoken words into text. The performance and quality of work had increased a lot. This performance lead to the research work on Emotion Recognition based on the language spoken that is obtaining the kind of emotion from the spoken speech which is an application based on human-robot interactions. Emotions can be recognized in a better way using Speech processing, Artificial Intelligence techniques and linguistic semantics. Systems are given training in such a way to detect the emotions from the spoken utterances. This paper contains about the survey from the history to the present works that took place in the speech emotion recognition and also the experiment results. The survey contains about the works that took place from the by different scientists and their usage of different features, classifiers etc. The paper also holds three categories, one is different databases that are involved, second is what features are involved for representation of speech and third about the classification schemes. The survey also includes the conclusions of performances and limitations of current speech emotion recognition.

 

KEYWORDS

Emotion, Features, Classification Techniques, Speech Recognition

REFERENCES

[1]     El Ayadi, Moataz, Mohamed S. Kamel, and Fakhri Karray. "Survey on speech emotion recognition: Features, classification schemes, and databases." Pattern Recognition 44, no. 3 (2011): pp.572-587.DOI: 10.1016/j.patcog.2010.09.020(CrossRef)(Google Scholar)
[2]     Kuchibhotla, Swarna, H. D. Vankayalapati, R. S. Vaddi, and Koteswara Rao Anne. "A comparative analysis of classifiers in emotion recognition through acoustic features." International Journal of Speech Technology 17, no. 4 (2014): pp.401-408.DOI: 10.1007/s10772-014-9239-3(CrossRef)(Google Scholar)
[3]     Khanna, Preeti, and M. Sasikumar. "Recognizing emotions from human speech." In Thinkquest~ 2010, pp. 219-223. Springer, New Delhi, (2011).DOI: 10.1007/978-81-8489-989-4_40(CrossRef)(Google Scholar)
[4]     Gadhe, Rani P., R. A. Shaikh Nilofer, V. B. Waghmare, P. P. Shrishrimal, and R. R. Deshmukh. "Emotion recognition from speech: a survey." Int. J. Sci. Eng. Res 6, no. 4 (2015): pp.632-635.
[5]     Joshi, Aastha, and Rajneet Kaur. "A Study of speech emotion recognition methods." Int. J. Comput. Sci. Mob. Comput.(IJCSMC) 2, no. 4 (2013): pp.28-31.
[6]     Ingale, Ashish B., and D. S. Chaudhari. "Speech emotion recognition." International Journal of Soft Computing and Engineering (IJSCE) 2, no. 1 (2012): pp.235-238.
[7]     Kuchibhotla, Swarna, Hima Deepthi Vankayalapati, and Koteswara Rao Anne. "An optimal two stage feature selection for speech emotion recognition using acoustic features." International journal of speech technology 19, no. 4 (2016): 657-667.DOI: 10.1007/s10772-016-9358-0(CrossRef)(Google Scholar)
[8]     Davletcharova, Assel, Sherin Sugathan, Bibia Abraham, and Alex Pappachen James. "Detection and analysis of emotion from speech signals." Procedia Computer Science 58 (2015): 91-96.
[9]     Nanavare, V. V., and S. K. Jagtap. "Recognition of human emotions from speech processing." Procedia Computer Science 49 (2015): 24-32.DOI: 10.1016/j.procs.2015.04.223(CrossRef)(Google Scholar)
[10]  Utane, Akshay S., and S. L. Nalbalwar. "Emotion recognition through Speech." In 2nd National Conference on Innovative Paradigms in Engineering & Technology, International Journal of Applied Information Systems, pp. 5-8. (2013).
[11]  Vogt, Thurid, Elisabeth André, and Johannes Wagner. "Automatic recognition of emotions from speech: a review of the literature and recommendations for practical realisation." In Affect and emotion in human-computer interaction, pp. 75-91. Springer, Berlin, Heidelberg, (2008).DOI: 10.1007/978-3-540-85099-1_7(CrossRef)(Google Scholar)
[12]  Anagnostopoulos, Christos-Nikolaos, and Theodoros Iliou. "Towards emotion recognition from speech: definition, problems and the materials of research." In Semantics in Adaptive and Personalized Services, pp. 127-143. Springer, Berlin, Heidelberg, (2010).DOI: 10.1007/978-3-642-11684-1_8(CrossRef)(Google Scholar)
[13]  Anne, Koteswara Rao, Swarna Kuchibhotla, and Hima Deepthi Vankayalapati. "Emotion recognition using spectral features." In Acoustic Modeling for Emotion Recognition, pp. 17-26. Springer, Cham, (2015).DOI: 10.1007/978-3-319-15530-2_3(CrossRef)(Google Scholar)
[14]  Atal, Bishnu S., and Suzanne L. Hanauer. "Speech analysis and synthesis by linear prediction of the speech wave." The journal of the acoustical society of America 50, no. 2B (1971): 637-655.DOI: 10.1121/1.1912679(CrossRef)(Google Scholar)
[15]  Barbu, Tudor. "Discrete speech recognition using a Hausdorff-based metric." In Proceedings of the 1st International Conference of E-Business and Telecommunication Networks, ICETE 2004, vol. 3, pp. 363-368. Setubal, (2004).
[16]  Batliner, Anton, Richard Huber, Heinrich Niemann, Elmar Nöth, Jörg Spilker, and Kerstin Fischer. "The recognition of emotion." In Verbmobil: Foundations of speech-to-speech translation, pp. 122-130. Springer, Berlin, Heidelberg, (2000).
[17]  Burges, Christopher JC. "A tutorial on support vector machines for pattern recognition." Data mining and knowledge discovery 2, no. 2 (1998): 121-167.DOI: 10.1023/a:1009715923555(CrossRef)(Google Scholar)
[18]  Kuchibhotla, Swarna, H. D. Vankayalapati, R. S. Vaddi, and Koteswara Rao Anne. "A comparative analysis of classifiers in emotion recognition through acoustic features." International Journal of Speech Technology 17, no. 4 (2014): 401-408.
[19]  Kuchibhotla, Swarna, B. S. Yalamanchili, H. D. Vankayalapati, and Koteswara Rao Anne. "Speech emotion recognition using regularized discriminant analysis." In Proceedings of the International Conference on Frontiers of Intelligent Computing: Theory and Applications (FICTA) 2013, pp. 363-369. Springer, Cham, (2014).
[20]  Kuchibhotlaa, Swarna, Hima Deepthi Vankayalapati, BhanuSree Yalamanchili, and Koteswara Rao Anne. "Analysis and evaluation of discriminant analysis techniques for multiclass classification of human vocal emotions." In Advances in Intelligent Informatics, pp. 325-333. Springer, Cham, (2015).

CITATION

  • APA:
    kuchibhotla,S.(2019). Speech Emotion Recognition: A Survey . International Journal of Multimedia and Ubiquitous Engineering, 14(2), 15-22. http://dx.doi.org/10.21742/IJMUE.2019.14.2.03
  • Harvard:
    kuchibhotla,S.(2019). "Speech Emotion Recognition: A Survey ". International Journal of Multimedia and Ubiquitous Engineering, 14(2), pp.15-22. doi:http://dx.doi.org/10.21742/IJMUE.2019.14.2.03
  • IEEE:
    [1]S.kuchibhotla, "Speech Emotion Recognition: A Survey ". International Journal of Multimedia and Ubiquitous Engineering, vol.14, no.2, pp.15-22, Nov. 2019
  • MLA:
    kuchibhotla Swarna.. "Speech Emotion Recognition: A Survey ". International Journal of Multimedia and Ubiquitous Engineering, vol.14, no.2, Nov. 2019, pp.15-22, doi:http://dx.doi.org/10.21742/IJMUE.2019.14.2.03

ISSUE INFO

  • Volume 14, No. 2, 2019
  • ISSN(p):1975-0080
  • ISSN(o):2652-1954
  • Published:Nov. 2019

DOWNLOAD