This paper discusses speech emotion recognition (SER) technologies and compares the efficacy of various classification methods, including support vector machines (SVM), C5.0, and a combination of both (SVM-C5.0). The study analyzes emotional states using a range of features such as energy, zero crossing rate (ZCR), pitch, and mel-frequency cepstral coefficients (MFCC), demonstrating that the SVM-C5.0 method outperforms other methods by 5.5% to 8.9% depending on the number of emotion states. The findings also highlight the challenges in SER, emphasizing the importance of emotional databases and feature extraction in developing effective recognition systems.
Related topics: