The document discusses a study on improving automatic speech recognition (ASR) systems for the Dari language using deep neural networks. It employs feature extraction through Mel-frequency cepstral coefficients (MFCCs) and compares the performance of multiple deep learning models, achieving an impressive average accuracy of 98.365%. The research addresses challenges such as dialect variability and dataset scarcity, aiming to enhance interaction for Dari speakers with modern technology.
Related topics: