The document reviews the development of speech corpus for automatic speech recognition in Indian languages, highlighting the lack of resources compared to English. It discusses various projects and initiatives across India aimed at creating speech databases in languages such as Hindi, Marathi, Kannada, and Punjabi, among others. The paper emphasizes the importance of these resources for advancing language technology and calls for a centralized authority to manage and distribute the corpora.
Related topics: