This paper discusses the identification and classification of named entities (NER) in Indian languages, specifically Hindi, Bengali, and Telugu, using the Hidden Markov Model (HMM). The authors present their results, including accuracy and performance metrics, highlighting the challenges of performing NER in Indian languages compared to English, due to factors like inflection and resource availability. The study concludes that with increased training, the accuracy of NER improves significantly, achieving F-measure scores of 96%, 98%, and 98.6% for Hindi, Bengali, and Telugu respectively.
Related topics: