This document presents an efficient feature selection model for Igbo text. It analyzes the structure of Igbo language and designs a feature selection system for an intelligent Igbo text-based system. The system collects Igbo text documents, performs preprocessing including normalization and tokenization, represents the text with n-gram models, and selects the most relevant features using mean TF-IDF. The results show that a bigram model best represents Igbo text given its compounding nature and provides more relevant features.
Related topics: