This paper presents a novel two-stage feature selection method for text categorization that utilizes information gain, principal component analysis (PCA), and genetic algorithms to optimize classification performance by reducing dimensionality. The proposed methodology ranks terms by their importance, applies feature selection and extraction techniques, and utilizes classifiers such as k-nearest neighbor (KNN) and C4.5 decision trees to analyze data sets. The approach improves computational efficiency and categorization accuracy by focusing only on the most relevant features, significantly reducing complexity.