The document presents a semi-automatic method for generating ground truth labels using unsupervised clustering combined with limited manual labeling, aimed at improving handwritten character recognition. It describes various feature representation techniques and evaluates the performance of clustering and labeling strategies on datasets like MNIST and Lampung. The results demonstrate competitive classification performance with minimal human intervention, emphasizing the efficiency of the proposed approach.