This document presents a novel approach for automatic face naming using weakly labeled images through two main methods: regularized low-rank representation (RLRR) and ambiguously supervised structural metric learning (ASML) to generate discriminative affinity matrices. The proposed methods utilize caption-based supervision to enhance the accuracy of face naming by effectively fusing the affinity matrices obtained from RLRR and ASML. Extensive experiments demonstrate that the new scheme outperforms existing methods, showing significant improvements in real-world datasets.