This document presents a novel method for evaluating automatic machine translation by focusing on part-of-speech (POS) information rather than relying on synonyms or paraphrasing. The proposed method, named hlepor, utilizes a weighted harmonic mean to assess the similarity between machine translations and human references, yielding better correlation with human judgments than established metrics like BLEU and METEOR. The authors emphasize its potential for accuracy and conciseness in translation evaluation, while acknowledging limitations related to language dependency and parsing accuracy.
Related topics: