The paper presents an unsupervised algorithm for improving keyword extraction from multilingual texts by utilizing the average term frequency-inverse document frequency (tf-idf) across multiple languages. The proposed method evaluates keyword significance not only within the same language but against multilingual contexts, thereby enhancing extraction accuracy to 91.3%. The study outlines a structured approach of eight steps, starting from data retrieval to keyword evaluation, demonstrating the effectiveness of multilingual keyword extraction in data retrieval systems.
Related topics: