This document surveys various cross-language information retrieval (CLIR) techniques, focusing on the process of retrieving documents in one language based on queries made in another. It emphasizes the importance of CLIR in today's multilingual internet landscape and categorizes the techniques into dictionary-based, corpus-based, machine translation-based, and ontology-based methods. The paper highlights advantages and disadvantages of each approach, aiming to improve information retrieval effectiveness across multiple languages.
Related topics: