This document proposes a project called TIE (Text Information Exploitation) to go beyond information extraction from text and towards exploiting text to deduce new knowledge. The objectives are to aggregate extracted information to find causal links and other insights not explicitly stated in source texts. The methodology involves using domain ontologies to integrate information extraction techniques from real-world texts. The goal is to semi-automate knowledge discovery for applications like analyzing business reports. The project brings together experts in knowledge acquisition, computational linguistics, machine learning and information retrieval to address open research issues and apply text mining to scenario of mining annual reports.
Related topics: