This document discusses a project that aims to extract semantic metadata from biodiversity literature through automatic text mining in order to enhance search capabilities. The project will transform the Biodiversity Heritage Library (BHL) into a next-generation digital library by applying techniques like text mining, machine learning, and social media to generate semantic annotations for entities, types, and relations. This semantic metadata will allow for more precise searching of BHL's collection compared to current keyword-based search, helping users discover relevant information despite ambiguity in searches.
Related topics: