This document discusses moving from unstructured to structured data in journalism. It provides examples of tools and projects that use machine learning and data processing to help journalists report the news more efficiently. These include tools from the New York Times, BBC, and Washington Post that help with tasks like entity extraction and knowledge mapping. One example discussed in more detail is the processing of the Panama Papers leak, which involved sorting, indexing, and analyzing over 11 million documents to build a structured database for investigative reporting.
Related topics: