DIADEM is an unsupervised web data extraction system that uses domain-specific annotators and ontologies to extract structured data from websites. It analyzes pages to discover templates and uses machine learning to classify elements. The system includes OPAL for analyzing forms, AMBER for extracting records from pages, and OXPath for navigating websites through form filling. A prototype of DIADEM was able to generate 15k-150k facts per page within 30-60 seconds by leveraging over 500 rules and 200 predicates.