The document discusses an approach called TableMiner for semantic interpretation of tables using partial data. TableMiner performs two tasks: column classification and cell disambiguation. It uses incremental inference on a sample of the table data rather than requiring all data. Several sample selection methods are evaluated, including sorting by name length, duplicate content, and feature representation size. The results show TableMiner can achieve comparable accuracy to using all data, while processing less data and reducing the search space for disambiguation. This demonstrates machines can learn effectively from partial data samples for semantic table interpretation.
Related topics: