This document summarizes Matt Lease's research into crowdsourcing at the University of Texas at Austin. It discusses how crowdsourcing platforms like Amazon Mechanical Turk have been used for labeling data and user studies. It also addresses some of the challenges with crowdsourcing like workflow design, handling sensitive data, regulation, fraud and ethics. Finally, it outlines several of Lease's studies on improving data quality when using noisy crowd labels for tasks like classification, ranking and repeated labeling.
Related topics: