This document summarizes three temporal reasoning datasets: MCTACO, TORQUE, and TRACIE.
[1] MCTACO is a multiple choice dataset for temporal commonsense understanding with 13k question-answer pairs about the duration, order, typical time, frequency, and stationarity of events. It was created using crowdsourcing.
[2] TORQUE is a reading comprehension dataset with over 20k temporal ordering questions about events in text. It uses natural language to annotate relationships between events, addressing limitations of prior work. The questions were generated and answered through crowdsourcing.
[3] TRACIE focuses on implicit events and uses distant supervision to generate temporal relation instances between