The document discusses problems with Hadoop/MapReduce frameworks for processing big data. It proposes an alternative called TABID (Time-Aware Big Data) that uses data streaming sketches to allow jobs to process data in its natural time order across multiple cores. This helps address issues like MapReduce being limited to key-value operations and not handling heterogeneous workloads well. It provides examples of how TABID could detect superspreaders in network traffic using sketches more efficiently than MapReduce.
Related topics: