Mário Almeida presented on making YARN highly available. YARN is not currently highly available as it has single points of failure. The presentation proposed storing application states in NDB MySQL Cluster to enable failure recovery. It described implementing an NDB state store for YARN and benchmarking it against HDFS and Zookeeper. Results showed NDB outperformed the others in throughput. Future work would implement a stateless architecture and study overhead of writing to NDB.
Related topics: