os big data distributed systems spark introduction process management graph processing process synchronization cpu scheduling nosql main memory virtual memory graphx file system implementation mapreduce p2p spark stream consensus paxos cloud scala rdd spanner hive shark newsql megastore nosql databases dynamo gfs distributed file system dht gossip epidemic algorithms chubby clock spotify bittorrent cdn cloud computing nist hdfs flink stratosphere security protection linux module io file system interface storage deadlocks threads mesos resource management yarn powergraph graphlab dstream seep stream processing time
See more