Apache Flume is a tool for collecting large amounts of streaming data from various sources and transporting it to a centralized data store like HDFS. It reliably delivers events from multiple data sources to destinations such as HDFS or HBase. Flume uses a simple and flexible architecture based on streaming data flows, with reliable delivery of events guaranteed through a system of agents, channels, and sinks.