Hadoop is a framework for distributed storage and processing of large datasets across clusters of computers. It provides HDFS for distributed file storage and MapReduce for processing and analyzing data in parallel. Key attributes of Hadoop include fault tolerance, scalability, and distributed processing to allow for storage and analysis of very large datasets beyond the capabilities of single machines.