Rackspace generates hundreds of gigabytes of email log data daily from over 600 servers. They struggled to efficiently store and query this data using various relational database approaches. They implemented a Hadoop/MapReduce system where logs are streamed to HDFS in real-time and indexed using Lucene/Solr. MapReduce jobs run every 10 minutes to build indexes, which are compressed and stored in HDFS. Solr instances then merge and serve indexes, enabling fast log searching within hours. This scalable system meets their needs to troubleshoot issues and gain insights from massive and growing log data.