The document discusses Southwest Power Pool's initial steps towards creating a data lake. It describes:
- Storing historical and real-time data that exceeded initial expectations, with around 50% being less frequently used
- Conducting a proof-of-concept evaluation of three vendors to offload less frequently used data and allow SQL query access with minimal changes to existing queries
- Choosing BigInsights based on its ability to do this along with supporting existing Netezza functions and allowing federated queries between Netezza and BigInsights
- The multi-phase vision to eventually incorporate more data types and workloads while improving performance, security, and governance