This document summarizes research on optimizing queries for big data analytics. It discusses how organizations use different databases with varied data models to store and query big data. The main focus is improving query performance by having a query framework that can detect optimized data copies created by data engineers and execute queries against these copies. The framework uses the Apache Calcite query optimizer which rewrites queries to use optimized copies when possible based on a cost model. An evaluation on real taxi trip data showed the approach improved query response times.