The document presents a framework for optimizing data systems and statistical methodologies to balance costs in distributed data analysis, focusing on computational, statistical, transmission, and infrastructure costs. It discusses the integration of data system design with statistical analysis, introducing the concept of a multi-objective optimization problem to navigate trade-offs in efficiency and quality of inference. The workshop emphasizes collaboration among various disciplines to address the complexities and realistic costs associated with scalable data analysis.
Related topics: