This document outlines a tutorial on data science for software engineering. It discusses several key issues:
1. The importance of talking to users to understand their needs and domain knowledge when conducting data science projects.
2. Challenges with software engineering data and models, including a lack of generalizability between projects and a need to understand the domain.
3. Best practices for data science projects, including suspecting the data quality, taking a cyclic approach of getting feedback from users, and automating analyses for repeatability.
The tutorial will cover additional topics like dealing with lack of local data, improving data and models, and privacy issues. The overall message is that success in applying data science to software engineering