This document outlines tasks for a working group to analyze and integrate genomic data sets for the individual NA12878. The goals are to:
1. Develop a strategy to analyze individual data sets and a plan to integrate the data and form consensus variant calls and confidence estimates.
2. Inventory existing NA12878 data sources and characterize them.
3. Define quality filters for reads, runs, and lanes based on existing large-scale studies.
4. Compile the first set of filtered NA12878 data from various sources by the end of September.
5. Run existing variant calling pipelines on the compiled data set using references genomes.