1. The document presents methods to reduce false positive variants in whole genome sequencing data, including logistic regression-based variant filtering and an ensemble approach using multiple variant calling algorithms.
2. The methods were tested on whole genome sequencing data from an extended family sequenced with two platforms. Variant calls agreed by both platforms were considered true positives to evaluate the methods.
3. The results show the proposed methods significantly reduced false negative rates compared to filtering based on genotype quality scores, while maintaining similar false discovery rates. The ensemble approach was especially effective at prioritizing true positive de novo mutations.