The document summarizes efforts to expand the Genome in a Bottle (GIAB) small variant benchmark using long and linked reads. Key points:
1) PacBio CCS and 10X Genomics data were used to add variants to the benchmark, mostly in regions difficult to map with short reads. This expanded coverage of variants and reference bases.
2) An initial evaluation found the majority of false positives and false negatives in tested variant callsets were correct in the benchmark, suggesting errors were in the callsets rather than the benchmark.
3) Refinements to the benchmark were identified, including excluding certain regions, to improve accuracy for the next version. The expanded benchmark improves evaluation of variant callers in difficult genomic