1. region CP022323.1:983,532-1,047,672 in DA163 has a CNV 2. This sample also has a good deal of long inserts 3. After repeat masking, are either the CNV or the long inserts affect? 4. With long insert reads, where are the ends (in current sample) 5. make sure in GATK that soft clipped bases are not considered 6. look into BWA settings re: soft clipping -- turn off entirely, or decrease the amount of soft clipping allowed 7. pull out heterozygous regions between homozygous regions -- are they CNV compared to their context?