A heuristic method for fast and accurate phasing and imputation of single-nucleotide polymorphism data in bi-parental plant populations.

Journal: TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik
Published Date:

Abstract

Key message New fast and accurate method for phasing and imputation of SNP chip genotypes within diploid bi-parental plant populations. This paper presents a new heuristic method for phasing and imputation of genomic data in diploid plant species. Our method, called AlphaPlantImpute, explicitly leverages features of plant breeding programmes to maximise the accuracy of imputation. The features are a small number of parents, which can be inbred and usually have high-density genomic data, and few recombinations separating parents and focal individuals genotyped at low density (i.e. descendants that are the imputation targets). AlphaPlantImpute works roughly in three steps. First, it identifies informative low-density genotype markers in parents. Second, it tracks the inheritance of parental alleles and haplotypes to focal individuals at informative markers. Finally, it uses this low-density information as anchor points to impute focal individuals to high density. We tested the imputation accuracy of AlphaPlantImpute in simulated bi-parental populations across different scenarios. We also compared its accuracy to existing software called PlantImpute. In general, AlphaPlantImpute had better or equal imputation accuracy as PlantImpute. The computational time and memory requirements of AlphaPlantImpute were tiny compared to PlantImpute. For example, accuracy of imputation was 0.96 for a scenario where both parents were inbred and genotyped at 25,000 markers per chromosome and a focal F individual was genotyped with 50 markers per chromosome. The maximum memory requirement for this scenario was 0.08 GB and took 37 s to complete.

Authors

  • Serap Gonen
    The Roslin Institute and Royal (Dick) School of Veterinary Studies, Easter Bush Research Centre, University of Edinburgh, Midlothian, EH25 9RG, UK.
  • Valentin Wimmer
    KWS SAAT SE, Grimsehlstr. 31, 37574, Einbeck, Germany.
  • R Chris Gaynor
    The Roslin Institute and Royal (Dick) School of Veterinary Studies, Easter Bush Research Centre, University of Edinburgh, Midlothian, EH25 9RG, UK.
  • Ed Byrne
    KWS-UK Ltd, 56 Church Street, Thriplow, Hertfordshire, SG8 7RE, UK.
  • Gregor Gorjanc
    The Roslin Institute and Royal (Dick) School of Veterinary Studies, Easter Bush Research Centre, University of Edinburgh, Midlothian, EH25 9RG, UK.
  • John M Hickey
    The Roslin Institute and Royal (Dick) School of Veterinary Studies, Easter Bush Research Centre, University of Edinburgh, Midlothian, EH25 9RG, UK. john.hickey@roslin.ed.ac.uk.