HISS: Snakemake-based workflows for performing SMRT-RenSeq assembly, AgRenSeq and dRenSeq for the discovery of novel plant disease resistance genes

Abstract Background In the ten years since the initial publication of the RenSeq protocol, the method has proved to be a powerful tool for studying disease resistance in plants and providing target genes for breeding programmes. Since the initial publication of the methodology, it has continued to b...

Full description

Saved in:
Bibliographic Details
Main Authors: Thomas M. Adams, Moray Smith, Yuhan Wang, Lynn H. Brown, Micha M. Bayer, Ingo Hein
Format: Article
Language:English
Published: BMC 2023-05-01
Series:BMC Bioinformatics
Subjects:
Online Access:https://doi.org/10.1186/s12859-023-05335-8
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1823861591194468352
author Thomas M. Adams
Moray Smith
Yuhan Wang
Lynn H. Brown
Micha M. Bayer
Ingo Hein
author_facet Thomas M. Adams
Moray Smith
Yuhan Wang
Lynn H. Brown
Micha M. Bayer
Ingo Hein
author_sort Thomas M. Adams
collection DOAJ
description Abstract Background In the ten years since the initial publication of the RenSeq protocol, the method has proved to be a powerful tool for studying disease resistance in plants and providing target genes for breeding programmes. Since the initial publication of the methodology, it has continued to be developed as new technologies have become available and the increased availability of computing power has made new bioinformatic approaches possible. Most recently, this has included the development of a k-mer based association genetics approach, the use of PacBio HiFi data, and graphical genotyping with diagnostic RenSeq. However, there is not yet a unified workflow available and researchers must instead configure approaches from various sources themselves. This makes reproducibility and version control a challenge and limits the ability to perform these analyses to those with bioinformatics expertise. Results Here we present HISS, consisting of three workflows which take a user from raw RenSeq reads to the identification of candidates for disease resistance genes. These workflows conduct the assembly of enriched HiFi reads from an accession with the resistance phenotype of interest. A panel of accessions both possessing and lacking the resistance are then used in an association genetics approach (AgRenSeq) to identify contigs positively associated with the resistance phenotype. Candidate genes are then identified on these contigs and assessed for their presence or absence in the panel with a graphical genotyping approach that uses dRenSeq. These workflows are implemented via Snakemake, a python-based workflow manager. Software dependencies are either shipped with the release or handled with conda. All code is freely available and is distributed under the GNU GPL-3.0 license. Conclusions HISS provides a user-friendly, portable, and easily customised approach for identifying novel disease resistance genes in plants. It is easily installed with all dependencies handled internally or shipped with the release and represents a significant improvement in the ease of use of these bioinformatics analyses.
format Article
id doaj-art-c7d21a50de1541a7bbd43aad155285e3
institution Kabale University
issn 1471-2105
language English
publishDate 2023-05-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj-art-c7d21a50de1541a7bbd43aad155285e32025-02-09T12:56:54ZengBMCBMC Bioinformatics1471-21052023-05-0124111010.1186/s12859-023-05335-8HISS: Snakemake-based workflows for performing SMRT-RenSeq assembly, AgRenSeq and dRenSeq for the discovery of novel plant disease resistance genesThomas M. Adams0Moray Smith1Yuhan Wang2Lynn H. Brown3Micha M. Bayer4Ingo Hein5Department of Cell and Molecular Sciences, The James Hutton InstituteDepartment of Cell and Molecular Sciences, The James Hutton InstituteDivision of Plant Sciences, School of Life Sciences, University of DundeeDivision of Plant Sciences, School of Life Sciences, University of DundeeDepartment of Information and Computational Sciences, The James Hutton InstituteDepartment of Cell and Molecular Sciences, The James Hutton InstituteAbstract Background In the ten years since the initial publication of the RenSeq protocol, the method has proved to be a powerful tool for studying disease resistance in plants and providing target genes for breeding programmes. Since the initial publication of the methodology, it has continued to be developed as new technologies have become available and the increased availability of computing power has made new bioinformatic approaches possible. Most recently, this has included the development of a k-mer based association genetics approach, the use of PacBio HiFi data, and graphical genotyping with diagnostic RenSeq. However, there is not yet a unified workflow available and researchers must instead configure approaches from various sources themselves. This makes reproducibility and version control a challenge and limits the ability to perform these analyses to those with bioinformatics expertise. Results Here we present HISS, consisting of three workflows which take a user from raw RenSeq reads to the identification of candidates for disease resistance genes. These workflows conduct the assembly of enriched HiFi reads from an accession with the resistance phenotype of interest. A panel of accessions both possessing and lacking the resistance are then used in an association genetics approach (AgRenSeq) to identify contigs positively associated with the resistance phenotype. Candidate genes are then identified on these contigs and assessed for their presence or absence in the panel with a graphical genotyping approach that uses dRenSeq. These workflows are implemented via Snakemake, a python-based workflow manager. Software dependencies are either shipped with the release or handled with conda. All code is freely available and is distributed under the GNU GPL-3.0 license. Conclusions HISS provides a user-friendly, portable, and easily customised approach for identifying novel disease resistance genes in plants. It is easily installed with all dependencies handled internally or shipped with the release and represents a significant improvement in the ease of use of these bioinformatics analyses.https://doi.org/10.1186/s12859-023-05335-8SMRT-AgRenSeq-ddRenSeqHiFi sequencingSnakemakeHigh-throughputPlant disease resistance
spellingShingle Thomas M. Adams
Moray Smith
Yuhan Wang
Lynn H. Brown
Micha M. Bayer
Ingo Hein
HISS: Snakemake-based workflows for performing SMRT-RenSeq assembly, AgRenSeq and dRenSeq for the discovery of novel plant disease resistance genes
BMC Bioinformatics
SMRT-AgRenSeq-d
dRenSeq
HiFi sequencing
Snakemake
High-throughput
Plant disease resistance
title HISS: Snakemake-based workflows for performing SMRT-RenSeq assembly, AgRenSeq and dRenSeq for the discovery of novel plant disease resistance genes
title_full HISS: Snakemake-based workflows for performing SMRT-RenSeq assembly, AgRenSeq and dRenSeq for the discovery of novel plant disease resistance genes
title_fullStr HISS: Snakemake-based workflows for performing SMRT-RenSeq assembly, AgRenSeq and dRenSeq for the discovery of novel plant disease resistance genes
title_full_unstemmed HISS: Snakemake-based workflows for performing SMRT-RenSeq assembly, AgRenSeq and dRenSeq for the discovery of novel plant disease resistance genes
title_short HISS: Snakemake-based workflows for performing SMRT-RenSeq assembly, AgRenSeq and dRenSeq for the discovery of novel plant disease resistance genes
title_sort hiss snakemake based workflows for performing smrt renseq assembly agrenseq and drenseq for the discovery of novel plant disease resistance genes
topic SMRT-AgRenSeq-d
dRenSeq
HiFi sequencing
Snakemake
High-throughput
Plant disease resistance
url https://doi.org/10.1186/s12859-023-05335-8
work_keys_str_mv AT thomasmadams hisssnakemakebasedworkflowsforperformingsmrtrenseqassemblyagrenseqanddrenseqforthediscoveryofnovelplantdiseaseresistancegenes
AT moraysmith hisssnakemakebasedworkflowsforperformingsmrtrenseqassemblyagrenseqanddrenseqforthediscoveryofnovelplantdiseaseresistancegenes
AT yuhanwang hisssnakemakebasedworkflowsforperformingsmrtrenseqassemblyagrenseqanddrenseqforthediscoveryofnovelplantdiseaseresistancegenes
AT lynnhbrown hisssnakemakebasedworkflowsforperformingsmrtrenseqassemblyagrenseqanddrenseqforthediscoveryofnovelplantdiseaseresistancegenes
AT michambayer hisssnakemakebasedworkflowsforperformingsmrtrenseqassemblyagrenseqanddrenseqforthediscoveryofnovelplantdiseaseresistancegenes
AT ingohein hisssnakemakebasedworkflowsforperformingsmrtrenseqassemblyagrenseqanddrenseqforthediscoveryofnovelplantdiseaseresistancegenes