TY - JOUR
T1 - Using bioinformatics to investigate functional diversity
T2 - A case study of MHC diversity in koalas
AU - Silver, Luke W
AU - McLennan, Elspeth A
AU - Beaman, Julian
AU - Burke da Silva, Karen
AU - Timms, Peter
AU - Hogg, Carolyn J
AU - Belov, Katherine
PY - 2024/12
Y1 - 2024/12
N2 - Conservation genomics can greatly improve conservation outcomes of threatened populations, including those impacted by disease. Understanding diversity within immune gene families, including the major histocompatibility complex (MHC) and toll-like receptors (TLR), is important due to the role they play in disease resilience and susceptibility. With recent advancements in sequencing technologies and bioinformatic tools, the cost of generating high-quality sequence data has significantly decreased and made it possible to investigate diversity across entire gene families in large numbers of individuals compared to investigating only a few genes or a few populations previously. Here, we use the koala as a case study for investigating functional diversity across populations. We utilised previous target enrichment data and 438 whole genomes to firstly, determine the level of sequencing depth required to investigate MHC diversity and, secondly, determine the current level of diversity in MHC genes in koala populations. We determined for low complexity, conserved genes such as TLR genes 10 × sequencing depth is sufficient to reliably genotype more than 90% of variants, whereas for complex genes such as the MHC greater than 20 × and preferably 30 × sequencing depth is required. We used whole genome data to identify 270 biallelic SNPs across 24 MHC genes as well as copy number variation (CNV) within class I and class II genes and conduct supertype analysis. Overall, we have provided a bioinformatic workflow for investigating variation in a complex immune gene family from whole genome sequencing data and determined current levels of diversity within koala MHC genes.
AB - Conservation genomics can greatly improve conservation outcomes of threatened populations, including those impacted by disease. Understanding diversity within immune gene families, including the major histocompatibility complex (MHC) and toll-like receptors (TLR), is important due to the role they play in disease resilience and susceptibility. With recent advancements in sequencing technologies and bioinformatic tools, the cost of generating high-quality sequence data has significantly decreased and made it possible to investigate diversity across entire gene families in large numbers of individuals compared to investigating only a few genes or a few populations previously. Here, we use the koala as a case study for investigating functional diversity across populations. We utilised previous target enrichment data and 438 whole genomes to firstly, determine the level of sequencing depth required to investigate MHC diversity and, secondly, determine the current level of diversity in MHC genes in koala populations. We determined for low complexity, conserved genes such as TLR genes 10 × sequencing depth is sufficient to reliably genotype more than 90% of variants, whereas for complex genes such as the MHC greater than 20 × and preferably 30 × sequencing depth is required. We used whole genome data to identify 270 biallelic SNPs across 24 MHC genes as well as copy number variation (CNV) within class I and class II genes and conduct supertype analysis. Overall, we have provided a bioinformatic workflow for investigating variation in a complex immune gene family from whole genome sequencing data and determined current levels of diversity within koala MHC genes.
KW - Whole genome resequencing
KW - TLR
KW - Functional genomics
KW - Bioinformatics
KW - MHC
KW - Koalas
UR - http://www.scopus.com/inward/record.url?scp=85205710129&partnerID=8YFLogxK
UR - http://purl.org/au-research/grants/ARC/LP210100450
U2 - 10.1007/s00251-024-01356-6
DO - 10.1007/s00251-024-01356-6
M3 - Article
SN - 0093-7711
VL - 76
SP - 381
EP - 395
JO - Immunogenetics
JF - Immunogenetics
IS - 5-6
ER -