Identification of the Bovine Arachnomelia Mutation by Massively Parallel Sequencing Implicates Sulfite Oxidase (SUOX) in Bone Development
Arachnomelia is a monogenic recessive defect of skeletal development in cattle. The causative mutation was previously mapped to a ∼7 Mb interval on chromosome 5. Here we show that array-based sequence capture and massively parallel sequencing technology, combined with the typical family structure in livestock populations, facilitates the identification of the causative mutation. We re-sequenced the entire critical interval in a healthy partially inbred cow carrying one copy of the critical chromosome segment in its ancestral state and one copy of the same segment with the arachnomelia mutation, and we detected a single heterozygous position. The genetic makeup of several partially inbred cattle provides extremely strong support for the causality of this mutation. The mutation represents a single base insertion leading to a premature stop codon in the coding sequence of the SUOX gene and is perfectly associated with the arachnomelia phenotype. Our findings suggest an important role for sulfite oxidase in bone development.
Published in the journal:
Identification of the Bovine Arachnomelia Mutation by Massively Parallel Sequencing Implicates Sulfite Oxidase (SUOX) in Bone Development. PLoS Genet 6(8): e32767. doi:10.1371/journal.pgen.1001079
Category:
Research Article
doi:
https://doi.org/10.1371/journal.pgen.1001079
Summary
Arachnomelia is a monogenic recessive defect of skeletal development in cattle. The causative mutation was previously mapped to a ∼7 Mb interval on chromosome 5. Here we show that array-based sequence capture and massively parallel sequencing technology, combined with the typical family structure in livestock populations, facilitates the identification of the causative mutation. We re-sequenced the entire critical interval in a healthy partially inbred cow carrying one copy of the critical chromosome segment in its ancestral state and one copy of the same segment with the arachnomelia mutation, and we detected a single heterozygous position. The genetic makeup of several partially inbred cattle provides extremely strong support for the causality of this mutation. The mutation represents a single base insertion leading to a premature stop codon in the coding sequence of the SUOX gene and is perfectly associated with the arachnomelia phenotype. Our findings suggest an important role for sulfite oxidase in bone development.
Introduction
Arachnomelia is a genetic disease in cattle characterized by skeletal abnormalities. Affected calves are usually stillborn with a spidery appearance and an abnormally shaped skull (Figure 1). The bones of the limbs are prolonged (dolichostenomelia) with marked thinning of the diaphyses that fracture easily in the course of forced birth assistance. Additional dysmorphic features are variable, e.g. defects of the vertebral column and sometimes cardiac malformations [1]–[4]. Initially, it was assumed that the pathogenesis of bovine arachnomelia resembles that of human Marfan syndrome, which is caused by mutations in the FBN1 gene [1]. However, the identification of other cattle with a mutation in the FBN1 gene established that arachnomelia is phenotypically distinct from Marfan syndrome [5]. Arachnomelia affected calves lack some typical Marfan features such as joint laxity and aortic root dilation [3]. Bovine arachnomelia is inherited as a monogenic autosomal recessive trait with complete penetrance [2], [4]. Carrier animals do not present any clinical signs. An outbreak of arachnomelia with hundreds of cases occurred during the 1980s after extensive world-wide usage of a highly selected artificial insemination sire in the international Brown Swiss cattle population [2]. In the past few years, a comparable outbreak of arachnomelia occurred in German Fleckvieh cattle [4]. Different genes seem to be responsible for the arachnomelia disease in these two cattle breeds, as there is no relationship between the founder animals and recently two independent loci were genetically mapped to different chromosomes [6], [7]. We mapped the arachnomelia mutation in Brown Swiss cattle to a 7.19 Mb interval on bovine chromosome (BTA) 5 [6]. Due to the lack of suitable candidates the genetic basis of arachnomelia is not understood. Therefore, the spontaneous cattle arachnomelia mutants provide unique resources to gain further insights into the biology of bone development.
Array enrichment and next-generation sequencing technology can be used to rapidly sequence targeted subsets of the genome [8], [9]. Sequence capture enrichment has already successfully been used to sequence the coding portion of the human genome [10]–[12] and also large genomic intervals [13], [14]. This technology offers great potential for positional cloning projects where the mapping resolution may be limited, e.g. in the case of recent mutations. Thus megabase sized regions can now be re-sequenced efficiently. Furthermore, the unique family structures in livestock populations greatly facilitate the discrimination of the causative variant from the many neutral polymorphisms that have to be expected from such re-sequencing projects. In this report we applied this approach to identify the causative mutation for bovine arachnomelia.
Results/Discussion
We selected two individuals for an array-based sequence capture and massively parallel sequencing approach to re-sequence the entire critical interval. Our design included one arachnomelia affected calf assumed to be homozygous across the entire sequence interval including the causative variant. The other animal chosen for re-sequencing was a partially inbred non-affected cow. Based on pedigree and marker data this non-affected cow was identical-by-descent for the critical segment of BTA 5, except for the causative arachnomelia mutation, which we predicted to reside only on her paternally derived chromosome (Figure 2). We chose this cow for complete re-sequencing of the entire critical interval, as the detection of a heterozygous polymorphism in this animal should reveal the causative mutation.
We enriched the ∼3.5 Mb non-repetitive sequence within the critical interval of BTA 5 in the two animals and collected about 30 million illumina reads per animal. After the alignment to the reference sequence, the mean coverage was 44-fold for the affected animal and 15-fold for the non-affected animal. The difference was most likely due to technical variations during the hybridization. The depth of coverage was variable across the targeted interval, similar to what had been described previously [15]. However, the two animals showed a similar distribution of the gaps across the targeted interval (Figure S1). In the affected calf 96% of the enriched bases had at least 4-fold coverage compared to 91% in the control sample. We called a homozygous variant when the respective position had ≥4-fold coverage and the observed difference between the experimental reads and the reference sequence occurred at ≥75% frequency. For the calling of a heterozygote variant we applied a threshold of ≥15-fold sequence coverage and a variant allele frequency between 25% and 75% based on published recommendations [16]. Using these criteria we recovered a total of 6,025 putative variants between the reads and the reference for the affected calf (4,848 homozygous and 1,177 heterozygous), and 4,318 putative variants for the control cow (3,818 homozygous and 500 heterozygous, Table 1). The high number of seemingly heterozygous variants detected in two animals supposed to be completely or almost completely homozygous underscores the challenges of aligning short reads from a complex mammalian genome to a draft quality reference sequence. About three-fourths of the heterozygous variant calls were located within an olfactory receptor (OR) gene cluster encompassing a 2.2 Mb segment with a total of 136 annotated loci [17]. The read coverage within this region was significantly higher than the average (60/37-fold) indicating segmental duplications. Other factors contributing to the high number of putative heterozygous variants may have been sequencing errors, non-specific hybridization of DNA during the enrichment, or amplification-mediated artifacts (i.e. polymerase errors during library preparation).
Due to the recessive inheritance and the lethal effect of the mutation we hypothesized that most likely a loss of function mutation affecting the coding sequence of a gene would be responsible for arachnomelia. Therefore, we subsequently concentrated on variants that were located within the coding sequences or within the splice sites of the annotated genes in the targeted region of the bovine genome. A total of 79 predicted homozygous variants were located within coding sequences or adjacent splice sites of the affected calf (Table 1, Table S1). The comparison between the affected calf and the non-affected cow revealed that 68 of these variants had identical homozygous mutant genotypes in the control cow and could thus be excluded as causative variants. Eight variants had no illumina coverage in the control cow and were subsequently found to be homozygous mutant by Sanger sequencing and thus also excluded. The three remaining variants were putative heterozygous variants in the control cow with 10, 12, and 27-fold coverage, respectively. We validated these three potential variants by Sanger sequencing and found that two of them were false positives as the affected calf and the control cow shared identical homozygous genotypes for these variants. For the remaining variant we confirmed that the control cow was indeed heterozygous, whereas the affected calf was homozygous mutant compared to the reference sequence. This variant was a 1 bp insertion located in exon 4 of the bovine SUOX gene (c.363–364insG, Figure 3). We found perfect concordance between the presence of this insertion and the arachnomelia phenotype (Table 2). All 16 affected calves were homozygous mutant and all 11 available mothers for these animals were heterozygous. Genomic DNA samples of 25 artificial insemination carrier sires, which had recorded arachnomelia offspring and which were related to the assumed founder, were tested and 23 of them were also heterozygous. For each of the two remaining suspected carriers only one single arachnomelia suspicious calf was recorded by the breeding organization and the diagnoses of these calves had not been confirmed by veterinarians. Therefore, we think that these two reported arachnomelia calves represented phenocopies and their sires are indeed free of the arachnomelia mutation. Our material included Beautician, son of the assumed founder Lilason, who was responsible for spreading the mutation into the global Brown Swiss population. We confirmed that Beautician is heterozygous for the SUOX mutation supporting the hypothesis for the origin of the causative mutation. Three acknowledged carrier bulls, which had the same genetic constellation as the non-affected inbred cow of our mutation analysis and were homozygous for all tested markers in the critical interval, were also found to be heterozygous for the SUOX mutation (Figure S2). None of 309 unrelated healthy Brown Swiss cattle had the homozygous mutant genotype, but 10 of them were presumed carriers. Thus the allele frequency of the deleterious insertion within this sample of unrelated Brown Swiss cattle was 1.6%, which is about half of the frequency that was estimated 20 years ago [18]. We screened a genetically diverse panel of animals from 15 breeds widely used in commercial cattle production to confirm that the identified mutation does not occur outside the Brown Swiss population. None of the 150 chromosomes in this sample showed the causative insertion (Table 2).
The c.363–364insG insertion is predicted to result in a frameshift beginning with amino acid residue 124 in the bovine SUOX protein sequence (p.Ala124GlyfsX42). While it is unclear whether the mutant protein of 164 residues is actually expressed, with more than 75% of the normal SUOX protein missing, it is very unlikely that the mutant protein fulfills any physiological function. Due to the frameshift and the premature stop codon, any mutant protein produced will contain 42 altered amino acids, and could potentially interfere with normal cellular function. The SUOX gene encodes the molybdohemoprotein sulfite oxidase, a terminal enzyme in the oxidative degradation pathway of sulfur-containing amino acids. Each monomer of the dimeric SUOX enzyme consists of three domains, the N-terminal heme domain, the central molybdenum domain and a C-terminal domain [19]. Deficiency of this enzyme in humans usually leads to recessive inherited sulfocysteinuria (OMIM 272300) characterized by major neurological abnormalities and early death [19]–[22]. The more severe cases, characterized by frequent seizures and death within a few days of birth, result from a complete SUOX loss of function [20], [22]. A single case of human sulfocysteinuria caused by a 1 bp deletion of human SUOX leading to a truncated protein has been reported, where some dysmorphic skeletal features were diagnosed in addition to severe neurodevelopmental anomalies [22].
The arachnomelia phenotype in cattle shows more severe skeletal defects and neonatal lethality compared to the human sulfocysteinuria patients. Another genetic disorder of human sulfur metabolism associated with a bone phenotype (arachnodactyly) is caused by CBS mutations resulting in cystathionine beta-synthase deficiency [23].
In cattle there are two virtually identical arachnomelia phenotypes in Brown Swiss cattle and in German Fleckvieh cattle. The mutation in German Fleckvieh was mapped to BTA 23, a region where the MOCS1 gene encoding molybdenum cofactor synthesis 1 is located [7]. A candidate causative mutation for German Fleckvieh arachnomelia has recently been identified in the bovine MOCS1 gene [Johannes Buitkamp, personal communication]. The MOCS1 protein is required for the synthesis of the molybdopterin cofactor, which forms the active site in SUOX. The involvement of SUOX and MOCS1 in the same biochemical pathway mutually supports the causality of these mutations in Brown Swiss and German Fleckvieh cattle.
We think that we have established the causality of the SUOX mutation for arachnomelia in Brown Swiss cattle based on the following arguments: (1) the perfect association of the SUOX mutation to the arachnomelia phenotype, (2) the obvious functional impact of a frameshift mutation on the encoded protein, (3) four non-affected inbred animals, which were identical-by-descent for all tested markers across the critical interval, were heterozygous at the SUOX mutation. The recognition of these four inbred animals was a key element in our discovery and illustrates the potential of livestock specific population structures for genetic research. This study represents one of the first successful applications of microarray-based enrichment of megabase-sized genomic regions followed by massively parallel sequencing to unravel the causative mutation underlying a Mendelian trait. This technology significantly reduces the time and resources required for mutation identification by abrogating the need for high resolution genetic mapping and thousands of Sanger sequencing reactions.
In summary, we have successfully applied a sequence capture strategy to identify SUOX as the causative gene for bovine arachnomelia, and thereby discovered an essential role for this gene during bone development. The knowledge of the causative mutation will allow direct genetic testing of Brown Swiss cattle and the elimination of this fatal genetic disease from the breeding population. This study highlights the enormous potential of spontaneous mutants in domestic animals to gain further insights into mammalian biology.
Materials and Methods
Sequence capture enrichment
The bovine genome assembly Btau 4.0 was used for all analyses. A custom tiling 385k sequence capture array targeting the arachnomelia region (BTA 5, 57,285,788–64,478,535 bp) was designed and manufactured by Roche NimbleGen. The reference sequence contained 179 gaps with a total of 49,558 bp (0.7% of the target sequence). The array was designed using NimbleGen's standard 15-mer frequency masking to minimize repeat content within capture probes. The probe spacing, tiling overlap, and probe length were determined using proprietary algorithms (NimbleGen). For the sequence capture library construction a total of 20 µg high-molecular weight genomic DNA was sheared to yield approximately 400 bp fragments using an ultrasound device and purified on QIAquick columns (QIAGEN). The genomic DNA was polished and repaired using T4 DNA polymerase and T4 PNK (Fermentas). Illumina adapters were added to each genomic DNA sample using T4 DNA ligase (Fermentas). Adapter ligated samples were purified and amplification competency was assessed by PCR with primers complementary to the ligated adapters and finally evaluated by agarose gel electrophoresis. Array hybridization was executed using an X1 mixer (Roche NimbleGen) and the NimbleGen Hybridization System for 3 days at 42°C following the manufacturer's recommended conditions. Human Cot-1 DNA (Invitrogen/Life Technologies) was used at a mass ratio of 5:1 vs. the library. Arrays were washed using the recommended protocol (Roche NimbleGen Arrays User's Guide v2.0). The captured molecules were eluted from the slides with elution reagent using a NimbleGen Elution Station. Eluted molecules were dried by centrifugation under vacuum, rehydrated and PCR amplified for 18 cycles with Phusion polymerase (Finnzymes). Enrichment of samples was assessed by quantitative PCR comparison to the same samples prior to hybridization. Following evaluation by agarose electrophoresis and purification, the amplified capture libraries were processed into sequencing libraries for the illumina GAII. A total of 33,676,855/31,356,905 single reads of different length (36 or 76 bp) comprising 2,245,122,900/2,241,558,140 bases raw data were produced for case and control, respectively.
Read mapping and variant calling
Repetitive elements of the 7.19 Mb genome sequence were masked using the Repeatmasker software and a total of 3,542,013 bp single copy sequence was used as reference for short read mapping. The SeqMan NGen v2.0 software (DNASTAR) was used for assembly with a minimal match percentage of 97%, a minimal match size of 19 nt, and a maximal coverage of 200. A total number of 4,984,147/1,440,459 reads were assembled for the case and the control, respectively. We required a minimal coverage of 4-fold for variant detection and ≥75% of the variant allele for calling a homozygous mutant genotype. Variants falling within the first 10 nt adjacent of masked repetitive sequences were excluded due to obvious alignment inconsistencies (e.g. high coverage, probably due to incomplete masking at the ends of repeats). The post-assembly processing of the variant data was carried out using the R statistical software package [24].
Sanger re-sequencing
Some variants were genotyped by re-sequencing of targeted PCR products using Sanger sequencing technology. PCR products were amplified using AmpliTaqGold360Mastermix (Applied Biosystems). PCR products were directly sequenced on an ABI 3730 capillary sequencer (Applied Biosystems) after treatment with exonuclease I and shrimp alkaline phosphatase. Sequence data were analyzed with Sequencher 4.9 (GeneCodes).
Supporting Information
Zdroje
1. RieckGW
SchadeW
1975 Die Arachnomelie (Spinnengliedrigkeit), ein neues erbliches letales Missbildungssyndrom des Rindes. Dtsch Tierärztl Wochenschr 82 342 347
2. KönigH
GaillardC
ChavazJ
HunzikerF
TontisA
1987 Prüfung von Schweizer Braunvieh-Bullen auf das vererbte Syndrom der Arachnomelie und Arthrogrypose (SAA) durch Untersuchung der Nachkommen im Fetalstadium. Tierärztl Umsch 42 692 697
3. TestoniS
GentileA
2004 Arachnomelia in four Italian Brown calves. Vet Rec 155 372
4. BuitkampJ
LuntzB
EmmerlingR
ReichenbachHD
WeppertM
2008 Syndrome of arachnomelia in Simmental cattle. BMC Vet Res 4 39
5. SingletonAC
MitchellAL
ByersPH
PotterKA
PaceJM
2005 Bovine model of Marfan syndrome results from an amino acid change (c.3598G >A, p.E1200K) in a calcium-binding epidermal growth factor-like domain of fibrillin-1. Hum Mutat 25 348 352
6. DrögemüllerC
RossiM
GentileA
TestoniS
JörgH
2009 Arachnomelia in Brown Swiss cattle maps to Chromosome 5. Mamm Genome 20 53 59
7. BuitkampJ
KühnC
SemmerJ
GötzKU
2009 Assignment of the locus for arachnomelia syndrome to bovine chromosome 23 in Simmental cattle. Anim Genet online early doi:10.1111/j.1365-2052.2009.01933.x
8. AlbertTJ
MollaMN
MuznyDM
NazarethL
WheelerD
2007 Direct selection of human genomic loci by microarray hybridization. Nat. Methods 4 903 905
9. OkouDT
SteinbergKM
MiddleC
CutlerDJ
AlbertTJ
ZwickME
2007 Microarray-based genomic selection for high-throughput resequencing. Nat Methods 4 907 909
10. HodgesE
XuanZ
BalijaV
KramerM
MollaMN
2007 Genome-wide in situ exon capture for selective resequencing. Nat Genet 39 1522 1527
11. NgSB
TurnerEH
RobertsonPD
FlygareSD
BighamAW
2009 Targeted capture and massively parallel sequencing of 12 human exomes. Nature 461 272 276
12. Ng SB BuckinghamKJ
LeeC
BighamAW
TaborHK
2010 Exome sequencing identifies the cause of a mendelian disorder. Nat Genet 42 30 35
13. VolpiL
RoversiG
ColomboEA
LeijstenN
ConcolinoD
2010 Targeted next-generation sequencing appoints c16orf57 as clericuzio-type poikiloderma with neutropenia gene. Am J Hum Genet 86 72 76
14. RehmanAU
MorellRJ
BelyantsevaIA
KhanSY
BogerET
2010 Targeted Capture and Next-Generation Sequencing Identifies C9orf75, Encoding Taperin, as the Mutated Gene in Nonsyndromic Deafness DFNB79. Am J Hum Genet 86 378 388
15. D'AscenzoM
MeachamC
KitzmanJ
MiddleC
KnightJ
2009 Mutation discovery in the mouse using genetically guided array capture and resequencing. Mamm Genome 20 424 436
16. BentleyDR
BalasubramanianS
SwerdlowHP
SmithGP
MiltonJ
2008 Accurate whole human genome sequencing using reversible terminator chemistry. Nature 456 53 59
17. ElsikCG
TellamRL
WorleyKC
GibbsRA
Bovine Genome Sequencing and Analysis Consortium 2009 The genome sequence of taurine cattle: a window to ruminant biology and evolution. Science 324 522 528
18. FuschiniE
FriesR
StockerH
1992 Malformations and genetic disorders in cattle: Registration, frequencies and control of heriditary diseases in the A.I. derived Braunvieh population [in German]. Wien Tierärztl Mschr 79 161 165
19. KiskerC
SchindelinH
PachecoA
WehbiWA
GarrettRM
1997 Molecular basis of sulfite oxidase deficiency from the structure of sulfite oxidase. Cell 91 973 983
20. JohnsonJL
CoyneKE
GarrettRM
ZabotMT
DorcheC
2002 Isolated sulfite oxidase deficiency: identification of 12 novel SUOX mutations in 10 patients. Hum Mutat 20 74
21. GarrettRM
JohnsonJL
GrafTN
FeigenbaumA
RajagopalanKV
1998 Human sulfite oxidase R160Q: identification of the mutation in a sulfite oxidase-deficient patient and expression and characterization of the mutant enzyme. Proc Natl Acad Sci USA 95 6394 6398
22. SeidahmedMZ
AlyamaniEA
RashedMS
SaadallahAA
AbdelbasitOB
2005 Total truncation of the molybdopterin/dimerization domains of SUOX protein in an Arab family with isolated sulfite oxidase deficiency. Am J Med Genet 136A 205 209
23. KrausJP
1994 Komrower Lecture. Molecular basis of phenotype expression in homocystinuria. J Inherit Metab Dis 17 383 390
24. R Development Core Team 2009 R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. ISBN 3-900051-07-0
Štítky
Genetika Reprodukčná medicínaČlánok vyšiel v časopise
PLOS Genetics
2010 Číslo 8
- Je „freeze-all“ pro všechny? Odborníci na fertilitu diskutovali na virtuálním summitu
- Gynekologové a odborníci na reprodukční medicínu se sejdou na prvním virtuálním summitu
Najčítanejšie v tomto čísle
- Identification of the Bovine Arachnomelia Mutation by Massively Parallel Sequencing Implicates Sulfite Oxidase (SUOX) in Bone Development
- Common Inherited Variation in Mitochondrial Genes Is Not Enriched for Associations with Type 2 Diabetes or Related Glycemic Traits
- A Model for Damage Load and Its Implications for the Evolution of Bacterial Aging
- Did Genetic Drift Drive Increases in Genome Complexity?