Distinct Genetic Architectures for Male and Female Inflorescence Traits of Maize
We compared the genetic architecture of thirteen maize morphological traits in a large population of recombinant inbred lines. Four traits from the male inflorescence (tassel) and three traits from the female inflorescence (ear) were measured and studied using linkage and genome-wide association analyses and compared to three flowering and three leaf traits previously studied in the same population. Inflorescence loci have larger effects than flowering and leaf loci, and ear effects are larger than tassel effects. Ear trait models also have lower predictive ability than tassel, flowering, or leaf trait models. Pleiotropic loci were identified that control elongation of ear and tassel, consistent with their common developmental origin. For these pleiotropic loci, the ear effects are larger than tassel effects even though the same causal polymorphisms are likely involved. This implies that the observed differences in genetic architecture are not due to distinct features of the underlying polymorphisms. Our results support the hypothesis that genetic architecture is a function of trait stability over evolutionary time, since the traits that changed most during the relatively recent domestication of maize have the largest effects.
Published in the journal:
Distinct Genetic Architectures for Male and Female Inflorescence Traits of Maize. PLoS Genet 7(11): e32767. doi:10.1371/journal.pgen.1002383
Category:
Research Article
doi:
https://doi.org/10.1371/journal.pgen.1002383
Summary
We compared the genetic architecture of thirteen maize morphological traits in a large population of recombinant inbred lines. Four traits from the male inflorescence (tassel) and three traits from the female inflorescence (ear) were measured and studied using linkage and genome-wide association analyses and compared to three flowering and three leaf traits previously studied in the same population. Inflorescence loci have larger effects than flowering and leaf loci, and ear effects are larger than tassel effects. Ear trait models also have lower predictive ability than tassel, flowering, or leaf trait models. Pleiotropic loci were identified that control elongation of ear and tassel, consistent with their common developmental origin. For these pleiotropic loci, the ear effects are larger than tassel effects even though the same causal polymorphisms are likely involved. This implies that the observed differences in genetic architecture are not due to distinct features of the underlying polymorphisms. Our results support the hypothesis that genetic architecture is a function of trait stability over evolutionary time, since the traits that changed most during the relatively recent domestication of maize have the largest effects.
Introduction
The genetic architecture of a complex trait is defined by the number, effect size, frequency, and gene action of the quantitative trait loci (QTL) that affect it. A comparison of studies from flies, mice, and humans shows that genetic architecture is remarkably consistent among these species, with many loci of small additive effect [1]. Distributions of QTL effect sizes are strikingly similar among different classes of mouse traits including behavior, biochemistry, immunology, and metabolism [2]. Similar results have been obtained in maize for flowering time, leaf morphology, and disease resistance traits [3]–[5]. Despite many well-powered genome-wide association studies (GWAS) of height variation in humans, no single polymorphism explaining even 1% of the variance in adult height has been found [6]–[9].
Fisher [10] provides a simple theoretical justification for these observations. For a well-adapted organism close to its fitness optimum, only small effects can increase fitness. Orr [11] showed that regardless of the distance from the fitness optimum, the expected distribution of effect sizes progressively fixed during adaptation is exponential, with a small number of large-effect loci fixed first, followed by progressively larger numbers of loci with smaller effects becoming fixed. The genetic architecture of intraspecific variation consists of many loci with small effects because loci with larger effects tend to be only briefly polymorphic.
A few traits exposed to strong, recent selection show distinct genetic architectures not characterized by many loci of small additive effect. For inbred dogs, three loci explain 38% of the variance in body weight among diverse breeds [12], and a single nucleotide polymorphism (SNP) at the IGF2 locus in pigs explains 15–30% of the variance in muscle mass [13]. In a cross between chicken populations recurrently-selected for high and low body weight, an epistatic network of four major loci explains 45% of the difference between parents [14]. Independent populations of anadromous stickleback fish that became trapped in freshwater lakes subsequently lost their armor plating through mutational changes at a single major locus [15]. The Fisher-Orr model predicts segregation of such large effects between populations exposed to divergent selective pressures, but not within a population exposed to directional selection.
Mating system also appears to influence genetic architecture. Flowering time QTL effects are much larger in the inbreeding species Arabidopsis thaliana than in maize, an outcrosser [16]. Inbreeding might allow isolated populations to fix large-effect mutations in response to divergent selective pressures, as the dog, chicken, and fish examples suggest. However, mating system differences cannot account for differences in genetic architecture between traits within an organism.
Plant and animal domesticates provide opportunities to compare genetic architecture between selected and unselected traits in populations exposed to the same demographic effects [17], [18]. Maize (Zea mays ssp. mays) was domesticated from teosinte (Zea mays ssp. parviglumis) 5,000 to 10,000 years ago in southwest Mexico [19]. Beadle [20] suggested that 4–5 recessive mutations underlie maize domestication as one out of every five hundred F2 progeny from a maize-teosinte cross appear maize-like. Two of these mutations have been identified: teosinte branched1 (tb1) causes an increase in apical dominance and reduction of lateral branching and teosinte glume architecture (tga1) causes “release” of the nutritious grain from bony, enclosing glumes [21], [22]. Other remarkable changes that occurred during maize domestication have yet to be fully explained.
Maize is a monoecious plant with an apical male inflorescence, the tassel, and an axillary female inflorescence, the ear (Figure 1). Maize and teosinte tassels are relatively similar, but dissimilarity between maize and teosinte ears fueled historical controversy about whether one could have evolved from the other [23], [24] until molecular data provided irrefutable evidence that maize evolved from teosinte [19]. Teosinte “ears” are small, occupy the lateral positions of a primary lateral branch, and have two rows of kernels. Maize ears are large, occupy the apical position of a primary lateral branch, and have from eight to over twenty rows of kernels. Although maize tassels are clearly different from teosinte tassels, the maize ear stands out as a monument of morphological evolution under human selection.
The maize tassel and ear, despite their differences, share a common developmental origin and are nearly indistinguishable from each other during early development. Tassel and ear become distinct through the formation of long branch primordia and the abortion of female floral organs in the tassel, and through the abortion of male floral organs in the ear. Several mutant phenotypes support a close developmental relationship between tassel and ear. Branches are usually only found in the tassel, but a number of mutations produce branched ear phenotypes [25]. Tasselseed phenotypes are characterized by failure to abort female development in the tassel, and can be induced by mutation or epigenetic change [25], [26]. Because the underlying genetic control of maize tassel and ear development is so similar, human selection for ear morphology may have indirectly changed the morphology of the tassel as well.
In this study we compare the genetic architecture of thirteen maize morphological traits, including seven inflorescence traits reported here and three leaf and three flowering traits reported previously [3], [4]. The four tassel and three ear traits were measured over eight environments in the maize nested association mapping (NAM) population, a set of 4892 recombinant inbred lines (RILs) derived from 26 biparental families that capture much of the genetic diversity of maize [27]. These RILs are ∼97% homozygous, show little evidence for segregation distortion or inter-chromosomal linkage disequilibrium (LD), and have been genotyped with 836 markers for an average of one marker every ∼1.3 cM [28]. Two methods were used to detect QTL: linkage mapping across the 26 families (joint linkage) and a GWAS approach that incorporates polymorphism data from 1.6 million maize SNPs [4], [29]. The NAM population has been recently studied for flowering, leaf, and disease-resistance traits [3]–[5], revealing genetic architectures characterized by many loci of small additive effect. Maize inflorescence traits have distinct genetic architectures characterized by larger QTL effect sizes. Increased effect size in maize inflorescences is caused by many hundreds of polymorphisms with larger effects and a deficiency of small-effect inflorescence polymorphisms. Ear traits have the largest effects and also show lower model predictive abilities. The close developmental relationship between male and female maize inflorescences allows us to infer from our results that genetic architecture may vary independently of genetic control, providing new evidence for how selection affects the genetic architecture of complex traits.
Results
Maize inflorescence variation in a large, diverse population of recombinant inbred lines
The four tassel traits and three ear traits measured are shown in Figure 1 and in Table 1. Traits measured in units of counts, branch number (BN) and ear row number (ERN), were not normally distributed and required box-cox transformation. Traits measured in units of length, tassel length (TL), spike length (SL), length of the branch zone (BZ), cob length (CL), and cob diameter (CD), were normally distributed. Broad-sense heritabilities ranged from 0.87–0.93, within the range of heritabilities reported previously for flowering and leaf traits. Correlations between phenotypes from temperate and tropical growing environments were high for all inflorescence traits, so a single best linear unbiased predictor was calculated for each trait over all locations.
Genomic regions controlling maize inflorescence variation
We mapped loci controlling maize inflorescence variation using joint linkage and GWAS analyses in the NAM population of ∼5000 RILs as described previously [3]–[5]. Table 2 presents the major differences between these analyses, and full results are presented in Tables S1 and S2. In brief, the joint linkage analysis used 836 markers, whereas the GWAS analysis incorporated genetic information from over 1.6 million SNPs genotyped in the 27 parental lines. Joint linkage QTL were fit as marker-by-family terms, meaning that 26 separate effects were fit for each QTL [3], whereas GWAS SNPs are biallelic. A single joint linkage model was developed for the entire genome, whereas GWAS models were fit for each chromosome separately. For the GWAS analysis, a subsampling procedure was used to assign a resample model inclusion probability (RMIP) value for each SNP ranging from 0 to 1, representing the percentage of subsamples in which that SNP was selected [30]. High correlations were observed between trait heritabilities, the number of joint linkage QTL detected, and the number of GWAS SNPs detected across the seven inflorescence traits (Table 1). Full results for joint linkage and GWAS analyses are presented in Tables S1 and S2. GWAS analysis confirms the presence of all QTL detected by joint linkage analysis, and often splits a multiallelic QTL into two or more biallelic loci. The specific families assigned to carry a given QTL are often different between the two analyses.
Inflorescence traits have larger QTL effects than flowering and leaf traits
We compared effects between ear, tassel, leaf, and flowering traits, and found that ear effects are largest and flowering effects are smallest in both joint linkage and GWAS analyses (Figure 2). Joint linkage analysis produces many more small effects than GWAS analysis as an artifact of the model fitting process, which assigns a separate effect to all 26 families at each QTL. Since most QTL are not present in all families, many of these effects are near zero. To compare effects among traits, the absolute value of each effect was scaled by the total heritable variation for that trait. Total heritable variation was calculated as the standard deviation of the trait BLUPs among a set of 282 diverse maize lines (this set includes the 27 parental lines) multiplied by our broad-sense heritability estimate for that trait (Table 1). Using the standard deviation of the trait BLUPs among just the 27 parental lines gave very similar results. Trait heritabilities were not included in an initial scaling process, leading to a modest correlation between heritability and median effect size (r2 = 0.127 for joint linkage and r2 = 0.233 for GWAS). Scaling by the total heritable variation reduced this correlation considerably (r2 = 0.045 for joint linkage and r2 = 0.075 for GWAS, Figures S1 and S2). QTL number varied from 26 to 40 among inflorescence, flowering, and leaf traits for joint linkage analysis. To control for variation in QTL number, we refit a model containing the 26 most significant QTL for all 13 traits and recalculated the effects. Results presented are for recalculated effects. This process did not change the magnitude of differences in effects among trait categories.
Our QTL effects are biased by the reference design of the NAM population, in which 26 diverse inbreds were each crossed to a common parent. Since the common parent is the reference point from which all other effects are judged, traits for which the common parent is an outlier, such as ear row number, will have inflated QTL effects in the NAM population. To correct for this bias, we inferred and present results of the full 26×26 matrix of QTL effects between all parental lines rather than using the 26×1 vector of observed QTL effects relative to the common parent (see Materials and Methods). We also regressed median QTL effects on the deviations of the common parent from the mean of the 27 parental lines and found little correlation (r2 = 0.067 for joint linkage and r2 = 0.043 for GWAS; Figures S1 and S2), even though ear row number had the largest deviation and the largest effects. To compare GWAS SNP effect sizes among traits, we calculated the absolute value of each median SNP effect across all subsamples in which that SNP was selected, and scaled this value by the total heritable variation. GWAS SNP number is correlated with heritability (Table 1), so we selected a fixed number of SNPs for each trait, ordered by decreasing RMIP value. Results presented at the bottom of Figure 2 include top 200 SNPs for each trait, and including the top 50, 100, or 500 SNPs yielded very similar results (Figure S3).
Inflorescence traits have larger effects than flowering or leaf traits across a range of QTL and SNP frequencies (Figure 3; Kolmogorov-Smirnov test p<10−16 for joint linkage and GWAS), and ear traits have larger effects than tassel effects (Kolmogorov-Smirnov test p = 0.004 for joint linkage and p<10−15 for GWAS). The few large-effect flowering QTL are contributed by anthesis-silking interval (ASI; Figure S4), and not days to anthesis (DA) or days to silking (DS). Low-frequency GWAS SNPs present in four or fewer families account for nearly all loci with scaled effects above 0.15 (Figure 3). Several high-frequency, large-effect SNPs for ear row number are exceptions. The GWAS SNPs with the largest effects are found at low frequency and have low Resample Model Inclusion Probability (RMIP) values, although there is no overall correlation between frequency and RMIP (Figure S5). In contrast, joint linkage effect sizes show no relationship with frequency. Joint linkage results span a larger range of effect sizes than GWAS results, which likely reflects stacking of linked QTL with effects in the same direction.
Model predictive ability is lower for ear traits
We assessed the predictive value of our GWAS models for each trait by summing the effects of SNPs with RMIP values of 0.05 or greater, weighted by their RMIP values, and calculating predicted values for the 27 parents and the 4892 RILs, with and without the inclusion of a family term (Figure 4). Ear trait models had lower model predictive abilities than all other traits except anthesis-silking interval. Inclusion of the family term always improved predictive ability, and predictive ability was generally higher for parents than for RILs.
Pleiotropic QTL affect ear and tassel traits
Joint linkage and GWAS analyses yield similar estimates of pleiotropy among 13 diverse maize morphological traits (Figure 5). Pleiotropy was assessed from joint linkage results by fitting the QTL for each trait to every other trait and correlating the resulting vectors of effects across the 26 families (Table S3, [3]). If a QTL has large positive or large negative effects for two traits in many of the same families, the effect vectors will be significantly correlated and pleiotropy will be inferred. For GWAS results, pleiotropy was assessed by averaging SNP effects for each trait in each family, weighted by their RMIP values, in sliding windows across the genome (see Materials and Methods and Table S4). Pleiotropy is observed between developmentally related traits across male and female inflorescences: cob length shows positive pleiotropy with spike length and with tassel length. Pleiotropy is also observed between elongation of vegetative and reproductive organs: leaf length shows positive pleiotropy with cob length, tassel length, spike length, and branch zone length. In addition we observed very strong pleiotropy between days to anthesis and days to silking, and moderate pleiotropy between both leaf length and leaf width with flowering traits. This pattern of pleiotropy has been observed previously using joint linkage results [3], [4] and is corroborated here using GWAS.
Since ear QTL have larger effects, we reasoned that the subset of QTL for other traits that show evidence of pleiotropy with ear traits might also have larger effects. To address this hypothesis, all pleiotropic GWAS SNPs were grouped according to whether they showed pleiotropy within or between trait categories (tassel, ear, and flowering/leaf; Figure 6). In general there are no differences in QTL effects between types of pleiotropic QTL within a trait category: pleiotropic tassel QTL have similarly-sized effects regardless of whether they are pleiotropic with ear, flowering/leaf, or other tassel QTL. The same pattern is observed for pleiotropic flowering and leaf QTL. The one exception is that ear QTL pleiotropic with flowering/leaf QTL appear slightly smaller than ear QTL pleiotropic with other ear QTL (Kolmogorov-Smirnov test p = 0.005).
When there is shared genetic control between ear traits and other traits, ear effects are larger than effects for other traits. Similarly, when there is shared genetic control between flowering/leaf traits and other traits, flowering/leaf effects are smaller than effects for other traits. Non-pleiotropic QTL are not displayed in Figure 6 but have significantly smaller effects than pleiotropic QTL, suggesting that our power to detect pleiotropy may be greater for QTL with larger effects.
SBP–domain genes are enriched for proximity to tassel branching loci
Induced and spontaneous mutations in many maize genes cause dramatic inflorescence phenotypes (Table 3). We considered these genes to be candidates for our maize inflorescence QTL, and tested them for enriched proximity to our GWAS SNPs for maize inflorescence traits. Two of the genes responsible for changes in inflorescence morphology during maize domestication have also been identified: teosinte glume architecture (tga1) encodes a squamosa-binding-protein (SBP)-domain transcription factor [22] and teosinte branched1 (tb1) encodes a TCP-domain protein [21]. For this reason, annotated SBP-domain and TCP-domain genes in the maize genome were also considered to be candidates and tested for enriched proximity to our GWAS SNPs for maize inflorescence traits. To test for enrichment, we considered only the ten GWAS SNPs with the highest RMIP values for each trait, both to minimize the number of tests and because we assumed that these high-RMIP SNPs would be closely linked to their causal polymorphisms. For each of the three sets of candidates (26 genes identified using induced or spontaneous mutations, 17 SBP genes, and 24 TCP genes), we calculated the genetic distance to the nearest GWAS SNP for each gene and compared these results to a null distribution estimated from 1000 sets of the same number of random genes. For instance, the null distribution for SBP genes was estimated from 1000 sets of 17 random genes. Cloned maize inflorescence mutants showed slight enrichment for proximity to tassel length and spike length loci: GWAS SNPs for both these traits fell within 1 cM of the fea2 and td1 loci (Figure S6-top). SBP-domain genes showed enrichment for proximity to GWAS SNPs for branch number and branch zone length (Figure S6-middle). Overall, three SBP domain genes are implicated in tassel branching, at 4 Mb on chromosome 2, 205 Mb on chromosome 4, and 139 Mb on chromosome 10 (AGP version1 coordinates). The first of these genes corresponds to liguleless1, which lies near a high-RMIP SNP for leaf angle as reported by Tian et al. [4]. SBP genes have no overall enrichment for proximity to GWAS SNPs for leaf angle, however. TCP-domain genes show no significant enrichment for proximity to GWAS SNPs for any trait (Figure S6-bottom). Only the enrichment between SBP-domain genes and branch number survives a Bonferonni correction. Two well-characterized SBP-domain genes were included in our candidate list (tga1 and tsh4), but are not associated with variation in branch number.
Discussion
Low-frequency SNPs with very large effects may represent linked loci
Low-frequency SNPs found in four families or fewer account for most of the largest GWAS effects (Figure 3). Lack of power likely accounts for both the failure to detect small effect GWAS SNPs at low frequency and the greater proportion of intermediate-frequency GWAS SNPs relative to the null distribution (see [4] Fig. 4). Lack of power does not help explain the over-representation of large-effect SNPs at low frequency, however. Causal variants at low and high frequencies are more likely matched by random SNPs. A causal variant present in one or 25 of the 26 families has just 26 possible incidence patterns, whereas a causal variant present in 13 families has over 10 million possible incidence patterns. Our dataset of 1.6 million SNPs is too small to tag all causal variants, and we are far less likely to tag intermediate-frequency than low- or high-frequency variants. We observe large-effect SNPs at low frequency but not at high frequency, however. One explanation is linkage: linked variants with effects in the same direction will more often be combined into a single “synthetic” effect if they are present at low frequency. Low-frequency SNPs with very large effects also have low RMIP values (Figure S5), which supports this explanation: rare recombinant individuals allow separation of linked synthetic loci, but are sampled only intermittently. Because all GWAS SNPS with effects over 0.3 standard deviations in this study are found in a single family, we hypothesize that they result from linked QTL. These large effects explain a small proportion of the total phenotypic variation because their frequencies are low.
Increased lability of the maize inflorescence
Larger QTL effects may reflect either larger effects of individual causal variants or greater linkage disequilibrium between causal variants with effects in the same direction. The latter phenomenon is expected to be most prevalent for SNPs found in a single family. However, the difference in magnitude between inflorescence and flowering/leaf effects holds true across the entire range of SNP frequencies (Figure 3), suggesting that individual inflorescence variants have larger effects than individual flowering or leaf variants. Also noteworthy is the deficiency of small effects for inflorescence variants, which cannot feasibly be due to linkage. Since many inflorescence traits are pleiotropic with flowering and leaf traits, we assume that many of the same polymorphisms underlie these QTL for different traits. Even in instances of shared genetic control, however, inflorescence effects are larger than flowering/leaf effects, and ear effects are larger than tassel effects (Figure 6). This does not support the scenario that inflorescence polymorphisms are unique, consisting for example of more frame-shifts, premature stop codons, or nonsynonymous substitutions. Rather, these results suggest that the maize inflorescence, and the maize ear in particular, is more labile.
Other traits with distinct effect sizes
Three flowering traits show a disjunct distribution of effect sizes, with days to anthesis (DA) and days to silking (DS) effects much smaller than anthesis-silking interval (ASI) effects (Figure S4). Stabilizing selection over millions of years may have purged Zea populations of large-effect variants for DA and DS due to the fitness cost of flowering too early or late relative to the rest of the population. In contrast, ASI may be a much “younger” trait specific to the apically-dominant architecture of the maize plant. Our scaling procedures may also have inflated effects for ASI. The development and maintenance of inbred lines by self-fertilization strongly selects for synchronous male and female flowering (ASI values close to zero), reducing the total heritable variation in ASI and increasing our scaled ASI QTL effects.
Reduced predictive ability of additive models for ear traits
The utility of GWAS studies is contingent on their ability to predict phenotypes. In this study we show that simple additive models containing several hundred SNPs explain over 50% of the phenotypic variation in a set of 4892 RILs for most of the 13 maize morphological traits (Figure 4). SNP number in these models could probably be reduced considerably without sacrificing predictive ability by removing SNPs in high linkage disequilibrium with each other [5]. Additive model predictions are least accurate for the ear traits (cob length (CL), cob diameter (CD), ear row number (ERN)), and the flowering trait anthesis-silking interval (ASI). To investigate the nature of this apparent non-additivity, we focus on models without a family term (Figure 4-top) that rely solely on GWAS SNPs to explain phenotypic differences within and between families. Most traits show ∼10% greater predictive ability in the parents than in the RILs, but for cob length this difference is dramatic (∼30%). We observe the opposite situation for cob diameter and ear row number: predictive ability is higher in the RILs than in the parents. Here we interpret these observations in terms of interaction effects. For cob length, additive effects detected in the RILs accurately predict parental phenotypes, so we infer that interaction effects are equally likely to enhance or mask a given QTL (their mean effect is close to zero) and they must be common enough to account for a ∼20% drop in predictive ability in the RILs. For cob diameter and ear row number, additive effects detected in the RILs do not predict parental phenotypes, so we infer that parental phenotypes are caused by more complex interaction effects that are seldom recapitulated in the RILs and have little influence on additive effect sizes.
Pleiotropic loci affect elongation of leaves and inflorescences
We observe several pleiotropic relationships consistent with previous developmental genetic work. Negative pleiotropy between spike length (SL) and branch number (BN) indicates a trade-off between the two, consistent with the finding that a given meristem in the maize inflorescence acquires the fate of either a long indeterminate branch or a short indeterminate spikelet pair [31]. Knowledge of shared developmental networks, not only between ears and tassels but also between the elongation of vegetative and reproductive structures, can help inform the choice of candidate genes. The QTL with pleiotropic effects on leaf length, tassel length, and cob length may involve genes that function in cell elongation throughout the plant, rather than inflorescence-specific developmental genes.
Distinguishing linkage from pleiotropy
In a biparental family, close linkage of genes cannot be distinguished from pleiotropic effects of a single gene. Assessment of pleiotropy in NAM is made possible by testing correlations between vectors of QTL or SNP effects across the 26 families of RILs. This analysis will only detect pleiotropy when the same polymorphism or haplotype is consistently associated with phenotypic effects on different traits. Another less stringent definition of pleiotropy would allow a single gene to control variation in different traits through different polymorphisms. A possible example of this type of pleiotropy is the liguleless1 (lg1) locus, which is associated with variation in both leaf angle [4] and tassel branching (this study). lg1 encodes an SBP-domain transcription factor. The association of lg1 with leaf angle is supported by its mutant phenotype [32], and the association of SBP-domain transcription factors with branching is supported by our results and by studies in rice [33], [34]. Effect estimates for lg1-linked leaf angle and branch number QTL in NAM are not correlated, suggesting that different polymorphisms may be responsible for the effects of lg1 on leaf and tassel traits. Since structural mutations in a gene are more likely to have effects wherever the gene is expressed, lg1-linked variants for leaf angle and branch number might be cis-regulatory variants operating independently of each other in specific tissues [35]. Pleiotropy of this type cannot be distinguished from linkage in our analyses.
Loci controlling natural variation in maize inflorescence traits are distinct from those uncovered using mutagenesis
Only a small degree of overlap is observed between the location of cloned maize inflorescence development genes and SNPs significant for inflorescence traits (Table 3, Figure S6A). Overlap between SBP-domain genes and loci for tassel branch number shows that our analysis has the power to detect such overlap where it does exist (Figure S6B). Most cloned maize inflorescence genes involve loss-of-function alleles generated by transposon or chemical mutagenesis that have obvious phenotypes in mutant screens. Such screens generally cannot uncover mutations for which there is genetic redundancy. Purifying selection may be relaxed for genes with redundant functions, allowing them to accumulate more mutations that change gene function than non-redundant genes. If this is true, then mutagenesis studies may be somewhat biased against the discovery of loci controlling natural variation.
Effects of selection on genetic architecture
The genetic architecture observed for maize inflorescence traits is novel. Very large effect sizes for a few major loci are commonly observed in plant and animal domesticates, including maize-teosinte segregants [36], divergently-selected dog breeds [12] and chicken populations [14]. Fish populations subjected to habitat change [15] demonstrate that these unusual genetic architectures may be caused by natural as well as human selective pressure. These observations are consistent with theoretical predictions of an exponential distribution of effect sizes underlying adaptation [11]. In each case, the number of large-effect QTL is very few, because large effects quickly move a trait close to its fitness optimum. In contrast, the genetic architecture of inflorescence traits within domesticated Zea is characterized by a shift in the entire distribution of effect sizes, with many more effects of intermediate size and many fewer small effects. Although unusual genetic architectures observed in domesticates are sometimes attributed to human preference for novelty, which may preserve unadaptive, large-effect mutations [12], it is difficult to explain how such a preference for novelty could account for the deficiency of small-effect inflorescence QTL.
Maize domestication released cryptic genetic variation for inflorescence traits [37]. For example, ear row number is invariant in teosinte but varies widely in maize, indicating that all genetic variants for ear row number in maize must be cryptic genetic variants in teosinte. Maize inflorescence QTL may have more large effects and fewer small effects because more of them are caused by newly-released cryptic variants. The distribution of effects for cryptic variation could differ from that of old, standing variation for two reasons. First, large effects become fixed or purged more rapidly than small effects [38]. Second, large-effects could become smaller through the gradual accumulation of buffering mutations [39]. This is the canalization hypothesis: organisms evolve robustness to genetic and environmental perturbation. Since the maize ear is a relatively recent creation, it has accumulated the least genetic buffering. These scenarios differ in their prediction of the distribution of effects of new mutations: either large-effect mutations keep arising transiently, or the canalized phenotype becomes resilient to large-effect mutations.
Maize ear and tassel traits have distinct genetic architectures even though they have shared genetic control: pleiotropic loci with effects on both tassel and ear show larger effects on the ear. This is the expected pattern if these pleiotropic loci had phenotypic effects in male but not female inflorescences in teosinte. Following maize domestication, they would act as newly-released cryptic variants in maize ears but not tassels. Maize domestication moved the ear from an axillary position to an apical position in the primary branch, which may have brought it under the control of an apical dominance network [40].
The process of domesticating maize from teosinte transformed plant architecture. The long lateral branch of teosinte with multiple, axillary, two-rowed female inflorescences was reshaped into a short, unbranched structure bearing a single, apical, multi-rowed ear. We present evidence that this process also transformed genetic architecture, creating a state of increased genetic lability in the maize ear that humans have cleverly exploited. Because only a few thousand generations have elapsed since the maize ear was created, ear traits still show a larger range of effect sizes than tassel, flowering, and leaf traits, for which maize and teosinte are phenotypically much more similar.
Future advancements in medicine and agriculture will benefit from an improved understanding of the forces that shape the genetic architecture of complex traits. The most rigorous study to date comparing the genetic architecture of traits within a species [2] examined 97 traits in mice and found little variation in effect size (see [1] Figure 2). These traits were predominantly fitness-related and may have stabilized over many millions of years. By comparing a suite of maize morphological traits that have experienced very different selective pressures over the last 5,000 years, we show that effect sizes are inversely proportional to trait stability and that genetic architecture may vary even when there are common underlying genes. We suggest that most large-effect maize ear QTL represent cryptic genetic variants released by the fixation of large-effect domestication mutations. The release of cryptic variation by directional selection might help explain the seemingly inexhaustible genetic variation in long-term selection experiments [41]. Because transgenesis can have large effects, it may also unveil cryptic variants, suggesting that interaction between natural and transgenic variation could impact phenotypes and selection schemes for a variety of domesticated and agricultural organisms.
Materials and Methods
Plant materials and phenotypic evaluations
The creation of the NAM population of RIL families has been described previously [3], [28]. Environments, field design, traits. Another publicly-available maize RIL family, the intermated B73-by-Mo17 (IBM) family, was also included in our analyses for a total of 4892 RILs from 26 biparental families with B73 as a common parent, and a total of 27 parents. All inflorescence traits were measured in eight environments, including Aurora, NY, Clayton, NC, Urbana, IL, Homestead, FL, and Ponce, PR in 2006, and Aurora, NY in 2007. Tassel traits were additionally measured in Columbia, MO in 2006 and Urbana, IL in 2007. Ear traits were additionally measured in Clayton, NC in 2007 and Aurora, NY in 2008. In each location, each family was represented by 220 rows: 200 rows of RILs and 10 rows of each parent. Data from some RILs was later discarded to bring the total RIL number to 4892 [28].
Phenotypic data transformation and best linear unbiased predictor (BLUP) calculation
Trait transformations were performed using the boxcox function in R with lambda ranging from −10 to +10 in increments of 0.1, where lambda values of 0 and 1 are equivalent to log and linear transformations, respectively. Branch number and ear row number traits had maximum likelihood values of lambda of 0.3 and 0.4 respectively. Box-cox transformed values of these traits were used to calculate BLUPs. BLUPs were calculated in SAS using PROC MIXED and a model with location, set(location), family, family*location, and entry(family) as random effects.
Genotypic data and joint linkage analysis
The genotypic dataset consisted of 836 markers, representing the subset of 1106 markers that could be placed unambiguously on the physical map, scored on 4892 RILs. Missing data, consisting primarily of markers that were non-informative in particular families, were imputed as previously described [4]. Joint linkage models were obtained in SAS using the stepwise selection procedure in PROC GLMSELECT. The family term was forced into the model, and each of the 836 possible marker-by-family terms was made available for inclusion. Significance levels for entry and exit of model terms were determined by permutation: phenotypic data were permuted against the genotypic data separately within each family, all 836 marker-by-family terms were tested, and the lowest resulting p-value was recorded for each permutation. 1000 permutations were performed, and alpha was set at .05.
SNP imputation, projection, and GWAS analysis
Missing SNP data from the maize HapMap project [29] were imputed as previously described [4]. For non-recombinant RIL marker intervals, SNP values of 0 (common parent allele) and 1 (alternate allele) were assigned according to the parental genotype. For recombinant RIL marker intervals, SNP values between 0 and 1 were assigned based on the physical position of the SNP within the interval and assuming a linear relationship between physical and genetic distance. Projection was also tested assuming a linear relationship between “genespace” and genetic distance, but this had very little effect on the results. GWAS models were fit for each chromosome separately. The phenotypes for each chromosome consisted of residuals from a joint linkage model excluding both the family covariate and all QTL on the chromosome under consideration. GWAS genotypes were obtained by scoring 1.6 million SNPs in the 27 parental lines and then “projecting” these genotypes into the progeny RILs. We employed a subsampling procedure wherein 80% of the RILs from each family were sampled without replacement, and forward regression was used to fit SNPs in the presence of the family term using permutation-derived significance thresholds [4]. This process was repeated 100 times to obtain a resample model inclusion probability (RMIP) value for each SNP ranging from 0 to 1, which represents the percentage of samples in which that SNP was selected. Only SNPs with RMIP values greater than or equal to 0.05 were used for further analysis.
QTL effect sizes
QTL effects for each trait were divided by the standard deviation of BLUP values across a set of 282 diverse maize lines that included the 27 parental lines, and multiplied by the broad-sense heritability estimate for that trait. Since a minimum of 26 QTL were detected for each trait, a 26-QTL model was refit for each trait and used to determine effect sizes. This experiment used a reference design (26 inbred lines were each crossed to a common parent), meaning that QTL effect sizes are potentially biased for traits for which the common parent is an outlier. To circumvent this problem, for each QTL we calculated the predicted effects of all pairwise matings between the 26 parents (eg: for two parents with effects of +1 and −1 relative to the common parent, the predicted QTL effect size in this family is 2), yielding a total of 325 (26 choose 2) effect sizes for each QTL, or a total of 6825 qtl effects per trait.
Pleiotropy
Pleiotropy between pairs of traits in the joint linkage analysis was evaluated as described previously [3]. Briefly, the QTL model for each trait was applied to every other trait, and correlations between effect estimates were used to detect significant pleiotropic QTL. For each QTL in each pairwise trait comparison, the Pearson correlation coefficient (r) between the two effect vectors of length 26 is significant at p<0.01 if r exceeds 0.495 (two tailed t distribution, 24 d.f.). The percentage of shared QTL between two traits is the sum of two fractions: the fraction of significant correlations when the model for trait 1 is applied to trait 2, and vice versa. Pleiotropy between pairs of traits in GWAS analysis has not been reported previously. First, the effects of all GWAS SNPs for each trait in each family were weighted by their RMIP values and averaged in sliding windows across the genome, in order to derive a vector of effect estimates for each trait in each window. Results presented here used a 5 cM window size and a 2.5 cM step, but similar results were obtained for larger and smaller windows. Second, for each pair of traits, only windows where the sum of RMIP values for each trait fell above a threshold (RMIP = 0.10 for the results presented) were considered. Finally, significance of Pearson correlation coefficients between effect estimates was calculated as for joint linkage analysis.
Co-localization of QTL and candidate genes
We considered only the top ten GWAS SNPs for each trait, ordered by decreasing RMIP value, on the assumption that these more robustly-selected SNPs should be more closely linked to the causal variants. To test for significant enrichment, the number of high-RMIP SNPs for a given trait that fell within 0.5, 1, and 2 cM of candidates was compared with a null distribution obtained by selecting an equivalent number of random genes (eg: 17 random genes for comparison to 17 SBP candidates), calculating their proximity to trait SNPs, and repeating this process 1000 times. Selection of random positions rather than random genes represents a far less stringent test, since genes are clustered in the maize genome.
Supporting Information
Zdroje
1. FlintJMackayTFC 2009 Genetic architecture of quantitative traits in mice, flies, and humans. Genome Research 19 723 733 doi:10.1101/gr.086660.108
2. ValdarWSolbergLCGauguierDBurnettSKlenermanP 2006 Genome-wide genetic association of complex traits in heterogeneous stock mice. Nat Genet 38 879 887 doi:10.1038/ng1840
3. BucklerESHollandJBBradburyPJAcharyaCBBrownPJ 2009 The genetic architecture of maize flowering time. Science 325 714 718 doi:10.1126/science.1174276
4. TianFBradburyPJBrownPJHungHSunQ 2011 Genome-wide association study of leaf architecture in the maize nested association mapping population. Nat Genet 43 159 162 doi:10.1038/ng.746
5. KumpKLBradburyPJWisserRJBucklerESBelcherAR 2011 Genome-wide association study of quantitative resistance to southern leaf blight in the maize nested association mapping population. Nat Genet 43 163 168 doi:10.1038/ng.747
6. GudbjartssonDFWaltersGBThorleifssonGStefanssonHHalldorssonBV 2008 Many sequence variants affecting diversity of adult human height. Nat Genet 40 609 615 doi:10.1038/ng.122
7. LettreGJacksonAUGiegerCSchumacherFRBerndtSI 2008 Identification of ten loci associated with height highlights new biological pathways in human growth. Nat Genet 40 584 591 doi:10.1038/ng.125
8. WeedonMNLangoHLindgrenCMWallaceCEvansDM 2008 Genome-wide association analysis identifies 20 loci that influence adult height. Nat Genet 40 575 583 doi:10.1038/ng.121
9. AllenHLEstradaKLettreGBerndtSIWeedonMN 2010 Hundreds of variants clustered in genomic loci and biological pathways affect human height. Nature
10. FisherRA 1930 The genetical theory of natural selection Oxford University Press, Oxford, UK p
11. OrrHA 1998 The population genetics of adaptation: the distribution of factors fixed during adaptive evolution. Evolution 52 935 949
12. BoykoARQuignonPLiLSchoenebeckJJDegenhardtJD 2010 A Simple Genetic Architecture Underlies Morphological Variation in Dogs. PLoS Biol 8 e1000451 doi:10.1371/journal.pbio.1000451
13. Van LaereASNguyenMBraunschweigMNezerCColletteC 1998 A regulatory mutation in IGF2 causes a major QTL effect on muscle growth in the pig. Hippocampus 8 244 261
14. CarlborgÖJacobssonLÅhgrenPSiegelPAnderssonL 2006 Epistasis and the release of genetic variation during long-term selection. Nat Genet 38 418 420 doi:10.1038/ng1761
15. ColosimoPFPeichelCLNerengKBlackmanBKShapiroMD 2004 The genetic architecture of parallel armor plate reduction in threespine sticklebacks. PLoS Biol 2 E109 doi:10.1371/journal.pbio.0020109
16. SalomePABombliesKLaitinenRAEYantLMottR 2011 Genetic Architecture of Flowering Time Variation in Arabidopsis thaliana. Genetics. Available: http://www.genetics.org/cgi/doi/10.1534/genetics.111.126607. Accessed 30 Mar 2011
17. Ross-IbarraJMorrellPLGautBS 2007 Plant domestication, a unique opportunity to identify the genetic basis of adaptation. Proc Natl Acad Sci U S A 104 Suppl 1 8641 8648 doi:10.1073/pnas.0700643104
18. GoddardMEHayesBJ 2009 Mapping genes for complex traits in domestic animals and their use in breeding programmes. Nat Rev Genet 10 381 391 doi:10.1038/nrg2575
19. MatsuokaYVigourouxYGoodmanMMSanchez GJBucklerE 2002 A single domestication for maize shown by multilocus microsatellite genotyping. Proc Natl Acad Sci U S A 99 6080 6084 doi:10.1073/pnas.052125199
20. BeadleGW 1978 Teosinte and the origin of maize. Maize breeding and genetics. Section 2. Evolution.. Available: http://apps.isiknowledge.com/full_record.do?product=CABI&search_mode=GeneralSearch&qid=1&SID=3A47lK9MCanJdApDb&page=1&doc=4. Accessed 15 Mar 2011
21. DoebleyJStecAHubbardL 1997 The evolution of apical dominance in maize. Nature 386 485 488 doi:10.1038/386485a0
22. WangHNussbaum-WaglerTLiBZhaoQVigourouxY 2005 The origin of the naked grains of maize. Nature 436 714 719 doi:10.1038/nature03863
23. BeadleGW 1980 The ancestry of corn. Sci Amer 242 112 119
24. MangelsdorfPC 1986 The origin of corn. Sci Amer 254 80 86
25. VollbrechtESchmidtRJ 2009 Development of the Inflorescences. Handbook of Maize: Its Biology New York, NY Springer New York 13 40 Available: http://www.springerlink.com/content/v14444r6930r16v8/. Accessed 21 Mar 2011
26. ParkinsonSEGrossSMHollickJB 2007 Maize sex determination and abaxial leaf fates are canalized by a factor that maintains repressed epigenetic states. Dev Biol 308 462 473 doi:10.1016/j.ydbio.2007.06.004
27. YuJHollandJBMcMullenMDBucklerES 2008 Genetic design and statistical power of nested association mapping in maize. Genetics 178 539 551 doi:10.1534/genetics.107.074245
28. McMullenMDKresovichSVilledaHSBradburyPLiH 2009 Genetic Properties of the Maize Nested Association Mapping Population. Science 325 737 740 doi:10.1126/science.1174320
29. GoreMAChiaJ-MElshireRJSunQErsozES 2009 A First-Generation Haplotype Map of Maize. Science 326 1115 1117 doi:10.1126/science.1177837
30. ValdarWHolmesCCMottRFlintJ 2009 Mapping in Structured Populations by Resample Model Averaging. Genetics 182 1263 1277 doi:10.1534/genetics.109.100727
31. BortiriEChuckGVollbrechtERochefordTMartienssenR 2006 ramosa2 encodes a LATERAL ORGAN BOUNDARY domain protein that determines the fate of stem cells in branch meristems of maize. Plant Cell 18 574 585 doi:10.1105/tpc.105.039032
32. MorenoMAHarperLCKruegerRWDellaportaSLFreelingM 1997 liguleless1 encodes a nuclear-localized protein required for induction of ligules and auricles during maize leaf organogenesis. Genes Dev 11 616 628
33. JiaoYWangYXueDWangJYanM 2010 Regulation of OsSPL14 by OsmiR156 defines ideal plant architecture in rice. Nat Genet 42 541 544 doi:10.1038/ng.591
34. MiuraKIkedaMMatsubaraASongX-JItoM 2010 OsSPL14 promotes panicle branching and higher grain productivity in rice. Nat Genet 42 545 549 doi:10.1038/ng.592
35. SternD 2010 Evolution, Development, and the Predictable Genome. Roberts & Company, USA. p. Available: http://www.publish.csiro.au/nid/223/pid/6186.htm. Accessed 21 Mar 2011
36. DoebleyJ 2004 The genetics of maize evolution. Annu Rev Genet 38 37 59 doi:10.1146/annurev.genet.38.072902.092425
37. LauterNDoebleyJ 2002 Genetic variation for phenotypically invariant traits detected in teosinte: implications for the evolution of novel forms. Genetics 160 333
38. GillespieJH 2004 Population genetics: a concise guide Johns Hopkins Univ Pr p
39. GibsonG 2009 Decanalization and the origin of complex disease. Nature Reviews Genetics 10 134 140
40. IltisHH 1983 From Teosinte to Maize: The Catastrophic Sexual Transmutation. Science 222 886 894 doi:10.1126/science.222.4626.886
41. DudleyJWLambertRJ 2004 100 Generations of Selection for Oil and Protein in Corn. Plant Breeding Rev 24 79 110 doi:10.1002/9780470650240.ch5
42. DeLongACalderon-UrreaADellaportaSL 1993 Sex determination gene TASSELSEED2 of maize encodes a short-chain alcohol dehydrogenase required for stage-specific floral organ abortion. Cell 74 757 768
43. McSteenPMalcomberSSkirpanALundeCWuX 2007 barren inflorescence2 Encodes a co-ortholog of the PINOID serine/threonine kinase and is required for organogenesis during inflorescence and vegetative development in maize. Plant Physiol 144 1000 1011 doi:10.1104/pp.107.098558
44. BensenRJJohalGSCraneVCTossbergJTSchnablePS 1995 Cloning and characterization of the maize An1 gene. Plant Cell 7 75 84 doi:10.1105/tpc.7.1.75
45. PengJRichardsDEHartleyNMMurphyGPDevosKM 1999 Green revolution genes encode mutant gibberellin response modulators. Nature 400 256 261 doi:10.1038/22307
46. VollbrechtEVeitBSinhaNHakeS 1991 The developmental gene Knotted-1 is a member of a maize homeobox gene family. Nature 350 241 243 doi:10.1038/350241a0
47. ChuckGMeeleyRBHakeS 1998 The control of maize spikelet meristem fate by theAPETALA2-like gene indeterminate spikelet1. Genes & Development 12 1145 1154
48. BombliesKWangR-LAmbroseBASchmidtRJMeeleyRB 2003 Duplicate FLORICAULA/LEAFY homologs zfl1 and zfl2 control inflorescence architecture and flower patterning in maize. Development 130 2385 2395
49. AcostaIFLaparraHRomeroSPSchmelzEHambergM 2009 tasselseed1 is a lipoxygenase affecting jasmonic acid signaling in sex determination of maize. Science 323 262 265 doi:10.1126/science.1164645
50. ChuckGCiganAMSaeteurnKHakeS 2007 The heterochronic maize mutant Corngrass1 results from overexpression of a tandem microRNA. Nat Genet 39 544 549 doi:10.1038/ng2001
51. ChuckGMeeleyRIrishESakaiHHakeS 2007 The maize tasselseed4 microRNA controls sex determination and meristem cell fate by targeting Tasselseed6/indeterminate spikelet1. Nat Genet 39 1517 1521 doi:10.1038/ng.2007.20
52. VeitBBriggsSPSchmidtRJYanofskyMFHakeS 1998 Regulation of leaf initiation by the terminal ear 1 gene of maize. Nature 393 166 168 doi:10.1038/30239
53. WalshJWatersCAFreelingM 1998 The maize gene liguleless2 encodes a basic leucine zipper protein involved in the establishment of the leaf blade-sheath boundary. Genes & Development 12 208 218 doi:10.1101/gad.12.2.208
54. GallavottiAZhaoQKyozukaJMeeleyRBRitterMK 2004 The role of barren stalk1 in the architecture of maize. Nature 432 630 635 doi:10.1038/nature03148
55. GallavottiABarazeshSMalcomberSHallDJacksonD 2008 sparse inflorescence1 encodes a monocot-specific YUCCA-like gene required for vegetative and reproductive development in maize. Proceedings of the National Academy of Sciences 105 15196
56. Taguchi-ShiobaraF 2001 The fasciated ear2 gene encodes a leucine-rich repeat receptor-like protein that regulates shoot meristem proliferation in maize. Genes & Development 15 2755 2766 doi:10.1101/gad.208501
57. BommertPLundeCNardmannJVollbrechtERunningM 2005 thick tassel dwarf1 encodes a putative maize ortholog of the Arabidopsis CLAVATA1 leucine-rich repeat receptor-like kinase. Development 132 1235
58. WhippleCJHallDHDeBlasioSTaguchi-ShiobaraFSchmidtRJ 2010 A conserved mechanism of bract suppression in the grass family. The Plant Cell Online 22 565
59. VollbrechtESpringerPSGohLBucklerESIVMartienssenR 2005 Architecture of floral branch systems in maize and related grasses. Nature 436 1119 1126 doi:10.1038/nature03892
60. ChuckGWhippleCJacksonDHakeS 2010 The maize SBP-box transcription factor encoded by tasselsheath4 regulates bract development and the establishment of meristem boundaries. Development 137 1585 1585 doi:10.1242/dev.052373
61. Satoh-NagasawaNNagasawaNMalcomberSSakaiHJacksonD 2006 A trehalose metabolic enzyme controls inflorescence architecture in maize. Nature 441 227 230 doi:10.1038/nature04725
62. ChuckG 2002 The Control of Spikelet Meristem Identity by the branched silkless1 Gene in Maize. Science 298 1238 1241 doi:10.1126/science.1076920
63. MuszynskiMGDamTLiBShirbrounDMHouZ 2006 delayed flowering1 encodes a basic leucine zipper protein that mediates floral inductive signals at the shoot apex in maize. Plant physiology 142 1523
Štítky
Genetika Reprodukčná medicínaČlánok vyšiel v časopise
PLOS Genetics
2011 Číslo 11
- Je „freeze-all“ pro všechny? Odborníci na fertilitu diskutovali na virtuálním summitu
- Gynekologové a odborníci na reprodukční medicínu se sejdou na prvním virtuálním summitu
Najčítanejšie v tomto čísle
- Evidence-Based Annotation of Gene Function in MR-1 Using Genome-Wide Fitness Profiling across 121 Conditions
- De Novo Origins of Human Genes
- TRY-5 Is a Sperm-Activating Protease in Seminal Fluid
- Relative Burden of Large CNVs on a Range of Neurodevelopmental Phenotypes