Academia.edu no longer supports Internet Explorer.
To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to upgrade your browser.
2011, Advances in Agronomy
Genomic selection," the ability to select for even complex, quantitative traits based on marker data alone, has arisen from the conjunction of new highthroughput marker technologies and new statistical methods needed to analyze the data. This review surveys what is known about these technologies, with sections on population and quantitative genetic background, DNA marker development, statistical methods, reported accuracies of genomic selection (GS) predictions, prediction of nonadditive genetic effects, prediction in the presence of subpopulation structure, and impacts of GS on long-term gain. GS works by estimating the effects of many loci spread across the genome. Marker and observation numbers therefore need to scale with the genetic map length in Morgans and with the effective population size of the population under GS. For typical crops, the requirements range from at least 200 to at most 10,000 markers and observations. With that baseline, GS can greatly accelerate the breeding cycle while also using marker information to maintain genetic diversity and potentially prolong gain beyond what is possible with phenotypic selection. With the costs of marker technologies continuing to decline and the statistical methods becoming more routine, the results reviewed here suggest that GS will play a large role in the plant breeding of the future. Our summary and interpretation should prove useful to breeders as they assess the value of GS in the context of their populations and resources.
DNA markers have enormous potential to improve the efficiency and precision of conventional plant breeding via marker-assisted selection (MAS). The large number of quantitative trait loci (QTLs) mapping studies for diverse crops species have provided an abundance of DNA marker-trait associations. In this review, we present an overview of the advantages of MAS and its most widely used applications in plant breeding, providing examples from cereal crops. We also consider reasons why MAS has had only a small impact on plant breeding so far and suggest ways in which the potential of MAS can be realized. Finally, we discuss reasons why the greater adoption of MAS in the future is inevitable, although the extent of its use will depend on available resources, especially for orphan crops, and may be delayed in less-developed countries. Achieving a substantial impact on crop improvement by MAS represents the great challenge for agricultural scientists in the next few decades.
Frontiers in genetics, 2016
Genomic selection (GS) is a promising approach exploiting molecular genetic markers to design novel breeding programs and to develop new markers-based models for genetic evaluation. In plant breeding, it provides opportunities to increase genetic gain of complex traits per unit time and cost. The cost-benefit balance was an important consideration for GS to work in crop plants. Availability of genome-wide high-throughput, cost-effective and flexible markers, having low ascertainment bias, suitable for large population size as well for both model and non-model crop species with or without the reference genome sequence was the most important factor for its successful and effective implementation in crop species. These factors were the major limitations to earlier marker systems viz., SSR and array-based, and was unimaginable before the availability of next-generation sequencing (NGS) technologies which have provided novel SNP genotyping platforms especially the genotyping by sequencin...
Agronomy, 2022
Plant geneticists and breeders have used marker technology since the 1980s in quantitative trait locus (QTL) identification. Marker-assisted selection is effective for large-effect QTL but has been challenging to use with quantitative traits controlled by multiple minor effect alleles. Therefore, genomic selection (GS) was proposed to estimate all markers simultaneously, thereby capturing all their effects. However, breeding programs are still struggling to identify the best strategy to implement it into their programs. Traditional breeding programs need to be optimized to implement GS effectively. This review explores the optimization of breeding programs for variety release based on aspects of the breeder’s equation. Optimizations include reorganizing field designs, training populations, increasing the number of lines evaluated, and leveraging the large amount of genomic and phenotypic data collected across different growing seasons and environments to increase heritability estima...
Trends in plant science, 2017
Genomic selection (GS) facilitates the rapid selection of superior genotypes and accelerates the breeding cycle. In this review, we discuss the history, principles, and basis of GS and genomic-enabled prediction (GP) as well as the genetics and statistical complexities of GP models, including genomic genotype×environment (G×E) interactions. We also examine the accuracy of GP models and methods for two cereal crops and two legume crops based on random cross-validation. GS applied to maize breeding has shown tangible genetic gains. Based on GP results, we speculate how GS in germplasm enhancement (i.e., prebreeding) programs could accelerate the flow of genes from gene bank accessions to elite lines. Recent advances in hyperspectral image technology could be combined with GS and pedigree-assisted breeding.
Next Generation Plant Breeding
Major paradigm shift in plant breeding since the availability of molecular marker technology is that mapping and characterizing the genetic loci that control a trait will lead to improved breeding. Often, one of the rationales for cloning of QTL is to develop the "perfect marker" for MAS, perhaps based on a functional polymorphism. In contrast, an advantage of genomic selection is precisely its black box approach to exploiting genotyping technology to expedite genetic progress. This is an advantage in our view because it does not rely on a "breeding by design" engineering approach to cultivar development requiring knowledge of biological function before the creation of phenotypes. Breeders can therefore use genomic selection without the large upfront cost of obtaining that knowledge. In addition, genomic selection can maintain the creative nature of phenotypic selection which couple's random mutation and recombination to sometimes arrive at solutions outside the engineer's scope. Currently, the lion's share of research on genomic selection has been performed in livestock breeding, where effective population size, extent of LD, breeding objectives, experimental design, and other characteristics of populations and breeding programs are quite different from those of crop species. Nevertheless, a great number of findings within this literature are very illuminating for genomic selection in crops and should be studied and built upon by crop geneticists and breeders. The application of powerful, relatively new statistical methods to the problem of high dimensional marker data for genomic selection has been nearly as important to the development of genomic selection as the creation of high-density marker platforms and greater computing power. The methods can be classified by what type of genetic architecture they try to capture.
Journal of Crop Improvement, 2011
Genomic selection (GS) has been implemented in animal and plant species, and is regarded as a useful tool for accelerating genetic gains. Varying levels of genomic prediction accuracy have been obtained in plants, depending on the prediction problem assessed and on several other factors, such as trait heritability, the relationship between the individuals to be predicted and those used to train the models for prediction, number of markers, sample size and genotype  environment interaction (GE). The main objective of this article is to describe the results of genomic prediction in International Maize and Wheat Improvement Center's (CIMMYT's) maize and wheat breeding programs, from the initial assessment of the predictive ability of different models using pedigree and marker information to the present, when methods for implementing GS in practical global maize and wheat breeding programs are being studied and investigated. Results show that pedigree (population structure) accounts for a sizeable proportion of the prediction accuracy when a global population is the prediction problem to be assessed. However, when the prediction uses unrelated populations to train the prediction equations, prediction accuracy becomes negligible. When genomic prediction includes modeling GE, an increase in prediction accuracy can be achieved by borrowing information from correlated environments. Several questions on how to incorporate GS into CIMMYT's maize and wheat programs remain unanswered and subject to further investigation, for example, prediction within and between related biparental crosses. Further research on the quantification of breeding value components for GS in plant breeding populations is required.
2021
ABSTRACTThe selection of informative markers has been studied massively as an alternative to reduce genotyping costs for the genomic selection (GS) application. Low-density marker panels are attractive for GS because they decrease computational time-consuming and multicollinearity beyond more individuals can be genotyped with the same cost. Nevertheless, these inferences are usually made empirically using “static” training sets and populations, which are adequate only to predict a breeding program’s initial cycles but might not for long-term cycles. Moreover, to the best of our knowledge, none of these inferences considered the inclusion of dominance into the GS models, which is particularly important to predict cross-pollinated crops. Therefore, that reveals an important and unexplored topic for allogamous long-term breeding. To achieve this goal, we employed two approaches: the former used empirical maize datasets, and the latter simulations of long-term breeding cycles of phenoty...
Crop Science, 2010
Janick/Plant Breeding Reviews V33, 2010
The Plant Genome, 2015
Prediction accuracy of genomic selection (GS) has been previously evaluated through simulation and cross-validation; however, validation based on progeny performance in a plant breeding program has not been investigated thoroughly. We evaluated several prediction models in a dynamic barley breeding population comprised of 647 six-row lines using four traits differing in genetic architecture and 1536 single nucleotide polymorphism (SNP) markers. The breeding lines were divided into six sets designated as one parent set and five consecutive progeny sets comprised of representative samples of breeding lines over a 5-yr period. We used these data sets to investigate the effect of model and training population composition on prediction accuracy over time. We found little difference in prediction accuracy among the models confirming prior studies that found the simplest model, random regression best linear unbiased prediction (RR-BLUP), to be accurate across a range of situations. In general, we found that using the parent set was sufficient to predict progeny sets with little to no gain in accuracy from generating larger training populations by combining the parent set with subsequent progeny sets. The prediction accuracy ranged from 0.03 to 0.99 across the four traits and five progeny sets. We explored characteristics of the training and validation populations (marker allele frequency, population structure, and linkage disequilibrium, LD) as well as characteristics of the trait (genetic architecture and heritability, H 2). Fixation of markers associated with a trait over time was most clearly associated with reduced prediction accuracy for the mycotoxin trait DON. Higher trait H 2 in the training population and simpler trait architecture were associated with greater prediction accuracy. G enomic selection is touted as a marker-based breeding approach that complements traditional markerassisted selection (MAS) and phenotypic selection. In traditional MAS, favorable alleles or genes for relatively simply inherited traits are mapped and then molecular markers linked to those alleles are used to select individuals to use as parents or to advance from segregating breeding populations (Bernardo, 2008). Marker-assisted selection is more effective than phenotypic selection if the tagged loci account for a large portion of the total genetic variation within the population of selection candidates (Collins et al., 2003; Castro et al., 2003; Xu and Crouch, 2008). The limitation of traditional MAS for highly complex traits is that it captures only a small portion of the total genetic variation because it uses a limited number of selected markers (Lande and Thompson, 1990; Bernardo, 2010). Phenotypic selection is effective on quantitative traits, but is limited to stages in breeding cycles and environments where such traits can be measured effectively, such as for advanced lines in multiple location field trials. Therefore, GS can be strategically implemented in
Journal of Genetic Engineering and Biotechnology, 2021
Background DNA markers improved the productivity and accuracy of classical plant breeding by means of marker-assisted selection (MAS). The enormous number of quantitative trait loci (QTLs) mapping read for different plant species have given a plenitude of molecular marker-gene associations. Main body of the abstract In this review, we have discussed the positive aspects of molecular marker-assisted selection and its precise applications in plant breeding programmes. Molecular marker-assisted selection has considerably shortened the time for new crop varieties to be brought to the market. To explore the information about DNA markers, many reviews have been published in the last few decades; all these reviews were intended by plant breeders to obtain information on molecular genetics. In this review, we intended to be a synopsis of recent developments of DNA markers and their application in plant breeding programmes and devoted to early breeders with little or no knowledge about the D...
Crop Science, 2015
African Journal of …, 2010
Mapping and tagging of agriculturally important genes have been greatly facilitated by an array of molecular markers in crop plants. Marker-assisted selection (MAS) is gaining considerable importance as it would improve the efficiency of plant breeding through precise transfer of ...
1999
indicated that genetic improvement usually accounted for about one-half of the total yield increase, with the This paper was presented as part of the symposium entitled "Postremainder attributed to changes in cultural practices Green Revolution Trends in Crop Yield Potential: Increasing, Stagnant or Greater Resistance to Stress." In this presentation, we have such as increased rates of mineral fertilizers and the use focused on (i) uses of marker technology in determining the genetic of herbicides for weed control and pesticides for control basis of phenotypic expression and the manipulation of phenotypic of insects and diseases. Duvick (1997) suggested that the variation in plants. This included the use of markers in understanding increased grain yielding ability of these widely successful heterosis, in attempts to improve hybrid predictions, in quantitative hybrids was due primarily to improved tolerance of abitrait locus (QTL) identification and mapping, in marker-assisted selecotic and biotic stresses, coupled with the maintenance tion (MAS), and in enhancing breeding success in the development of the ability to maximize yield per plant under nonof improved lines and hybrids; (ii) the role of genomics in developing stress growing conditions. a precise understanding of the genetic basis of phenotypic expression Opportunities for gains resulting from changes in culwhich will then provide more precision in the manipulation of phenotural practices are limited (particularly in the USA and typic variation; and (iii) some attempts to integrate marker technology and genomics into empirical breeding strategies. In addition, we have other developed countries). Therefore, future gains in focused on what has been successful as well as what has fallen short the productivity of most crops may depend almost enof expectations, and have suggested some of the possible reasons for tirely on genetic improvements. In fact, environmental the lack of success. Because of page limitations, we could not include concerns may cause a reduction in the use of agricultural an exhaustive review of the plant literature and have limited many chemicals and fertilizers. Also, many parts of the world of our examples to investigations in maize (Zea mays L). may have limited supplies of such chemicals and plant nutrients. Therefore, plant breeders will need to develop and apply new technology (such as marker-assisted se
2010
The availability of thousands of genome wide molecular markers has made possible the use of genomic selection in plants and animals. However, the evaluation of models for genomic selection in plant breeding populations is very limited. In this study, we provide an overview of several models for genomic selection, whose predictive ability we investigated using two plant data sets. One data set contains the historical phenotypic records of a series of wheat (Triticum aestivum L.) trials and recently generated genomic data. The other data set pertains to international maize (Zea mays L.) trials in which two disease traits (Exserohilum turcicum and Cercospora zeae-maydis) were measured in maize lines evaluated in five international environments. Results showed that models including marker information yield important gains in predictive ability, relative to that of a pedigree-based model––this, with a modest number of markers. Estimates of marker effects were different across environment...
Crop Science, 2009
Agricultural Research & Technology: Open Access Journal, 2018
2021
Reductions of genotyping marker density have been extensively evaluated as potential strategies to reduce the genotyping costs of genomic selection (GS). Low-density marker panels are appealing in GS because they entail lower multicollinearity and computational time-consumption and allow more individuals to be genotyped for the same cost. However, statistical models used in GS are usually evaluated with empirical data, using "static" training sets and populations. This may be adequate for making predictions during a breeding program's initial cycles, but not for the long term. Moreover, to the best of our knowledge, no GS models consider the effect of dominance, which is particularly important for breeding outcomes in cross-pollinated crops. Hence, dominance effects are an important and unexplored issue in GS for long-term programs involving allogamous species. To address it, we employed two approaches: analysis of empirical maize datasets and simulations of long-term ...
Genetics
The availability of dense molecular markers has made possible the use of genomic selection (GS) for plant breeding. However, the evaluation of models for GS in real plant populations is very limited. This article evaluates the performance of parametric and semiparametric models for GS using wheat (Triticum aestivum L.) and maize (Zea mays) data in which different traits were measured in several environmental conditions. The findings, based on extensive cross-validations, indicate that models including marker information had higher predictive ability than pedigree-based models. In the wheat data set, and relative to a pedigree model, gains in predictive ability due to inclusion of markers ranged from 7.7 to 35.7%. Correlation between observed and predictive values in the maize data set achieved values up to 0.79. Estimates of marker effects were different across environmental conditions, indicating that genotype 3 environment interaction is an important component of genetic variability. These results indicate that GS in plant breeding can be an effective strategy for selecting among lines whose phenotypes have yet to be observed.
BMC Bioinformatics, 2014
Background: Genomic selection (GS) promises to improve accuracy in estimating breeding values and genetic gain for quantitative traits compared to traditional breeding methods. Its reliance on high-throughput genome-wide markers and statistical complexity, however, is a serious challenge in data management, analysis, and sharing. A bioinformatics infrastructure for data storage and access, and user-friendly web-based tool for analysis and sharing output is needed to make GS more practical for breeders. Results: We have developed a web-based tool, called solGS, for predicting genomic estimated breeding values (GEBVs) of individuals, using a Ridge-Regression Best Linear Unbiased Predictor (RR-BLUP) model. It has an intuitive web-interface for selecting a training population for modeling and estimating genomic estimated breeding values of selection candidates. It estimates phenotypic correlation and heritability of traits and selection indices of individuals. Raw data is stored in a generic database schema, Chado Natural Diversity, co-developed by multiple database groups. Analysis output is graphically visualized and can be interactively explored online or downloaded in text format. An instance of its implementation can be accessed at the NEXTGEN Cassava breeding database, http://cassavabase.org/solgs.
BMC genomics, 2015
Genomic selection (GS) in forestry can substantially reduce the length of breeding cycle and increase gain per unit time through early selection and greater selection intensity, particularly for traits of low heritability and late expression. Affordable next-generation sequencing technologies made it possible to genotype large numbers of trees at a reasonable cost. Genotyping-by-sequencing was used to genotype 1,126 Interior spruce trees representing 25 open-pollinated families planted over three sites in British Columbia, Canada. Four imputation algorithms were compared (mean value (MI), singular value decomposition (SVD), expectation maximization (EM), and a newly derived, family-based k-nearest neighbor (kNN-Fam)). Trees were phenotyped for several yield and wood attributes. Single- and multi-site GS prediction models were developed using the Ridge Regression Best Linear Unbiased Predictor (RR-BLUP) and the Generalized Ridge Regression (GRR) to test different assumption about tra...
Genetics Selection Evolution, 2013
Background: Arguably, genotypes and phenotypes may be linked in functional forms that are not well addressed by the linear additive models that are standard in quantitative genetics. Therefore, developing statistical learning models for predicting phenotypic values from all available molecular information that are capable of capturing complex genetic network architectures is of great importance. Bayesian kernel ridge regression is a non-parametric prediction model proposed for this purpose. Its essence is to create a spatial distance-based relationship matrix called a kernel. Although the set of all single nucleotide polymorphism genotype configurations on which a model is built is finite, past research has mainly used a Gaussian kernel. Results: We sought to investigate the performance of a diffusion kernel, which was specifically developed to model discrete marker inputs, using Holstein cattle and wheat data. This kernel can be viewed as a discretization of the Gaussian kernel. The predictive ability of the diffusion kernel was similar to that of non-spatial distance-based additive genomic relationship kernels in the Holstein data, but outperformed the latter in the wheat data. However, the difference in performance between the diffusion and Gaussian kernels was negligible. Conclusions: It is concluded that the ability of a diffusion kernel to capture the total genetic variance is not better than that of a Gaussian kernel, at least for these data. Although the diffusion kernel as a choice of basis function may have potential for use in whole-genome prediction, our results imply that embedding genetic markers into a non-Euclidean metric space has very small impact on prediction. Our results suggest that use of the black box Gaussian kernel is justified, given its connection to the diffusion kernel and its similar predictive performance.
PLoS Genetics, 2012
Predicting organismal phenotypes from genotype data is important for plant and animal breeding, medicine, and evolutionary biology. Genomic-based phenotype prediction has been applied for single-nucleotide polymorphism (SNP) genotyping platforms, but not using complete genome sequences. Here, we report genomic prediction for starvation stress resistance and startle response in Drosophila melanogaster, using ,2.5 million SNPs determined by sequencing the Drosophila Genetic Reference Panel population of inbred lines. We constructed a genomic relationship matrix from the SNP data and used it in a genomic best linear unbiased prediction (GBLUP) model. We assessed predictive ability as the correlation between predicted genetic values and observed phenotypes by cross-validation, and found a predictive ability of 0.23960.008 (0.23060.012) for starvation resistance (startle response). The predictive ability of BayesB, a Bayesian method with internal SNP selection, was not greater than GBLUP. Selection of the 5% SNPs with either the highest absolute effect or variance explained did not improve predictive ability. Predictive ability decreased only when fewer than 150,000 SNPs were used to construct the genomic relationship matrix. We hypothesize that predictive power in this population stems from the SNP-based modeling of the subtle relationship structure caused by long-range linkage disequilibrium and not from population structure or SNPs in linkage disequilibrium with causal variants. We discuss the implications of these results for genomic prediction in other organisms.
Theoretical and Applied Genetics, 2013
LASSO, ridge regression, reproducing kernel Hilbert space, and partial least square regression. The accuracy of the predictions was assessed through the correlation between observed and predicted genetic values by cross validation within each panel and between the two panels. We observed equivalent accuracy among the four predictive models for a given trait, and marked differences were observed among traits. Depending on the trait concerned, within-panel cross validation yielded median correlations ranging from 0.29 to 0.62 in the Reunion Island panel and from 0.11 to 0.5 in the Guadeloupe panel. Cross validation between panels yielded correlations ranging from 0.13 for smut resistance to 0.55 for brix. This level of correlations is promising for future implementations. Our results provide the first validation of genomic selection in sugarcane.
PLoS ONE, 2013
Genome-wide molecular markers are often being used to evaluate genetic diversity in germplasm collections and for making genomic selections in breeding programs. To accurately predict phenotypes and assay genetic diversity, molecular markers should assay a representative sample of the polymorphisms in the population under study. Ascertainment bias arises when marker data is not obtained from a random sample of the polymorphisms in the population of interest. Genotyping-by-sequencing (GBS) is rapidly emerging as a low-cost genotyping platform, even for the large, complex, and polyploid wheat (Triticum aestivum L.) genome. With GBS, marker discovery and genotyping occur simultaneously, resulting in minimal ascertainment bias. The previous platform of choice for whole-genome genotyping in many species such as wheat was DArT (Diversity Array Technology) and has formed the basis of most of our knowledge about cereals genetic diversity. This study compared GBS and DArT marker platforms for measuring genetic diversity and genomic selection (GS) accuracy in elite U.S. soft winter wheat. From a set of 365 breeding lines, 38,412 single nucleotide polymorphism GBS markers were discovered and genotyped. The GBS SNPs gave a higher GS accuracy than 1,544 DArT markers on the same lines, despite 43.9% missing data. Using a bootstrap approach, we observed significantly more clustering of markers and ascertainment bias with DArT relative to GBS. The minor allele frequency distribution of GBS markers had a deficit of rare variants compared to DArT markers. Despite the ascertainment bias of the DArT markers, GS accuracy for three traits out of four was not significantly different when an equal number of markers were used for each platform. This suggests that the gain in accuracy observed using GBS compared to DArT markers was mainly due to a large increase in the number of markers available for the analysis.
PloS one, 2015
The prediction of the flowering time (FT) trait in Brassica napus based on genome-wide markers and the detection of underlying genetic factors is important not only for oilseed producers around the world but also for the other crop industry in the rotation system in China. In previous studies the low density and mixture of biomarkers used obstructed genomic selection in B. napus and comprehensive mapping of FT related loci. In this study, a high-density genome-wide SNP set was genotyped from a double-haploid population of B. napus. We first performed genomic prediction of FT traits in B. napus using SNPs across the genome under ten environments of three geographic regions via eight existing genomic predictive models. The results showed that all the models achieved comparably high accuracies, verifying the feasibility of genomic prediction in B. napus. Next, we performed a large-scale mapping of FT related loci among three regions, and found 437 associated SNPs, some of which represe...
Developing high yielding rice varieties that are tolerant to drought stress is crucial for the sustainable livelihood of rice farmers in rainfed rice cropping ecosystems. Genomic selection (GS) promises to be an effective breeding option for these complex traits. We evaluated the effectiveness of two rather new options in the implementation of GS: trait and environment-specific marker selection and the use of multi-environment prediction models. A reference population of 280 rainfed lowland accessions endowed with 215k SNP markers data was phenotyped under a favorable and two managed drought environments. Trait-specific SNP subsets (28k) were selected for each trait under each environment, using results of GWAS performed with the complete genotype dataset. Performances of single-environment and multi-environment genomic prediction models were compared using kernel regression based methods (GBLUP and RKHS) under two cross validation scenario: availability (CV2) or not (CV1) of phenot...
PLOS ONE
The high concentration of arsenic (As) in rice grains, in a large proportion of the rice growing areas, is a critical issue. This study explores the feasibility of conventional (QTL-based) marker-assisted selection and genomic selection to improve the ability of rice to prevent As uptake and accumulation in the edible grains. A japonica diversity panel (RP) of 228 accessions phenotyped for As concentration in the flag leaf (FL-As) and in the dehulled grain (CG-As), and genotyped at 22,370 SNP loci, was used to map QTLs by association analysis (GWAS) and to train genomic prediction models. Similar phenotypic and genotypic data from 95 advanced breeding lines (VP) with japonica genetic backgrounds, was used to validate related QTLs mapped in the RP through GWAS and to evaluate the predictive ability of across populations (RP-VP) genomic estimate of breeding value (GEBV) for As exclusion. Several QTLs for FL-As and CG-As with a low-medium individual effect were detected in the RP, of which some colocalized with known QTLs and candidate genes. However, less than 10% of those QTLs could be validated in the VP without loosening colocalization parameters. Conversely, the average predictive ability of across populations GEBV was rather high, 0.43 for FL-As and 0.48 for CG-As, ensuring genetic gains per time unit close to phenotypic selection. The implications of the limited robustness of the GWAS results and the rather high predictive ability of genomic prediction are discussed for breeding rice for significantly low arsenic uptake and accumulation in the edible grains. cultivated soil, or in the irrigation water, and in the rice plant . As accumulation in the rice plant is the highest in the roots, followed by the straw, the whole (cargo) grain and the polished (white) grain [2, 5, 6]. Presence of As in the paddy field also affects crop growth and development and, consequently, crop yield .
Heredity
Here, we perform cross-generational GS analysis on coastal Douglas-fir (Pseudotsuga menziesii), reflecting transgenerational selective breeding application. A total of 1321 trees, representing 37 full-sib F 1 families from 3 environments in British Columbia, Canada, were used as the training population for (1) EBVs (estimated breeding values) of juvenile height (HTJ) in the F 1 generation predicting genomic EBVs of HTJ of 136 individuals in the F 2 generation, (2) deregressed EBVs of F 1 HTJ predicting deregressed genomic EBVs of F 2 HTJ, (3) F 1 mature height (HT35) predicting HTJ EBVs in F 2 , and (4) deregressed F 1 HT35 predicting genomic deregressed HTJ EBVs in F 2 . Ridge regression best linear unbiased predictor (RR-BLUP), generalized ridge regression (GRR), and Bayes-B GS methods were used and compared to pedigree-based (ABLUP) predictions. GS accuracies for scenarios 1 (0.92, 0.91, and 0.91) and 3 (0.57, 0.56, and 0.58) were similar to their ABLUP counterparts (0.92 and 0.60, respectively) (using RR-BLUP, GRR, and Bayes-B). Results using deregressed values fell dramatically for both scenarios 2 and 4 which approached zero in many cases. Cross-generational GS validation of juvenile height in Douglas-fir produced predictive accuracies almost as high as that of ABLUP. Without capturing LD, GS cannot surpass the prediction of ABLUP. Here we tracked pedigree relatedness between training and validation sets. More markers or improved distribution of markers are required to capture LD in Douglas-fir. This is essential for accurate forward selection among siblings as markers that track pedigree are of little use for forward selection of individuals within controlled pollinated families.
TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik, 2018
Multi-trait genomic prediction models are useful to allocate available resources in breeding programs by targeted phenotyping of correlated traits when predicting expensive and labor-intensive quality parameters. Multi-trait genomic prediction models can be used to predict labor-intensive or expensive correlated traits where phenotyping depth of correlated traits could be larger than phenotyping depth of targeted traits, reducing resources and improving prediction accuracy. This is particularly important in the context of allocating phenotyping resource in plant breeding programs. The objective of this work was to evaluate multi-trait models predictive ability with different depth of phenotypic information from correlated traits. We evaluated 495 wheat advanced breeding lines for eight baking quality traits which were genotyped with genotyping-by-sequencing. Through different approaches for cross-validation, we evaluated the predictive ability of a single-trait model and a multi-tra...
The usefulness of Genomic Prediction (GP) in crop and livestock breeding programs has led to efforts to develop new and improved GP approaches including non-linear algorithm, such as artificial neural networks (ANN) (i.e. deep learning) and gradient tree boosting. However, the performance of these algorithms has not been compared in a systematic manner using a wide range of GP datasets and models. Using data of 18 traits across six plant species with different marker densities and training population sizes, we compared the performance of six linear and five non-linear algorithms, including ANNs. First, we found that hyperparameter selection was critical for all non-linear algorithms and that feature selection prior to model training was necessary for ANNs when the markers greatly outnumbered the number of training lines. Across all species and trait combinations, no one algorithm performed best, however predictions based on a combination of results from multiple GP algorithms (i.e. ...
Frontiers in Plant Science
Most important food and feed crops in the world belong to the C3 grass family. The future of food security is highly reliant on achieving genetic gains of those grasses. Conventional breeding methods have already reached a plateau for improving major crops. Genomics tools and resources have opened an avenue to explore genome-wide variability and make use of the variation for enhancing genetic gains in breeding programs. Major C3 annual cereal breeding programs are well equipped with genomic tools; however, genomic research of C3 cool-season perennial grasses is lagging behind. In this review, we discuss the currently available genomics tools and approaches useful for C3 cool-season perennial grass breeding. Along with a general review, we emphasize the discussion focusing on forage grasses that were considered orphan and have little or no genetic information available. Transcriptome sequencing and genotype-by-sequencing technology for genome-wide marker detection using next-generation sequencing (NGS) are very promising as genomics tools. Most C3 cool-season perennial grass members have no prior genetic information; thus NGS technology will enhance collinear study with other C3 model grasses like Brachypodium and rice. Transcriptomics data can be used for identification of functional genes and molecular markers, i.e., polymorphism markers and simple sequence repeats (SSRs). Genome-wide association study with NGS-based markers will facilitate marker identification for marker-assisted selection. With limited genetic information, genomic selection holds great promise to breeders for attaining maximum genetic gain of the cool-season C3 perennial grasses. Application of all these tools can ensure better genetic gains, reduce length of selection cycles, and facilitate cultivar development to meet the future demand for food and fodder.
International Journal of Molecular Sciences
Terminal drought is the main stress limiting pea (Pisum sativum L.) grain yield in Mediterranean environments. This study aimed to investigate genotype × environment (GE) interaction patterns, define a genomic selection (GS) model for yield under severe drought based on single nucleotide polymorphism (SNP) markers from genotyping-by-sequencing, and compare GS with phenotypic selection (PS) and marker-assisted selection (MAS). Some 288 lines belonging to three connected RIL populations were evaluated in a managed-stress (MS) environment of Northern Italy, Marchouch (Morocco), and Alger (Algeria). Intra-environment, cross-environment, and cross-population predictive ability were assessed by Ridge Regression best linear unbiased prediction (rrBLUP) and Bayesian Lasso models. GE interaction was particularly large across moderate-stress and severe-stress environments. In proof-of-concept experiments performed in a MS environment, GS models constructed from MS environment and Marchouch da...
PLOS ONE
Genetic progress for forage quality has been poor in alfalfa (Medicago sativa L.), the mostgrown forage legume worldwide. This study aimed at exploring opportunities for markerassisted selection (MAS) and genomic selection of forage quality traits based on breeding values of parent plants. Some 154 genotypes from a broadly-based reference population were genotyped by genotyping-by-sequencing (GBS), and phenotyped for leaf-to-stem ratio, leaf and stem contents of protein, neutral detergent fiber (NDF) and acid detergent lignin (ADL), and leaf and stem NDF digestibility after 24 hours (NDFD), of their dense-planted half-sib progenies in three growing conditions (summer harvest, full irrigation; summer harvest, suspended irrigation; autumn harvest). Trait-marker analyses were performed on progeny values averaged over conditions, owing to modest germplasm × condition interaction. Genomic selection exploited 11,450 polymorphic SNP markers, whereas a subset of 8,494 M. truncatula-aligned markers were used for a genome-wide association study (GWAS). GWAS confirmed the polygenic control of quality traits and, in agreement with phenotypic correlations, indicated substantially different genetic control of a given trait in stems and leaves. It detected several SNPs in different annotated genes that were highly linked to stem protein content. Also, it identified a small genomic region on chromosome 8 with high concentration of annotated genes associated with leaf ADL, including one gene probably involved in the lignin pathway. Three genomic selection models, i.e., Ridge-regression BLUP, Bayes B and Bayesian Lasso, displayed similar prediction accuracy, whereas SVR-lin was less accurate. Accuracy values were moderate (0.3-0.4) for stem NDFD and leaf protein content, modest for leaf ADL and NDFD, and low to very low for the other traits. Along with previous results for the same germplasm set, this study indicates that GBS data can be exploited to improve both quality traits (by genomic selection or MAS) and forage yield.
Frontiers in Plant Science
Genotyping-by-Sequencing (GBS) may drastically reduce genotyping costs compared with single nucleotide polymorphism (SNP) array platforms. However, it may require optimization for specific crops to maximize the number of available markers. Exploiting GBS-generated markers may require optimization, too (e.g., to cope with missing data). This study aimed (i) to compare elements of GBS protocols on legume species that differ for genome size, ploidy, and breeding system, and (ii) to show successful applications and challenges of GBS data on legume species. Preliminary work on alfalfa and Medicago truncatula suggested the greater interest of ApeKI over PstI:MspI DNA digestion. We compared KAPA and NEB Taq polymerases in combination with primer extensions that were progressively more selective on restriction sites, and found greater number of polymorphic SNP loci in pea, white lupin and diploid alfalfa when adopting KAPA with a non-selective primer. This protocol displayed a slight advantage also for tetraploid alfalfa (where SNP calling requires higher read depth). KAPA offered the further advantage of more uniform amplification than NEB over fragment sizes and GC contents. The number of GBS-generated polymorphic markers exceeded 6,500 in two tetraploid alfalfa reference populations and a world collection of lupin genotypes, and 2,000 in different sets of pea or lupin recombinant inbred lines. The predictive ability of GBS-based genomic selection was influenced by the genotype missing data threshold and imputation, as well as by the genomic selection model, with the best model depending on traits and data sets. We devised a simple method for comparing phenotypic vs. genomic selection in terms of predicted yield gain per year for same evaluation costs, whose application to preliminary data for alfalfa and pea in a hypothetical selection scenario for each crop indicated a distinct advantage of genomic selection.
Agronomy
Genomic selection can increase the rate of genetic gain in crops through accumulation of positive alleles and reduce phenotyping costs by shortening the breeding cycle time. We performed genomic prediction for resistance to wheat rusts in tetraploid wheat accessions using three cross-validation with the objective of predicting: (1) rust resistance when individuals are not tested in all environments/locations, (2) the performance of lines across years, and (3) adult plant resistance (APR) of lines with bivariate models. The rationale for the latter is that seedling assays are faster and could increase prediction accuracy for APR. Predictions were derived from adult plant and seedling responses for leaf rust (Lr), stem rust (Sr) and stripe rust (Yr) in a panel of 391 accessions grown across multiple years and locations and genotyped using 16,483 single nucleotide polymorphisms. Different Bayesian models and genomic best linear unbiased prediction yielded similar accuracies for all tra...
Theoretical and Applied Genetics
Efficiency of breeding programs of legume crops such as chickpea, pigeonpea and groundnut has been considerably improved over the past decade through deployment of modern genomic tools and technologies. For instance, next-generation sequencing technologies have facilitated availability of genome sequence assemblies, re-sequencing of several hundred lines, development of HapMaps, high-density genetic maps, a range of marker genotyping platforms and identification of markers associated with a number of agronomic traits in these legume crops. Although marker-assisted backcrossing and marker-assisted selection approaches have been used to develop superior lines in several cases, it is the need of the hour for continuous population improvement after every breeding cycle to accelerate genetic gain in the breeding programs. In this context, we propose a sequence-based breeding approach which includes use of independent or combination of parental selection, enhancing genetic diversity of breeding programs, forward breeding for early generation selection, and genomic selection using sequencing/genotyping technologies. Also, adoption of speed breeding technology by generating 4-6 generations per year will be contributing to accelerate genetic gain. While we see a huge potential of the sequence-based breeding to revolutionize crop improvement programs in these legumes, we anticipate several challenges especially associated with high-quality and precise phenotyping at affordable costs, data analysis and management related to improving breeding operation efficiency. Finally, integration of improved seed systems and better agronomic packages with the development of improved varieties by using sequence-based breeding will ensure higher genetic gains in farmers' fields.
Plants
Genomic selection (GS) is a predictive approach that was built up to increase the rate of genetic gain per unit of time and reduce the generation interval by utilizing genome-wide markers in breeding programs. It has emerged as a valuable method for improving complex traits that are controlled by many genes with small effects. GS enables the prediction of the breeding value of candidate genotypes for selection. In this work, we address important issues related to GS and its implementation in the plant context with special emphasis on tomato breeding. Genomic constraints and critical parameters affecting the accuracy of prediction such as the number of markers, statistical model, phenotyping and complexity of trait, training population size and composition should be carefully evaluated. The comparison of GS approaches for facilitating the selection of tomato superior genotypes during breeding programs is also discussed. GS applied to tomato breeding has already been shown to be feasi...
Horticulture Research
Many studies showed that few degrees above tomato optimum growth temperature threshold can lead to serious loss in production. Therefore, the development of innovative strategies to obtain tomato cultivars with improved yield under high temperature conditions is a main goal both for basic genetic studies and breeding activities. In this paper, a F4 segregating population was phenotypically evaluated for quantitative and qualitative traits under heat stress conditions. Moreover, a genotyping by sequencing (GBS) approach has been employed for building up genomic selection (GS) models both for yield and soluble solid content (SCC). Several parameters, including training population size, composition and marker quality were tested to predict genotype performance under heat stress conditions. A good prediction accuracy for the two analyzed traits (0.729 for yield production and 0.715 for SCC) was obtained. The predicted models improved the genetic gain of selection in the next breeding cy...
Molecular breeding : new strategies in plant improvement, 2018
The increasing cost of energy and finite oil and gas reserves have created a need to develop alternative fuels from renewable sources. Due to its abiotic stress tolerance and annual cultivation, high-biomass sorghum ( L. Moench) shows potential as a bioenergy crop. Genomic selection is a useful tool for accelerating genetic gains and could restructure plant breeding programs by enabling early selection and reducing breeding cycle duration. This work aimed at predicting breeding values via genomic selection models for 200 sorghum genotypes comprising landrace accessions and breeding lines from biomass and saccharine groups. These genotypes were divided into two sub-panels, according to breeding purpose. We evaluated the following phenotypic biomass traits: days to flowering, plant height, fresh and dry matter yield, and fiber, cellulose, hemicellulose, and lignin proportions. Genotyping by sequencing yielded more than 258,000 single-nucleotide polymorphism markers, which revealed pop...
PloS one, 2017
Genomic wide selection is a promising approach for improving the selection accuracy in plant breeding, particularly in species with long life cycles, such as Jatropha. Therefore, the objectives of this study were to estimate the genetic parameters for grain yield (GY) and the weight of 100 seeds (W100S) using restricted maximum likelihood (REML); to compare the performance of GWS methods to predict GY and W100S; and to estimate how many markers are needed to train the GWS model to obtain the maximum accuracy. Eight GWS models were compared in terms of predictive ability. The impact that the marker density had on the predictive ability was investigated using a varying number of markers, from 2 to 1,248. Because the genetic variance between evaluated genotypes was significant, it was possible to obtain selection gain. All of the GWS methods tested in this study can be used to predict GY and W100S in Jatropha. A training model fitted using 1,000 and 800 markers is sufficient to capture...
PloS one, 2016
Genomic selection (GS) is a modern breeding approach where genome-wide single-nucleotide polymorphism (SNP) marker profiles are simultaneously used to estimate performance of untested genotypes. In this study, the potential of genomic selection methods to predict testcross performance for hybrid canola breeding was applied for various agronomic traits based on genome-wide marker profiles. A total of 475 genetically diverse spring-type canola pollinator lines were genotyped at 24,403 single-copy, genome-wide SNP loci. In parallel, the 950 F1 testcross combinations between the pollinators and two representative testers were evaluated for a number of important agronomic traits including seedling emergence, days to flowering, lodging, oil yield and seed yield along with essential seed quality characters including seed oil content and seed glucosinolate content. A ridge-regression best linear unbiased prediction (RR-BLUP) model was applied in combination with 500 cross-validations for ea...
BMC Genomics
Background: A thorough verification of the ability of genomic selection (GS) to predict estimated breeding values for pea (Pisum sativum L.) grain yield is pending. Prediction for different environments (inter-environment prediction) has key importance when breeding for target environments featuring high genotype × environment interaction (GEI). The interest of GS would increase if it could display acceptable prediction accuracies in different environments also for germplasm that was not used in model training (inter-population prediction). Results: Some 306 genotypes belonging to three connected RIL populations derived from paired crosses between elite cultivars were genotyped through genotyping-by-sequencing and phenotyped for grain yield, onset of flowering, lodging susceptibility, seed weight and winter plant survival in three autumn-sown environments of northern or central Italy. The large GEI for grain yield and its pattern (implying larger variation across years than sites mainly due to year-to-year variability for low winter temperatures) encouraged the breeding for wide adaptation. Wider within-population than between-population variation was observed for nearly all traits, supporting GS application to many lines of relatively few elite RIL populations. Bayesian Lasso without structure imputation and 1% maximum genotype missing rate (including 6058 polymorphic SNP markers) was selected for GS modelling after assessing different GS models and data configurations. On average, inter-environment predictive ability using intra-population predictions reached 0.30 for yield, 0.65 for onset of flowering, 0.64 for seed weight, and 0.28 for lodging susceptibility. Using inter-population instead of intra-population predictions reduced the interenvironment predictive ability to 0.19 for grain yield, 0.40 for onset of flowering, 0.28 for seed weight, and 0.22 for lodging susceptibility. A comparison of GS vs phenotypic selection (PS) based on predicted genetic gains per unit time for same selection costs suggested greater efficiency of GS for all traits under various selection scenarios. For yield, the advantage in predicted efficiency of GS over PS was at least 80% using intra-population predictions and 20% using inter-population predictions. A genome-wide association study confirmed the highly polygenic control of most traits. Conclusions: Genome-enabled predictions can increase the efficiency of pea line selection for wide adaptation to Italian environments relative to phenotypic selection.
Agronomy
Grain protein content (GPC) is controlled by complex genetic systems and their interactions and is an important quality determinant for hard spring wheat as it has a positive effect on bread and pasta quality. GPC is variable among genotypes and strongly influenced by the environment. Thus, understanding the genetic control of wheat GPC and identifying genotypes with improved stability is an important breeding goal. The objectives of this research were to identify genetic backgrounds with less variation for GPC across environments and identify quantitative trait loci (QTLs) controlling the stability of GPC. A spring wheat nested association mapping (NAM) population of 650 recombinant inbred lines (RIL) derived from 26 diverse founder parents crossed to one common parent, ‘Berkut’, was phenotyped over three years of field trials (2014–2016). Genomic selection models were developed and compared based on predictions of GPC and GPC stability. After observing variable genetic control of ...
Silvae Genetica
To assess the genetic and environmental components of gene-expression variation among trees we used RNA-seq technology and Eucalyptus urophylla x grandis hybrid clones tested in field conditions. Leaf and xylem transcriptomes of three 20 month old clones differing in terms of growth, repeated in two blocks, were investigated. Transcriptomes were very similar between ramets. The number of expressed genes was significantly (P<0.05) higher in leaf (25,665±634) than in xylem (23,637±1,241). A pairwise clone comparisons approach showed that 4.5 to 14 % of the genes were differentially expressed (false discovery rate [FDR]<0.05) in leaf and 7.1 to 16 % in xylem. An assessment of among clone variance components revealed significant results in leaf and xylem in 3431 (248) genes (at FDR<0.2) and 160 (3) (at FDR<0.05), respectively. These two complementary approaches displayed correlated results. A focus on the phenylpropanoid, cellulose and xylan pathways revealed a large majo...
Euphytica, 2016
We provide a general framework for quantifying the effects of genotypic selection prediction accuracy and varying cost ratios of phenotyping to genotyping on the economic performance of genotypic selection relative to traditional phenotypic selection. Economic performance is measured using normalized average cost per unit of genetic gain. The economic performance of genotypic selection declines with (1) trait heritability, (2) relative cost of genotyping, and (3) the number of QTL (genes) affecting the trait. The benefits of increasing the training population size tend to be higher for traits with low heritability and traits affected by a larger number of QTL. The economically optimal sizes of the training population tend to be larger than the sizes that are typically used in current plant breeding programs.
Agronomy
Plant geneticists and breeders have used marker technology since the 1980s in quantitative trait locus (QTL) identification. Marker-assisted selection is effective for large-effect QTL but has been challenging to use with quantitative traits controlled by multiple minor effect alleles. Therefore, genomic selection (GS) was proposed to estimate all markers simultaneously, thereby capturing all their effects. However, breeding programs are still struggling to identify the best strategy to implement it into their programs. Traditional breeding programs need to be optimized to implement GS effectively. This review explores the optimization of breeding programs for variety release based on aspects of the breeder’s equation. Optimizations include reorganizing field designs, training populations, increasing the number of lines evaluated, and leveraging the large amount of genomic and phenotypic data collected across different growing seasons and environments to increase heritability estima...
Theoretical and Applied Genetics, 2017
Key message Rice breeding programs based on pedigree schemes can use a genomic model trained with data from their working collection to predict performances of progenies produced through rapid generation advancement. Abstract So far, most potential applications of genomic prediction in plant improvement have been explored using cross validation approaches. This is the first empirical study to evaluate the accuracy of genomic prediction of the performances of progenies in a typical rice breeding program. Using a cross validation approach, we first analyzed the effects of marker selection and statistical methods on the accuracy of prediction of three different heritability traits in a reference population (RP) of 284 inbred accessions. Next, we investigated the size and the degree of relatedness with the progeny population (PP) of sub-sets of the RP that maximize the accuracy of prediction of phenotype across generations, i.e., for 97 F 5 -F 7 lines derived from biparental crosses between 31 accessions of the RP. The extent of linkage disequilibrium was high (r 2 = 0.2 at 0.80 Mb in RP and at 1.1 Mb in PP). Consequently, average marker density above one per 22 kb did not improve the accuracy of predictions in the RP. The accuracy of progeny prediction varied greatly depending on the composition of the training set, the trait, LD and minor allele frequency. The highest accuracy achieved for each trait exceeded 0.50 and was only slightly below the accuracy achieved by cross validation in the RP. Our results thus show that relatively high accuracy (0.41-0.54) can be achieved using only a rather small share of the RP, most related to the PP, as the training set. The practical implications of these results for rice breeding programs are discussed. Communicated by Marcos Malosetti.
Sugar Tech, 2021
Sugarcane breeding programs aim to deliver new high-yielding varieties, resistant to diseases and pests, which contribute to profitability and sustainability of cane industries. These programs generally mobilize significant experimental, technological and human resources on long-term basis. Their efficiency in terms of genetic gains per unit of cost and time and their ability to release new varieties rely on the development of many breeding applications based on quantitative genetics theory and on statistical analyses of numerous experimental data from selection schemes including DNA marker data developed for some genomic breeding applications. New methodological approaches and new technologies that might better guide and support breeding research in cultivars development programs are continually sought. This paper presents an overview of the main applications developed in statistical methodology in support of the efficiency of sugarcane breeding programs. For each type of applicati...
Plant Breeding, 2020
Pigeonpea is an important food legume crop of semi-arid tropical regions. Plateauing of pigeonpea yield has been worrying breeders for the past 6-7 decades. Serious breeding efforts made during this period resulted in various high-yielding and disease resistant cultivars. However, the gains in pigeonpea productivity have been modest. The authors, while reviewing this situation, conclude that long generation turnover, complexity of biological traits, low selection response and overreliance upon pedigree breeding present the key bottlenecks for this situation. In this paper, some alternative breeding approaches and technologies are suggested for the genetic enhancement of yield stability and stress response of pigeonpea.
Molecular Breeding, 2017
Crop improvement is a long-term, expensive institutional endeavor. Genomic selection (GS), which uses single nucleotide polymorphism (SNP) information to estimate genomic breeding values, has proven
Plants, 2022
This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY
Frontiers in Plant Science, 2021
Mixed stand (MS) cropping of pea with small-grain cereals can produce more productive and environment-friendly grain crops relative to pure stand (PS) crops but may require selection to alleviate the pea competitive disadvantage. This study aimed to assess the pea variation for competitive ability and its associated traits and the efficiency of four phenotypic or genomic selection strategies. A set of 138 semi-leafless, semi-dwarf pea lines belonging to six recombinant inbred line populations and six parent lines were genotyped using genotyping-by-sequencing and grown in PS and in MS simultaneously with one barley and one bread wheat cultivar in two autumn-sown trials in Northern Italy. Cereal companions were selected in a preliminary study that highlighted the paucity of cultivars with sufficient earliness for association. Pea was severely outcompeted in both years albeit with variation for pea proportion ranging from nearly complete suppression (<3%) to values approaching a bal...
The Plant Genome, 2020
Genomic selection (GS) based recurrent selection methods were developed to accelerate the domestication of intermediate wheatgrass [IWG, Thinopyrum intermedium (Host) Barkworth & D.R. Dewey]. A subset of the breeding population phenotyped at multiple environments is used to train GS models and then predict trait values of the breeding population. In this study, we implemented several GS models that investigated the use of additive and dominance effects and G×E interaction effects to understand how they affected trait predictions in intermediate wheatgrass. We evaluated 451 genotypes from the University of Minnesota IWG breeding program for nine agronomic and domestication traits at two Minnesota locations during 2017-2018. Genet-mean based heritabilities for these traits ranged from 0.34 to 0.77. Using fourfold cross validation, we observed the highest predictive abilities (correlation of 0.67) in models that considered G×E effects. When G×E effects were fitted in GS models, trait predictions improved by 18%, 15%, 20%, and 23% for yield, spike weight, spike length, and free threshing, respectively. Genomic selection models with dominance effects showed only modest increases of up to 3% and were trait-dependent. Crossenvironment predictions were better for high heritability traits such as spike length, shatter resistance, free threshing, grain weight, and seed length than traits with low heritability and large environmental variance such as spike weight, grain yield, and seed width. Our results confirm that GS can accelerate IWG domestication by increasing genetic gain per breeding cycle and assist in selection of genotypes with promise of better performance in diverse environments.
Current Trends in Wheat Research
Wheat (T. aestivum) is one of the key food grain crops and is a prominent source of calories and proteins globally. In addition to mushrooming population and rising abiotic stresses in this ongoing climate change era, biotic stresses pose a great threat to wheat production over the globe. Fungal diseases such as rusts, mildew, along with pests like aphid, hinder the potential yield performance of the elite wheat cultivars to a huge extent. The complex nature of plant-parasite interactions is shown to be the decisive factor for the ultimate resistance expression in wheat. However, the advancement of molecular genetics and biotechnology enabled the replacement of the tedious, time and resource consuming cytogenetic analyses of locating APR and ASR genes using molecular mapping techniques. Continuous efforts have been made to mine resistance genes from diverse genetic resources such as wild relatives for combating these diseases and pests, which are repositories of R genes. Additionall...
International Journal of Molecular Sciences, 2020
Common rust (CR) caused by Puccina sorghi is one of the destructive fungal foliar diseases of maize and has been reported to cause moderate to high yield losses. Providing CR resistant germplasm has the potential to increase yields. To dissect the genetic architecture of CR resistance in maize, association mapping, in conjunction with linkage mapping, joint linkage association mapping (JLAM), and genomic prediction (GP) was conducted on an association-mapping panel and five F3 biparental populations using genotyping-by-sequencing (GBS) single-nucleotide polymorphisms (SNPs). Analysis of variance for the biparental populations and the association panel showed significant genotypic and genotype x environment (GXE) interaction variances except for GXE of Pop4. Heritability (h2) estimates were moderate with 0.37–0.45 for the individual F3 populations, 0.45 across five populations and 0.65 for the association panel. Genome-wide association study (GWAS) analyses revealed 14 significant ma...
Frontiers in Plant Science
Graphical This review summarized heat stress-mediated morphological and physiological changes in maize and elucidated the molecular mechanisms responsible for maize response to heat stress. Furthermore, plausible approaches to dissecting the regulatory network associated with heat stress response and improving maize adaptation to global warming have been discussed. This figure was made using BioRender.
Agriculture
Sugarcane (Saccharum spp.) is an important perennial grass crop for both sugar and biofuel industries. The Louisiana sugarcane breeding program is focused on improving sugar yield by incrementally increasing genetic gain. With the advancement in genotyping and (highthroughput) phenotyping techniques, genomic selection is a promising marker-assisted breeding tool. In this study, we assessed ridge regression best linear unbiased prediction (rrBLUP) and various Bayesian models to evaluate genomic prediction accuracy using a 10-fold cross validation on 95 commercial and elite parental clones from the Louisiana sugarcane breeding program. Datasets (individual and pooled in various combinations) were constructed based on soil type (light—Commerce silty loam, heavy—Sharkey clay) and crop (plant cane, ratoon). A total of 3906 SNPs were used to predict the genomic estimated breeding values (GEBVs) of the clones for sucrose content and cane and sugar yield. Prediction accuracy was estimated b...
International Journal of Molecular Sciences, 2020
Molecular markers are one of the major factors affecting genomic prediction accuracy and the cost of genomic selection (GS). Previous studies have indicated that the use of quantitative trait loci (QTL) as markers in GS significantly increases prediction accuracy compared with genome-wide random single nucleotide polymorphism (SNP) markers. To optimize the selection of QTL markers in GS, a set of 260 lines from bi-parental populations with 17,277 genome-wide SNPs were used to evaluate the prediction accuracy for seed yield (YLD), days to maturity (DTM), iodine value (IOD), protein (PRO), oil (OIL), linoleic acid (LIO), and linolenic acid (LIN) contents. These seven traits were phenotyped over four years at two locations. Identification of quantitative trait nucleotides (QTNs) for the seven traits was performed using three types of statistical models for genome-wide association study: two SNP-based single-locus (SS), seven SNP-based multi-locus (SM), and one haplotype-block-based mul...
Frontiers in Plant Science
Cassava (Manihot esculenta Crantz) is a staple crop for ~800 million people in sub-Saharan Africa. Its production and productivity are being heavily affected by the two viral diseases: cassava brown streak disease (CBSD) and cassava mosaic disease (CMD), impacting greatly on edible root yield. CBSD is currently endemic to central, eastern and southern Africa, if not contained could spread to West Africa the largest cassava producer and consumer in the continent. Genomic selection (GS) has been implemented in Ugandan cassava breeding for accelerated development of virus resistant and high yielding clones. This study leveraged available GS training data in Uganda for pre-emptive CBSD breeding in W. Africa alongside CMD and fresh root yield (FRW). First, we tracked genetic gain through the current three cycles of GS in Uganda. The mean genomic estimated breeding values (GEBVs), indicated general progress from initial cycle zero (C0) to cycle one (C1) and cycle two (C2) for CBSD traits ...
PLoS biology, 2014
Next generation sequencing (NGS) technologies are being used to generate whole genome sequences for a wide range of crop species. When combined with precise phenotyping methods, these technologies provide a powerful and rapid tool for identifying the genetic basis of agriculturally important traits and for predicting the breeding value of individuals in a plant breeding population. Here we summarize current trends and future prospects for utilizing NGS-based technologies to develop crops with improved trait performance and increase the efficiency of modern plant breeding. It is our hope that the application of NGS technologies to plant breeding will help us to meet the challenge of feeding a growing world population.
Global Wheat Production [Working Title], 2019
This chapter covers the production and breeding status of winter wheat (Triticum aestivum L.) used for early-season animal grazing and late-season grain production in the Southern Great Plains of the United States. Besides, in the chapter, the current production status and needs, the drawbacks of current cultivars, breeding strategies of the crop, novel genomics tools, and sensor technologies that can be used to improve dual-purpose winter wheat cultivars were presented. We will focus on traits that are, in general, not required by cultivars used for grain-only production but are critical for cool-season forage production.
Frontiers in Genetics
Increasing selection differential and decreasing cycle time, the rate of genetic improvement can be accelerated. Creating and capturing higher genetic with higher accuracy within the shortest possible time is the prerequisite for enhancing genetic gain for any trait. Comprehensive yield testing at multi-locations at early generations together with the shortest line fixation time can expedite the rapid recycling of parents in the breeding program through recurrent selection. Genomic selection is efficient in capturing high breeding value individuals taking additive genetic effects of all genes into account with and without extensive field testing, thus reducing breeding cycle time enhances genetic gain. In the Bangladesh Rice Research Institute, GS technology together with the trait-specific marker-assisted selection at the early generation of RGA-derived breeding lines showed a prediction accuracy of 0.454–0.701 with 0.989–2.623 relative efficiency over the four consecutive years of...
Crop Science, 2020
Plant breeders need efficient systems to identify which inbreds to combine to create new hybrid cultivars. The North Carolina Design II (NC DII) is a useful mating design to evaluate the potential of hybrid varieties and their inbred parents. Genomic best linear unbiased prediction (GBLUP) models, either with or without the inclusion of a dominance term in the model, have been found to be an efficient method for using rich marker sets for prediction. This study used marker data and phenotypic data collected in 11 organic trials across 6 locations on 40 inbred sweet corn (Zea mays This article is protected by copyright. All rights reserved. L.) genotypes and 100 hybrid progenies formed from 4 disconnected NC DII mating blocks to predict performance of untested sweet corn hybrids. In 2017, validation trials of 24 previously untested hybrids were grown in five organic environments to assess the correlation between actual performance and the performance predicted by GBLUP or NC DII general combining abilities (GCAs). Five-fold cross-validation accuracy ranged from 0.29 to 0.82 for the GBLUP predictions based on additive effects alone, and from 0.70 to 0.91 for GBLUP predictions based on combined additive and dominance effects. For all traits except flavor, addition of dominance effects to the model increased the cross-validation accuracy. Correlations between values measured in the 2017 validation trials and values predicted from the 2015 and 2016 training trials ranged from 0.36 to 0.
PLOS ONE, 2017
The objective of this study was to explore the potential of genomic prediction (GP) for soybean resistance against Sclerotinia sclerotiorum (Lib.) de Bary, the causal agent of white mold (WM). A diverse panel of 465 soybean plant introduction accessions was phenotyped for WM resistance in replicated field and greenhouse tests. All plant accessions were previously genotyped using the SoySNP50K BeadChip. The predictive ability of six GP models were compared, and the impact of marker density and training population size on the predictive ability was investigated. Cross-prediction among environments was tested to determine the effectiveness of the prediction models. GP models had similar prediction accuracies for all experiments. Predictive ability did not improve significantly by using more than 5k SNPs, or by increasing the training population size (from 50% to 90% of the total of individuals). The GP model effectively predicted WM resistance across field and greenhouse experiments when each was used as either the training or validation population. The GP model was able to identify WM-resistant accessions in the USDA soybean germplasm collection that had previously been reported and were not included in the study panel. This study demonstrated the applicability of GP to identify useful genetic sources of WM resistance for soybean breeding. Further research will confirm the applicability of the proposed approach to other complex disease resistance traits and in other crops.
Agronomy
Although nitrogen (N) is the most limiting nutrient for agricultural production, its overuse is associated with environmental pollution, increased concentration of greenhouse gases, and several human and animal health implications. These implications are greatly affected by biochemical transformations and losses of N such as volatilization, leaching, runoff, and denitrification. Half of the globally produced N fertilizers are used to grow three major cereals—rice, wheat, and maize—and their current level of N recovery is approximately 30–50%. The continuously increasing application of N fertilizers, despite lower recovery of cereals, can further intensify the environmental and health implications of leftover N. To address these implications, the improvement in N use efficiency (NUE) by adopting efficient agronomic practices and modern breeding and biotechnological tools for developing N efficient cultivars requires immediate attention. Conventional and marker-assisted selection meth...
Frontiers in Plant Science
Soybean brown rust (SBR), caused by Phakopsora pachyrhizi, is a devastating fungal disease that threatens global soybean production. This study conducted a genome-wide association study (GWAS) with seven models on a panel of 3,082 soybean accessions to identify the markers associated with SBR resistance by 30,314 high quality single nucleotide polymorphism (SNPs). Then five genomic selection (GS) models, including Ridge regression best linear unbiased predictor (rrBLUP), Genomic best linear unbiased predictor (gBLUP), Bayesian least absolute shrinkage and selection operator (Bayesian LASSO), Random Forest (RF), and Support vector machines (SVM), were used to predict breeding values of SBR resistance using whole genome SNP sets and GWAS-based marker sets. Four SNPs, namely Gm18_57,223,391 (LOD = 2.69), Gm16_29,491,946 (LOD = 3.86), Gm06_45,035,185 (LOD = 4.74), and Gm18_51,994,200 (LOD = 3.60), were located near the reported P. pachyrhizi R genes, Rpp1, Rpp2, Rpp3, and Rpp4, respecti...
2021
ABSTRACTDespite being one of the most consumed vegetables in the United States, the elemental profile of sweet corn (Zea mays L.) is limited in its dietary contributions. To address this through genetic improvement, a genome-wide association study was conducted for the concentrations of 15 elements in fresh kernels of a sweet corn association panel. In concordance with mapping results from mature maize kernels, we detected a probable pleiotropic association of zinc and iron concentrations with nicotianamine synthase5 (nas5), which purportedly encodes an enzyme involved in synthesis of the metal chelator nicotianamine. Additionally, a pervasive association signal was identified for cadmium concentration within a recombination suppressed region on chromosome 2. The likely causal gene underlying this signal was heavy metal ATPase3 (hma3), whose counterpart in rice, OsHMA3, mediates vacuolar sequestration of cadmium and zinc in roots, whereby regulating zinc homeostasis and cadmium accu...
Theoretical and Applied Genetics, 2019
Key message The optimization of training populations and the use of diagnostic markers as fixed effects increase the predictive ability of genomic prediction models in a cooperative wheat breeding panel. Abstract Plant breeding programs often have access to a large amount of historical data that is highly unbalanced, particularly across years. This study examined approaches to utilize these data sets as training populations to integrate genomic selection into existing pipelines. We used cross-validation to evaluate predictive ability in an unbalanced data set of 467 winter wheat (Triticum aestivum L.) genotypes evaluated in the Gulf Atlantic Wheat Nursery from 2008 to 2016. We evaluated the impact of different training population sizes and training population selection methods (Random, Clustering, PEVmean and PEVmean1) on predictive ability. We also evaluated inclusion of markers associated with major genes as fixed effects in prediction models for heading date, plant height, and resistance to powdery mildew (caused by Blumeria graminis f. sp. tritici). Increases in predictive ability as the size of the training population increased were more evident for Random and Clustering training population selection methods than for PEVmean and PEVmean1. The selection methods based on minimization of the prediction error variance (PEV) outperformed the Random and Clustering methods across all the population sizes. Major genes added as fixed effects always improved model predictive ability, with the greatest gains coming from combinations of multiple genes. Maximum predictabilities among all prediction methods were 0.64 for grain yield, 0.56 for test weight, 0.71 for heading date, 0.73 for plant height, and 0.60 for powdery mildew resistance. Our results demonstrate the utility of combining unbalanced phenotypic records with genome-wide SNP marker data for predicting the performance of untested genotypes.
Crop Science, 2020
Fusarium ear rot (FER) disease of maize (Zea mays L.) is caused by Fusarium verticillioides (Sacc.) Nirenberg, which produces fumonisin (FUM), a mycotoxin linked to human and animal health risks. Extensive field trials, laborious inoculations and ear evaluations, and expensive antibody assays are required to reliably assess resistances to FER and FUM contamination in breeding populations. To evaluate the potential utility of genomic selection (GS) to improve FER and FUM in maize, we genotyped 6086 single nucleotide polymorphisms (SNPs) on 449 S0:1 lines from a recurrent selection population. Two different partitions of the S0:1 evaluation data were made to test the ability of models trained on 251 or 201 lines evaluated at three locations in 2014–2015 to predict FER and FUM of 198 or 248 different lines evaluated at three locations in 2016. Single‐stage univariate and multivariate genomic best linear unbiased predictor (GBLUP) models and two‐stage GBLUP, Bayes Cπ, Bayesian LASSO, an...
Molecular Breeding, 2019
Rice blast (RB), caused by the fungal pathogen Magnaporthe oryzae, is a major disease in rice (Oryzae sativa L.) with resistance controlled by major and minor genes. Genomic selection (GS) is a breeding technology applicable for selecting traits controlled by many genes. Our objective was to assess the utility of GS in improving RB resistance. A population of 161 accessions from Africa and another population of 162 accessions from the USA were evaluated for resistance to six and eight RB isolates, respectively. Each rice population was genotyped with single nucleotide polymorphism (SNP) markers. The accuracy of GS was determined using seven models: genomic best linear unbiased prediction (gBLUP), gBLUP with some markers as fixed effects (fgBLUP), gBLUP model with population structure as a covariate (sgBLUP), multitrait gBLUP (mgBLUP), Bayesian (BayesA and BayesC) models, and a multiple linear regression model using significant markers (MLR). Each set of population had accessions with good resistance to multiple isolates. Using cross-validation, the accuracy of gBLUP ranged from 0.15 to 0.72; the gBLUP, sgBLUP, mgBLUP, and Bayesian methods had similar accuracy, while fgBLUP gave the greatest accuracy. Without cross-validation, gBLUP, sgBLUP, fgBLUP, and Bayesian methods were similar and were superior to mgBLUP and MLR. In general, a GS model built on data from one isolate was able to predict the phenotypes generated from other isolates, suggesting common genes controlling resistance across isolates. Our results demonstrate that GS may be a very useful method to improve RB resistance. The fgBLUP model could be used to effectively select for both durable and resistance traits conferred by major genes.
Methods in Molecular Biology
Genomic-enabled prediction models are of paramount importance for the successful implementation of genomic selection (GS) based on breeding values. As opposed to animal breeding, plant breeding includes extensive multienvironment and multiyear field trial data. Hence, genomic-enabled prediction models should include genotype × environment (G × E) interaction, which most of the time increases the prediction performance when the response of lines are different from environment to environment. In this chapter, we describe a historical timeline since 2012 related to advances of the GS models that take into account G × E interaction. We describe theoretical and practical aspects of those GS models, including the gains in prediction performance when including G × E structures for both complex continuous and categorical scale traits. Then, we detailed and explained the main G × E genomic prediction models for complex traits measured in continuous and noncontinuous (categorical) scale. Rela...
Frontiers in Plant Science, 2016
Current trends in population growth and consumption patterns continue to increase the demand for wheat, a key cereal for global food security. Further, multiple abiotic challenges due to climate change and evolving pathogen and pests pose a major concern for increasing wheat production globally. Triticeae species comprising of primary, secondary, and tertiary gene pools represent a rich source of genetic diversity in wheat. The conventional breeding strategies of direct hybridization, backcrossing and selection have successfully introgressed a number of desirable traits associated with grain yield, adaptation to abiotic stresses, disease resistance, and bio-fortification of wheat varieties. However, it is time consuming to incorporate genes conferring tolerance/resistance to multiple stresses in a single wheat variety by conventional approaches due to limitations in screening methods and the lower probabilities of combining desirable alleles. Efforts on developing innovative breeding strategies, novel tools and utilizing genetic diversity for new genes/alleles are essential to improve productivity, reduce vulnerability to diseases and pests and enhance nutritional quality. New technologies of high-throughput phenotyping, genome sequencing and genomic selection are promising approaches to maximize progeny screening and selection to accelerate the genetic gains in breeding more productive varieties. Use of cisgenic techniques to transfer beneficial alleles and their combinations within related species also offer great promise especially to achieve durable rust resistance.
BMC genomics, 2017
The advent of high-throughput genotyping technologies coupled to genomic prediction methods established a new paradigm to integrate genomics and breeding. We carried out whole-genome prediction and contrasted it to a genome-wide association study (GWAS) for growth traits in breeding populations of Eucalyptus benthamii (n =505) and Eucalyptus pellita (n =732). Both species are of increasing commercial interest for the development of germplasm adapted to environmental stresses. Predictive ability reached 0.16 in E. benthamii and 0.44 in E. pellita for diameter growth. Predictive abilities using either Genomic BLUP or different Bayesian methods were similar, suggesting that growth adequately fits the infinitesimal model. Genomic prediction models using ~5000-10,000 SNPs provided predictive abilities equivalent to using all 13,787 and 19,506 SNPs genotyped in the E. benthamii and E. pellita populations, respectively. No difference was detected in predictive ability when different sets o...
2021
ABSTRACTThe selection of informative markers has been studied massively as an alternative to reduce genotyping costs for the genomic selection (GS) application. Low-density marker panels are attractive for GS because they decrease computational time-consuming and multicollinearity beyond more individuals can be genotyped with the same cost. Nevertheless, these inferences are usually made empirically using “static” training sets and populations, which are adequate only to predict a breeding program’s initial cycles but might not for long-term cycles. Moreover, to the best of our knowledge, none of these inferences considered the inclusion of dominance into the GS models, which is particularly important to predict cross-pollinated crops. Therefore, that reveals an important and unexplored topic for allogamous long-term breeding. To achieve this goal, we employed two approaches: the former used empirical maize datasets, and the latter simulations of long-term breeding cycles of phenoty...
2019
The cuticle, a hydrophobic layer of cutin and waxes synthesized by plant epidermal cells, is the major barrier to water loss when stomata are closed at night and under water-limited conditions. Elucidating the genetic architecture of natural variation for leaf cuticular conductance (gc) is important for identifying genes relevant to improving crop productivity in drought-prone environments. To this end, we conducted a genome-wide association study ofgcof adult leaves in a maize inbred association panel that was evaluated in four environments (Maricopa, AZ, and San Diego, CA in 2016 and 2017). Five genomic regions significantly associated withgcwere resolved to seven plausible candidate genes (ISTL1, two SEC14 homologs, cyclase-associated protein, a CER7 homolog, GDSL lipase, and β-D-XYLOSIDASE 4). These candidates are potentially involved in cuticle biosynthesis, trafficking and deposition of cuticle lipids, cutin polymerization, and cell wall modification. Laser microdissection RNA...
Frontiers in Genetics, 2022
Since the inception of the theory and conceptual framework of genomic selection (GS), extensive research has been done on evaluating its efficiency for utilization in crop improvement. Though, the marker-assisted selection has proven its potential for improvement of qualitative traits controlled by one to few genes with large effects. Its role in improving quantitative traits controlled by several genes with small effects is limited. In this regard, GS that utilizes genomic-estimated breeding values of individuals obtained from genome-wide markers to choose candidates for the next breeding cycle is a powerful approach to improve quantitative traits. In the last two decades, GS has been widely adopted in animal breeding programs globally because of its potential to improve selection accuracy, minimize phenotyping, reduce cycle time, and increase genetic gains. In addition, given the promising initial evaluation outcomes of GS for the improvement of yield, biotic and abiotic stress tolerance, and quality in cereal crops like wheat, maize, and rice, prospects of integrating it in breeding crops are also being explored. Improved statistical models that leverage the genomic information to increase the prediction accuracies are critical for the effectiveness of GS-enabled breeding programs. Study on genetic architecture under drought and heat stress helps in developing production markers that can significantly accelerate the development of stress-resilient crop varieties through GS. This review focuses on the transition from traditional selection methods to GS, underlying statistical methods and tools used for this purpose, current status of GS studies in crop plants, and perspectives for its successful implementation in the development of climate-resilient crops.
bioRxiv (Cold Spring Harbor Laboratory), 2023
DNA copy numbers have recently emerged as an important new marker system. In the absence of a contiguous reference genome, alternative detection systems such as the comparative hybridization method have been used to detect copy number variations (CNVs). With the advent of chromosome-level resolved reference genomes based on the incorporation of long-read sequencing and powerful bioinformatics pipelines, comprehensive detection of all structural variations (SVs) in the poplar genome is now within reach. Gene CNVs and their inheritance are important because they can cause dosage effects in phenotypic variations. These are potent genetic markers that should be considered in complex trait variation such as growth and adaptation in poplar. SVs such as CNVs could be used in future genomic selection studies for poplar, especially in cases when heterosis increases hybrid performance (hybrid vigor). This Chapter reports recent findings on SVs in natural populations of Populus spp. as well as on artificially induced SVs in poplar to understand their potential importance in generating a considerable amount of phenotypic improvement. The Chapter concludes with an outlook on the future implementation of knowledge on SVs in poplar crop breeding.
Frontiers in Plant Science, 2023
China's and Europe's dependence on imported protein is a threat to the food self-sufficiency of these regions. It could be solved by growing more legumes, including alfalfa that is the highest protein producer under temperate climate. To create productive and high-value varieties, the use of large genetic diversity combined with genomic evaluation could improve current breeding programs. To study alfalfa diversity, we have used a set of 395 alfalfa accessions (i.e. populations), mainly from Europe, North and South America and China, with fall dormancy ranging from 3 to 7 on a scale of 11. Five breeders provided materials (617 accessions) that were compared to the 400 accessions. All accessions were genotyped using Genotyping-by-Sequencing (GBS) to obtain SNP allele frequency. These genomic data were used to describe genetic diversity and identify genetic groups. The accessions were phenotyped for phenology traits (fall dormancy and flowering date) at two locations (Lusignan in France, Novi Sad in Serbia) from 2018 to 2021. The QTL were detected by a Multi-Locus Mixed Model (mlmm). Subsequently, the quality of the genomic prediction for each trait was assessed. Cross-validation was used to assess the quality of prediction by testing GBLUP, Bayesian Ridge Regression (BRR), and Bayesian Lasso methods. A genetic structure with seven groups was found. Most of these groups were related to the geographical origin of the accessions and showed that European and American material is genetically distinct from Chinese material. Several QTL associated with fall dormancy were found and most of these were linked to genes. In our study, the infinitesimal methods showed a higher prediction quality than the Bayesian Lasso, and the genomic prediction achieved high (>0.75) predicting abilities in some cases. Our results are encouraging for alfalfa breeding by showing that it is possible to achieve high genomic prediction quality.
Loading Preview
Sorry, preview is currently unavailable. You can download the paper by clicking the button above.