Plant breeding is a vital agricultural technique aimed at developing new crop varieties with higher yield, improved quality and better adaptability. A key method within this field is hybrid breeding, which involves crossing two genetically distinct genotypes to produce hybrids exhibiting desirable traits. A central concept in hybrid breeding is
heterosis, or hybrid vigour. The term heterosis was coined by Shull in 1914 and was first detected in maize. Heterosis refers to the superiority of an F
1 hybrid in one or more characters over its parents.
Shull (1952) defined heterosis as, the ‘Interpretation of increased yield, vigour, fruitfulness, size, resistance to abiotic and biotic stress or speed of development, manifested by cross-bred organisms as compared with corresponding inbreds, as the specific results of unlikeness in the constituents of the uniting parental gametes. Heterosis is a chief cause of non-additive gene action which is the maximum in cross-pollinated crops. It is particularly significant in cross-pollinated crops, where outbreeding increases heterozygosity and improves hybrid performance.
To maximize heterosis, breeders classify germplasm into
heterotic groups-collections of genotypes that produce superior hybrids when crossed with members of another group. This categorization helps identify optimal parental combinations and utilize genetic diversity effectively. The main objective of heterotic grouping is to develop superior hybrid combinations within a short period of time. By exploiting heterosis through careful parental selection and grouping, plant breeding significantly enhances crop productivity and resilience. The present study is a review about the importance of heterotic grouping, its objectives and different methods employed for the formation of heterotic groups.
Principles of heterotic grouping
Heterotic grouping is vital in hybrid breeding, based on principles like genetic diversity, heterosis, combining ability and genetic distance. Genetic diversity enables classification by maximizing variability between lines, which enhances hybrid vigor when complementary traits combine. Heterosis is another important principle around which the heterotic grouping is done. The lines were classified into different groups and evaluated for heterosis in the F
1 hybrid. Combining ability analysis helps to classify the germplasm into heterotic groups. Lines within the same group tend to have similar combining ability than between groups. Genetic distance, often measured using molecular markers, further aids in grouping; greater distance between lines from different groups indicates more variability and higher heterosis potential.
Heterotic grouping
Heterotic grouping is a pre-requisite step in hybrid breeding. It is the process of grouping different germplasm lines into various heterotic groups.
Melchinger and Gumber (1998) defined a heterotic group ‘as a group of related or unrelated genotypes from the same or different populations, which display similar combining ability and heterotic response when crossed with genotypes from other genetically distinct germplasm groups’. When two divergent heterotic groups are crossed, it results in the formation of superior hybrid combinations. These hybrids were found to be more productive than the hybrids developed from the same heterotic group.Heterotic grouping involves identifying diverse germplasm groups that when crossed with one another, produce high-quality hybrids with greater heterosis. A heterotic group is a group of genotypes which exhibits similar combining ability and a heterotic behaviour when crossed with genotypes of other groups.
Steps in heterotic grouping
The steps given by
Melchinger and Gumber (1998) for the identification of heterotic groups were,
1. Grouping germplasm based on genetic similarity.
2. Selection of representative genotypes from subgroups for producing diallel-cross.
3. Evaluation of diallel crosses along with parents in replicated trials.
4. Selection of most promising cross combinations as potential heterotic groups.
If heterotic patterns are present already, the superior genotypes in a heterotic group can be taken as testers and used for evaluation and classification of existing germplasm. Test cross is performed and the lines with similar combining ability and heterotic behaviour can be grouped into a new heterotic group. If the lines have similar combining ability and heterotic behaviour as of the existing heterotic group then it can be merged to the existing heterotic groups.
Melchinger and Gumber (1998) recommended the following criteria for the choice of heterotic patterns:
(1) High mean performance and large genetic variance in the hybrid population.
(2) High per se performance and good adaption of the parent population to the target region(s).
(3) Low inbreeding depression, if hybrids are produced from inbred lines.
Heterotic pattern
The heterotic pattern is a key factor for utilizing germplasm to maximize the performance of the population crosses and derived hybrids (
Eberhart et al., 1995). The term heterotic pattern refers to a specific pair of two heterotic groups, which express high heterosis and consequently high hybrid performance in their cross (
Melchinger and Gumber, 1998). A heterotic pattern is improved by exploiting genetic variation generated within heterotic groups
(Riedelsheimer et al., 2012). The primary criteria to develop a heterotic pattern proposed by
Reif et al. (2005) was the hybrid form the heterotic pattern should show high heterosis and yield with a reduced SCA and lower SCA/GCA ratio or higher GCA/SCA ratio. Heterotic patterns can determine the type of genotype in hybrid breeding for a long period.
Objectives of heterotic grouping
1. To obtain improved hybrid performance and mean heterosis.
2. To lessen the specific combining ability (SCA) variance and reduce the ratio of SCA to GCA variance.
3. To eliminate unnecessary crosses which were made to find out superior hybrids by classifying lines into heterotic groups.
4. To save the time and resources for hybrid development.
5. To broaden the genetic base of hybrid by utilizing new germplasm lines.
Methods of heterotic grouping
A heterotic group classification can be done either based on a quantitative or molecular approach. The quantitative methods of classification include HGCAMT, HSGCA and SCA methods which use combining ability estimates of pedigree lines and hybrid field data to classify inbreds. The molecular-based techniques classify lines based on genetic distance (GD) or genetic similarity (GS) from molecular markers which is very useful for describing heterotic groups and studying associations among inbreds at molecular level. Results from both quantitative and molecular approaches vary depending on the environment, the test material or the molecular technique used. The different methods of heterotic grouping along with its advantages were presented in Table 1.
Pedigree analysis
Heterotic grouping with pedigree analysis is a traditional method of heterotic grouping. Classification of germplasm lines into different heterotic groups can be done using pedigree analysis. Pedigree analysis helps to obtain the genetic diversity present in the population with which the germplasm is assigned to different heterotic groups. The Reid and Lancaster heterotic groups of maize were formed using the pedigree method of analysis which is utilized in the Corn Belt (Region of United States).
Reif et al. (2003) classified sub-tropical maize into two distinct heterotic groups (flint and dent) using SSR markers and pedigree information.
Pedigree analysis was performed by
Reid et al. (2011) to classify 129 maize inbred lines out of which 119 maize inbred lines were classified into different heterotic groups using pedigree analysis combined with the SSR marker. The main concept of pedigree analysis is to trace the inbred’s genetic background to their ancestral varieties using pedigree information. The inbred is grouped based on the percentage of the parent’s contribution. The inbred will be assigned to a heterotic group of parents which contributed more than 50% of the genetic background in that particular inbred. For example, The inbred CM105 from (V3 X B14) consist of genes from B14 about 87.5% and is grouped based on the similarity of genetic background in BSSS germplasm to which B14 belongs. In the case of the inbreds CO343 and CO344 whose parents were Mo17 and CO255 are classified as the group with the genetic background of CO255. The contribution of these parents is equal in the inbreds CO343 and CO344 but it is grouped to CO255 as it is early maturing which is most suitable in Canada. Using the pedigree method 94 inbreds were classified into 8 groups but 35 inbreds lines cannot classified into any of these 8 groups. Table 2 depicts the classification of groups based on the major sources of parents and the number of genotypes in each group.
Simple sequence repeats (SSR) analysis revealed that inbred lines having closer pedigree relationships have more genetic similarity. The average genetic similarity within the BSSS group is found to be greater than the average genetic similarity between BSSS and another group (Fig 1). Discriminant analysis reveals that SSR analysis is in accordance with the pedigree groups.
Combining ability analysis
Combining ability is the ability of the genotype to transmit superior performance to its crosses. It is the ability of the parents to combine well with each other. It is the capacity of a genotype to pass superior traits to its crosses, classified as general combining ability (GCA) and specific combining ability (SCA). GCA reflects average performance across crosses, while SCA shows performance in specific crosses, measured
via diallel or line × tester mating designs. Combining ability analysis is crucial for classifying germplasm into heterotic groups, enabling the development of superior hybrids. Line × tester mating, using proven testers, is commonly used for this purpose; if testers are unavailable, diallel mating is an alternative. Overall, combining ability analysis is essential in hybrid breeding and heterotic grouping. Heterotic grouping in pigeon pea can also be done using combining ability along with next generation sequencing
(Patel et al., 2022). The heterotic grouping of mungbean lines were done using combining ability studies through diallel analysis by
Singh et al. (2016) based on seed size and yield.
Combining ability can be used in three ways to formulate a heterotic group.
1. Using SCA effects of grain yield.
2. Using both GCA and SCA effects of grain yield.
3. Using GCA effects for multiple component traits.
Heterotic grouping using SCA
In this method, inbred lines are grouped based on their SCA effects for traits like grain yield. The inbred lines are crossed to proven testers and their SCA effects for grain yield are determined. Lines with similar SCA effects are placed in the same heterotic group. For this, the identification of a proven tester is the preliminary step. The inbred lines are crossed with some testers and a pair of testers which shows contrasting positive and negative SCA effects are chosen as proven testers. Based on these estimates of GCA and SCA the inbreds are classified into different heterotic groups. The inbred lines which showed positive GCA effects are selected. Out of these lines, the inbreds which show a positive SCA effect with one tester and with higher grain yield than the grand mean can be grouped into one heterotic group and the inbreds which show a positive SCA effect with other testers with higher grain yield than grand mean can be grouped into another heterotic group. Hence when crossing inbreds present in different heterotic groups maximum heterosis can be obtained.
This method was used in a study by
Singh et al. (2021) to classify 61 maize inbred lines into heterotic groups using the SCA estimates for grain yield. First, the 61 inbred lines were crossed by two inbred testers in line × tester mating design and 122 test-crosses were generated. BML 6 and BML 7 were used as testers and these inbred testers are the parents of DHM 117, a well-adapted maize variety. Three hybrid checks were maintained along with the test crosses in a randomized block design and morphological observations are recorded to estimate GCA and SCA effects. Along with this, the economic heterosis of F
1 is calculated by comparing the standard checks. Out of 61 inbred lines, 29 lines showed positive GCA and SCA effects with higher mean grain yield. The SCA effects of these 29 inbred lines in the test cross are then compared with the mean yield of testers. The lines which show positive SCA effects with the tester BML 7 and test cross mean grain yield greater than or equal to the mean yield of test cross of inbreds with both the testers grouped to heterotic group A (12 lines) and the lines which show positive SCA effects with the tester BML 6 and with test cross mean grain yield greater than or equal to the mean yield of test cross of inbreds with both the testers was grouped to B (17 lines).
Heterotic grouping using SCA and GCA
Fan et al. (2008) proposed a heterotic group’s specific and general combining ability (HSGCA) model to classify the inbred lines based on both GCA and SCA effects.
The heterotic group’s specific and general combining ability (HSGCA) method proposed by
Fan et al. (2009) was computed as follows:
SCA = Cross mean (Xi.) × Line mean (X.j ) × Tester mean (Xi.) + Overall mean (X..)
GCA = Line mean (X.j ) × Overall mean (X..)
HSGCA = Cross mean (Xi) × Tester mean (Xi.) = GCA + SCA
Where,
Xij = Mean yield of the cross between i
th tester and j
th line.
Xi. = Mean yield of the i
th tester.
X.j = Mean yield of j
th line.
This method also involves the identification of testers and the grouping of lines into different heterotic groups.
1. The inbred lines with negative HSGCA effects were grouped into the same heterotic group in which their corresponding tester was present. So the inbreds will be classified into two known tester groups. Here a line can be grouped into more than one heterotic group.
2. In the second step, the line which is grouped into more than one heterotic group will be grouped into the heterotic group with the smallest value or largest negative value and can be removed from other heterotic group.
3. The inbred lines with positive HSGCA effects were grouped into different unknown heterotic groups different from the two testers.
Identification of representative tester lines is key for the HSGCA method
(Fan et al., 2009). This method combines both GCA and SCA effects and hence it is more efficient than the grouping method where only the SCA effect is used. This is because in SCA the interaction of hybrid and the environment may lead to mismatching of lines into different heterotic groups.
For example, a study was conducted for the classification of 21 quality protein maize (QPM) inbred lines into heterotic groups and was done using the HSGCA method by
Mekasha et al. (2021). The 21 lines were crossed with two elite QPM lines as testers CML 159 and CML 144 in a line × tester mating design. The estimates of SCA effects, GCA effects and grain main yield were obtained for 21 maize inbred lines. Then HSGCA was calculated based on the formula given by
Fan et al. (2009) for the inbreds. The procedure mentioned by
Fan et al. (2009) was also followed for the classification of heterotic groups.
These maize inbred lines were classified into 4 different groups as in Table 3. The lines which show a negative HSGCA effect with tester CML 159 were grouped along with that tester in group Heterotic group A. The lines which showed a negative HSGCA effect with tester CML 144 were grouped in Heterotic group B along with the tester. The lines which show negative HSGCA with both the testers were grouped into Heterotic group AB and the lines with positive HSGCA were grouped into unknown heterotic group.
HSGCA method was found to be most effective in classifying the maize inbred lines into different heterotic groups. To obtain heterosis for grain yield a breeder should choose both the lines and testers with positive GCA effects or at least the lines with positive GCA effects which can be used as female parents. Similarly, in wheat heterotic groups have been made using GCA and SCA effects
(Deviren et al., 2024).
Heterotic grouping using GCA with multiple traits
Badu-Apraku et al. (2013) proposed heterotic grouping based on the GCAof multiple traits (HGCAMT) method.
Badu-Apraku et al. (2015) explained that grouping by the HGCAMT method was achieved by standardizing the GCA effects (mean of zero andstandard deviation of 1) of the traits that had significant mean squares forgenotypes under each study condition using the following statistical model:

Where,
Y = HGCAMT, which is the genetic value measuring the relationship among genotypes based on the GCA of multiple traits i to n.
Yi = Individual GCA effect of genotypes for trait i.
Yi = Mean of GCA effects across genotypes for trait i.
S = Standard deviation of the GCA effects of trait i.
εij = Residual of the model associated with thecombination of inbred i and trait j.
Grouping of Inbred lines based on a single trait
i.e., grain yield can mislead the classification as grain yield is a quantitative character and controlled by polygenes. In case of grain yield, if the yield gets reduced subsequently the heritability of grain yield will also be decreased. The HGCAMT method uses multiple component traits of inbreds with significant GCA effects which have a positive strong correlation with grain yield under different environmental conditions.
A study was conducted at IITA, Nigeria by
Okunlola et al. (2023) to find the combining ability of early-maturing quality protein maize inbred lines under
Striga-infested conditions and low nitrogen conditions. After finding the GCA and SCA estimates they classified the inbreds into different heterotic groups using the HGCMAT method. A total of 47 inbred lines and 4 testers along with 2 hybrid checks that have combined resistance for both
Striga-infested and low N conditions were classified (Fig 2). 188 test crosses were done in striga–infested conditions, low N conditions and optimum conditions. Observations are recorded for morphological and yield-related traits under each experiment. Heterotic grouping using HGCMAT proposed by
Badu-Apraku et al. (2015) was followed in this study.
Forty seven inbreds were successfully placed into 4 heterotic groups along with the tester are mentioned in Table 4.
The QPM hybrids obtained from crossing lines in the different heterotic groups will be resistant to
Striga-infested conditions, tolerant to low N content in soil and will be early-maturing hybrids.
Phenotypic clustering
It is a conventional heterotic group classification method. Multivariate analysis is used to classify the germplasm into various heterotic groups based on the differences in agronomic and morphological characters. Due to environmental influences and unknown genetic mechanisms, the reliability of classification is uncertain. Hence phenotypic clustering along with genetic characterization of the population using molecular markers proved to be an effective way to classify germplasm into different heterotic groups.
The clustering method was effectively used for the development of heterotic groups in sunflower by
Ibrar et al. (2024). 109 Sunflower genotypes were classified into different heterotic groups and the efficiency of identification of these groups was evaluated using the performance of F
1 in line × tester method. Phenotypic, genotypic and biochemical data were combined and using 3 different algorithms (hierarchical, k-means, hybrid), the lines were assigned to the heterotic groups.
The hierarchical algorithm clustering (Fig 3) was found to be the best for the classification of sunflower genotypes into heterotic groups.
The D
2 method was used for the combined data of morphological, biochemical and genotypic characterization. The result of this clustering revealed two distinct groups of genotypes. Out of 109, 31 genotypes were grouped under Cluster 1 and 78 genotypes were grouped under Cluster 2 respectively. Cluster 1 consists of restorer lines and Cluster 2 consists of CMS lines, B lines and self-pollinated lines. Based on an estimate of genetic distance the clusters are further divided into subgroups
i.e., heterotic groups. Custer 1 was divided into 6 sub groups and Cluster 2 was divided into another 6 sub-groups as in Table 5.
Twelve genotypes from each heterotic group (sub-group) were crossed in line × tester mating design and 36 F
1 hybrids were evaluated for their performance. In each cross, one genotype from a heterotic group of CMS lines is crossed with other genotype from a heterotic group of restorer lines. The final results revealed that the hybrids produced by crossing lines from two different heterotic groups showed remarkable heterosis for the traits studied in a desirable direction and can be used in the future hybrid breeding of Sunflower. Multivariate analysis of fifty pea genotypes conducted by
Singh et al. (2021) to identify genetic divergence based on phenotypical traits and grouped them into six clusters. Maximum value of inter-cluster distance (D
2 = 6.471) was recorded between cluster I and cluster IV genotypes these clusters can be used as distant parents in any breeding programme for pea.
Marker assisted heterotic grouping
Molecular technology has become a vital tool in modern plant breeding, offering advantages over conventional methods through higher efficiency, reduced costs and errors and independence from environmental influences. Molecular markers allow for whole-genome assessment and have proven effective in classifying inbred lines into heterotic groups.
Barata and Carena (2006) emphasized using molecular approaches as preliminary tools in field evaluations to create distinct heterotic groups with strong within-group genetic similarities.
In hybrid breeding, particularly in maize, molecular markers are widely used to assess genetic distances, which underpin heterotic grouping. Genetically similar lines within a group have smaller distances, while genetically dissimilar lines between groups exhibit larger distances. For accurate correlations between genetic distance and hybrid performance, essential requirements include appropriate phenotypic evaluation, the right marker choice and a sufficient number of inbred lines.
Restriction fragment length polymorphism
RFLP was among the first markers used for heterotic grouping, particularly in maize. It proved useful in assessing genetic diversity and grouping lines by geographic origin. However, limitations were noted-RFLP alone could not reliably differentiate between testers from opposite heterotic groups
(Warburton et al., 2005) and was better at predicting intra-group cross performance than inter-group performance
(Melchinger et al., 1998).
Amplified fragment length polymorphism
AFLP emerged as an improvement over RFLP, offering more polymorphism and efficiency by evaluating many loci in a single PCR. Studies by
Ajmone Marsan et al. (1998) and
Pejic et al. (1998) showed its usefulness in identifying genetic diversity. AFLP-based genetic distance measure-ments were moderately correlated with hybrid performance, making AFLP a reliable but somewhat limited predictor of F
1 yield.
Li et al. (2004) successfully classified Italian and Chinese inbred maize lines using AFLP, demonstrating its utility in global germplasm classification.
Random amplified polymorphic DNA
RAPD markers have also been employed to assess genetic diversity in maize, wheat, rice and barley. Though easy to use and capable of revealing high polymorphism, RAPD markers tend to show low correlation between genetic distance and hybrid performance. For instance,
Parentoni et al. (2001) used RAPD to group 28 maize varieties, while
Bruel et al. (2007) classified 16 inbred lines into five heterotic groups. Bootstrap methods confirmed sufficient RAPD markers for reliable results, though predictive accuracy for hybrid yield remained low.
Simple sequence repeats
SSR markers are among the most widely used for heterotic grouping due to their high polymorphism and reproducibility. They enable morphological and molecular profiling, making them ideal for identifying desirable heterotic patterns. SSR-based groupings often align with pedigree and combining ability analyses.
Reif et al. (2003) and
Aguiar et al. (2008) found SSR markers useful for systematic introgression of exotic germplasm.
Sruthi et al. (2020) used 50 SSR markers to classify 96 rice parental lines into two distinct heterotic groups-Maintainer (B lines) and Restorer (R lines) with results matching pedigree data.
Single nucleotide polymorphism
Recently SNP are in great demand for molecular studies especially in the categorization of heterotic groups. SNP markers based on genetic distance show a high correlation with the heterosis of F
1 so it can be used to predict the performance of F
1 along with the classification of heterotic groups. It is a more advantageous and effective method of heterotic grouping when compared to other methods of grouping like HSGCA and HGCAMT heterotic grouping (
Badu-Apraku et al., 2015). A study was performed by
Silva et al. (2021) in sorghum to evaluate the genetic diversity and identify heterotic groups. Genotyping of 160 sorghum lines was done using 86,342 SNP markers. Population structure analysis was analysed using a Bayesian model-based clustering algorithm in a STRUCTURE software program. Principal component analysis was done on a marker-based similarity depicted in Fig 4. The clustering method was conducted using the Neighbor-Joining model as in Fig 5. Then four heterotic groups were identified using both PCA and clustering methods. The cluster G1 and G2 consists mainly of R lines, G3 has both B lines and R lines and G4 has B lines. The PCA and clustering analysis was found to be correlated with pedigree analysis.
SNPs were also used in sorghum for classifying 96 inbred lines into different heterotic groups
(Zhang et al., 2025). The inbreds were critically analyzed by whole-genome sequencing and categorized into group 1 which contains restorer lines and group 2 with male sterile lines respectively. Hybrids exhibiting strong heterosis were obtained by crossing group 1 and group 2.
Efficiency of methods of heterotic grouping
The above mentioned heterotic grouping methods by SCA for grain yield, HSGCA, HGCAMT and SNP-GD were assessed for their efficiencies in the classification of heterotic grouping.
Fan et al. (2009) and
Badu-Apraku et al. (2016) proposed a formula for estimating the efficiency of heterotic grouping methods.

Where,
HY
intergroup = Number of high yielding inter-heterotic group hybrids.
TN
intergroup = Total number of inter-heterotic group hybrids.
LY
intergroup = Number of low yielding intea-heterotic group hybrids.
TN
intergroup = Total number of inter-heterotic group hybrids.
Fan et al. (2009) state that the strength of the inter-heterotic group crosses in relation to the within-group crosses determines the effectiveness of a heterotic grouping strategy. But it also depends upon other factors like the percentage of available inbred lines, the genetic diversity among genotypes and heterotic patterns obtained from the group. Several researchers have used these techniques to look into the grouping of maize germplasm lines. Furthermore,
Akinwale et al. (2014) compared the efficiency of SCA, HSGCA and SSpR-based GD techniques for classifying early ripening yellow maize inbreds assessed under
Striga-infested and
Striga-free growth conditions using orthogonal comparisons. The HSGCA approach was the most effective method under the conditions for research.
Factors affecting heterotic grouping
Heterotic grouping is influenced by genetic structure, linkage disequilibrium (LD), molecular markers, genetic divergence measures, combining ability and phenotypic data.
Genetic structure
The genetic makeup of a population, including diversity and subpopulation, plays a vital role. High genetic variability allows broader heterotic group formation, improving breeding outcomes. Subpopulations, often shaped by geography or breeding programs, frequently form distinct groups.
Molecular markers
Markers like SSR, RFLP and RAPD are essential for estimating genetic distances between lines. Greater genetic distances typically correlate with stronger heterosis. Advancement in molecular markers with high throughput technology played a vital role in estimation of genetic diversity and analysis of heterosis with marker assisted selection (
Nair and Pandey, 2024).
Linkage disequilibrium
LD between genes and markers influences grouping precision. High LD enhances accuracy and impacts epistatic variance and breeding strategies. LD can also highlight overdominant loci, aiding hybrid performance predictions.
Genetic divergence measure
Methods like cluster analysis using SNPs offer precise heterotic grouping by directly assessing genetic diversity.
Combining ability
GCA and SCA help classify lines based on hybrid performance. HSGCA and HGCAMT methods are effective under stress conditions like drought or pests.
Phenotypic data
Phenotypic traits such as yield, resistance and stress tolerance validate genetic groupings. High phenotypic diversity suggests greater heterotic potential, guiding the selection of optimal parent lines for hybrid vigor.
Advantages and disadvantages
Heterotic grouping offers several advantages in plant breeding. The advantages and disadvantages of heterotic grouping are listed below.
