Cloning of the GDF9 promoter region
The
GDF9 promoter amplification was conducted using gDNA from Australian-Hu sheep as template, with primers P1 and P2 (Table 1) as described. Sequencing results verified that PCR product sizes matched the anticipated designs (Fig 1), confirming their sequencing suitability.
Analysis of GDF9 promoter sequence
Sequencing of the
GDF9 promoter identified six mutation sites (Fig 2): g.42117805C>T, g.42117824T>C, g.42117899G>A, g.42118070A>G, g.42118197T>G and g.42118272C>T. Linkage analysis showed complete linkage among these sites (Fig 3), indicating strong linkage disequilibrium, with the predominant haplotype being C-T-G-A-T-C.
Analysis of
GDF9 sequence identified three potential promoter regions with scores above 0.85, located at -1092 to -1043, -633 to -584 and -270 to -221 bp. JASPAR software prediction showed multiple transcription factor-binding sites near these regions (Fig 4). MethPrimer analysis identified two CpG islands at positions -1953 to -1816 and -1801 to -1472 bp.
Sites g.42118070A>G, g.42117805C>T and g.42118272C>T showed low polymorphism (PIC < 0.25), while g.42118197T>G, g.42117824T>C and g.42117899G>A showed moderate polymorphism (0.25 < PIC < 0.50). Chi-square tests showed g.42118070A>G, g.42118197T>G, g.42118272C>T, g.42117805C>T, g.42117824T>C and g.42117899G>A sites were in Hardy-Weinberg equilibrium (P>0.05) (Table 2).
Association analysis of GDF9 SNPs with lambing productivity
The
GDF9 variants g.42118070A>G, g.42118272C>T, g.42117805C>T, g.42117824T>C and g.42117899G>A showed significant correlation with lambing rate in Australian-Hu sheep (P<0.05). The CT genotype at g.42118272C>T was associated with a 0.25 lamb increase compared to CC genotype. The AG genotype at g.42118070A>G showed a 0.31 lamb increase versus AA genotype. The GA genotype at g.42117899G>A was linked to a 0.22 lamb increase compared to GG genotype and TC genotype at g.42117824T>C showed a 0.23 lamb increase versus TT genotype. The frequency of TC genotype was 0.33 higher than CC genotype (Table 3).
Construction of recombinant vectors
To examine SNP transcriptional activity, primer P3-5 (Table 1) was designed to amplify three segments encompassing the mutation site. These segments were cloned into reporter vector pGL3-Basic to identify core promoter regions, designated as pGL3-520, pGL3-991 and pGL3-1321. Fragments containing mutation sites (g.42118070A>G, g.42118197T>G, g.42118272C>T) and fragments containing mutation sites (g.42117805C>T, g.42117824T>C, g.42117899G>A, g.42118070A>G, g.42118197T>G, g.42118272C>T) were directionally cloned into dual luciferase reporter vector pGL3-Basic alongside wild-type fragment to identify critical transcription factor binding sites. These constructs were named pGL3-991(mut0), pGL3-991(mut3) and pGL3-991(mut6). Sequencing confirmed the target fragment sequences (Fig 5 and 6), verifying successful construction of the promoter region fragment and point mutation recombinant plasmid.
Activity analysis of the truncated fragment on GDF9 promoter
The dual luciferase reporter plasmids pGL3-520, pGL3-991 and pGL3-1321, along with empty vector pGL3-Basic, were co-transfected with control plasmid pRL-TK into 293T cells. After forty-eight hours, cells were harvested and luciferase activity assessed. Results (Fig 7) showed that fluorescence activity of pGL3-520 was significantly higher than pGL3-Basic (
p<0.001). pGL3-991 and pGL3-1321 did not exhibit significant difference in activity compared to pGL3-520, suggesting that primary transcription binding sites of the
GDF9 promoter are located within the (-517 to +3) region.
Activity analysis of SNPs in GDF9 promoter
Dual luciferase reporter plasmids pGL3-991(mut0), pGL3-991(mut3) and pGL3-991(mut6) were co-transfected with control plasmid pRL-TK into 293T cells. After forty-eight hours, cells were harvested and luciferase activity was assessed. Results in Fig 8 showed that luciferase activity of pGL3-991, pGL3-991(mut3) and pGL3-991(mut6) was significantly elevated (
p < 0.05) compared to control plasmid pGL3-Basic. The pGL3-991(mut3) group showed a significant reduction in luciferase activity (
p<0.05) compared to the pGL3-991 group, suggesting that g.42118070A>G, g.42118197T>G and g.42118272C>T sites may inhibit promoter transcriptional activity. No significant changes in luciferase activity were observed between pGL3-991(mut3) and pGL3-991(mut6) groups, indicating that the additional three mutation sites do not affect promoter activity.
Analysis of SNP site association with lambing performance
GDF9 is a key component of the transforming growth factor β (TGF-β) superfamily. It regulates early follicular development and oocyte maturation in sheep, with its deficiency causing follicular arrest and infertility. Research has shown that single nucleotide polymorphisms (SNPs) in the
GDF9 coding region correlate with litter size in multiple-lamb sheep breeds, such as Hu sheep and Small-tailed Han sheep
(Ling et al., 2025). However, studies of the promoter region of Australian-Hu sheep remain scarce.
Six single nucleotide polymorphisms (SNPs) were identified: g.42117805C>T, g.42117824T>C, g.42117899G>A, g.42118070A>G, g.42118197T>G and g.42118272C>T. Linkage analysis showed these sites were fully linked, forming a 0-kb strong linkage disequilibrium (LD) block, designated as Block1. The most prevalent haplotype was “C-T-G-A-T-C,” exhibiting the highest frequency. This aligns with the “-534A/G, -407T/G and -332C/T complete linkage” pattern identified by
Li et al., (2020) in the Hu sheep
GDF9 promoter region, indicating that the LD structure in the
GDF9 promoter may represent a conserved genetic pattern regulating reproductive traits in sheep. A mutation at one locus can alter transcription factor binding affinity, while other linked loci may enhance this regulatory effect by modifying the promoter region’s secondary structure. This suggests
GDF9 in Australian-Hu sheep maintains its regulatory capacity over reproductive traits under hybridization conditions. Five SNPs (g.42118070A>G, g.42118272C>T, g.42117805C>T, g.42117824T>C and g.42117899G>A) showed significant correlations with lambing rate in Australian-Hu Sheep (P<0.05), demonstrating “heterozygous genotypes yielding higher lambing rates.”
The loci g. 42118197T>G, g. 42117824T>C and g. 42117899G>A demonstrated moderate polymorphism (0.25 < PIC < 0.50), suggesting these loci maintain genetic diversity within the Australian-Hu sheep population and are viable molecular markers for breeding selection. In contrast, loci g. 42118070A>G and g. 42117805C>T exhibited low polymorphism (PIC<0.25). This low polymorphism may result from long-term artificial selection for high lambing rates, leading to increased frequency of dominant alleles. Caution is needed in future breeding programs to prevent excessive loss of genetic diversity, as reduced diversity may limit the population’s adaptive potential and selection response-especially under changing environmental conditions in southern Xinjiang. Additionally, all associated loci adhered to Hardy-Weinberg equilibrium (P>0.05), indicating a stable genetic structure within the Australian-Hu sheep population, without significant shifts from artificial selection.
Bioinformatics analysis identified two CpG islands within the
GDF9 promoter region in Australian-Hu sheep, at -1953 to -1816 and -1801 to -1472 bp. Multiple predicted binding sites for reproduction-associated transcription factors were observed. GATA4, a critical transcription factor, regulates ovarian granulosa cell proliferation (
Jin, 2016). SNPs near its binding site may affect GATA4 binding, influencing granulosa cell survival and follicular maturation, impacting lambing rate. These findings align with previous studies on sheep
GDF9 gene variations.
Pan et al., (2016) reported significant associations between SNPs in the
GDF9 regulatory region and lambing rate in Small-tailed Han sheep, with the heterozygous genotype predominant.
Wang et al., (2020) confirmed the synergistic regulatory effect of SNP linkage in the
GDF9 promoter region on reproductive traits in Luzhong meat sheep. This study identified a complete linkage pattern for six SNPs in the
GDF9 promoter region in hybrid Australian-Hu sheep and elucidated their synergistic effect on lambing rate, supplementing genetic regulatory evidence for reproductive traits in hybrid sheep.
Promoter region activity analysis
Through promoter truncated fragments of varying lengths (pGL3-520, pGL3-991 and pGL3-1321) in dual-luciferase reporter assays, the core promoter region (-517 to +3) was identified as the principal sequence driving gene transcription. This region contains multiple transcription factor-binding sites, including HOXB3, GATA4 and GRHL2. The extended regions (-908 to +3) and (-1200 to +3) facilitate binding of additional transcription factors for nuanced regulation of core transcription activity. The characterization of the
GDF9 promoter’s core functional zone provides a crucial molecular target for elucidating mechanisms that modulate follicular development through transcriptional regulation, influencing the lambing rate.
Through analysis of point mutation activity using a dual luciferase reporter system, mutations g.42118070A>G, g.42118197T>G and g.42118272C>T were identified as critical sites affecting promoter transcriptional activity. These mutations are located within essential transcription factor-binding regions and their presence alters binding efficiency of transcription factors, suppressing
GDF9 transcription. In a study on Hu sheep and Bashibai sheep by
Jin et al., (2016), mutations in the 5' regulatory region of
GDF9 gene altered the binding site of transcription factor OCT1. After co-transfection with an OCT1 expression vector, transcriptional activity of the mutant promoter was significantly reduced. These findings indicate that OCT1 exerts a pronounced inhibitory effect on promoter activity of the mutant
GDF9 gene in sheep, explaining the functional effects of the identified mutation sites. Furthermore, the 5' regulatory region of
GDF9 contains multiple binding sites for transcription factors associated with follicular development, such as GATA4 and YY1. The mutation identified may synergistically affect binding efficiency of multiple transcription factors, regulating
GDF9 expression. This hypothesis warrants further validation through subsequent EMSA or ChIP experiments.
Wang et al., (2024) investigated four Chinese sheep breeds, including Small-tailed Han sheep and Hu sheep and identified polymorphic sites within
BMP15 and
GDF9 genes. Mutations at these sites were significantly associated with lambing rate, affecting ovulation numbers. As critical paracrine factors secreted by oocytes, balanced activity of GDF9 and BMP15 is crucial in determining follicular development fate. Individuals with single mutations in BMP15 or GDF9 exhibit increased ovulation, potentially due to diminished dimerization capacity or reduced receptor-binding ability of mutated proteins
(Hanrahan et al., 2004). When mutations in both BMP15 and GDF9 co-occur, secretion levels of these proteins are low, resulting in reduced mature BMP15 and GDF9 proteins, weakened functional interaction between BMP15 and GDF9 and diminished biological activity of BMP15/GDF9 heterodimers. This leads to impaired regulation of follicular development, culminating in increased ovulation rates in mutant ewes
(Wang et al., 2020). This suggests that when using
GDF9 mutation sites for selecting prolificacy traits in sheep, it is necessary to consider the genetic background of breeds and interaction of ovarian microenvironmental factors to achieve precise improvement in reproductive performance.