Skip to main content

Exploring the association of ESR1 and ESR2 gene SNPs with polycystic ovary syndrome in human females: a comprehensive association study

Abstract

Background

Polycystic Ovary Syndrome (PCOS) affects a significant proportion of human females worldwide and is characterized by hormonal, metabolic, and reproductive dysfunctions, including infertility, irregular menstrual cycles, acanthosis nigricans, and hirsutism. Mutations in the estrogen receptor genes ESR1 and ESR2, involved in normal follicular development and ovulation, can contribute to development of the PCOS. The present study focuses on investigating the potential correlation between single nucleotide polymorphisms (SNPs) of ESR1 and ESR2 genes and the incidence of this syndrome.

Methods

For this study, SNPs in ESR1 and ESR2 genes were retrieved from the ENSEMBL database and analyzed for their effect on mutated proteins using different bioinformatics tools including SIFT, PolyPhen, CADD, REVEL, MetaLR, I-Mutant, CELLO2GO, ProtParam, SOPMA, SWISS-MODEL and HDDOCK.

Results

All the SNPs documented in the present study were deleterious. All the SNPs except rs1583384537, rs1450198518, and rs78255744 decreased protein stability. Two variants rs1463893698 and rs766843910 in the ESR2 gene altered the localization of mutated proteins i.e. in addition to the nucleus, proteins were also found in mitochondria and extracellular, respectively. SNPs rs104893956 in ESR1 and rs140630557, rs140630557, rs1596423459, rs766843910, rs1596405923, rs762454979 and rs1384121511 in ESR2 gene significantly changed the secondary structure of proteins (2D). SNPs that markedly changed 3D configuration included rs1554259481, rs188957694 and rs755667747 in ESR1 gene and rs1463893698, rs140630557, rs1596423459, rs766843910, rs1596405923, rs762454979 and rs1384121511 in ESR2 gene. Variants rs1467954450 (ESR1) and rs140630557 (ESR2) were identified to reduce the binding tendency of ESRα and β receptors with estradiol as reflected by the docking scores i.e. -164.97 and -173.23, respectively.

Conclusion

Due to the significant impact on the encoded proteins, these variants might be proposed as biomarkers to predict the likelihood of developing PCOS in the future and for diagnostic purposes.

Background

Polycystic Ovary Syndrome (PCOS) is a complicated disorder affecting the female reproductive system. It is characterized by the presence of multiple egg-containing collagen-filled follicles that are arrested during growth, a thick ovarian capsule termed the tunica albuginea, central adiposity, obesity, high luteinizing hormone serum levels, hirsutism, and acne caused by the presence of excess androgens. One of the main concerns for most human females is the infertility associated with PCOS due to anovulatory or oligoovulatory infertility and menstrual irregularities. Even after conception many human females are more prone to miscarriage due to this condition [1,2,3]. Systemic complications include a higher risk of cardiovascular disease, dyslipidemia and hypertension, insulin resistance and a four-fold increased risk of developing Type 2 diabetes. PCOS is also associated with obstructive sleep apnea, endometrial cancer, depression and lipid abnormalities [4,5,6].

The risk factors linked with the pathophysiology of PCOS can be categorized into three groups, namely genetic, non-genetic, and hormonal. The PCOS associated genes can be classified into six categories: genes associated with adrenal and ovarian steroidogenesis, including cytochrome P450, family 11, sub-family A, member 1 (CYP11A1), cytochrome P450, family 17 (CYP17), cytochrome P450, family 19 (CYP19) and cytochrome P450, family 21 (CYP21); genes associated with the effects of steroid hormones such as androgen receptor gene (AR) and sex hormone binding globular protein (SHBG); genes regulating gonadotropin activity and regulation such as luteinizing hormone (LH), follicle stimulating hormone receptor (FSHR) and anti-mullerian hormone (AMH); genes linked to the activity and release of insulin such as calpain 10 (CAPN10), insulin receptor substrate-1 (IRS-1), insulin receptor substrate-2 (IRS-2) and insulin (INS) etc. [3, 7,8,9,10]. Additionally Environmental factors that contribute to PCOS such as environmental endocrine disruptors (EEDs), obesity, and diet, have not been extensively documented [11]. In addition, hormonal factors such as the presence of hyperandrogenism and insulin resistance are also considered to contribute towards PCOS [12, 13].

The ESR1 and ESR2 genes comprise of 8 and 9 exons, respectively. Their chromosomal locations are 6q25.1 and 14q23.1, respectively [14]. ESR1 and ESR2 genes encode for ESRα and ESRβ proteins which function as binding sites for estradiol during the process of follicle development and ovulation. Estradiol is a determinant of follicle quality [15]. ESRα expression occurs in interstitial, thecal and granulosa cells (GC) of developing antral follicles while ESRβ is only expressed in GC part of follicle. Both these receptor proteins have spliced isoform i.e. three and four for ESRα (ESRα66, ESRα46 and ESRα36) and ESRβ (ESRβ1 to ESRβ5), respectively [16, 17]. Under normal conditions, estradiol binds with ESRα and β receptors and acts synergistically with follicle stimulating hormone (FSH) to upregulate the expression of steroidgenic hormones including LHR receptor resulting in prominent preovulatory follicle selection and ovulation [18]. Hence, estradiol plays vital role in normal follicle development which is strongly related with ESRα and ESRβ proteins expression [19]. These two receptors play a vital role in regulating estrogen functions and dimerize to control the transcription of several downstream genes involved in physiological ovarian functions [17, 20, 21]. ESRs expression has been found to be increased in epithelium and stroma during proliferative stage of female reproductive cycle [22, 23].

Literature also supports the link between ESRs polymorphisms and PCOS like single nucleotide variants of ESRα rs9340799 and rs1999805 have been found to be associated with PCOS patients of Pakistan and China, respectively [24, 25]. An ESRα associated polymorphism rs2234693 has been reported to be found more frequently in PCOS patients than in normal controls [26]. Connection between ESR1 and 2 genes mutations and PCOS is explained in Fig. 1.

Fig. 1
figure 1

Connection of ESR1 and ESR2 genes mutations and PCOS in human females [17]

Keeping in view the above mentioned association of ESRα and β receptors with PCOS clinical manifestations, these receptor proteins might be considered as most significant markers of PCOS. Any mutations in the ESRα and β encoding genes may disrupt the normal development of follicles, resulting in polycystic ovaries. The present study aimed to identify SNPs in ESR1 and 2 genes that may lead to abnormal development of ESR α and β receptors. These SNPs could potentially serve as biomarkers for the prognosis and diagnosis of PCOS.

Methodology

Retrieving coding sequences of genes and reported SNPs from ENSEMBL

To retrieve the coding sequences of ESR1 (ENST00000206249.8) and ESR2 (ENST00000341099.6) genes and the SNPs, ENSEMBL database (https://asia.ensembl.org/index.html, accessed on 1 September 2022) was explored. SNPs were incorporated in normal CDS of genes to generate mutated sequences. The coding sequences of normal genes are shown (Supplementary data Table 1). Stop gained, missense and frame shift mutations retrieved and addressed in present study are described in detail (Table 1).

Table 1 Single nucleotide polymorphisms (SNPs) in ESR1 and ESR2 genes associated with PCOS addressed in present study

Translation of normal and mutated genes sequences

ExPaSy tool (https://web.expasy.org/translate/, accessed on 1 September 2022) was employed to convert nucleotide sequences of ESR1 and ESR2 genes into amino acid sequences [27].

Assessment of missense SNPs deleteriousness

To assess the deletriousness of non-synonymous missense mutations, five tools were employed. i. e. Sorting Intolerant from Tolerant (SIFT) (https://sift.bii.a-star.edu.sg, accessed on 1 September 2022), Polymorphism Phenotyping (PolyPhen) (https://genetics.bwh.harvard.edu/pph2/, accessed on 1 September 2022), Combined Annotation Dependent Depletion (CADD) (https://cadd.gs.washington.edu/snv, accessed on 1 September 2022), Rare Exome Variant Ensemble Learner (REVEL) and Meta Logistic Regression (MetaLR) [28,29,30,31,32].

Effect of missense SNPs on stability of mutated proteins

To determine the effect of missense SNPs on stability of mutated proteins, I-Mutant suit (https://folding.biofold.org/i-mutant/i-mutant2.0.html, accessed on 7 September 2022) was used. Gibbs free energy (DGG) and reliability index (RI) were measured. Value of DGG above zero shows increase while below zero represents decrease in stability of mutated proteins [33].

Physicochemical properties prediction

Physicochemical properties like number of amino acids, molecular weight, isoelectric point (pI), half-life, extinction coefficient, aliphatic index, instability index and grand average of hydropathicity (GRAVY) were determined for the normal as well as mutated proteins using ProtParam tool (https://web.expasy.org/protparam/, accessed on 7 September 2022).

Sub-cellular localization prediction

To assess the SNPs effect on sub-cellular localization of mutated proteins, CELLO2GO (https://cello.life.nctu.edu.tw/cello2go/, accessed on 7 September 2022) tool was used [34].

Secondary structure prediction

To predict the effect of SNPs on 2D of mutated protein, SOPMA secondary structure predicted method (https://npsa-prabi.ibcp.fr/cgi-bin/npsa_automat.pl?page=/NPSA/npsa_sopma.html, accessed on 7 September 2022) was employed. Using this method, percentage of alpha helix, extended strand, beta turn and random coil was determined for all the SNPs containing amino acid sequences and results were compared with the normal protein [35]. Disordered residues were assessed in normal and mutated sequences using protein disorder prediction server i.e. PrDOS (https://prdos.hgc.jp/cgi-bin/result.cgi?ppid=376231p1d1662706327, accessed on 13 September 2022). This tool helped to predict the effect of SNPs on number of disordered regions and number of residues exhibiting disorder. The results were obtained in the form of disorder profile plot and sequence of proteins with disordered regions highlighted in red [36].

Three dimensional structure prediction and validation

To detect the effect of SNPs on 3D configuration of proteins, homology modelling server SWISS-MODEL (https://swissmodel.expasy.org, accessed on August 2023) was used. The pdb structure derived using this tool was validated through ERRAT and PROCHECK (https://saves.mbi.ucla.edu/results?job=1072813&p=errat, accessed on 26 August 2023) tools [37]. In ERRAT validation, the quality was calculated for pdb structures of normal and mutant proteins [38]. On the other hand, in PROCHECK validation, G-value and Ramachandran plots were predicted for the normal and mutant cases. Ramachandran plots helped us in assessment of amino acids in most favored, additional allowed, generously allowed and disallowed regions [39].

Docking analysis

To analyze the effect of SNPs on interaction of mutated forms of ESR1 and ESR2 with native form of estradiol hormone, docking was performed using web server for protein–protein docking i.e. HDOCK server (hdock.phys.hust.edu.cn, accessed on August 2023). The estradiol pdb structure was uploaded as input receptor molecule while the mutated structures of ESR1 and ESR2 were uploaded as ligands. Docking score, confidence score and ligand RMSD values were recorded.

Results

Deleteriousness of SNPs

The present study has documented a total of fifteen SNPs (four stop-gained and eleven missense) of the ESR1 gene and fifteen SNPs (six stop-gained and nine missense) of the ESR2 gene. Deleteriousness analysis of the missense SNPs showed the pathogenicity of all the missense mutations addressed in present study (Table 1).

Analysis of SNPs effect on stability of mutated proteins

To determine the effect of SNPs on stability of mutated proteins, I-Mutant tool was used. It was found that in case of ESR1 gene, two SNPs i.e. rs1583384537 and rs1584799119 increased stability of mutated proteins as compared to remaining polymorphisms which had decreasing effect. On the other hand, in case of ESR2 gene, two out of six missense mutations i.e. rs1450198518 and rs78255744 showed increasing effect while remaining four SNPs reduced the mutated proteins stability (Table 2).

Table 2 Effect of SNPs on stability of mutated proteins predicted using I-Mutant at 25°C and pH = 7

Analysis of SNPs effect on physicochemical properties of mutated proteins

In case of ESR1 gene, six out of fifteen SNPs i.e. rs1554259481, rs104893956, rs755667747, rs1467954450, rs762742833 and rs1253340312 altered physicochemical properties of mutated proteins considerably. In these six SNPs, deviation in number of amino acids from normal value of 649 was observed to be 90, 164, 318, 369, 520 and 535, respectively. Lowest value of pI was observed to be 4.73 in case of rs104893956 while the highest (6.67) was found to be caused by SNP rs755667747. SNPs rs1554259481, rs104893956, rs755667747, rs1467954450, rs762742833 and rs1253340312 changed the extinction coefficient from normal value of 62,520 M−1cm−1 to 8940, 14,900, 35,340, 36,830, 56,560 and 50,725 M−1cm−1, respectively. Only two SNPs rs755667747 and rs1467954450 changed the aliphatic index from normal value of 73.70 to 51.04 and 57.99, respectively. Only slight variation was observed in aliphatic index in cases of all other SNPs. The only SNP rs1554259481 reduced the instability index of mutated protein to 17.52 from normal 42.52. Half-life remained unaffected in all the cases. Highest alteration in GRAVY was observed in case of SNPs rs755667747 (-0.820) and rs1467954450 (-0.727) as compared to normal value of -0.499 (Table 3).

Table 3 Prediction of effect of SNPs on physicochemical properties of mutated proteins

In case of ESR2 gene, eight out of fifteen SNPs i.e. rs1463893698, rs140630557, rs1596423459, rs766843910, rs1596405923, rs762454979, rs1384121511 and rs1257844897 altered the number of amino acids i.e. 71, 132, 200, 236, 338, 346 and 478, respectively as compared to normal 530 amino acids. Not a single SNP effected the half-life of mutated proteins. Lowest and highest deviations in normal value of pI i.e. 8.81 were observed in cases of SNPs rs1463893698 (5.28) and rs1596405923, rs766843910, rs762454979 and rs1384121511 (9.24, 9.21, 9.20 and 9.02, respectively). Extinction coefficients of mutated proteins were observed to be altered greatly in cases of mutations rs1463893698 (8940 M−1cm−1), rs140630557 (21,555 M−1cm−1), rs1596423459 (33,390 M−1cm−1) and rs766843910 (36,745 M−1cm−1). Six SNPs i. e. rs1463893698 (64.51), rs140630557 (70.83), rs1596423459 (59.95), rs766843910 (58.64), rs1596405923 (68.40) and rs762454979 (71.83) caused considerable change in aliphatic index of mutated proteins. Variation in instability index was observed in cases of SNPs rs1463893698 (68.91), rs140630557 (66.53), rs1596423459 (63.83), rs766843910 (65.47), rs1596405923 (61.82) and rs1384121511 (60.37). Mutations rs766843910 and rs1257844897 showed highest and lowest deviations i.e. -0.644 and -0.231, respectively in normal value (-0.288) of GRAVY.

Analysis of SNPs effect on sub-cellular localization of mutated proteins

The sub-cellular localization prediction revealed that in case of ESR1 gene, only one SNP rs1554259481 altered the localization of mutated protein from nuclear to nuclear and cytoplasmic (Supplementary data Fig. S1).

In case of ESR2 gene, the localization remained unaffected. While in case of ESR2 gene, two SNPs i.e. rs1463893698 and rs766843910 changed the localization of mutated proteins from normal location (nuclear) to nuclear: mitochondrial and nuclear: extracellular, respectively (Table 4, Supplementary data Fig. S2).

Table 4 Effect of SNPs on stability of mutated protein predicted using I-Mutant

Analysis of SNPs effect on secondary structures of mutated proteins

Effect of SNPs on different aspects of 2D structure of mutated proteins i.e. alpha helix, extended strand, beta turn, random coil, number of disordered regions and number of disordered amino acids have been observed. It was found that in case of ESR1 gene, all the SNPs altered 2D structural properties of proteins. The highest and lowest deviations in normal values were observed to be 41.76% (rs1583384537 and rs1584799119) and 25.00% (rs104893956), respectively in case of alpha helix, 7.85% (rs1253340312) and 14.02% (rs104893956), respectively in case of extended strand, 3.08% (rs762742833) and 10.37% (rs104893956), respectively in case of beta turn and 41.60% (rs188957694) and 52.70% (rs1131692059), respectively in case of random coil. All the SNPs except rs1583384537 were found to alter the number of disordered regions and the number of amino acids in disordered regions (Supplementary data Table 2). The highest and lowest deviations from the normal values of number of disordered regions (137) and number of amino acids in disordered regions (7) were observed to be 289 and 11 in case of SNP rs1131692059 and 23 and 2 in case of rs1554259481 mutation, respectively (Supplementary data Figs. S3 and S4).

In case of ESR2 gene, all the mutations were observed to change the secondary structure of mutated proteins. The highest and lowest alterations in normal values were observed to be 25.15% (rs762454979) and 0% (rs1463893698), respectively in case of alpha helix, 16.19% (rs1463893698) and 7.17% (rs78255744), respectively in case of extended strand, 4.24% (rs766843910) and 1.52% (rs140630557), respectively in case of beta turn and 80.28% (rs1463893698) and 51.05% (rs1257844897), respectively in case of random coil.

The highest and lowest deviations from the normal values of number of disordered regions (177) and number of amino acids in disordered region (7) were observed to be 189 and 7 in case of SNP rs1414263985 and 28 and 2 in case of SNP rs1463893698, respectively (Supplementary data Figs. S5 and S6).

Analysis of SNPs effect on 3D structure of mutated proteins

The SWISSMODEL based analysis of mutated proteins of ESRα gene revealed SNPs rs1554259481, rs188957694 and rs755667747 drastically altered the 3D configuration of mutated proteins. While SNPs rs1583384537 and rs1131692059 caused slight change in structure. However, all other mutations addressed in present study did not affect the overall configuration of mutated proteins (Fig. 2).

Fig. 2
figure 2

Effect of ESR1 gene SNPs documented in present study on 3D structure of mutated proteins predicted using SWISSMODEL

In case of ESRβ gene, mutations rs1463893698, rs140630557, rs1596423459, rs766843910, rs1596405923, rs762454979 and rs1384121511 induced considerable change in mutated proteins. Slight change in 3D structure has been observed in case of SNP rs1257844897. While in all other cases, no effect was observed on tertiary structure of proteins (Fig. 3).

Fig. 3
figure 3

Effect of ESR2 gene SNPs documented in present study on 3D structure of mutated proteins predicted using SWISSMODE

The pdb structures of normal and mutated proteins generated using PHYRE2 tool were subjected to ERRAT and Ramachandran plot analysis. In case of ESRα gene, the values for all the structures including normal and mutated proteins, the ERRAT scores were above 70% and hence, are considered of good quality (Supplementary data Fig. S7). However, in cases of two stop gained SNPs i.e. rs1554259481 and rs104893956 which caused formation of truncated proteins, the overall quality factor was below 50% while in cases of SNPs rs755667747 and rs1467954450, the quality factor was below 70% but above 50%. In ESRβ gene encoded normal and mutated proteins, in all the cases the overall quality factor value was above 70% except rs1596423459, rs140630557, rs766843910 and rs1596405923. For the mutated structure induced by stop-gained mutation rs140630557, no quality score was predicted by ERRAT (Supplementary data Fig. S8).

As far as the validation of protein structures using Ramachandran plots is concerned, the residues found in most favored regions were above or closer to 90% and hence qualified the good quality score. However, in cases of ESRα gene SNPs rs1554259481 and rs104893956 and ESRβ gene SNP rs140630557, the scores were in the range of 70% (Supplementary data Figs. S9 and S10). The G-values were closer to zero in all the cases further validating the protein structures.

Analysis of SNPs effect on binding tendency of ESR1 and ESR2 receptors with E2

Docking analysis of mutated forms of ESR1 and ESR2 proteins with E2 depicted highest scores of binding energy in ESR1 cases 1, 5 and 15 documenting SNPs rs1583384537, rs778449608 and rs1436999383, respectively. While in ESR2 cases 3, 4, 5, 6, 7, 9 and 11 – 15 presenting SNPs rs1450198518, rs754945292, rs1596423459, rs766843910, rs1596405923, rs1384121511, rs1414263985, rs78255744, rs768924970, rs1257844897 and rs200502775. Lowest binding tendency was observed in case 9 (rs1467954450) for ESR1 gene and in case 2 (rs140630557) for ESR2 gene (Table 4 and Figs. 4 and 5).

Fig. 4
figure 4

Docking analysis results performed using HDOCK server to predict the effect of SNPs documented in present study on binding tendency of estradiol with ESRα receptor. a normal, b rs1583384537, c rs1554259481, d rs104893956, e rs761613029, f rs778449608, g rs866869178, h rs188957694, i rs755667747, j rs1467954450, k rs1584799119, l rs1131692059, m rs762742833, n rs758798083, o rs1253340312, p rs143699938

Fig. 5
figure 5

Docking analysis results performed using HDOCK server to predict the effect of SNPs documented in present study on binding tendency of estradiol with ESRβ receptor. a normal, b rs1463893698, c rs140630557, d rs1450198518, e rs754945292, f rs1596423459, g rs766843910, h rs1596405923, i rs762454979, j rs1384121511, k rs1249242790, l rs1414263985, m rs78255744, n rs768924970, o rs1257844897, p rs200502775

Discussion

There have been several studies showing the correlation between SNPs of ESR1 and ESR2 genes with PCOS. These studies were based on comparison of genes between the healthy and the diseased individuals using experimental methods including biochemical and hormonal analysis, restriction fragment length polymerase chain reaction (RFLP-PCR), real-time polymerase chain reaction (RT-PCR) and Sanger sequencing [24, 25, 40, 41].

A study focusing at finding ESR1 and ESR2 gene markers associated with PCOS in Tunisian human females revealed strong association of SNVs rs2234693 and rs3798577 in ESRα gene and rs1256049 in ESRβ gene with this disease [40]. Another study has reported the association of SNP rs1999805 in ESR1 gene with PCOS in Chinese population [25]. In Pakistani human females from Punjab, three SNPs i.e. rs2234693, rs9340799 and rs8179176 in ESR1 gene and rs4986938 in ESR2 gene have been reported to be significantly associated with PCOS [24]. Another research project investigated the correlation between the SNP rs4986938 of the ESR2 gene with PCOS in unmarried Iraqi human females. This study focused on the substitution of G allele with A allele, with the results indicating the A allele had a higher association with PCOS in comparison to the wild type G allele [42].

Most of the studies available in the literature have focused on biomarkers in intronic and untranslated regions (UTRs) while data about exonic region SNPs is considerably scarce. Moreover, there is lack of detailed information regarding the impact of these mutations on physicochemical characteristics, localization and 2D and 3D structures. The current study aimed to investigate the effects of ESR1 and ESR2 genes SNPs reported in Human Genome Project on the attributes of encoded proteins. The SNPs that were focused in present study have not been previously examined in literature.

The pI shows acidity or alkalinity of mutated proteins. In ESR1 gene, only the mutation rs104893956 enhanced acidity of mutated protein with pI = 4.73 and the mutations rs755667747, rs1467954450 and rs1253340312 had reducing effect on acidity. As far as the ESR2 gene is concerned, the SNP rs1463893698 induced acidity in mutated protein while the mutations rs766843910, rs1596405923, rs762454979 and rs1384121511 increased alkalinity of mutated proteins.

Aliphatic index is the measure of number of amino acids with aliphatic side chain in a protein which reflects protein thermostability over wide range of temperatures. If the value ranges between 66.5 to 84.33 then the protein is considered highly stable [43]. In case of ESR1 gene, only two SNPs i.e. rs755667747 and rs1467954450 decreased thermostability of mutated proteins. The highest thermostability was observed in case of SNP rs1253340312. In case of ESR2 gene, two SNPs i.e. rs1596423459 and rs766843910 reduced the thermostability of mutated proteins while the highest thermostability was induced by mutations rs1257844897 and rs200502775. Instability index indicates stability of protein in test tube [44]. Instability index below 40 reflects protein stability. Among the SNPs documented in ESR1 gene, only the mutation rs1554259481 has increasing effect on protein stability with instability index of 17.52. As far as the ESR2 gene is concerned, all the mutated proteins just like the normal one were found unstable with instability index above 40. Two SNPs rs1463893698 and rs140630557 reduced stability of mutated proteins to greater extent. GRAVY indicates the polarity level of proteins [45]. In cases of both the ESR1 and ESR2 genes, the negative values of GRAVY indicate that all the mutated proteins are hydrophilic just like the normal one.

In ESR1 gene, three documented SNPs i.e. rs1554259481, rs104893956 and rs755667747 have been found to cause drastic change in physicochemical properties, 2D and 3D structures of mutated proteins. In ESR2 gene, the seven SNPs i.e. rs1463893698, rs140630557, rs1596405923, rs1596423459, rs762454979, rs1384121511 and rs766843910 caused significant alterations in physicochemical properties, 2D and 3D structures of mutant proteins. So, these SNPs can be used as susceptibility markers for PCOS.

Binding of E2 with ESR1 and ESR2 receptors is a crucial event in biological actions of this hormone [46]. Any mutation altering the different attributes of ESR1 and ESR2 receptors might lead to disturbance of the binding tendency of E2 with these receptor proteins and might have serious influence on body functions. Therefore, alteration in binding tendency of ESR1 and ESR2 proteins mutated forms with E2 has been observed in present study. Single nucleotide variants i.e. rs1467954450 (ESR1 receptor), rs1463893698 and rs140630557 (ESR2 receptor) markedly reduced this binding as revealed by their docking scores. Although literature reports the association of ESR1 gene heterozygous mutation c. 619G > A/p.A207T with insensitivity of the encoded receptor towards E2 as well as the association of ESR genes polymorphisms with disturbances in E2 concentration in PCOS patients [21, 47]. However, this is the first ever study reporting the effect of rs1467954450 (ESR1 receptor), rs1463893698 and rs140630557 SNPs in ESRα and ESRβ genes on binding with E2. This reduced binding affinity might also contribute to disturbance in effective level of E2 hormone in the body.

Conclusion

The current in-silico study has demonstrated a strong association between ten SNPs present in ESR1 and ESR2 genes with PCOS. However, it is vital to conduct further research to assess the potential of these mutations as PCOS biomarkers. Identifying these SNPs could assist in predicting the likelihood of PCOS development in human females. These findings could also contribute to the development of targeted therapies for PCOS and help improve the understanding of the underlying molecular mechanisms involved in the pathogenesis of the disorder.

Availability of data and materials

SNPs documented in present study have been retrieved form ENSEMBL database (https://web.expasy.org/translate).

Abbreviations

PCOS:

Polycystic Ovary Syndrome

CYP11A1:

Cytochrome P450, family 11, sub-family A, member 1

CYP17:

Cytochrome P450, family 17

CYP19:

Cytochrome P450, family 19

CYP21:

Cytochrome P450, family 21

AR:

Androgen receptor gene

SHBG:

Sex hormone binding globular protein

LH:

Luteinizing hormone

FSHR:

Follicle stimulating hormone receptor

AMH:

Anti-mullerian hormone

CAPN10:

Calpain 10

IRS-1:

Insulin receptor substrate-1

IRS-2:

Insulin receptor substrate-2

INS:

Insulin

EEDs:

Environmental endocrine disruptors

SIFT:

Sorting Intolerant from Tolerant

PolyPhen:

Polymorphism Phenotyping

CADD:

Combined Annotation Dependent Depletion

REVEL:

Rare Exome Variant Ensemble Learner

MetaLR:

Meta Logistic Regression

DGG:

Gibbs free energy

RI:

Reliability index

GRAVY:

Grand average of hydropathicity

pI:

Isoelectric point

PrDOS:

Protein disorder prediction server

RFLP-PCR:

Restriction fragment length polymerase chain reaction

RT-PCR:

Real-time polymerase chain reaction

UTRs:

Untranslated regions

References

  1. Norman RJ, Dewailly D, Legro RS, Hickey TE. Polycystic ovary syndrome. Lancet. 2007;370:685–97.

    Article  PubMed  CAS  Google Scholar 

  2. Walters KA, Gilchrist RB, Ledger WL, Teede HJ, Handelsman DJ, Campbell RE. New perspectives on the pathogenesis of PCOS: neuroendocrine origins. Trends Endocrinol Metab. 2018;29:841–52.

    Article  PubMed  CAS  Google Scholar 

  3. Muccee F, Bijou O, Harakeh S, Adawiyah R, Sayyed RZ, Haghshenas L, Alshehri D, Ansari MJ, Ghazanfar S. In-Silico investigation of effects of single-nucleotide polymorphisms in PCOS-associated CYP11A1 gene on mutated proteins. Genes. 2022;13:1231.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  4. Tay CT, Hart RJ, Hickey M, Moran LJ, Earnest A, Doherty DA, Teede HJ, Joham AE. Updated adolescent diagnostic criteria for polycystic ovary syndrome: impact on prevalence and longitudinal body mass index trajectories from birth to adulthood. BMC Med. 2020;18:1–11.

    Article  Google Scholar 

  5. Jacewicz-Święcka M, Kowalska I. Polycystic ovary syndrome and the risk of cardiometabolic complications in longitudinal studies. Diabetes Metab Res Rev. 2018;34: e3054.

    Article  PubMed  Google Scholar 

  6. Allahbadia GN, Merchant R. Polycystic ovary syndrome in the Indian Subcontinent. Semin Reprod Med. 2008;26:022–34.

    Article  CAS  Google Scholar 

  7. Zhang T, Liang W, Fang M, Yu J, Ni Y, Li Z. Association of the CAG repeat polymorphisms in androgen receptor gene with polycystic ovary syndrome: a systemic review and meta-analysis. Gene. 2013;524:161–7.

    Article  PubMed  CAS  Google Scholar 

  8. Pusalkar M, Meherji P, Gokral J, Chinnaraj S, Maitra A. CYP11A1 and CYP17 promoter polymorphisms associate with hyperandrogenemia in polycystic ovary syndrome. Fertil Steril. 2009;92:653–9.

    Article  PubMed  CAS  Google Scholar 

  9. Peng CY, Xie HJ, Guo ZF, Nie YL, Chen J, Zhou JM, Yin J. The association between androgen receptor gene CAG polymorphism and polycystic ovary syndrome: a case-control study and meta-analysis. J Assist Reproduct Genet. 2014;31:1211–9.

    Article  Google Scholar 

  10. Khan MJ, Ullah A, Basit S. Genetic basis of polycystic ovary syndrome (PCOS): current perspectives. Appl Clin Genet. 2019;12:249.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Zhang B, Zhou W, Shi Y, Zhang J, Cui L, Chen ZJ. Lifestyle and environmental contributions to ovulatory dysfunction in women of polycystic ovary syndrome. BMC Endocr Disord. 2020;20:1–7.

    Article  Google Scholar 

  12. Ashraf S, Nabi M, Rashid F, Amin S. Hyperandrogenism in polycystic ovarian syndrome and role of CYP gene variants: a review. Egypt J Med Hum Genet. 2019;20:1–10.

    Article  Google Scholar 

  13. De Leo V, La Marca A, Petraglia F. Insulin-lowering agents in the management of polycystic ovary syndrome. Endocr Rev. 2003;24:633–67.

    Article  PubMed  Google Scholar 

  14. Gennari L, Merlotti D, De Paola V, Calabro A, Becherini L, Martini G, Nuti R. Estrogen receptor gene polymorphisms and the genetics of osteoporosis: a HuGE review. Am J Epidemiol. 2005;161:307–20.

    Article  PubMed  CAS  Google Scholar 

  15. Tang ZR, Zhang R, Lian ZX, Deng SL, Yu K. Estrogen-receptor expression and function in female reproductive disease. Cells. 2019;8:1123.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  16. Chauvin S, Cohen-Tannoudji J, Guigon CJ. Estradiol signaling at the heart of folliculogenesis: its potential deregulation in human ovarian pathologies. Int J Mol Sci. 2022;23:512.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  17. Xu XL, Deng SL, Lian ZX, Yu K. Estrogen receptors in polycystic ovary syndrome. Cells. 2021;10:459.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  18. Walters KA. Polycystic ovary syndrome: Is it androgen or estrogen receptor? Curr Opin Endocr Metab Res. 2020;12:1–7.

    Article  Google Scholar 

  19. Segawa T, Teramoto S, Omi K, Miyauchi O, Watanabe Y, Osada H. Changes in estrone and estradiol levels during follicle development: a retrospective large-scale study. Reprod Biol Endocr. 2015;13:1–7.

    Article  CAS  Google Scholar 

  20. Arao Y, Korach KS. The F domain of estrogen receptor α is involved in species-specific, tamoxifen-mediated transactivation. J Biol Chem. 2018;293:8495–507.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  21. Liu F, Tian L, Tan J, Li Z, Qin H, Xu D, Huang Z, Wu X, Chen G, Wu Q. Identification of a novel ESR1 mutation in a Chinese PCOS woman with estrogen insensitivity in IVF treatment. Reprod Biol Endocrinol. 2022;20:157.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  22. Villavicencio A, Bacallao K, Avellaira C, Gabler F, Fuentes A, Vega M. Androgen and estrogen receptors and co-regulators levels in endometria from patients with polycystic ovarian syndrome with and without endometrial hyperplasia. Gynecol Oncol. 2006;103:307–14.

    Article  PubMed  CAS  Google Scholar 

  23. Maliqueo M, Clementi M, Gabler F, Johnson MC, Palomino A, Sir-Petermann T, Vega M. Expression of steroid receptors and proteins related to apoptosis in endometria of women with polycystic ovary syndrome. Fertil Steril. 2003;80:812–9.

    Article  PubMed  Google Scholar 

  24. Liaqat I, Jahan N, Krikun G, Taylor HS. Genetic polymorphisms in Pakistani women with polycystic ovary syndrome. Reprod Sci. 2015;22:347–57.

    Article  PubMed  CAS  Google Scholar 

  25. Jiao X, Chen W, Zhang J, Wang W, Song J, Chen D, Zhu W, Shi Y, Yu X. Variant alleles of the ESR1, PPARG, HMGA2, and MTHFR genes are associated with polycystic ovary syndrome risk in a Chinese population: a case-control study. Front Endocrinol. 2018;9:504.

    Article  Google Scholar 

  26. Zhou S, Wen S, Sheng Y, Yang M, Shen X, Chen Y, Kang D, Xu L. Association of estrogen receptor genes polymorphisms with polycystic ovary syndrome: a systematic review and meta-analysis based on observational studies. Front Endocrinol. 2021;12: 726184.

    Article  Google Scholar 

  27. Gasteiger E, Gattiker A, Hoogland C, Ivanyi I, Appel RD, Bairoch A. ExPASy: the proteomics server for in-depth protein knowledge and analysis. Nucleic Acids Res. 2003;31:3784–8.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  28. Sim NL, Kumar P, Hu J, Henikoff S, Schneider G, Ng PC. SIFT web server: predicting effects of amino acid substitutions on proteins. Nucleic Acids Res. 2012;40:W452–7.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  29. Adzhubei I, Jordan DM, Sunyaev SR. Predicting functional effect of human missense mutations using PolyPhen-2. Curr Protoc Hum Genet. 2013;76:21–7.

    Google Scholar 

  30. Rentzsch P, Witten D, Cooper GM, Shendure J, Kircher M. CADD: predicting the deleteriousness of variants throughout the human genome. Nucleic Acids Res. 2019;47:D886–94.

    Article  PubMed  CAS  Google Scholar 

  31. Dong C, Wei P, Jian X, Gibbs R, Boerwinkle E, Wang K, Liu X. Comparison and integration of deleteriousness prediction methods for nonsynonymous SNVs in whole exome sequencing studies. Hum Mol Genet. 2015;24:2125–37.

    Article  PubMed  CAS  Google Scholar 

  32. Ioannidis NM, Rothstein JH, Pejaver V, Middha S, McDonnell SK, Baheti S, Musolf A, Li Q, Holzinger E, Karyadi D. REVEL: an ensemble method for predicting the pathogenicity of rare missense variants. Am J Hum Genet. 2016;99:877–85.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  33. Kulshreshtha S, Chaudhary V, Goswami GK, Mathur N. Computational approaches for predicting mutant protein stability. J Comput Aided Mol Des. 2016;30:401–12.

    Article  PubMed  CAS  Google Scholar 

  34. Yu CS, Cheng CW, Su WC, Chang KC, Huang SW, Hwang JK, Lu CH. CELLO2GO: a web server for protein subCELlular LOcalization prediction with functional gene ontology annotation. PLoS One. 2014;9: e99368.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Geourjon C, Deleage G. SOPMA: significant improvements in protein secondary structure prediction by consensus prediction from multiple alignments. Bioinform. 1995;11:681–4.

    Article  CAS  Google Scholar 

  36. Ishida T, Kinoshita K. PrDOS: prediction of disordered protein regions from amino acid sequence. Nucleic Acids Res. 2007;35:W460–4.

    Article  PubMed  PubMed Central  Google Scholar 

  37. Sheik SS, Sundararajan P, Hussain ASZ, Sekar K. Ramachandran plot on the web Bioinform. 2002;18:1548–9.

    CAS  Google Scholar 

  38. Colovos C, Yeates TO. Verification of protein structures: patterns of nonbonded atomic interactions. Protein Sci. 1993;2:1511–9.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  39. Hollingsworth SA, Karplus PA. A fresh look at the Ramachandran plot and the occurrence of standard structures in proteins. Biomol Concepts. 2010;1:271–83.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  40. Douma Z, Dallel M, Bahia W, Salem AB, Ali FHB, Almawi WY, Lautier C, Haydar S, Grigorescu F, Mahjoub T. Association of estrogen receptor gene variants (ESR1 and ESR2) with polycystic ovary syndrome in Tunisia. Gene. 2020;741: 144560.

    Article  PubMed  CAS  Google Scholar 

  41. Mir R, Tayeb FJ, Barnawi J, Jalal MM, Saeedi NH, Hamadi A, Altayar MA, Alshammari SE, Mtiraoui N, Ali ME. Biochemical characterization and molecular determination of estrogen Receptor-α (ESR1 PvuII-rs2234693 T> C) and MiRNA-146a (rs2910164 C> G) polymorphic gene variations and their association with the risk of polycystic ovary syndrome. Int J Environ Res Public Health. 2022;19:3114.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  42. Al-Jumaili EFA. Allelic estrogen receptor beta (ESR2) gene association with Polycystic Ovary Syndrome in unmarried Iraqi female of Baghdad governorate. Iraqi J Biotechnol. 2019;18:26–32.

    Google Scholar 

  43. Morya VK, Yadav S, Kim EK, Yadav D. In silico characterization of alkaline proteases from different species of Aspergillus. Appl Biochem Biotechnol. 2012;166:243–57.

    Article  PubMed  CAS  Google Scholar 

  44. Gamage DG, Gunaratne A, Periyannan GR, Russell TG. Applicability of instability index for in vitro protein stability prediction. Protein Pept Lett. 2019;26:339–47.

    Article  PubMed  CAS  Google Scholar 

  45. Jin YT, Jin TY, Zhang ZL, Ye YN, Deng Z, Wang J, Guo FB. Quantitative elucidation of associations between nucleotide identity and physicochemical properties of amino acids and the functional insight. J Comput Struct Biotechnol. 2021;19:4042–8.

    Article  CAS  Google Scholar 

  46. Campello RS, Fátima LA, Barreto-Andrade JN, Lucas TF, Mori RC, Porto CS, Machado UF. Estradiol-induced regulation of GLUT4 in 3T3-L1 cells: involvement of ESR1 and AKT activation. J Mol Endocrinol. 2017;59:257–68.

    Article  PubMed  CAS  Google Scholar 

  47. Uzar I, Bogacz A, Sowińska-Przepiera E, Kotrych K, Wolek M, Sulikowski T, Kamiński A. The influence of ESR1 polymorphisms on selected hormonal, metabolic and mineral balance markers in women with hyperandrogenism. Sci Rep. 2022;12:19712.

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

The authors extend their appreciation to the Deputyship of Research and Innovation, ‘Ministry of Education’ in Saudi Arabia for funding this research (IFKSUOR3-191-3).

Funding

The authors extend their appreciation to the Deputyship of Research and Innovation, ‘Ministry of Education’ in Saudi Arabia for funding this research (IFKSUOR3-191–3).

Author information

Authors and Affiliations

Authors

Contributions

F.M designed methodology, N.M.A, and NH perceived the idea, S.R performed analysis, T.A write up of manuscript, FMH, visualization, H.S formal analysis.

Corresponding authors

Correspondence to Fatima Muccee or Suhail Razak.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1:

Supplementary data Figure S1. Analysis of effect of SNPs in ESR1 gene on sub-cellular localization of mutated proteins. Supplementary data Figure S2. Analysis of effect of SNPs in ESR2 gene on sub-cellular localization of mutated proteins. Supplementary data Figure S3. Sequences of normal and mutated proteins encoded by ESR1 gene, the disordered regions are represented in red color. Supplementary data Figure S4. Sequences of normal and mutated proteins encoded by ESR2 gene, the disordered regions are represented in red color. Supplementary data Figure S5. Disorder profile plots for normal and the mutated ESRα proteins analyzed in present study based on PrDOS tool. Supplementary data Figure S6. Disorder profile plots for normal and the mutated ESRβ proteins analyzed in present study based on PrDOS tool. Supplementary data Figure S7. Validation of PHYRE2 tool generated pdb structures of normal and mutated proteins encoded by ESR1 gene using ERRAT2 tool. Supplementary data Figure S8. Validation of PHYRE2 tool generated pdb structures of normal and mutated proteins encoded by ESR2 gene using ERRAT2 tool. Supplementary data Figure S9. Validation of PHYRE2 tool generated pdb structures of normal and mutated proteins encoded by ESR1 gene using Ramachandran plots. Supplementary data Figure S10. Validation of PHYRE2 tool generated pdb structures of normal and mutated proteins encoded by ESR2 gene using Ramachandran plots. Supplementary data Table 1. Transcript IDs and CDS of ESRα and ESRβ genes analyzed in present study. Supplementary data Table 2. Predicting the effect of SNPs on number of disordered regions and the number of disordered amino acids of ESRα and ESRβ genes predicted using PrDOS server.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Muccee, F., Ashraf, N.M., Razak, S. et al. Exploring the association of ESR1 and ESR2 gene SNPs with polycystic ovary syndrome in human females: a comprehensive association study. J Ovarian Res 17, 27 (2024). https://doi.org/10.1186/s13048-023-01335-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s13048-023-01335-7

Keywords