Skip to main content

A genome-wide cross-trait analysis identifies genomic correlation, pleiotropic loci, and causal relationship between sex hormone-binding globulin and rheumatoid arthritis

Abstract

Background

Our study aims to investigate an intrinsic link underlying sex hormone-binding globulin (SHBG) and rheumatoid arthritis (RA), which remains inconclusive in observational settings.

Methods

Summary statistics were collected from the largest GWAS(s) on SHBG adjusted for BMI (SHBGadjBMI; Noverall = 368,929; Nmen = 180,094; Nwomen = 188,908), crude SHBG (Noverall = 370,125; Nmen = 180,726; Nwomen = 189,473), and RA (Ncase = 22,350; Ncontrol = 74,823). A genome-wide cross-trait design was performed to quantify global and local genetic correlation, identify pleiotropic loci, and infer a causal relationship.

Results

Among the overall population, a significant global genetic correlation was observed for SHBGadjBMI and RA (\(r_{{\text{g}}}\) = 0.11, P = 1.0 × 10−4) which was further supported by local signal (1q25.2). A total of 18 independent pleiotropic SNPs were identified, of which three were highly likely causal variants and four were found to have effects on both traits through gene expression mediation. A putative causal association of SHBGadjBMI on RA was demonstrated (OR = 1.20, 95% CI = 1.01–1.43) without evidence of reverse causality (OR = 0.999, 95% CI = 0.997–1.000). Sex-specific analyses revealed distinct shared genetic regions (men: 1q32.1-q32.2 and 5p13.1; women: 1q25.2 and 22q11.21-q11.22) and diverse pleiotropic SNPs (16 in men and 18 in women, nearly half were sex-specific) underlying SHBGadjBMI and RA, demonstrating biological disparities between sexes. Replacing SHBGadjBMI with crude SHBG, a largely similar yet less significant pattern of results was observed.

Conclusion

Our cross-trait analysis suggests an intrinsic, as well as a sex-specific, link underlying SHBG and RA, providing novel insights into disease etiology.

Background

Human autoimmune disorders often present sex-specific characteristics. In rheumatoid arthritis (RA), women account for more than 70% of all cases [1] whom also exhibit higher disease activity, worse functionality, and more severe comorbidities [2]. Epidemiological studies have linked hormone alterations with RA onset. For instance, post-menopause and postpartum, characterized by declined estrogen levels, have been linked to an increased risk of developing RA. Conversely, periods during pregnancy and breastfeeding, featured by increased estrogen levels, have been associated with a reduced risk of RA [3]. However, the potential pathogenic role of hormones in RA remains largely uncharacterized.

Sex hormone-binding globulin (SHBG) acts as a carrier of sex steroids and regulates the extent to which these hormones are delivered to body tissues [4], mediating the relationship underlying various endocrine organs and pathophysiology of diseases [5]. Clarifying the role of SHBG in RA provides novel insights into not only disease etiology but perhaps also sex disparity. So far, studies investigating the SHBG-RA relationship remain sparse with largely inconsistent findings [6,7,8]. For example, a small-sized case–control study (55 RA and 50 controls) reported SHBG levels to be significantly lower in female cases than in controls, both pre-menopausal (34.8 ± 10.0 vs. 58.4 ± 17.9 nmol/L) and post-menopausal (35.5 ± 10.7 vs. 44.9 ± 7.1 nmol/L) [7], while another case–control study (120 RA and 518 controls) did not support such a difference (post-menopausal: 57.6 ± 26.9 vs. 57.3 ± 27.3 nmol/L) [6]. As for men, SHBG did not seem to differ significantly according to results from a case–control study involving 104 male cases and 99 age- and sex-matched controls [8]. Except for insufficient power, discrepancies in these results could also derive from bias, confounding, or reverse causality which are common in conventional epidemiological studies. Mendelian randomization (MR) analysis narrows the gap by elucidating a putative exposure-outcome causal association, using genetic variants (single nucleotide polymorphisms, SNPs) as instrumental variables [9]. Applying a two-sample MR framework, a positive causal association of circulation SHBG with RA has been reported (OR = 1.003; 95% CI = 1.000–1.007) [10]. Nevertheless, this MR used only a handful of instruments (NIV = 13) and a small number of RA cases (NRA = 4017), substantially restricting the robustness of the results.

The increasing availability of genetic data produced by large-scale genome-wide association studies (GWAS) enables the utilization of a compiled analytical strategy named genome-wide cross-trait analysis. This analysis features several analytical aspects and permits the quantification of shared and distinct etiology underlying complex traits. Our study, therefore, aims to extend previous findings by implementing a comprehensive genome-wide cross-trait approach, leveraging summary statistics of the hitherto largest GWAS(s) conducted for SHBG and RA. In addition to genetic data of overall circulating SHBG (Noverall = 370,125), data on sex-specific SHBG (Nmen = 180,726, Nwomen = 189,473) and BMI-adjusted SHBG (SHBGadjBMI, Noverall = 368,929, Nmen = 180,094, and Nwomen = 188,908) were further incorporated to detect sex disparity and to control for the confounding effect of BMI. We first quantified the global and local genetic correlation underlying SHBG and RA, and then identified potential pleiotropic loci affecting both traits. We conducted functional annotation of these loci and performed fine-mapping analysis and transcriptome-wide association study (TWAS) to provide biological insight. We finally evaluated a putative causal relationship. A flowchart of the overall study design is shown in Fig. 1.

Fig. 1
figure 1

Flowchart of the overall study design. We investigated the shared genetic architecture underlying sex hormone-binding globulin (SHBG) and rheumatoid arthritis (RA). We first quantified the genome-wide genetic correlation between SHBG and RA; we then estimated local genetic correlations by partitioning the genome into linkage-disequilibrium independent blocks. Next, we identified potential pleiotropic loci contributing to both traits and conducted functional annotation for these loci. We also performed fine-mapping and transcriptome-wide association analysis to provide biological insight. Finally, to make a causal inference, we conducted a bidirectional Mendelian randomization

Methods

SHBG GWAS The latest and largest GWAS of circulating SHBG was conducted using UK Biobank (UKBB), which involved 370,125 participants of European ancestry. Genetic variants were imputed using the Haplotype Reference Consortium and the 1000 Genomes Project reference panel. In addition to the sample quality control metrics performed centrally by UKBB, white European ancestry was determined by K-means clustering combined with self-report. Genetic variants with minor allele frequency (MAF) > 1% as well as those that passed quality control in batches were involved. Association test was performed using a linear mixed model, adjusted for genotyping chip, age at baseline, and ten genetically derived principal components (PCs). Genome-wide significant index variants (P < 5 × 10–8) were identified using 1 Mb distance-based clumping with linkage disequilibrium (LD) < 0.05 across all variants [11].

To account for the confounding effect of body mass index (BMI), additional analysis was performed among 368,929 UKBB individuals, which consisted of individuals enrolled in the SHBG GWAS with BMI records. BMI was adjusted (SHBGadjBMI) along with the aforementioned variables. Sex-specific analyses were also performed (SHBG: Nmen = 180,726, Nwomen = 189,473; SHBGadjBMI: Nmen = 180,094, Nwomen = 188,908) [11].

Significant index SNPs discovered by these GWAS(s) were used as instrumental variables (IVs). For crude SHBG, 335 SNPs were identified, with corresponding figures of 188 for men and 176 for women. For SHBGadjBMI, 477 SNPs were identified, with corresponding figures of 251 for men and 271 for women. The characteristics of SHBG-associated (crude, BMI-adjusted, and sex-specific) index SNPs used for MR analysis are presented in (Additional file 1: Table S1–S6). The full sets GWAS summary statistics were also obtained and used for other genetic analyses.

RA GWAS The hitherto largest GWAS of RA was conducted by meta-analyzing 25 cohorts involving 22,350 cases and 74,823 controls of European ancestry. Genotype quality control, imputation, and case–control association analyses were conducted separately for each cohort. Quality control excluded samples with low call rate, closely related individuals, or outliers based on ancestries identified through PC analysis. Additionally, variants with low call rate, low MAF, or low Hardy–Weinberg P value were excluded. Genotyping imputation was performed using 1000 Genomes Phase 3 panel with minimum imputation accuracy (r2) of 0.3. SNPs associated with RA were detected assuming additive effects using logistic regression models, correcting for sex and PCs. Finally, effect sizes of the identical SNPs across all participating studies were combined using a fixed-effect inverse variance weighted meta-analysis [12]. Index SNPs were obtained with genome-wide significance (P < 5 × 10–8) using 1 Mb distance-based clumping with linkage disequilibrium (LD) < 0.05 across all variants after removing the Major Histocompatibility Complex (MHC) region. A total of 182 RA-associated index SNPs were identified, of which 175 were available and used as IVs in our bidirectional MR analysis. The full set of GWAS summary statistics was downloaded and used for other analyses. To the best of our knowledge, none of these 25 studies overlapped with participants in UKBB, our exposure GWAS(s) [12]. For all analyses, the human reference genome build 37 (hg19) was used.

Statistical analysis

Genome-wide genetic correlation analysis

Genome-wide genetic correlation (\(r_{{\text{g}}}\)) was estimated using cross-trait LD-score regression (LDSC) [13, 14]. This algorithm quantifies the average sharing of genetic effects between pairs of traits that is independent of environmental confounders. On average, SNPs in high LD exhibit higher average χ2 statistics than SNPs in low LD in polygenic traits. Similarly, when analyzing traits with genetic correlation, the product of z-scores from two studies presents a comparable relationship with χ2 statistics for a single study. The algorithm is described in the formula as follows:

$$E_{{\left[ {\beta_{j}\gamma_{j}} \right]}} = \frac{{\sqrt {N_{1} N_{2} } r_{{\text{g}}} }}{M}l_{j} + \frac{{N_{{\text{s}}} r}}{{\sqrt {N_{1} N_{2} } }}$$

Among them, ß\(_{j}\) and γ\(_{j}\) represent the z-scores of SNP j on traits 1 and trait 2, \(r_{{\text{g}}}\) represents the genetic covariance, \(M\) represents the number of SNPs, \(N_{1}\) and \(N_{2}\) represent the sample sizes of each trait, \(N_{{\text{s}}}\) represents the number of overlapping samples, \({\text{r}}\) represents the phenotypic correlation in overlapping samples, and \(l_{j}\) represents the LD score. The estimates of genetic correlation range from − 1 to 1, with − 1 representing a completely negative correlation and 1 representing a completely positive correlation. The MHC region (chr6:28,477,797–33,448,354), known for its strong effects on autoimmune conditions and complex LD pattern, was excluded from this analysis. Given that analyses were repeated three times (overall and sex-specific populations), a Bonferroni-corrected P value of 0.05/3 was used to define statistical significance.

Local genetic correlation analysis

Genomic correlation collapses the effect of all SNPs across the whole genome. Even with a negligible global genetic correlation, there might be specific regions in the genome affecting both traits. We next quantified local genetic correlation using heritability estimation from summary statistics (ρ-HESS). The genome was partitioned into 1,703 LD-independent regions with an average size of 1.6 Mb. Firstly, genetic covariance was used to evaluate trait similarity within each LD-independent region under a fixed-effect model. Genetic covariance quantifies the covariation of traits on their original scales. To enable comparisons across different traits and genomic regions, genetic correlation was further employed to standardize covariation using the jackknife approach [15]. The MHC region was excluded from this analysis. A Bonferroni-corrected P value of 0.05/(1703 × 3) was used to define statistical significance, considering analyses across 1703 LD-independent regions for the overall and sex-specific populations.

Cross-trait meta-analysis

Genetic correlation suggests shared genetic components—either due to genetic variants having an independent effect on both traits (horizontal pleiotropy or pleiotropy) or genetic variants influencing one trait via its effect on the other (vertical pleiotropy or causality). To detect potential pleiotropic loci, we applied a cross-trait meta-analysis using cross-phenotype association analysis (CPASSOC) [16]. This algorithm integrates association evidence from multiple traits to detect variants affecting at least one trait. CPASSOC has several attractive features. In addition to allowing for sample overlap and relying only on summary statistics, it is compatible with trait heterogeneity effects, where a specific genetic variant may have varying magnitudes of effects (including different directions) on different trait [17]. Given the fact that trait homogeneity assumption is less likely to hold with multiple traits involved, test statistic SHet (rather than SHom) which improves statistical power in the presence of trait heterogeneous effects was therefore used to combine association evidence. We obtained independent top-associated loci applying PLINK LD-based clumping function (parameters: –clump-p1 5e−8 –clump-p2 1e−5 –clump-r2 0.2 –clump-kb 500). Significant pleiotropic SNPs were defined as variants with Psingle-trait < 1 × 10–5 in both traits and PCPASSOC < 5 × 10–8 in paired traits. These SNPs were further divided into four categories. First, a “known” pleiotropic SNP was one that reached genome-wide significance in both single traits (PSHBG < 5 × 10–8, PRA < 5 × 10–8, and PCPASSOC < 5 × 10–8). These SNPs were naturally pleiotropic even without performing CPASSOC. Second, a “single-trait-driven” pleiotropic SNP was one that reached genome-wide significance in one of two single traits (PSHBG < 5 × 10–8 or PRA < 5 × 10–8 and PCPASSOC < 5 × 10–8). Third, an “LD-tagged” pleiotropic SNP was one that, despite not reaching genome-wide significance in any single trait (5 × 10–8 < PSHBG / RA < 1 × 10–5 and PCPASSOC < 5 × 10–8), was in LD (r2 threshold = 0.2) with index SNPs (or any SNP located within ± 250 kb around the index SNPs) identified by single-trait GWAS(s). Finally, a “novel” pleiotropic SNP was of great interest to us, which was defined as those that neither reached genome-wide significance in any single trait nor in LD with previously identified SHBG- or RA-associated SNPs. Ensembl Variant Effect Predictor (VEP) was used to map pleiotropic SNPs to the nearest genes based on location [18].

Fine-mapping analysis

Index SNPs are not necessarily causal variants. We further identified a 99% credible set of causal variants through FM-summary method, a simplified Bayesian fine-mapping method using summary statistics. Briefly, each pleiotropic SNP and the variants within 500 kb around, extracted from the pooled results of cross-trait meta-analysis, were used as input for FM-summary. FM-summary then set a flat prior and generated a posterior inclusion probability (PIP) of a true trait/disease association for each variant using the steepest descent approximation [19, 20]. A 99% credible set is equivalent to sorting SNPs from the largest to the smallest PIPs and taking the cumulative sum of PIPs until it reaches at least 99%.

Transcriptome-wide association study

Cross-trait meta-analysis identifies pleiotropic loci affecting both traits without considering gene expression, while many pleiotropic loci influence complex traits by modulating gene expression levels. To identify relevant genes whose expression patterns vary across tissues, we performed a TWAS analysis. Imputable genes were provided by pre-trained joint-tissue imputation prediction models (GTEx v8) [21]. Gene-phenotype association analysis was performed by S-PrediXcan [21, 22]. We first performed a single-trait TWAS and then intersected these results based on gene-tissue pair to examine if they were shared across traits. Bonferroni correction was used considering the number of gene-tissue pairs tested in each trait and the analyses repeated in overall and sex-specific populations.

Bidirectional Mendelian randomization analysis

A two-sample bidirectional MR analysis was performed to identify a putative causal relationship. Inverse variance weighted (IVW) approach was used as our primary approach, assuming all IVs were valid, or the overall pleiotropy was balanced to zero [23]. A series of sensitivity analyses were conducted to validate model assumptions and to guarantee the robustness of our findings. The MR-Egger intercept test was used to reflect directional pleiotropy [24]. Weighted median approach was performed under the assumption that up to 50% of IVs contributing to analysis were invalid [25]. MR pleiotropy residual sum and outlier test (MR-PRESSO) was applied to detect outliers and obtain outlier-corrected effects [26]. Additionally, MR-PRESSO distortion test was performed to examine the discrepancies between the causal estimates before and after outlier correction [26]. IVW approach was further repeated by excluding palindromic SNPs (A/T or G/C SNPs introducing ambiguity into the identification of effect alleles) or pleiotropic SNPs (SNPs affecting phenotypes served as potential confounders of the SHBG-RA relationship, identified through LDLink, and violating the exclusion restriction assumption). Details of the excluded pleiotropic SNPs and their associated confounders are shown in Additional file 1: Table S1–S6. Direction of causality was inferred using Steiger test. Robust adjusted profile score (MR-RAPS) was further applied to validate the conformity of results. To test whether the causal estimates were driven by individual SNP, a leave-one-out analysis was performed with each SNP iteratively removed, and IVW applied using the remaining SNPs.

Statistical power was calculated using the non-centrality parameter of the test statistic as suggested by Brion et al. [27]. The proportion of variance explained by IVs was computed using the formula provided by Shim et al. [28]. Given analyses were repeated three times (overall and sex-specific populations), an assumed α of 0.05/3 was employed. The strength of the IVs was evaluated using the F-statistic.

All analyses were conducted with packages “TwoSampleMR”, “MRInstruments”, “MendelianRandomization”, “MR-PRESSO” and “mr.raps” in R v3.6.3. Given analyses were repeated three times (overall and sex-specific populations), a Bonferroni-corrected P value threshold of 0.05/3 was used to define statistical significance in MR. Additionally, a P value of < 0.05 was employed for suggestive significance.

Ethics/consent statement

This was a secondary analysis of existing, publicly available summary-level GWAS data. The statement of ethics for each GWAS can be found elsewhere, approved by relevant ethics committees [11, 12].

Results

Genetic correlations between SHBG and RA

After correcting for multiple testing (P < 0.05/3), as shown in Table 1, we found a minimal shared genetic basis between crude SHBG and RA (\(r_{{\text{g}}}\) = 0.05, P = 3.9 × 10–2). The effects remained null in both men (\({\text{r}}_{{\text{g}}}\) = 0.05, P = 3.2 × 10–2) and women (\(r_{{\text{g}}}\) = 0.04, P = 0.10), possibly due to the confounding effect of BMI as studies have found a decreased level of SHBG [29] but an increased risk of RA [30] among individuals who were overweight or obese. As expected, after adjusting for BMI, a positive genome-wide genetic correlation was observed for SHBGadjBMI and RA (\(r_{{\text{g}}}\) = 0.11, P = 1.0 × 10–4), and this effect remained consistent in sex-specific analysis (men: \(r_{{\text{g}}}\) = 0.07, P = 1.1 × 10–3, women: \(r_{{\text{g}}}\) = 0.09, P = 2.0 × 10–4). Therefore, in our subsequent analysis, SHBGadjBMI was used as primary exposure, complemented by crude SHBG.

Table 1 Genome-wide genetic correlation between sex hormone-binding globulin and rheumatoid arthritis, excluding the MHC region

Partitioning the whole genome into 1703 regions, we identified significant local genetic correlations for SHBGadjBMI with RA at one region (1q25.2) (Fig. 2). In addition, distinct genomic regions were identified in men (1q32.1-q32.2 and 5p13.1) and women (1q25.2 and 22q11.21-q11.22).

Fig. 2
figure 2

Local genetic correlation: BMI-adjusted sex hormone-binding globulin and rheumatoid arthritis (excluding MHC). X-axis represents chromosomes, Y-axis represents negative logarithm of P values, and each dot in the Manhattan plot represents a linkage-disequilibrium independent genomic region. Chromosomes and genomic regions for significant local genetic correlations are marked. MHC regions were excluded due to their complex LD pattern. SHBGadjBMI: sex hormone-binding globulin adjusted for BMI; RA: rheumatoid arthritis

Analysis of crude SHBG with RA presented insignificant results, further demonstrating the effectiveness of focusing on SHBGadjBMI (Additional file 2: Fig. S1).

Cross-trait meta-analysis of SHBG and RA

Motivated by the significant global and local genetic correlation, we further conducted a cross-trait meta-analysis to identify pleiotropic SNPs affecting both traits (Fig. 3 and Additional file 1 Table S7). Of the total 18 independent pleiotropic SNPs identified for SHBGadjBMI and RA, rs10951192 (PCPASSOC = 8.99 × 10–10) was detected as a novel SNP and located in JAZF1, a gene implicated in transcriptional repression and inflammation suppression [31]. Additionally, we identified four “known” SNPs, seven “SHBGadjBMI-driven” SNPs, three “RA-driven” SNPs, and three “LD-tagged” SNPs. The sex-specific analyses revealed distinct pleiotropic SNPs underlying SHBGadjBMI and RA for men and women. For men, 16 independent pleiotropic SNPs were detected, among which rs7512646 (PCPASSOC = 6.61 × 10–10) was novel, located in IL6R, a gene encoding a subunit of the interleukin 6 receptor complex [32] and participating in the immune response and autoimmune diseases [33]. For women, 18 independent pleiotropic SNPs were detected, two of which were novel: rs244468 (PCPASSOC = 7.70 × 10–10), located in ARHGAP26, a gene encoding GTPase activating protein that regulates tumor immunity [34] and inflammation [35], as well as rs4921915 (PCPASSOC = 3.59 × 10–10), an intergenic variant.

Fig. 3
figure 3

Cross-trait meta-analysis: BMI-adjusted sex hormone-binding globulin and rheumatoid arthritis (excluding MHC). Independent top-associated loci of sex hormone-binding globulin adjusted for BMI and rheumatoid arthritis were located in the inner and middle circles of the circular Manhattan plot, respectively. Results of cross-trait meta-analyses were listed in outer circles. Significant pleiotropic SNPs were presented by red spots, defined as variants with P SHBGadjBMI < 1 × 10–5, PRA < 1 × 10–5, and PCPASSOC < 5 × 10–8. Novel pleiotropic SNPs detected for SHBGadjBMI with RA were marked by a green spot, and defined as a significant pleiotropic SNP neither reached genome-wide significance in a single trait (5 × 10–8 < PSHBGadjBMI < 1 × 10–5, 5 × 10–8 < PRA < 1 × 10–5, and PCPASSOC < 5 × 10–8) nor in linkage-disequilibrium (LD) with previously identified SHBG- or RA-associated SNPs. MHC regions were excluded due to their complex LD pattern. Chr: chromosome; SHBGadjBMI: sex hormone-binding globulin adjusted for BMI. RA: rheumatoid arthritis

Replacing SHBGadjBMI with crude SHBG, a similar pattern of results was observed. In overall population, 70.0% of pleiotropic SNPs were either overlapped or in LD with SNPs identified using SHBGadjBMI. The proportion was 66.7% in men and 76.9% in women when analyzed by sex. Functional annotations of these pleiotropic SNPs are shown in (Additional file 1: Table S9, S10).

Fine-mapping analysis

For each of the pleiotropic SNPs, we further determined a 99% credible set of causal SNPs, providing targets for future downstream experimental analysis (Additional file 1: Table S11). In general, we found 448 candidate causal SNPs across all shared loci between SHBGadjBMI and RA. Of these, three pleiotropic SNPs (rs113243085, rs917770, and rs575623373) showed a posterior probability of 1.00. In sex-specific analysis, 327 candidate causal SNPs were identified for men, and 385 for women.

Transcriptome-wide association studies

After Bonferroni correction, single-trait TWAS identified 148 genes significantly associated with RA and 2,115 genes with SHBGadjBMI. Intersecting the single-trait TWAS results, we finally identified 16 pleiotropic genes (AC007389.1, AC007389.5, AC007613.1, AC012370.2, ACOXL-AS1, AP000553.3, CCDC116, FADS1, FADS2, PHF19, SNN, TMEM258, TRAF1, TRIM38, UBE2L3, and YDJC) shared by SHBGadjBMI and RA, and the corresponding figure was four for men and 11 for women (Additional file 1: Table S12). Notably, AC007389.1 and AC007389.5, the two pseudogenes, were originated from the same locus on chromosome 2. Moreover, seven pleiotropic genes identified through TWAS overlapped with nearby genes mapped by CPASSOC-identified SNPs.

Bidirectional Mendelian randomization analysis

We performed a bidirectional two-sample MR analysis to evaluate a causal relationship. As shown in Fig. 4 and (Additional file 1: Table S13), a 20% increased risk of RA was observed per each SD (approximately 30.3 nmol/L) increment in genetically predicted SHBGadjBMI (IVW OR = 1.20, 95% CI = 1.01–1.43). The estimates remained directionally consistent in weighted median approach although with large uncertainty. In sex-specific analyses, the magnitude of effect size indicated potential disparities between men and women, with an OR of 1.13 (95% CI = 0.93–1.34) in women, and an OR of 1.07 (95% CI = 0.90–1.26) in men, although none of these estimates reached statistical significance.

Fig. 4
figure 4

Causal associations: sex hormone-binding globulin and risk of rheumatoid arthritis in Mendelian randomization. Green squares represent odds ratio of outcomes per each SD increment in genetically predicted exposures. Bars represent 95% confidence intervals. The Mendelian randomization analysis was performed based on the inverse variance weighted approach. SHBG: sex hormone-binding globulin; SHBGadjBMI: sex hormone-binding globulin adjusted for BMI; RA: rheumatoid arthritis; SNP: Single nucleotide polymorphism; OR: odds ratio; CI: confidence interval

We performed sensitivity analyses to verify the robustness of the results. The causal associations for SHBGadjBMI with RA were remained in overall population after correcting outliers (OR = 1.21, 95% CI = 1.04–1.38), removing confounding SNPs (OR = 1.49, 95% CI = 1.10–2.01) or palindromic SNPs (OR = 1.20, 95% CI = 0.99–1.45) (Additional file 1: Table S14). Consistent findings were also observed in MR-RAPS (Additional file 1: Table S15). Leave-one-out analysis indicated small fluctuations upon the exclusion of individual SNPs (Additional file 2: Figure S2).

On the contrary, no reverse causality was observed as genetically predicted RA did not seem to affect SHBGadjBMI either in overall or sex-specific analyses. The estimates were non-significant with effect sizes close to 1. These findings were further confirmed by Steiger directionality test (all P < 1 × 10–99) (Additional file 1: Table S13).

Replacing SHBGadjBMI with crude SHBG, similar but insignificant results were observed. Genetically predicted levels of crude SHBG may increase the risk of RA overall (IVW OR = 1.18, 95% CI = 0.99–1.41), but this trend was not observed either in men (IVW OR = 1.03, 95% CI = 0.86–1.22) or women (IVW OR = 1.10, 95% CI = 0.91–1.32).

Under the current sample size of outcome (97,173 with 23.00% of RA cases) and assuming an α of 0.05/3, given the phenotypic variance explained by SHBGadjBMI-associated IVs to be 0.178, our study had 80% power to detect a 6% change for the risk of RA with SHBGadjBMI. The corresponding effect sizes were 6% for men and 7% for women. For crude SHBG, with the phenotypic variance explained by index SNPs to be 0.132, our study had 80% power to detect a 7% change, with the corresponding effect sizes of 6% for men and 8% for women (Additional file 1: Table S16).

Discussion

To the best of our knowledge, this is the first large-scale genome-wide cross-trait design that comprehensively investigates the shared genetic architecture underlying SHBG and RA. We found a significant shared genetic basis, both globally and locally, between SHBGadjBMI and RA overall. Such a genetic correlation could be further decomposed into horizontal pleiotropy and vertical pleiotropy, reflected by the multiple independent pleiotropic loci identified in cross-trait meta-analysis as well as the putative causal relationship confirmed by MR. Looking into men and women separately, most findings were replicated in both sexes, while sex-specific results were also observed, reflecting a distinct role of SHBG in affecting RA susceptibility across sex. Replacing SHBGadjBMI with crude SHBG, a largely similar yet less significant pattern of results was observed, corroborating main findings.

The SHBG-RA relationship remains largely inconclusive in clinical and epidemiological investigations [6,7,8]. A study suggested a lower SHBG level in RA patients [7], while other studies did not support this finding [6, 8]. Additionally, it is challenging to determine whether the decreased SHBG levels are caused by RA itself or by glucocorticoid treatment, a common method of treating RA which may result to decreased SHBG level [36]. Our findings, however, overcame the limitation of these studies and extended prior findings in several important ways. Through a comprehensive genome-wide cross-trait design, our study for the first time confirmed an intrinsic link underlying SHBG and RA, expanding beyond observational associations. The significant global and local SHBG-RA genetic correlations support a shared genetic basis unconfounded by environmental factors, of which magnitude (\(r_{{\text{g}}}\) = 0.11) is noteworthy in comparison with figures estimated among autoimmune diseases, e.g., between RA and systemic lupus erythematosus (\(r_{{\text{g}}}\) = 0.40, P = 6.01 × 10–10) or between RA and its well-established risk factor obesity (\(r_{{\text{g}}}\) ranging from − 0.01 to 0.08, all P > 0.05) or between RA and its possible consequence bone mineral density (\(r_{{\text{g}}}\) = − 0.059, P = 0.005) [37, 38]. Our study found a non-negligible genetic correlation of 11%, indicating a strong pathogenic link. The identification of multiple pleiotropic loci provides further evidence of a shared genetic basis. The validities of these pleiotropic SNPs were supported by fine-mapping analysis which identified three loci to be highly likely causal, and by TWAS analysis which overlapped with nearby genes mapped by CPASSOC-identified SNPs. We hereby provide a description of some of these pleiotropic genes. UBE2L3, a “known” pleiotropic gene, plays a role in both RA and sex hormone signaling [39, 40]. PHF19, TRAF1, and CCDC116 have been previously confirmed to be associated with RA [41,42,43]. However, their potential roles in SHBG were implicated by us. PHF19 is involved in the pathway of RNA polymerase I promoter opening and gene expression (transcription); TRAF1 is related to apoptosis and autophagy; CCDC116, a protein-coding gene located in centrosome, may be involved in RNA processing [44]. The level of SHBG may be regulated through these pathways. SNN, a protein-coding gene playing a role in the toxic effects of organotin [45] and endosomal maturation [46], has been proved to be linked to SHBG [11] and a number of other autoimmune diseases [47] including ankylosing spondylitis, psoriasis, ulcerative colitis, Crohn’s disease, and sclerosing cholangitis. AC007389.1 and AC007389.5, the two pseudogenes originating from the same locus, may interactively influence gene expression regulation. Finally, leveraging summary statistics of the hitherto largest GWAS(s) of both traits, the statistical power of our MR was greatly improved compared with previous MR [10]—involving a 37-fold increased number of IVs (481 vs. 13) and a six-fold augmented number of RA cases (22,350 vs. 4,017), we were able to detect a modest effect of 6–8% per standard deviation increment of genetically predicted SHBG with RA which previous studies were not well-powered to detect The negligible causal effect size observed in previous MR (OR = 1.003) is perhaps of limited clinical relevance in comparison with our findings (OR = 1.20). We were also able to determine the direction of association via bidirectional MR, suggesting SHBG may plays a pathogenic role in the development of RA rather than a secondary accompanying abnormality.

In addition to an overall landscape, distinct genetic architecture and biological mechanisms were identified for men and women separately through our sex-specific analysis. In addition to sex-specific shared genetic regions, nearly half of the pleiotropic loci we identified were sex-specific. We also observed potential differences in the pattern of pleiotropic genes between sexes, which may indicate the distinct pathways. Future studies are needed to validate the sex-specific role of these genes. TRIM38 and ZSCAN23 are men-specific genes. The former plays a role in modulating the severity of autoimmune disease [48] and negatively regulates inflammatory responses triggered by TLR3/4 and TNF/IL-1β [49] and nuclear factor (NF)-κB signaling [50]. ZSCAN23 has also been identified as an immune-related target gene [51]. In women, FADS2 is a member of the fatty acid desaturase gene family, which is involved in the pathways of alpha-linolenic acid metabolism and arachidonate biosynthesis III. Moreover, studies have shown that long-chain polyunsaturated fatty acids, synthesized by FADS2, have anti-inflammatory properties, and may be involved in regulating immune function [52, 53]. RGL2, a protein-coding gene, relates to the pathway of immune response antigen presentation by MHC class II. These identified genes may provide insights into the underlying sex-specific biological mechanisms.

Our analyses must be interpreted with caution. First, due to limited data availability, we could not examine RA subtypes characterized by anti-citrullinated antibody status or classify subgroups according to the severity of RA [41]. To our knowledge, nearly 90% of RA cases involved in our RA GWAS summary statistics were seropositive. Moreover, it appears that the seropositive and seronegative RA share heritability and have similar risk alleles outside of the MHC locus [54, 55]. Second, our study was conducted restricted to European ancestry populations. While this reduces population stratification and genetic heterogeneity, it also constrains the generalizability of findings to other ethnicities. Third, we need to be cautious in explaining the identified potential pleiotropic SNPs and genes through computational analysis. Further experimental validation is needed to strengthen the underlying biological mechanisms that we have detected. Fourth, adjusting for BMI in SHBG GWAS(s) could lead to BMI-associated SNPs being mistakenly identified as SHBG-associated SNPs (BMI as a collider), which may further violate the independence assumption of MR analysis (IVs are not associated with confounders). However, this collider bias has been minimized in the original GWAS study by discarding all loci which changed effect direction and/or had large changes in effect estimate and statistical significance when compared to the unadjusted model [11]. Moreover, we found a comparable effect of SHBG and SHBGadjBMI on RA (1.19 vs. 1.21) in MR analysis, which indicates a less likely biased result. Fifth, considering the uncertainty of the effect in a few sensitivity analyses and in sex-specific MR analysis, further validation may be needed to establish a robust causal relationship between SHBGadjBMI and RA.

Conclusions

In conclusion, leveraging the hitherto largest genome-wide genetic data and advanced statistical approaches, the current study expands understanding of the observational association of SHBG with RA by providing evidence of genetic correlation, pleiotropic loci, and causal relationships. Our findings demonstrate an intrinsic, as well as a potential sex-specific link underlying SHBG and RA, and further shed novel light on biological mechanisms. Future studies are warranted to validate the function of the identified variants and genes, and to extend our findings on the diagnostic and therapeutic values of SHBG in RA.

Availability of data and materials

All the data used in this study were from the publicly accessible GWAS summary statistics and can be accessed through the corresponding references presented in the main text.

Abbreviations

BMI:

Body mass index

CPASSOC:

Cross-phenotype association analysis

GWAS:

Genome-wide association studies

IVW:

Inverse variance weighted

LD:

Linkage disequilibrium

LDSC:

LD-score regression

MAF:

Minor allele frequency

MR:

Mendelian randomization

MR-PRESSO:

MR pleiotropy residual sum and outlier

MR-RAPS:

Robust adjusted profile score

MVMR:

Multivariable MR

PC:

Principal component

PIP:

Posterior inclusion probability

RA:

Rheumatoid arthritis

SHBG:

Sex hormone-binding globulin

SHBGadjBMI:

BMI-adjusted SHBG

SNP:

Single nucleotide polymorphism

TWAS:

Transcriptome-wide association study

UKBB:

UK Biobank

VEP:

Variant Effect Predictor

ρ-HESS:

Heritability estimation from summary statistics

References

  1. Deane KD, Demoruelle MK, Kelmenson LB, Kuhn KA, Norris JM, Holers VM. Genetic and environmental risk factors for rheumatoid arthritis. Best Pract Res Clin Rheumatol. 2017;31(1):3–18.

    PubMed  PubMed Central  Google Scholar 

  2. Krasselt M, Baerwald C. Sex, symptom severity, and quality of life in rheumatology. Clin Rev Allergy Immunol. 2019;56(3):346–61.

    PubMed  Google Scholar 

  3. Alpízar-Rodríguez D, Finckh A. Environmental factors and hormones in the development of rheumatoid arthritis. Sem Immunopathol. 2017;39(4):461–8.

    Google Scholar 

  4. Selby C. Sex hormone binding globulin: origin, function and clinical significance. Ann Clin Biochem. 1990;27(Pt 6):532–41.

    PubMed  Google Scholar 

  5. Goldstajn MS, Toljan K, Grgic F, Jurkovic I, Baldani DP. Sex hormone binding globulin (SHBG) as a marker of clinical disorders. Coll Antropol. 2016;40(3):211–8.

    PubMed  Google Scholar 

  6. Hall GM, Perry LA, Spector TD. Depressed levels of dehydroepiandrosterone sulphate in postmenopausal women with rheumatoid arthritis but no relation with axial bone density. Ann Rheum Dis. 1993;52(3):211–4.

    CAS  PubMed  PubMed Central  Google Scholar 

  7. Mirone L, Altomonte L, D’Agostino P, Zoli A, Barini A, Magaro M. A study of serum androgen and cortisol levels in female patients with rheumatoid arthritis. Correlation with disease activity. Clin Rheumatol. 1996;15(1):15–9.

    CAS  PubMed  Google Scholar 

  8. Tengstrand B, Carlstrom K, Hafstrom I. Bioavailable testosterone in men with rheumatoid arthritis-high frequency of hypogonadism. Rheumatology. 2002;41(3):285–9.

    CAS  PubMed  Google Scholar 

  9. Smith GD, Ebrahim S. “Mendelian randomization”: Can genetic epidemiology contribute to understanding environmental determinants of disease? Int J Epidemiol. 2003;32(1):1–22.

    PubMed  Google Scholar 

  10. Qu Z, Huang J, Yang F, Hong J, Wang W, Yan S. Sex hormone-binding globulin and arthritis: a Mendelian randomization study. Arthritis Res Ther. 2020;22(1):118.

    CAS  PubMed  PubMed Central  Google Scholar 

  11. Ruth KS, Day FR, Tyrrell J, Thompson DJ, Wood AR, Mahajan A, et al. Using human genetics to understand the disease impacts of testosterone in men and women. Nat Med. 2020;26(2):252–8.

    CAS  PubMed  PubMed Central  Google Scholar 

  12. Ishigaki K, Sakaue S, Terao C, Luo Y, Sonehara K, Yamaguchi K, et al. Multi-ancestry genome-wide association analyses identify novel genetic mechanisms in rheumatoid arthritis. Nat Genet. 2022;54(11):1640–51.

    CAS  PubMed  PubMed Central  Google Scholar 

  13. Bulik-Sullivan BK, Loh PR, Finucane HK, Ripke S, Yang J, Schizophrenia Working Group of the Psychiatric Genomics C, et al. LD score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat Genet. 2015;47(3):291–5.

    CAS  PubMed  PubMed Central  Google Scholar 

  14. Bulik-Sullivan B, Finucane HK, Anttila V, Gusev A, Day FR, Loh PR, et al. An atlas of genetic correlations across human diseases and traits. Nat Genet. 2015;47(11):1236–41.

    CAS  PubMed  PubMed Central  Google Scholar 

  15. Shi H, Mancuso N, Spendlove S, Pasaniuc B. Local genetic correlation gives insights into the shared genetic architecture of complex traits. Am J Hum Genet. 2017;101(5):737–51.

    CAS  PubMed  PubMed Central  Google Scholar 

  16. Zhu X, Feng T, Tayo BO, Liang J, Young JH, Franceschini N, et al. Meta-analysis of correlated traits via summary statistics from GWASs with an application in hypertension. Am J Hum Genet. 2015;96(1):21–36.

    CAS  PubMed  PubMed Central  Google Scholar 

  17. Li X, Zhu X. Cross-phenotype association analysis using summary statistics from GWAS. Methods Mol Biol. 2017;1666:455–67.

    CAS  PubMed  PubMed Central  Google Scholar 

  18. McLaren W, Gil L, Hunt SE, Riat HS, Ritchie GR, Thormann A, et al. The ensembl variant effect predictor. Genome Biol. 2016;17(1):122.

    PubMed  PubMed Central  Google Scholar 

  19. Huang H, Fang M, Jostins L, Umićević Mirkov M, Boucher G, Anderson CA, et al. Fine-mapping inflammatory bowel disease loci to single-variant resolution. Nature. 2017;547(7662):173–8.

    CAS  PubMed  PubMed Central  Google Scholar 

  20. Maller JB, McVean G, Byrnes J, Vukcevic D, Palin K, Su Z, et al. Bayesian refinement of association signals for 14 loci in 3 common diseases. Nat Genet. 2012;44(12):1294–301.

    CAS  PubMed  PubMed Central  Google Scholar 

  21. Zhou D, Jiang Y, Zhong X, Cox NJ, Liu C, Gamazon ER. A unified framework for joint-tissue transcriptome-wide association and Mendelian randomization analysis. Nat Genet. 2020;52(11):1239–46.

    CAS  PubMed  PubMed Central  Google Scholar 

  22. Barbeira AN, Dickinson SP, Bonazzola R, Zheng J, Wheeler HE, Torres JM, et al. Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics. Nat Commun. 2018;9(1):1825.

    PubMed  PubMed Central  Google Scholar 

  23. Burgess S, Butterworth A, Thompson SG. Mendelian randomization analysis with multiple genetic variants using summarized data. Genet Epidemiol. 2013;37(7):658–65.

    PubMed  PubMed Central  Google Scholar 

  24. Burgess S, Thompson SG. Interpreting findings from Mendelian randomization using the MR-Egger method. Eur J Epidemiol. 2017;32(5):377–89.

    PubMed  PubMed Central  Google Scholar 

  25. Bowden J, Davey Smith G, Haycock PC, Burgess S. Consistent estimation in Mendelian randomization with some invalid instruments using a weighted median estimator. Genet Epidemiol. 2016;40(4):304–14.

    PubMed  PubMed Central  Google Scholar 

  26. Verbanck M, Chen C-Y, Neale B, Do R. Detection of widespread horizontal pleiotropy in causal relationships inferred from Mendelian randomization between complex traits and diseases. Nat Genet. 2018;50(5):693–8.

    CAS  PubMed Central  Google Scholar 

  27. Brion M-JA, Shakhbazov K, Visscher PM. Calculating statistical power in Mendelian randomization studies. Int J Epidemiol. 2012;42(5):1497–501.

    PubMed Central  Google Scholar 

  28. Shim H, Chasman DI, Smith JD, Mora S, Ridker PM, Nickerson DA, et al. A multivariate genome-wide association analysis of 10 LDL subfractions, and their response to statin treatment, in 1868 Caucasians. PLoS ONE. 2015;10(4):e0120758.

    PubMed  PubMed Central  Google Scholar 

  29. Wei S, Schmidt MD, Dwyer T, Norman RJ, Venn AJ. Obesity and menstrual irregularity: associations with SHBG, testosterone, and insulin. Obesity. 2009;17(5):1070–6.

    CAS  PubMed  Google Scholar 

  30. Crowson CS, Matteson EL, Davis JM 3rd, Gabriel SE. Contribution of obesity to the rise in incidence of rheumatoid arthritis. Arthritis Care Res. 2013;65(1):71–7.

    Google Scholar 

  31. Chen W, Zhong Y, Yuan Y, Zhu M, Hu W, Liu N, et al. New insights into the suppression of inflammation and lipid accumulation by JAZF1. Genes & Diseases. 2022.

  32. Larsen JV, Petersen CM. SorLA in interleukin-6 signaling and turnover. Mol Cell Biol. 2017;37(11).

  33. Kang S, Tanaka T, Narazaki M, Kishimoto T. Targeting interleukin-6 signaling in clinic. Immunity. 2019;50(4):1007–23.

    CAS  PubMed  Google Scholar 

  34. He H, Huang J, Wu S, Jiang S, Liang L, Liu Y, et al. The roles of GTPase-activating proteins in regulated cell death and tumor immunity. J Hematol Oncol. 2021;14:1–15.

    CAS  Google Scholar 

  35. Connelly TM, Berg AS, Harris LR III, Hegarty JP, Ruggiero FM, Deiling SM, et al. T-cell activation rho GTPase–activating protein expression varies with inflammation location and severity in Crohn’s disease. J Surg Res. 2014;190(2):457–64.

    CAS  PubMed  Google Scholar 

  36. Navarro MA, Nolla JM, Machuca MI, González A, Mateo L, Bonnin RM, et al. Salivary testosterone in postmenopausal women with rheumatoid arthritis. J Rheumatol. 1998;25(6):1059–62.

    CAS  PubMed  Google Scholar 

  37. Tang B, Shi H, Alfredsson L, Klareskog L, Padyukov L, Jiang X. Obesity-related traits and the development of rheumatoid arthritis: evidence from genetic data. Arthritis Rheumatol. 2021;73(2):203–11.

    CAS  PubMed  Google Scholar 

  38. Yu XH, Yang YQ, Cao RR, Cai MK, Zhang L, Deng FY, et al. Rheumatoid arthritis and osteoporosis: shared genetic effect, pleiotropy and causality. Hum Mol Genet. 2021;30(21):1932–40.

    CAS  PubMed  Google Scholar 

  39. Orozco G, Eyre S, Hinks A, Bowes J, Morgan AW, Wilson AG, et al. Study of the common genetic background for rheumatoid arthritis and systemic lupus erythematosus. Ann Rheum Dis. 2011;70(3):463–8.

    PubMed  Google Scholar 

  40. Christou EAA, Banos A, Kosmara D, Bertsias GK, Boumpas DT. Sexual dimorphism in SLE: above and beyond sex hormones. Lupus. 2019;28(1):3–10.

    CAS  PubMed  Google Scholar 

  41. Okada Y, Wu D, Trynka G, Raj T, Terao C, Ikari K, et al. Genetics of rheumatoid arthritis contributes to biology and drug discovery. Nature. 2014;506(7488):376–81.

    CAS  PubMed  Google Scholar 

  42. Chang M, Rowland CM, Garcia VE, Schrodi SJ, Catanese JJ, van der Helm-van Mil AH, et al. A large-scale rheumatoid arthritis genetic study identifies association at chromosome 9q33. 2. PLoS Genet. 2008;4(6):e1000107.

    PubMed  PubMed Central  Google Scholar 

  43. Sharma A, Liu X, Hadley D, Hagopian W, Chen W-M, Onengut-Gumuscu S, et al. Identification of non-HLA genes associated with development of islet autoimmunity and type 1 diabetes in the prospective TEDDY cohort. J Autoimmun. 2018;89:90–100.

    CAS  PubMed  PubMed Central  Google Scholar 

  44. He P, Xia W, Wang L, Wu J, Guo YF, Zeng KQ, et al. Identification of expression quantitative trait loci (eQTLs) in human peripheral blood mononuclear cells (PBMCs) and shared with liver and brain. J Cell Biochem. 2018;119(2):1659–69.

    CAS  PubMed  Google Scholar 

  45. Davidson CE, Reese BE, Billingsley ML, Yun JK. Stannin, a protein that localizes to the mitochondria and sensitizes NIH-3T3 cells to trimethyltin and dimethyltin toxicity. Mol Pharmacol. 2004;66(4):855–63.

    CAS  PubMed  Google Scholar 

  46. Pueyo JI, Magny EG, Sampson CJ, Amin U, Evans IR, Bishop SA, et al. Hemotin, a regulator of phagocytosis encoded by a small ORF and conserved across metazoans. PLoS Biol. 2016;14(3):e1002395.

    PubMed  PubMed Central  Google Scholar 

  47. Ellinghaus D, Jostins L, Spain SL, Cortes A, Bethune J, Han B, et al. Analysis of five chronic inflammatory diseases identifies 27 new associations and highlights disease-specific patterns at shared loci. Nat Genet. 2016;48(5):510–8.

    CAS  PubMed  PubMed Central  Google Scholar 

  48. Lin Y, Luo Z. NLRP6 facilitates the interaction between TAB2/3 and TRIM38 in rheumatoid arthritis fibroblast-like synoviocytes. FEBS Lett. 2017;591(8):1141–9.

    CAS  PubMed  Google Scholar 

  49. Hu M-M, Shu H-B. Multifaceted roles of TRIM38 in innate immune and inflammatory responses. Cell Mol Immunol. 2017;14(4):331–8.

    CAS  PubMed  PubMed Central  Google Scholar 

  50. Hu S, Li Y, Wang B, Peng K. TRIM38 protects chondrocytes from IL-1β-induced apoptosis and degeneration via negatively modulating nuclear factor (NF)-κB signaling. Int Immunopharmacol. 2021;99:108048.

    CAS  PubMed  Google Scholar 

  51. Wang L, Rishishwar L, Mariño-Ramírez L, Jordan IK. Human population-specific gene expression and transcriptional network modification with polymorphic transposable elements. Nucleic Acids Res. 2016;45(5):2318–28.

    PubMed Central  Google Scholar 

  52. Zhao Y, Yang G, Wu N, Cao X, Gao J. Integrated transcriptome and phosphoproteome analyses reveal that fads2 is critical for maintaining body LC-PUFA homeostasis. J Proteom. 2020;229:103967.

    CAS  Google Scholar 

  53. Philippou E, Nikiphorou E. Are we really what we eat? Nutrition and its role in the onset of rheumatoid arthritis. Autoimmun Rev. 2018;17(11):1074–7.

    CAS  PubMed  Google Scholar 

  54. Frisell T, Holmqvist M, Källberg H, Klareskog L, Alfredsson L, Askling J. Familial risks and heritability of rheumatoid arthritis: role of rheumatoid factor/anti-citrullinated protein antibody status, number and type of affected relatives, sex, and age. Arthritis Rheum. 2013;65(11):2773–82.

    CAS  PubMed  Google Scholar 

  55. Padyukov L, Seielstad M, Ong RT, Ding B, Rönnelid J, Seddighzadeh M, et al. A genome-wide association study suggests contrasting associations in ACPA-positive versus ACPA-negative rheumatoid arthritis. Ann Rheum Dis. 2011;70(2):259–65.

    PubMed  Google Scholar 

Download references

Acknowledgements

We thank all investigators and consortium for sharing the genome-wide summary statistics.

Funding

Open access funding provided by Karolinska Institute. Dr. Xia Jiang is supported by a starting grant of Karolinska Institutet (2-1534/2020). Open Access funding was provided by Karolinska Institutet.

Author information

Authors and Affiliations

Authors

Contributions

YJ, QL, and XJ analyzed the data and interpreted the results. YJ and XJ wrote the manuscript. XJ and LA contributed to conceptualization. QL, LK, IK and XJ provided statistical expertise. QL, LA, LK, and IK revised the manuscript. All authors provided the corresponding author with permission to be named in the manuscript. XJ is the guarantor of this study. All authors read and approved the final version of the manuscript.

Corresponding author

Correspondence to Xia Jiang.

Ethics declarations

Ethics approval and consent to participate

This was a secondary analysis of existing, publicly available summary-level GWAS data. The statement of ethics for each GWAS can be found elsewhere, approved by relevant ethics committees.

Consent for publication

Not applicable.

Competing interests

The authors declare no conflicts of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Table S1.

The characteristic of sex hormone-binding globulin adjusted for BMI associated index SNPs, their effect sizes with exposure and outcome, as well as their associations with potential confounders. Table S2. The characteristic of sex hormone-binding globulin adjusted for BMI associated index SNPs in men, their effect sizes with exposure and outcome, as well as their associations with potential confounders. Table S3. The characteristic of sex hormone-binding globulin adjusted for BMI associated index SNPs in women, their effect sizes with exposure and outcome, as well as their associations with potential confounders. Table S4. The characteristic of crude sex hormone-binding globulin associated index SNPs, their effect sizes with exposure and outcome, as well as their associations with potential confounders. Table S5. The characteristic of crude sex hormone binding globulin associated index SNPs in men, their effect sizes with exposure and outcome, as well as their associations with potential confounders. Table S6. The characteristic of crude sex hormone-binding globulin associated index SNPs in women, their effect sizes with exposure and outcome, as well as their associations with potential confounders. Table S7. Cross-trait meta-analysis between sex hormone-binding globulin adjusted for BMI and rheumatoid arthritis, excluding the MHC region (PCPASSOC < 5×10−8, PSHBGadjBMI < 1×10−5 and PRA < 1×10−5). Table S8. Cross-trait meta-analysis between crude sex hormone-binding globulin and rheumatoid arthritis, excluding the MHC region (PCPASSOC < 5×10−8, PSHBG < 1×10−5 and PRA < 1×10−5). Table S9. Functional annotation of SNPs shared between sex hormone-binding globulin adjusted for BMI and rheumatoid arthritis identified from the cross-trait meta-analysis, excluding the MHC region. Table S10. Functional annotation of SNPs shared between crude sex hormone-binding globulin and rheumatoid arthritis identified from the cross-trait meta-analysis, excluding the MHC region. Table S11. List of 99% credible set SNPs in each sex hormone-binding globulin adjusted for BMI and rheumatoid arthritis shared locus identified from fine-mapping analysis (r2 threshold = 0.6). Table S12. Shared significant genes between sex hormone binding globulin adjusted for BMI and rheumatoid arthritis from transcriptome-wide association studies using gene expressions across 49 GTEx tissues. Table S13. Bidirectional causal associations between genetically predicted sex hormone-binding globulin levels and risk of rheumatoid arthritis. Table S14. Sensitivity analysis of causal association between genetically predicted sex hormone-binding globulin levels and risk of rheumatoid arthritis. Table S15. Causal association between sex hormone-binding globulin levels and risk of rheumatoid arthritis based on the robust adjusted profile score (RAPS). Table S16. Power calculation in assumed and actual OR in our study.

Additional file 2: Fig. S1.

Local genetic correlation between crude sex hormone-binding globulin and rheumatoid arthritis. Colored bars represent loci with significant local genetic correlation, covariance, and SNP-heritability after multiple testing adjustment. SHBG: sex hormone-binding globulin; RA: rheumatoid arthritis. Fig. S2. Leaving one SNP out at a time for the association between sex hormone-binding globulin and rheumatoid arthritis.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Jiang, Y., Liu, Q., Alfredsson, L. et al. A genome-wide cross-trait analysis identifies genomic correlation, pleiotropic loci, and causal relationship between sex hormone-binding globulin and rheumatoid arthritis. Hum Genomics 17, 81 (2023). https://doi.org/10.1186/s40246-023-00528-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s40246-023-00528-x

Keywords