Integrated proteomic and metabolomic modules identified as biomarkers of mortality in the Atherosclerosis Risk in Communities study and the African American Study of Kidney Disease and Hypertension

Zhou, Linda; Surapaneni, Aditya; Rhee, Eugene P.; Yu, Bing; Boerwinkle, Eric; Coresh, Josef; Grams, Morgan E.; Schlosser, Pascal

doi:10.1186/s40246-022-00425-9

Research
Open access
Published: 03 November 2022

Integrated proteomic and metabolomic modules identified as biomarkers of mortality in the Atherosclerosis Risk in Communities study and the African American Study of Kidney Disease and Hypertension

Linda Zhou¹,
Aditya Surapaneni¹,
Eugene P. Rhee²,
Bing Yu³,
Eric Boerwinkle³,
Josef Coresh¹,
Morgan E. Grams^1,4 &
…
Pascal Schlosser¹

Human Genomics volume 16, Article number: 53 (2022) Cite this article

3451 Accesses
5 Citations
1 Altmetric
Metrics details

Abstract

Background

Proteins and metabolites are essential for many biological functions and often linked through enzymatic or transport reactions. Individual molecules have been associated with all-cause mortality. Many of these are correlated and might jointly represent pathways or endophenotypes involved in diseases.

Results

We present an integrated analysis of proteomics and metabolomics via a local dimensionality reduction clustering method. We identified 224 modules of correlated proteins and metabolites in the Atherosclerosis Risk in Communities (ARIC) study, a general population cohort of older adults (N = 4046, mean age 75.7, mean eGFR 65). Many of the modules displayed strong cross-sectional associations with demographic and clinical characteristics. In comprehensively adjusted analyses, including fasting plasma glucose, history of cardiovascular disease, systolic blood pressure and kidney function among others, 60 modules were associated with mortality. We transferred the network structure to the African American Study of Kidney Disease and Hypertension (AASK) (N = 694, mean age 54.5, mean mGFR 46) and identified mortality associated modules relevant in this disease specific cohort. The four mortality modules relevant in both the general population and CKD were all a combination of proteins and metabolites and were related to diabetes / insulin secretion, cardiovascular disease and kidney function. Key components of these modules included N-terminal (NT)-pro hormone BNP (NT-proBNP), Sushi, Von Willebrand Factor Type A, EGF And Pentraxin (SVEP1), and several kallikrein proteases.

Conclusion

Through integrated biomarkers of the proteome and metabolome we identified functions of (patho-) physiologic importance related to diabetes, cardiovascular disease and kidney function.

Background

The metabolome and the proteome are inextricably linked and essential to human physiology. Proteins perform many different biological functions, from enzymatic activity to molecular transport, and metabolites are often intermediates or end-products of these reactions. Metabolites are central to energy generation and homeostasis and their concentrations are often tightly regulated through generation, transport across compartments, as well as breakdown and excretion [1,2,3].

Over the past decade, many publications have identified single metabolites or proteins that are associated with all-cause mortality [4,5,6,7,8,9,10]. For example, Hu et al. [10] identified six serum metabolites associated with all-cause mortality in chronic kidney disease. In a study of 3523 participants from the Framingham Heart Study, 38 of 85 preselected circulating protein biomarkers were associated with all-cause mortality, and the addition of proteins to a model with traditional clinical variables improved all-cause mortality prediction [4]. When evaluating larger read-outs of the metabolome and proteome; however, many biomarkers are correlated, and an integrated analysis of both metabolomic and proteomic platforms may better elucidate pathways altered early in the disease process. Together, proteins and metabolites influence and are influenced by many externally observed phenotypes, representing endophenotypes that simultaneously highlight disease relevant physiology [11]. Similarly, many diseases are characterized by de-regulated pathways rather than single metabolic reactions [12].

In this manuscript, we performed data-driven identification of pathways (modules) based on circulating proteins and metabolites in the Atherosclerosis Risk in Communities (ARIC) study and constructed aggregate measures of these modules. For this, we used Netboost, a network-analysis-based dimension reduction technique [13, 14]. In this approach, proteins and metabolites are clustered into modules based on Spearman correlation, and then module information is aggregated by a principal component analysis. We then characterized associated modules with respect to human physiology and related them to mortality. To study their relevance for CKD, we transferred the mortality-associated modules to the African American Study of Kidney Disease and Hypertension (AASK) and tested the association with mortality within this cohort of CKD patients and found in particular insulin, cardiovascular and kidney function-related modules.

Results

ARIC study population characteristics

The 4027 participants in the ARIC study population were an average of 76.6 years old, with 53.9% women and 17.1% African American (Table 1). In the AASK CKD cohort, there were 694 participants who were an average of 54.5 years old, with 38.5% women and 100% African American.

Table 1 Baseline characteristics of participants in ARIC and AASK

Full size table

Integrated omics module formation and characterization in ARIC

The 4616 proteins and 474 metabolites (Fig. 1) were clustered into 224 modules in the ARIC data (Fig. 2, Additional file 1: Table S1). There were 81 proteins and 12 metabolites that remained unassigned. The mean module size was 22.3 proteins and / or metabolites; 119 modules consisted exclusively of proteins, 61 modules consisted exclusively of metabolites, and 44 modules were a combination of proteins and metabolites. There were 371 principal components (PCs) used to represent the 224 modules (Methods).

Module PCs were related to clinical variables, demonstrating that many modules reflect a specific phenotype (Additional file 1: Table S2). For example, > 50% of the variance in the first PCs of module 25, and module 211 were explained by sex, which is consistent with many of the protein / metabolite components being hormonal regulation proteins / metabolites. The estimated glomerular filtration rate (eGFR) explained 80.5% of the variance of PC1 of module 15 and included creatinine and cystatin C among other proteins/metabolites that are known biomarkers of kidney filtration (Additional file 1: Table S2). Similarly, many of the other variables were strongly related to modules (Module 92—glucose 41.1%; module 116—high-density lipoprotein (HDL) 39.6%; module 9—total cholesterol 42.5%).

Associations of modules with mortality

Over an average follow-up period of 6.6 years, there were 924 deaths. There were 64 module PCs that were significantly associated with mortality in ARIC in a comprehensively adjusted model (P < 0.05/371; Methods) representing 60 different modules (Additional file 1: Table S3). The most significant associations were module 67 PC1 (HR per SD: 1.39, p-value = 1.0e−16) and module 30 PC1 (HR per SD:0.74, p-value = 9.9e−15). The local network structures as two dimensional projections of their pairwise dissimilarities display the varying degree of linkage between the proteomic and metabolomics layers of these modules (Fig. 3). Of note, module 30 included the two aptamers of SVEP1 which are consistently highly linked. Module 67 showed that the metabolite ribitol was close to the six proteins in that module, whereas beta-citrylglutamate in module 30 was more loosely linked to a central cluster of proteins including the two SVEP1 aptamers and N-terminal pro BNP.

Transferability of modules to AASK

After transferring module membership and the PC loadings to AASK, the average Spearman correlation of module components (i.e., proteins and metabolites) to the first PC were consistent with that observed in ARIC (correlation of the average correlation coefficients, 0.91, Fig. 4). More than a third (36.2%) of the modules even had higher average Spearman correlation coefficients of proteins / metabolites with the first PC in AASK compared to ARIC, despite the PC directions being fitted on the ARIC data. Relatively few modules displayed a noticeable drop in correlation (Δcorrelation < − 0.1, 12.5%). Similarly, the regressions of module PCs on clinical traits were comparable between AASK and ARIC, particularly sex, eGFR / measured GFR (mGFR) and urinary albumin-to-creatinine ratio (ACR) / 24 h urine protein levels displayed high agreement. Notably, age displayed low transferability between the general population cohort (ARIC) and the CKD cohort (AASK) (Fig. 5). However, this appeared related to the positive association of GFR and age in AASK, an artifact of the CKD study design. Once age was adjusted for eGFR, we observed consistent correlations of age-module PCs between ARIC and AASK (Fig. 5).

In AASK, there were 148 deaths over 8.75 follow-up years. Of the 64 associations significant in ARIC 60 were direction consistent and four were also significant in AASK (P < 0.05/64; Table 2, Additional file 1: Table S3). All of these were mixed modules with both proteins and metabolites (Modules 30, 42, 67 and 98). The hazard ratios in AASK were consistently more pronounced than the ones in ARIC and explained a considerably proportion of risk with hazard ratios ranging from 0.61 to 1.49 per standard deviation unit.

Table 2 Mortality associations of the four modules associated in both the Atherosclerosis Risk in Communities study (ARIC) general population cohort and the African American Study of Kidney Disease and Hypertension (AASK) cohort of patients with chronic kidney disease

Full size table

Discussion

Metabolites and proteins are intricately linked: as substrates and enzymes, in allosteric interactions, and the assembly of protein complexes. However, few studies simultaneously evaluate the proteome and metabolome. In the present study, we integrate proteomic and metabolomic data into correlation-driven modules, demonstrate face validity through cross-sectional associations with baseline phenotypes, clinical relevance via linkage to mortality, and generalizability through transferal to a CKD cohort. We identified 60 modules of proteins and metabolites significantly associated with mortality in the general population and four of them additionally associated in the CKD cohort. As testament to the utility of combining multiple sources of omics data, all four of the modules were mixed, containing both proteins and metabolites.

We can discern specific pathological patterns associated with the four modules. For example, module 67 can be placed in the context of insulin secretion and diabetes, with many of its components associated with diabetes risk. Chiro-inositol is a secondary messenger in the insulin signaling pathway. It modulates insulin secretion, the mitochondrial respiratory chain, and glycogen storage [15]. Ribitol has been associated with diabetic retinopathy stage and was closely correlated to the module proteins in our study (Fig. 3) [16]. The protein TSP2 has been associated with levels of plasma glucose (P < 0.001), insulin (P < 0.01) and homeostasis model assessment of insulin resistance (HOMA-IR) (P < 0.001) by Morikawa et al. [17]. ApoA1, ApoB, and the ApoB/A1 ratio have been suggested as early indicators for predicting type II diabetes [18]. In fact, each of the module components has been implicated with insulin, risk of diabetes or both in some manner (ADAM17 [19], ATL2 [20], MGP [21], SPLC2 [22], N-methylproline [23], 3-methylhistidine [24]). Taken together, this nominates new connections between the module components and proposes module 67 as a biomarker of diabetes.

Module 30 relates to cardiovascular disease, with several of the individual components associated with hypertension and heart disease. A missense variant of the sushi, von Willebrand factor type A, EGF and pentraxin domain containing SVEP1 has been associated with coronary artery disease [25] . N-terminal pro BNP and galectin-3 are prognostic biomarkers of acute heart failure [26]. Kallikrein is active in multiple proteolytic reactions, including that of the kallikrein-kinin system and the renin-angiotensin system, and thus helps regulate blood pressure. It has been suggested that kallikrein inhibitors may have utility in the treatment of cardiovascular disease [27]. Interestingly, reduced urinary kallikrein levels have been associated with the development of high blood pressure, which is one of the major risk factors in the development of cardiac hypertrophy, ischemic heart disease, and cardiac failure [28]. Finally, the sole metabolite in module 30, beta-citrylglutamate has been associated with the single nucleotide polymorphism (SNP) rs10911021 on chromosome 1q25 and this SNP is associated with coronary heart disease in patients with type 2 diabetes [29]. Interestingly, in a recent review while some other serpins have been associated with cardiovascular pathologies SPB13 had no known pathophysiological links [30].

Module 98 and its components are related to kidney function. PC1 of module 98 showed a high correlation with GFR (cor_ARIC = 0.52, cor_AASK = 0.44; Additional file 1: Table S2). The mortality-associated PC2 of module 98 showed correlations with both sex (cor_ARIC = 0.4, cor_AASK = 0.38) and GFR (cor_ARIC = 0.25, cor_AASK = 0.26). Of its components high plasma guanidinoacetate-to-homoarginine ratio is associated with high all-cause mortality rate in adult renal transplant recipients with a hazard ratio of 1.35 [95% CI 1.19–1.53]) [31] Moreover, guanidinoacetate is very closely correlated to the proteins in the module (Additional file 1: Fig. S1). Lower kidney clearances of kynurete, a highly protein-bound solute, were associated with significantly greater risks of CKD progression [32] and has been reported to be in close association with xanthurenate [33]. ANGL3 plays a critical role in nephrotic syndrome, among several other diseases [34]. Considering the comprehensive adjustment of our mortality analyses, including sex and GFR, the module illustrates the data-driven pathway effect that goes beyond GFR-related mortality but still might reflect some form of kidney function. Notably, to our knowledge ENPP5 and GDF-11/8, the most central components of the module (Additional file 1: Table S1 and Fig. S1) have not been well studied in relation to kidney physiology.

Lastly, for module 42 we did not observe a clear pattern across all twelve components (six proteins, six metabolites). While some of the metabolites are involved in the tryptophan pathway and/or relate to kidney function (N-formylanthranilic acid, phenylacetylglutamate, anthranilate) [35], the first PC was only moderately associated with GFR (cor_ARIC = 0.35, cor_AASK = 0.30) and other components were associated with rare disorders of sulfur amino acid metabolism (cysteine s-sulfate) [36] or immune response (NRP1) [37].

A major strength of this study is the use of network methods to integrate proteins and metabolites in well-designed cohorts with large sample sizes of population and events, long follow-up, extensive metabolomics and proteomics panels, and the demonstration of transferability to an external population very different from the initial cohort. Through the unsupervised rank-based design of the network abstraction, we were able to identify data-driven pathways across the two omics domains and simultaneously structure our data and reduce the multiple testing burden. Literature review underlined the consistency of the identified modules in the endpoint associations and provided initial hypotheses with respect to their potentially shared biological pathways.

Limitations included that Netboost, similar to other correlation-based approaches, does not infer causal relations and module membership in some instances might be confounded by external influences, i.e., module members might be downstream of a common cause. Second, biological networks as reflected in proteomics and metabolomics data are complex and different network methodologies might identify different aspects of the underlying physiology. Hence, the modules inferred in our analyses should not be viewed as absolute but rather as one representation and other approaches might highlight further aspects relevant to mortality. Third, this is the first application of Netboost to proteomics data. While the approach has not been validated for this datatype, proteomics shares many of the distributional properties of metabolomics. Finally, the two cohorts are quite distinct and thus only a subset of the ARIC mortality associations was reproducible in the younger AASK CKD cohort. Whether this relates to the underlying biology or limited sample size remains to be determined. While the small sample size did limit power for the evaluation of the associations with mortality, those that do appear were among the strongest in the ARIC general population cohort and are well supported in their generalizability.

Conclusions

This study identifies integrated biomarkers of the proteome and metabolome that relate to physiological and pathological changes important in human health and disease. We used a novel clustering technique to begin to unravel how correlated proteins and metabolites together contribute to adverse health outcomes in addition to established risk factors. Future studies are needed to explore the co-regulation of proteins and metabolites in a functional manner and to apply the findings on mortality risk with prevention and treatment in mind.

Methods

Study population

The ARIC study is a prospective community-based cohort of 15,792 individuals who were recruited and enrolled between 1987 and 1989 from four US communities (Forsyth County, NC; Jackson, MS; Minneapolis suburbs, MN; Washington County, MD). Details on the ARIC study design and methods have been previously published [38]. During the fifth study visit between 2011 and 2013 blood samples were collected for quantification of plasma protein and serum metabolite levels. Institutional review boards at each field center have approved of the study and written informed consent has been obtained from participants at baseline and follow-up visits. All 4046 participants with available proteomic and metabolomic profiling at visit 5 (61.6% of study visit participants) were included. The censoring date for follow-up was December 31st, 2018.

The AASK study was a trial of 1094 adult African Americans aged 18–70 years with hypertensive chronic kidney disease (mGFR 20–65 ml/min per 1.73 m²) recruited from 21 clinical centers in the United States. AASK trial enrollment occurred between February 1995 and September 1998, and the trial phase ended in September 2001. All 694 participants with available proteomic and metabolomic profiling at baseline in the trial phase were included in our analysis [39].

Proteomic and metabolomic profiling

ARIC has a uniform blood collection protocol (https://sites.cscc.unc.edu/aric/Cohort_Manuals/Blood_Collection_And_Processing_7.PDF) for serum separate tubes (SST) and EDTA tubes across all 4 sites. EDTA tubes were spun (3000 g for 10 min at 4 °C) and plasma frozen. Similarly, AASK has a routine blood collection protocol for SSTs (https://repository.niddk.nih.gov/studies/aask-trial/MOOP/). In ARIC, 5282 plasma proteins were quantified in ARIC participants using a Slow Off-rate Modified Aptamer–based capture array and plasma collected at visit 5, using the SomaScan® platform v4. Similar procedures, using the expanded SomaScan® v4.1 platform, were applied to serum samples from the baseline visit in AASK, resulting in quantification of 7596 serum proteins in the AASK study [39]. For both studies, proteins were log₂-transformed to account for skewed raw value distributions, and values outside of 5 SDs on the log₂-scale were winsorized. In addition, we excluded proteins if the Bland Altman coefficient of variation among blind duplicate samples was greater than 0.5 (Fig. 1). The final analysis included only human proteins that were quantified in both cohorts (N = 4616).

Serum metabolite profiling was performed using untargeted mass spectrometry following standard protocols at Metabolon, Inc. (Morrisville, NC) using the SST samples in both studies (HD4 Platform). There were 970 and 820 metabolites of known identity quantified in the ARIC and AASK study, respectively [40]. Xenobiotics were excluded during preprocessing. Endogenous metabolites with > 80% missing was excluded. All metabolites were scaled to a median of 1 and log₂-transformed, and metabolites with variance < 0.01 on log₂-scales were removed. The final analysis included only metabolites that were quantified in both cohorts (N = 474). Missing data were imputed with minimum values (0.71% of the combined protein and metabolite analysis dataset) and capped at 5 standard deviations above or below the mean (Fig. 1).

Module formation

Netboost is an unsupervised three-step dimension reduction technique developed in the context of DNA methylation and gene expression data [13]. In brief, first, unrelated variable pairs are filtered such that a sparse correlation-based network can be constructed on the strongest network edges. Second, variables are hierarchically clustered into modules based on the sparse network. Modules form a data-driven partition of all metabolites and proteins included in the analysis. The background module consists of 81 proteins and 12 metabolites that were left without closely related components. Third, module-aggregated measures are quantified using the PCs of each module except the background module. In this study, we used Netboost to characterize modules using combined proteomic and metabolomic data similar to previous applications to mass spectrometry data [41, 42]. The minimal module size was set to two, distance measures were based on Spearman coefficients, and robust PCs were used [13]. Highly correlated preface modules (i.e., modules with correlation of the first PCs greater than 0.9) were merged to further reduce the dimensionality. Three PCs of the modules were exported, or fewer if they already accounted for at least 50% of the module variance.

Characterization of modules and association with mortality

After identifying modules of proteins and metabolites using Netboost in ARIC, to characterize modules we regressed module PCs on clinical traits. Clinical traits included age, sex, eGFR, ACR, HDL, body mass index (BMI), fasting plasma glucose, total cholesterol, systolic blood pressure, history of cardiovascular disease (CVD), and history of smoking. eGFR was defined using the CKD Epi 2009 equation using creatinine and cystatin C.

Next, we evaluated the associations between the module PCs and mortality using Cox proportional hazards models. Analyses were adjusted for age, sex, race-center, eGFR [43], CVD, history of smoking, diabetes, fasting plasma glucose, log 2 transformed ACR, systolic blood pressure, antihypertensive medications, HDL, total cholesterol, and BMI. Adjustment for total cholesterol and BMI used linear splines with knots at 200 mg/dL and 25 kg/m², respectively [44, 45].

Transferability of modules and relevance in a cohort with CKD

We next evaluated whether module membership transferred to a separate cohort with CKD patients. To do this, module memberships and PC loadings developed from the ARIC cohort were applied to the AASK cohort. Cross sectional regression models with the same clinical traits were used to characterize the modules and compared with those done in ARIC. To account for the AASK study design where participants were selected based on mGFR 20–65 ml/min per 1.73 m², we additionally calculated correlations with age residuals from a regression on GFR.

As in ARIC, a Cox proportional hazards model was used to test for associations between the module PCs and mortality. Only those modules that had a statistically significant association with mortality in ARIC were tested in AASK. In AASK, model covariates included age, sex, mGFR, CVD, history of smoking, fasting plasma glucose, log 2 transformed 24 h urine protein levels, systolic blood pressure, HDL, total cholesterol, and BMI. Again, adjustment for total cholesterol and BMI used linear splines with knots at 200 mg/dL and 25 kg/m², respectively [44, 45].

Both ARIC and AASK study analyses accounted for multiple testing by a Bonferroni adjustment for the number of analyses (P-value < 0.05/371 and P-value < 0.05/64, respectively).

Availability of data and materials

Pre-existing data access policies for each of the parent cohort studies specify that research data requests can be submitted to each steering committee; these will be promptly reviewed for confidentiality or intellectual property restrictions and will not unreasonably be refused. Please refer to the data sharing policies of these studies on https://www2.cscc.unc.edu/aric/node/10303 (ARIC) and https://repository.niddk.nih.gov/studies/aask-trial/ (AASK).

References

Kelly RS, Chawes BL, Blighe K, et al. An integrative transcriptomic and metabolomic study of lung function in children with asthma. Chest. 2018;154(2):335–48.
Article PubMed PubMed Central Google Scholar
Kottgen A, Raffler J, Sekula P, Kastenmuller G. Genome-wide association studies of metabolite concentrations (mGWAS): relevance for nephrology. Semin Nephrol. 2018;38(2):151–74.
Article PubMed Google Scholar
Dubin RF, Rhee EP. Proteomics and metabolomics in kidney disease, including insights into etiology, treatment, and prevention. Clin J Am Soc Nephrol. 2020;15(3):404–11. https://doi.org/10.2215/CJN.07420619.
Article CAS PubMed Google Scholar
Ho JE, Lyass A, Courchesne P, et al. Protein biomarkers of cardiovascular disease and mortality in the community. J Am Heart Assoc. 2018;7(14):e008108. https://doi.org/10.1161/JAHA.117.008108.
Article CAS PubMed PubMed Central Google Scholar
Yu B, Heiss G, Alexander D, Grams ME, Boerwinkle E. Associations between the serum metabolome and all-cause mortality among african americans in the atherosclerosis risk in communities (ARIC) study. Am J Epidemiol. 2016;183(7):650–6. https://doi.org/10.1093/aje/kwv213.
Article PubMed PubMed Central Google Scholar
Deelen J, Kettunen J, Fischer K, et al. A metabolic profile of all-cause mortality risk identified in an observational study of 44,168 individuals. Nat Commun. 2019;10(1):3346. https://doi.org/10.1038/s41467-019-11311-9.
Article CAS PubMed PubMed Central Google Scholar
Harris TB, Ferrucci L, Tracy RP, et al. Associations of elevated interleukin-6 and C-reactive protein levels with mortality in the elderly. Am J Med. 1999;106(5):506–12.
Article CAS PubMed Google Scholar
Orwoll ES, Wiedrick J, Jacobs J, et al. High-throughput serum proteomics for the identification of protein biomarkers of mortality in older men. Aging Cell. 2018. https://doi.org/10.1111/acel.12717.
Article PubMed PubMed Central Google Scholar
Li Z, Zhong W, Lv Y, et al. Associations of plasma high-sensitivity C-reactive protein concentrations with all-cause and cause-specific mortality among middle-aged and elderly individuals. Immun Ageing. 2019;16(1):28. https://doi.org/10.1186/s12979-019-0168-5.
Article CAS PubMed PubMed Central Google Scholar
Hu JR, Coresh J, Inker LA, et al. Serum metabolites are associated with all-cause mortality in chronic kidney disease. Kidney Int. 2018;94(2):381–9.
Article CAS PubMed PubMed Central Google Scholar
Suhre K, Arnold M, Bhagwat AM, et al. Connecting genetic risk to disease end points through the human blood plasma proteome. Nat Commun. 2017;8:14357. https://doi.org/10.1038/ncomms14357.
Article CAS PubMed PubMed Central Google Scholar
Gomari DP, Schweickart A, Cerchietti L, et al. Variational autoencoders learn universal latent representations of metabolomics data. bioRxiv. 2021. https://doi.org/10.1101/2021.01.14.426721.
Article Google Scholar
Schlosser P, Knaus J, Schmutz M, et al. Netboost: Boosting-supported network analysis improves high-dimensional omics prediction in acute myeloid leukemia and huntington’s disease. IEEE/ACM Trans Comput Biol Bioinform. 2021;18(6):2635–48. https://doi.org/10.1109/TCBB.2020.2983010.
Article CAS PubMed Google Scholar
Schlosser P, Li Y, Sekula P, et al. Genetic studies of urinary metabolites illuminate mechanisms of detoxification and excretion in humans. Nat Genet. 2020;52(2):167–76. https://doi.org/10.1038/s41588-019-0567-8.
Article CAS PubMed PubMed Central Google Scholar
Wright JD, Folsom AR, Coresh J, et al. The ARIC (atherosclerosis risk in communities) study: JACC focus seminar 3/8. J Am Coll Cardiol. 2021;77(23):2939–59.
Article PubMed PubMed Central Google Scholar
Curovic VR, Suvitaival T, Mattila I, et al. Circulating metabolites and lipids are associated to diabetic retinopathy in individuals with type 1 diabetes. Diabetes. 2020;69(10):2217. https://doi.org/10.2337/db20-0104.
Article CAS PubMed PubMed Central Google Scholar
Morikawa N, Adachi H, Enomoto M, et al. Thrombospondin-2 as a potential risk factor in a general population. Int Heart J. 2019;60(2):310–7. https://doi.org/10.1536/ihj.18-246.
Article CAS PubMed Google Scholar
Gao L, Zhang Y, Wang X, Dong H. Association of apolipoproteins A1 and B with type 2 diabetes and fasting blood glucose: a cross-sectional study. BMC Endocr Disord. 2021;21(1):59. https://doi.org/10.1186/s12902-021-00726-5.
Article CAS PubMed PubMed Central Google Scholar
Shalaby L, Thounaojam M, Tawfik A, et al. Role of endothelial ADAM17 in early vascular changes associated with diabetic retinopathy. J Clin Med. 2020;9(2):400. https://doi.org/10.3390/jcm9020400.
Article CAS PubMed Central Google Scholar
Lundbäck V, Kulyté A, Arner P, Strawbridge RJ, Dahlman I. Genome-wide association study of diabetogenic adipose morphology in the GENetics of adipocyte lipolysis (GENiAL) cohort. Cells. 2020;9(5):1085. https://doi.org/10.3390/cells9051085.
Article CAS PubMed Central Google Scholar
Antonopoulos S, Mylonopoulou M, Angelidi AM, Kousoulis AA, Tentolouris N. Association of matrix γ-carboxyglutamic acid protein levels with insulin resistance and lp(a) in diabetes: a cross-sectional study. Diabetes Res Clin Pract. 2017;130:252–7.
Article CAS PubMed Google Scholar
Nandula SR, Huxford I, Wheeler TT, Aparicio C, Gorr SU. The parotid secretory protein BPIFA2 is a salivary surfactant that affects lipopolysaccharide action. Exp Physiol. 2020;105(8):1280–92. https://doi.org/10.1113/EP088567.
Article CAS PubMed PubMed Central Google Scholar
Chai JC, Chen GC, Yu B, et al. Serum metabolomics of incident diabetes and glycemic changes in a population with high diabetes burden: the hispanic community health study/study of latinos. Diabetes. 2022;71(6):1338–49. https://doi.org/10.2337/db21-1056.
Article CAS PubMed Google Scholar
Marchesini G, Forlani G, Zoli M, Vannini P, Pisi E. Muscle protein breakdown in uncontrolled diabetes as assessed by urinary 3-methylhistidine excretion. Diabetologia. 1982;23(5):456–8. https://doi.org/10.1007/BF00260962.
Article CAS PubMed Google Scholar
Winkler MJ, Müller P, Sharifi AM, et al. Functional investigation of the coronary artery disease gene SVEP1. Basic Res Cardiol. 2020;115(6):67. https://doi.org/10.1007/s00395-020-00828-6.
Article CAS PubMed PubMed Central Google Scholar
Sani MU, Damasceno A, Davison BA, et al. N-terminal pro BNP and galectin-3 are prognostic biomarkers of acute heart failure in sub-saharan africa: lessons from the BAHEF trial. ESC Heart Fail. 2021;8(1):74–84. https://doi.org/10.1002/ehf2.13032.
Article PubMed Google Scholar
Kolte D, Shariat-Madar Z. Plasma kallikrein inhibitors in cardiovascular disease: an innovative therapeutic approach. Cardiol Rev. 2016;24(3):99–109. https://doi.org/10.1097/CRD.0000000000000069.
Article PubMed Google Scholar
Sharma JN, Narayanan P. The kallikrein-kinin pathways in hypertension and diabetes. Prog Drug Res. 2014;69:15–36.
PubMed Google Scholar
Pipino C, Shah H, Prudente S, et al. Association of the 1q25 diabetes-specific coronary heart disease locus with alterations of the γ-glutamyl cycle and increased methylglyoxal levels in endothelial cells. Diabetes. 2020;69(10):2206–16. https://doi.org/10.2337/db20-0475.
Article CAS PubMed PubMed Central Google Scholar
Sánchez-Navarro A, González-Soria I, Caldiño-Bohn R, Bobadilla NA. An integrative view of serpins in health and disease: the contribution of SerpinA3. Am J Physiol Cell Physiol. 2021;320(1):C106–18. https://doi.org/10.1152/ajpcell.00366.2020.
Article PubMed Google Scholar
Hanff E, Said MY, Kayacelebi AA, et al. High plasma guanidinoacetate-to-homoarginine ratio is associated with high all-cause and cardiovascular mortality rate in adult renal transplant recipients. Amino Acids. 2019;51(10–12):1485–99. https://doi.org/10.1007/s00726-019-02783-6.
Article CAS PubMed Google Scholar
Chen Y, Zelnick LR, Wang K, et al. Kidney clearance of secretory solutes is associated with progression of CKD: the CRIC study. J Am Soc Nephrol. 2020;31(4):817–27. https://doi.org/10.1681/ASN.2019080811.
Article CAS PubMed PubMed Central Google Scholar
Cheng Y, Li Y, Benkowitz P, Lamina C, Köttgen A, Sekula P. The relationship between blood metabolites of the tryptophan pathway and kidney function: a bidirectional mendelian randomization analysis. Sci Rep. 2020;10(1):12675–x. https://doi.org/10.1038/s41598-020-69559-x.
Article CAS PubMed PubMed Central Google Scholar
Jiang S, Qiu GH, Zhu N, Hu ZY, Liao DF, Qin L. ANGPTL3: a novel biomarker and promising therapeutic target. J Drug Target. 2019;27(8):876–84. https://doi.org/10.1080/1061186X.2019.1566342.
Article CAS PubMed Google Scholar
Barrios C, Beaumont M, Pallister T, et al. Gut-microbiota-metabolite axis in early renal function decline. PLoS One. 2015;10(8):e0134311. https://doi.org/10.1371/journal.pone.0134311.
Article CAS PubMed PubMed Central Google Scholar
Olney JW, Misra CH, Gubareff TD. Cysteine-S-sulfate: Brain damaging metabolite in sulfite oxidase Deficiency1. J Neuropathol Exp Neurol. 1975;34(2):167–77. https://doi.org/10.1097/00005072-197503000-00005.
Article CAS PubMed Google Scholar
Chaudhary B, Khaled YS, Ammori BJ, Elkord E. Neuropilin 1: function and therapeutic potential in cancer. Cancer Immunol Immunother. 2014;63(2):81–99. https://doi.org/10.1007/s00262-013-1500-0.
Article CAS PubMed Google Scholar
Aric Investigators. The atherosclerosis risk in communities (ARIC) study: design and objectives the ARIC investigators. Am J Epidemiol. 1989;129(4):687–702.
Article Google Scholar
Grams ME, Surapaneni A, Chen J, et al. Proteins associated with risk of kidney function decline in the general population. J Am Soc Nephrol. 2021;32(9):2291. https://doi.org/10.1681/ASN.2020111607.
Article PubMed PubMed Central Google Scholar
Luo S, Coresh J, Tin A, et al. Serum metabolomic alterations associated with proteinuria in CKD. Clin J Am Soc Nephrol. 2019;14(3):342–53. https://doi.org/10.2215/CJN.10010818.
Article CAS PubMed PubMed Central Google Scholar
Bernard L, Zhou L, Surapaneni A, et al. Serum metabolites and kidney outcomes: the atherosclerosis risk in communities study. Kidney Med. 2022;4(9):100522. https://doi.org/10.1016/j.xkme.2022.100522.
Article PubMed PubMed Central Google Scholar
Bächle H, Sekula P, Schlosser P et al. Uromodulin and its association with urinary metabolites: the german chronic kidney disease study. Nephrol Dial Transplant. 2022. https://doi.org/10.1093/ndt/gfac187.
Levey AS, Stevens LA, Schmid CH, et al. A new equation to estimate glomerular filtration rate. Ann Intern Med. 2009;150(9):604–12.
Article PubMed PubMed Central Google Scholar
Bhaskaran K, Dos-Santos-Silva I, Leon DA, Douglas IJ, Smeeth L. Association of BMI with overall and cause-specific mortality: a population-based cohort study of 3·6 million adults in the UK. Lancet Diabetes Endocrinol. 2018;6(12):944–53.
Article PubMed PubMed Central Google Scholar
Yi S, Yi J, Ohrr H. Total cholesterol and all-cause mortality by sex and age: a prospective cohort study among 128 million adults. Sci Rep. 2019;9(1):1596. https://doi.org/10.1038/s41598-018-38461-y.
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors thank the staff and participants of the ARIC and AASK study for their important contributions. The opinions presented do not necessarily represent those of the NIDDK, the NIH, the Department of Health and Human Services, or the US Government. The interpretation and reporting of these data are the responsibility of the authors and in no way should be seen as an official policy or interpretation of the US Government. SomaLogic Inc. conducted the SomaScan assays in exchange for use of ARIC data.

Funding

The work of Pascal Schlosser was funded by the German Research Foundation (DFG) grant SCHL 2292/1-1 (Walter Benjamin Fellowship), and the EQUIP Program for Medical Scientists, Faculty of Medicine, University of Freiburg. The work of Morgan Grams was funded by NIDDK: R01 DK108803, R01 DK124399, NHLBI: K24 HL155861. The Atherosclerosis Risk in Communities study has been funded in whole or in part with Federal funds from the National Heart, Lung, and Blood Institute, National Institutes of Health, Department of Health and Human Services, under Contract nos. (75N92022D00001, 75N92022D00002, 75N92022D00003, 75N92022D00004, 75N92022D00005). The metabolite data at ARIC visit 5 was supported by R01HL141824. The proteomic data at ARIC visit 5 was supported in part by NIH/NHLBI grant R01 HL134320.

Author information

Authors and Affiliations

Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, 2024 E. Monument St., Baltimore, MD, 21287, USA
Linda Zhou, Aditya Surapaneni, Josef Coresh, Morgan E. Grams & Pascal Schlosser
Nephrology Division and Endocrine Unit, Massachusetts General Hospital, Boston, MA, USA
Eugene P. Rhee
Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, University of Texas Health Science Center at Houston, Houston, TX, USA
Bing Yu & Eric Boerwinkle
Division of Precision Medicine, Department of Medicine, New York University, New York, NY, USA
Morgan E. Grams

Authors

Linda Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Aditya Surapaneni
View author publications
You can also search for this author in PubMed Google Scholar
Eugene P. Rhee
View author publications
You can also search for this author in PubMed Google Scholar
Bing Yu
View author publications
You can also search for this author in PubMed Google Scholar
Eric Boerwinkle
View author publications
You can also search for this author in PubMed Google Scholar
Josef Coresh
View author publications
You can also search for this author in PubMed Google Scholar
Morgan E. Grams
View author publications
You can also search for this author in PubMed Google Scholar
Pascal Schlosser
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Research idea and study design: LZ, AS, JC, MG, PS; Data acquisition: BY, EB, MG, JC; Data analysis/interpretation: LZ, AS, EPR, JC, MG, PS; Supervision or mentorship: MG, PS. Each author contributed important intellectual content during manuscript drafting or revision and agrees to be personally accountable for the individual’s own contributions and to ensure that questions pertaining to the accuracy or integrity of any portion of the work. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Pascal Schlosser.

Ethics declarations

Ethics approval and consent to participate

The Atherosclerosis Risk In Communities (ARIC) study was approved by the IRB of the University of North Carolina at Chapel Hill, Johns Hopkins University, University of Mississippi Medical Center, Wake Forest University, University of Minnesota, Brigham and Women's Hospital, and Baylor College of Medicine. The African American Study of Kidney Disease and Hypertension clinical protocol was approved by the Institutional Review Board (IRB) of each participating institution, and each patient provided informed consent.

Consent for publication

All authors have approved the manuscript and give their consent for submission and publication.

Competing interests

The authors declare no competing interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1

. Additional materials including the module memberships and annotations of proteins/metabolites; the cross-sectional associations of modules and participant characteristics; the mortality associations of modules; and network representations of module 42 and 98.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Zhou, L., Surapaneni, A., Rhee, E.P. et al. Integrated proteomic and metabolomic modules identified as biomarkers of mortality in the Atherosclerosis Risk in Communities study and the African American Study of Kidney Disease and Hypertension. Hum Genomics 16, 53 (2022). https://doi.org/10.1186/s40246-022-00425-9

Download citation

Received: 18 August 2022
Accepted: 18 October 2022
Published: 03 November 2022
DOI: https://doi.org/10.1186/s40246-022-00425-9

Integrated proteomic and metabolomic modules identified as biomarkers of mortality in the Atherosclerosis Risk in Communities study and the African American Study of Kidney Disease and Hypertension

Abstract

Background

Results

Conclusion

Background

Results

ARIC study population characteristics

Integrated omics module formation and characterization in ARIC

Associations of modules with mortality

Transferability of modules to AASK

Discussion

Conclusions

Methods

Study population

Proteomic and metabolomic profiling

Module formation

Characterization of modules and association with mortality

Transferability of modules and relevance in a cohort with CKD

Availability of data and materials

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Supplementary Information

Additional file 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Human Genomics

Contact us