Polymorphisms in transcription factor binding sites and enhancer regions and pancreatic ductal adenocarcinoma risk

Ünal, Pelin; Lu, Ye; Bueno-de-Mesquita, Bas; van Eijck, Casper H. J.; Talar-Wojnarowska, Renata; Szentesi, Andrea; Gazouli, Maria; Kreivenaite, Edita; Tavano, Francesca; Małecka-Wojciesko, Ewa; Erőss, Bálint; Oliverius, Martin; Bunduc, Stefania; Nóbrega Aoki, Mateus; Vodickova, Ludmila; Boggi, Ugo; Giaccherini, Matteo; Kondrackiene, Jurate; Chammas, Roger; Palmieri, Orazio; Theodoropoulos, George E.; Bijlsma, Maarten F.; Basso, Daniela; Mohelnikova-Duchonova, Beatrice; Soucek, Pavel; Izbicki, Jakob R.; Kiudelis, Vytautas; Vanella, Giuseppe; Arcidiacono, Paolo Giorgio; Włodarczyk, Barbara; Hackert, Thilo; Schöttker, Ben; Uzunoglu, Faik G.; Bambi, Franco; Goetz, Mara; Hlavac, Viktor; Brenner, Hermann; Perri, Francesco; Carrara, Silvia; Landi, Stefano; Hegyi, Péter; Dijk, Frederike; Maiello, Evaristo; Capretti, Giovanni; Testoni, Sabrina Gloria Giulia; Petrone, Maria Chiara; Stocker, Hannah; Ermini, Stefano; Archibugi, Livia; Gentiluomo, Manuel; Cavestro, Giulia Martina; Pezzilli, Raffaele; Di Franco, Gregorio; Milanetto, Anna Caterina; Sperti, Cosimo; Neoptolemos, John P.; Morelli, Luca; Vokacova, Klara; Pasquali, Claudio; Lawlor, Rita T.; Bazzocchi, Francesca; Kupcinskas, Juozas; Capurso, Gabriele; Campa, Daniele; Canzian, Federico

doi:10.1186/s40246-024-00576-x

Research
Open access
Published: 02 February 2024

Polymorphisms in transcription factor binding sites and enhancer regions and pancreatic ductal adenocarcinoma risk

Pelin Ünal¹,
Ye Lu¹,
Bas Bueno-de-Mesquita²,
Casper H. J. van Eijck³,
Renata Talar-Wojnarowska⁴,
Andrea Szentesi⁵,
Maria Gazouli⁷,
Edita Kreivenaite⁸,
Francesca Tavano⁹,
Ewa Małecka-Wojciesko⁴,
Bálint Erőss^5,10,11,
Martin Oliverius¹²,
Stefania Bunduc^10,11,13,
Mateus Nóbrega Aoki¹⁴,
Ludmila Vodickova^15,16,17,
Ugo Boggi¹⁸,
Matteo Giaccherini¹⁹,
Jurate Kondrackiene⁸,
Roger Chammas²⁰,
Orazio Palmieri⁹,
George E. Theodoropoulos²¹,
Maarten F. Bijlsma^22,23,
Daniela Basso²⁴,
Beatrice Mohelnikova-Duchonova²⁵,
Pavel Soucek¹⁵,
Jakob R. Izbicki²⁶,
Vytautas Kiudelis⁸,
Giuseppe Vanella^27,28,
Paolo Giorgio Arcidiacono²⁷,
Barbara Włodarczyk⁴,
Thilo Hackert²⁹,
Ben Schöttker^30,31,
Faik G. Uzunoglu²⁶,
Franco Bambi³²,
Mara Goetz²⁶,
Viktor Hlavac¹⁵,
Hermann Brenner^30,31,33,34,
Francesco Perri⁹,
Silvia Carrara³⁵,
Stefano Landi¹⁹,
Péter Hegyi^5,6,10,11,
Frederike Dijk³⁶,
Evaristo Maiello³⁷,
Giovanni Capretti^38,39,
Sabrina Gloria Giulia Testoni²⁷,
Maria Chiara Petrone²⁷,
Hannah Stocker^30,31,
Stefano Ermini³²,
Livia Archibugi^27,28,
Manuel Gentiluomo¹⁹,
Giulia Martina Cavestro⁴⁰,
Raffaele Pezzilli⁴¹,
Gregorio Di Franco⁴²,
Anna Caterina Milanetto⁴³,
Cosimo Sperti⁴³,
John P. Neoptolemos²⁹,
Luca Morelli⁴²,
Klara Vokacova^16,17,
Claudio Pasquali⁴³,
Rita T. Lawlor⁴⁴,
Francesca Bazzocchi⁴⁵,
Juozas Kupcinskas⁸,
Gabriele Capurso^27,28,
Daniele Campa¹⁹ &
…
Federico Canzian¹

Human Genomics volume 18, Article number: 12 (2024) Cite this article

1605 Accesses
3 Altmetric
Metrics details

Abstract

Genome-wide association studies (GWAS) are a powerful tool for detecting variants associated with complex traits and can help risk stratification and prevention strategies against pancreatic ductal adenocarcinoma (PDAC). However, the strict significance threshold commonly used makes it likely that many true risk loci are missed. Functional annotation of GWAS polymorphisms is a proven strategy to identify additional risk loci. We aimed to investigate single-nucleotide polymorphisms (SNP) in regulatory regions [transcription factor binding sites (TFBSs) and enhancers] that could change the expression profile of multiple genes they act upon and thereby modify PDAC risk. We analyzed a total of 12,636 PDAC cases and 43,443 controls from PanScan/PanC4 and the East Asian GWAS (discovery populations), and the PANDoRA consortium (replication population). We identified four associations that reached study-wide statistical significance in the overall meta-analysis: rs2472632(A) (enhancer variant, OR 1.10, 95%CI 1.06,1.13, p = 5.5 × 10⁻⁸), rs17358295(G) (enhancer variant, OR 1.16, 95%CI 1.10,1.22, p = 6.1 × 10⁻⁷), rs2232079(T) (TFBS variant, OR 0.88, 95%CI 0.83,0.93, p = 6.4 × 10⁻⁶) and rs10025845(A) (TFBS variant, OR 1.88, 95%CI 1.50,1.12, p = 1.32 × 10⁻⁵). The SNP with the most significant association, rs2472632, is located in an enhancer predicted to target the coiled-coil domain containing 34 oncogene. Our results provide new insights into genetic risk factors for PDAC by a focused analysis of polymorphisms in regulatory regions and demonstrating the usefulness of functional prioritization to identify loci associated with PDAC risk.

Introduction

Despite rapid advances in modern medical technology and significant improvements in survival rates of many cancers, pancreatic ductal adenocarcinoma (PDAC) is still highly lethal, with a 5-year survival after diagnosis of 11% [1]. PDAC is rarely detected at an early stage, and its etiology is still not completely clear [2, 3]. As a consequence, there is an urgent need to construct a successful PDAC risk assessment model to identify susceptible individuals for prevention or early detection and advance our understanding of pancreatic carcinogenesis. The ultimate goal is to reduce the incidence and mortality of PDAC.

Cigarette smoking, increased body mass index, heavy alcohol consumption, and a diagnosis of diabetes mellitus have all been demonstrated to increase the risk of PDAC [4]. Family history of pancreatic cancer has been associated with increased risk, suggesting that inherited genetic factors also play an essential role, with approximately 5–10% of PDAC patients reporting a family history of pancreatic cancers [5].

Among inherited genetic factors, single-nucleotide polymorphisms (SNPs) are the most frequently studied variations, mainly in genome-wide association studies (GWAS). Thanks to GWAS, many associations of genome-wide significance (p < 5 × 10⁻⁸) have been reported between genetic variants and common diseases and traits [6]. These associations have led to insights into the architecture of disease susceptibility which might lead to advances in clinical care and personalized medicine. The number of independent susceptibility variants for PDAC has been estimated to be nearly 2000 according to a method to estimate the degree of polygenicity [7]; however, only 30 independent loci at genome-wide significance level have been discovered so far [8]. Therefore, a large number of PDAC risk SNPs remains to be found [9, 10].

Much research is focused on genetic variants in protein-coding regions because their potential impact on proteins is relatively easy to predict; however, the majority of risk variants are located in non-coding regions. Non-coding variants are unrelated to the final amino acid sequences and protein functions such as DNA binding, catalytic activity, and ligand–receptor interaction. However, the possible effect of these variants is differential gene expression.

Secondary analyses have been conducted on existing GWAS data to identify novel loci, and additional cases and controls have been genotyped from independent populations, which has also been implemented successfully on PDAC. For instance, we and others have successfully investigated the association of SNPs in long noncoding RNAs (lncRNAs) [11], microRNAs [12], expression quantitative trait loci (eQTLs) [13] and particular pathway-related genes [14,15,16] for PDAC, resulting in several novel germline risk loci.

Here, we focused on genetic variants located in two major types of regulatory regions, enhancers and transcription factor (TF) binding sites (TFBSs). Enhancers are cis-acting DNA sequences that can boost gene transcription and therefore play a critical role in regulating tissue-specific gene expression. They typically function independently of orientation and at varying distances from their target promoters [17]. TFBSs are another major class of non-coding regulatory regions. They are frequently found clustered in short sequences of 5–30 nucleotides within the promoters [18]. SNPs of these two types of regulatory regions can directly constitute an important part of regulation in the human genome through altered binding affinity for TFs. Since TFs recognize and bind specific DNA sequences and affect the expression of target genes, polymorphic variants located in TFBSs and enhancers could perturb transcription factor binding and eventually alter gene expression [19]. With this research, we sought to assess whether SNPs in these regulatory regions are germline cancer susceptibility gene variants in PDAC by using GWAS data.

Material and methods

Study populations

As the discovery population, genotyping data from PanScan I, PanScan II, PanScan III, and PanC4 were downloaded from the database of Genotypes and Phenotypes (dbGaP) website (study accession numbers: phs000206.v5.p3 and phs000648.v1.p1, project reference: #12644). All the individuals were genotyped using Illumina InfiniumHumanHap550v3 (PanScan I), Illumina InfiniumHuman610-Quad (PanScan II), OmniExpress arrays (PanScan III) or HumanOmniExpressExome-8v1 (PanC4) DNA Analysis Genotyping BeadChips. After merging the four genotype datasets (hereafter referred to as PanScan/PanC4), we performed imputation on the genotype data with the TOPMed imputation panel (version TOPMed-r2), followed by quality control steps. We excluded subjects with cryptic relatedness (PI_HAT > 0.2), gender mismatches, and variants with a minor allele frequency (MAF) < 0.01, completion rate and call rate < 98%, low-quality imputation score (INFO score < 0.7), evidence for violations of Hardy–Weinberg equilibrium (p < 1 × 10⁻⁵), leaving 7,509,345 variants genotyped on 14,266 individuals (7205 cases and 7061 controls) in the final dataset. PLINK 2.0 was used to perform principal component analysis on genotypes from all study populations, merged with genotypes of subjects from phase 3 of the 1000 Genomes Project. Individuals who did not cluster with the 1000 Genomes subjects of European descent in the principal component analysis (N = 439) were excluded from further analysis.

In order to narrow the list of variants, the summary statistics of a meta-analysis based on three East Asian studies [the Japan Pancreatic Cancer Research (JaPAN) consortium GWAS, the National Cancer Center (NCC) GWAS, and the BioBank Japan (BBJ) GWAS] comprising 2,039 pancreatic cancer patients and 32,592 controls in the Japanese population was used [20]. Genotyping on these individuals was performed using Illumina HumanCoreExome (JaPAN), Illumina HumanHap550/Illumina Human610-Quad (NCC) or Illumina HumanOmniExpressExome/Illumina HumanOmniExpress (BBJ) Genotyping BeadChips. Imputation was performed on each dataset with the 1000G phase3 v5 reference panel. A total of 7,914,378 variants remained after the post-imputation quality control, excluding variants with a MAF < 0.01 and low-quality imputation score (INFO score < 0.5).

A total of 7182 individuals (3392 PDAC cases and 3790 controls) from the PANcreatic Disease ReseArch (PANDoRA) consortium were genotyped to validate the previously selected variants. PANDoRA was previously described in detail [21]. It is a multicentric consortium consisting of 11 European countries (Greece, Italy, Germany, the Netherlands, Denmark, the Czech Republic, Hungary, Poland, Ukraine, Lithuania, and the UK), whose samples and data have been collected at the German Cancer Research Center (DKFZ, Heidelberg, Germany), where the DNA bank and the central database were established. PDAC cases were defined as individuals with an established diagnosis of PDAC. Controls were patients from the general population without any pancreatic disease at recruitment, individuals hospitalized for reasons other than cancer, or blood donors. Data were collected on sex, age, and country of origin for each case and control. Controls were recruited in the same geographical regions as the cases. Controls from the Netherlands and Germany were obtained respectively from the ‘European Prospective Investigation into Cancer and Nutrition’ (EPIC) [22] and the ‘Epidemiological investigations on chances of preventing, recognizing early and optimally treating chronic diseases in an elderly population’ (ESTHER) [23]. For this study, only PANDoRA subjects with self-declared European ancestry were included.

SNP selection

In order to improve our chances of finding associations with PDAC risk, first of all we limited our selection to SNPs showing association at p < 10⁻⁴ in the PanScan/PanC4 dataset and not in LD among them [r² > 0.6, checked with LDlink (https://ldlink.nci.nih.gov)], resulting into 2575 SNPs. The SNPs in TFBSs were retrieved from the SNP2TFBS database [24], containing annotations for 200 transcription factors with SNPs predicted to alter their affinity for binding. The effects of SNPs in the whole genome on TF binding were estimated using position weight matrices (PWM), which model the specificity of TF binding. The database calculates a score based on the difference between the PWM match scores of both alleles for each SNP-TF binding. The TFBS SNP list was generated by downloading all predicted variant-TF interactions from the SNP2TFBS database (N = 2,281,137, involving 1,900,881 unique SNPs).

To obtain enhancer SNPs, we used a defined list of enhancer regions from published research [25], which used the activity-by-contact (ABC) model to predict which enhancers regulate which genes in 131 human cell types and tissues. First, we extracted the genomic positions of enhancers and their target genes reported for the normal pancreatic tissue and the PANC-1 pancreatic cancer cell line. We thus obtained 55,967 enhancer regions. As the next step, we mapped SNPs on the enhancer regions via UCSC genome browser tools [26] to create a list of SNPs situated in enhancers consisting of 1,190,420 SNPs.

We checked the associations of polymorphisms in TFBS and enhancers with PDAC risk in the discovery dataset (PanScan/PanC4). In the following step, we performed a meta-analysis between TFBS SNPs and enhancer SNPs present in the results of both PanScan/PanC4 and East Asian GWAS summary statistics, using the “meta” and “metafor” R packages. We then excluded known pancreatic cancer risk loci and the SNPs in LD with them (r² > 0.6). To select SNPs for the replication phase, we applied the following inclusion criteria: p < 10⁻⁴ in the meta-analysis and a significant association in both PanScan/PanC4 and the East Asian GWAS summary statistics (p < 0.05). In addition, we selected SNP rs17358295, because it showed the most significant association in the East Asian GWAS summary statistics (p = 4.6 × 10⁻³), despite having only a modestly significant association in the meta-analysis (p = 2.7 × 10⁻²). We finally picked the top significant SNPs in TFBSs and enhancers for genotyping analysis on the PANDoRA population. Summary information on the SNPs in TFBSs and enhancers selected for replication is included in Additional file 3: Data S3.

Sample preparation and genotyping

The sample preparation and genotyping process were conducted at a single laboratory at German Cancer Research Center in Heidelberg, Germany. DNA extraction from the whole blood of both cases and controls within the PANDoRA consortium was carried out using a Qiagen-manufactured kit (Hilden, Germany). To ensure uniformity, the order of DNA samples from case and control subjects was randomized on plates, guaranteeing an equal representation of cases and controls in each batch. Genotyping was conducted via allele-specific PCR-based TaqMan technology (ThermoFisher, Applied Biosystems, Waltham, MA) by ordering TaqMan SNP Genotyping Assays for the selected seven SNPs. The PCR protocol was performed with TaqMan Genotyping Master Mix in 384-well plates following the manufacturer's recommendations. The PCR plates were read on a ViiA7 real-time instrument (Applied Biosystems), and genotypes were determined using the ViiA7 RUO Software, version 1.2.2 (Applied Biosystems).

Statistical analysis

In the discovery phase, a logistic regression analysis was carried out by computing odds ratio (OR), 95% confidence intervals (95% CI), and p values to test the association between the SNPs and PDAC risk. The analysis was performed on 14,266 individuals (PanScan/PanC4) and was adjusted for sex, age, and the top ten principal components to avoid confounding due to population stratification. A meta-analysis was performed between the East Asian GWAS summary statistics and the discovery population. The top seven SNPs were analyzed in PANDoRA using logistic regression, adjusting for age, sex and country of origin (PANDoRA lacks GWAS data, therefore principal component data are not available). Deviation from Hardy–Weinberg equilibrium was tested for the variants genotyped in PANDoRA using the control subjects. Finally, a meta-analysis was conducted between the results of the three populations with a total of 56,079 individuals. Meta-analysis models were chosen depending on the heterogeneity (fixed-effect: I² < 50%, random-effect: I² ≥ 50%). In order to take into account the number of independent tests, LD, with a threshold of r² > 0.6, was used to discard variants representing the same association. The Bonferroni-corrected threshold for statistical significance was 0.05/2575 = 1.94 × 10⁻⁵.

Functional annotation

Several databases were utilized to link the variants with the best associations to potential functional explanations. To identify the possible effect of the SNPs on gene expression (eQTL/sQTL analysis), we used the data available in the Genotype-Tissue Expression (GTEx) project (https://www.gtexportal.org). We used the Ensembl Variant Effect Predictor (VEP) (https://www.ensembl.org/info/docs/tools/vep/index.html), HaploReg (https://pubs.broadinstitute.org/mammals/haploreg/haploreg.php), RegulomedB (https://www.regulomedb.org), Expression Atlas (https://www.ebi.ac.uk/gxa/home), The Human Protein Atlas (https://www.proteinatlas.org), TNMplot (https://tnmplot.com/analysis/) to check for regulatory potentials (for example, changes in transcription factor affinity, chromatin state regulation, changes in the expression). By using the Ensemble website (https://www.ensembl.org/), we analyzed the regions near the significant SNPs to look for regulatory regions.

Results

In this study, we used the SNP2TFBS database and the published data [25] on genome-wide enhancer maps. These two datasets were used to establish two comprehensive lists of SNPs with a potential regulatory role. We checked their possible associations with PDAC risk with a two-phase approach, a discovery phase consisting of data on 7205 cases and 7061 controls from a GWAS conducted on PDAC risk (PanScan/PanC4) and 2039 cases and 32,592 controls from an East Asian GWAS, and a replication phase comprising 3392 PDAC patients and 3790 controls from the PANDoRA consortium. Therefore, the final sample size used in this study was 12,636 PDAC cases and 43,443 controls, as shown in Table 1.

Table 1 Characteristics of the study populations

Full size table

The TFBS and enhancer SNP lists were first intersected with 2575 independent SNPs associated with PDAC risk with p < 10⁻⁴ from the PanScan/PanC4 dataset, which resulted in 778 TFBS SNPs and 84 enhancer SNPs. In the following step, we performed a meta-analysis between 673 TFBS SNPs and 12 enhancer SNPs which are present in PanScan/PanC4 and East Asian GWAS summary statistics. This resulted in 46 SNPs in TFBS and 12 in enhancers showing association with PDAC risk with p < 0.05. By considering a combination of low association p values and in-silico functional after exclusions of the known PDAC risk loci and SNPs in high linkage disequilibrium (LD; r² > 0.6) (see materials and methods for details), we finally picked the top 4 significant SNPs in TFBSs (rs10025845, rs11241697, rs11032793, rs2232079) and the top 3 significant SNPs in enhancers (rs2472632, rs17358295, rs11624002) for genotyping analysis in PANDoRA. A scheme of SNP selection is shown in Fig. 1. The results of the overall meta-analysis for four study-wise significant SNPs are shown in Table 2 and Fig. 2. The associations between these seven SNPs and the risk of PDAC for all three populations are shown in Additional file 1: Data S1.

Table 2 Associations of the selected SNPs with PDAC risk

Full size table

rs2472632(A) was observed to be associated with increased PDAC risk at a nearly genome-wide significance level after the meta-analysis of all three populations (p = 5.52 × 10⁻⁸) with the same effect size direction in all populations. This enhancer region variant was predicted to affect expression of the coiled-coil domain containing 34 (CCDC34) gene (Fig. 3a), in normal pancreas tissue (ABC score = 0.016; computation of ABC scores is explained in the original paper [25]). In Pancreatic Adenocarcinoma (PAAD) tumor tissues, the CCDC34 gene was significantly overexpressed with p = 1.22 × 10⁻¹⁹ compared to normal pancreas tissues (Fig. 3b) according to the TNMplot database, which has RNA-seq data of tumor and healthy tissues from TCGA and GTEx repositories.

Additionally, rs2472632(A) is located in the intron region of the leucine-rich repeat-containing G protein-coupled receptor 4 (LGR4) gene, the expression of which was 3.1 times higher in PDAC tissues than in normal pancreas tissues (false discovery rate-adjusted p = 1.92 × 10⁻¹¹) in the Expression Atlas database. The splicing quantitative trait loci (sQTL) analysis in GTEx showed that the rs2472632(A) was associated with an alternative splicing mechanism of LGR4 mRNA (p = 2.12 × 10⁻²³) (Fig. 3c), and eQTL analysis in GTEx suggests that the rs2472632(A) was associated with higher expression of lin-7 homolog C (LIN7C) (p = 8.62 × 10⁻⁵) in the pancreatic tissue (Fig. 3d).

Furthermore, rs2472632(A) is associated with decreasing protein levels of defensin, alpha 5 (DEFA5), also known as human alpha defensin 5 (HD5), in the GWAS Catalog with p = 1.0 × 10⁻²⁵ (Study accession: GCST90247261).

The enhancer variant rs17358295(G) with p = 6.1 × 10⁻⁷ and the TFBS variant rs2232079(T) with p = 6.38 × 10⁻⁶ were observed to be significantly associated with PDAC risk in the overall meta-analysis. While rs17358295(G) had an increased risk, the TFBS variant had a decreasing risk for PDAC in all three populations. rs17358295 maps to an enhancer that targets 20 genes in normal pancreas tissue, with ABC scores ranging between 0.015 and 0.130 (see Additional file 2: Data S2).

The TFBS variant, rs10025845(A), was associated (p < 0.05) in all three populations and became statistically significant at the study-wide level after overall meta-analysis (p = 1.35 × 10⁻⁵). While this variant’s minor allele (G) creates a binding site for the transcription factor Yin Yang 1 (YY1), the effect allele (A) is predicted by SNP2TFBS not to bind any TF. Additionally, this variant overlaps with a lincRNA coding gene, LINC01258.

The enhancer variant rs11624002(T) showed an association with PDAC risk in the overall meta-analysis, which however was not significant when considering a Bonferroni-corrected threshold. This variant is located within an enhancer near a protein-coding gene, Delta 4-Desaturase, Sphingolipid 2 (DEGS2), with ABC score = 0.018 (Additional file 3: Data S3).

Finally, two SNPs, rs11241697(C) and rs11032793(C), predicted to create/abrogate binding patterns for TFs, did not show statistically significant associations in the overall meta-analysis, including all cases and controls. Moreover, the meta-analysis showed high heterogeneity for them (I² > 75%).

Discussion

The majority of SNPs found to be associated with disease risk lie outside of protein-coding regions [27], which makes interpreting GWAS results challenging. This remains true even after fine mapping around the associated loci [28]. Most disease-associated variants affect gene expression by altering functional DNA elements. New tools are available for predicting the functional characteristics of non-coding variants. In this study, we leveraged recently developed advanced functional annotation to perform a comprehensive association analysis between non-coding variants and the susceptibility to PDAC.

The overall meta-analysis showed a variant, rs2472632(A), associated with PDAC risk close to genome-wide significance. The enhancer where rs2472632 is located is predicted to target the CCDC34 gene, an oncogene that has been reported to be up-regulated in bladder cancer [29], cervical cancer [30], colorectal cancer [31], and PDAC [32]. Qi et al. and the TNMplot database, both showed by using the TCGA and GTEx datasets that CCDC34 mRNA expression levels were significantly increased in PAAD compared with normal pancreatic tissues and were associated with significantly poor overall survival. This finding suggests that the A allele of the enhancer variant might increase the expression of the CCDC34 oncogene by creating a stronger binding affinity for transcription factors in the locus. However, in the literature there are not yet functional studies available to elucidate the direct effect of the increased expression of this gene on cancer development mechanisms in PDAC cells.

Moreover, rs2472632 is located in an intronic region of LGR4, a gene that functions as an activator of the Wnt/β-catenin signaling pathway [33]. Although this pathway plays an important role during development, its abnormal activation has been reported as one of the predisposing factors in many cancer types, such as melanoma [34], multiple myeloma [35], ovarian cancer [36], thyroid carcinoma [37], etc. In addition, the Wnt/β-catenin signaling pathway promotes apoptosis resistance, which contributes to pancreatic cancer pathogenesis [38]. As a member of the signaling pathway, LGR4 is best known for regulating the cells’ ability to respond to Wnt ligands and is widely expressed in the pancreatic tissue [39]. According to the sQTL analysis results, rs2472632 seems to be located at a splicing site of an alternative exon of LGR4. This finding suggests that the variant could contribute to PDAC risk through alternative RNA splicing. In addition, eQTL analysis showed that rs2472632 is associated with LIN7C expression, a membrane trafficking protein linked to metastasis development in some cancers [40]. Furthermore, deletions on 11p14.1, the chromosomal region where the enhancer variant is located, have been associated with attention deficit hyperactivity disorder (ADHD), developmental delay, and obesity [41].

Additionally, the rs2472632(A) variant has been associated with a reduction in HD5 peptide levels. HD5, a member of the defensin protein family, is a crucial antimicrobial peptide with powerful activity against various pathogens due to its ability to create pores in their membranes and enter their cytosol [42]. Notably, Paneth cells, located at the base of small intestinal crypts, release HD5 in response to stimuli like bacteria and cholinergic signals [43]. Also, we know that HD5 participates in the regulation of acute and chronic inflammatory processes [44]. This emphasizes the role of HD5 in reducing tissue inflammation, including pancreatic tissue. Indeed, in one study researchers demonstrated the presence of HD5 protein in PDAC tissues using immunohistochemistry [45]. In another study, researchers damaged the pancreatic duct in rats chemically and they found increased level of alfa-defensin-5 protein as a response [46]. The hypothesis posits that individuals carrying the A allele of the rs2472632 variant might exhibit lower levels of expressed HD5 protein. This potential decrease in HD5 expression could lead to reduced protection against both acute and chronic pancreatitis, conditions that have been linked to an increased risk of developing PDAC. Furthermore, in a healthy Japanese population (35–81 years old) HD5 concentration in fecal samples was significantly lower in the elderly group (age > 70 years old) than the middle-aged group (age ≤ 70 years old) [47]. This suggests that reduced levels of HD5 could be associated with an elevated risk of diseases in the elderly population. This also supports our hypothesis to make the point that having low HD5 levels could increase the risk of developing PDAC.

Thus, although the exact mechanism by which rs2472632 is associated with PDAC risk is not clear, various lines of evidence suggest that the locus and variations in this locus deserve more attention to be investigated functionally.

Our study pointed out two further loci significantly associated with PDAC risk, rs10025845(A) and rs2232079(T). While the minor allele (A) of rs10025845 is not predicted to result in any TF binding site, the G allele creates a binding site for the YY1 TF, which has also been shown to play a tumor suppressor role in PDAC [48]. This finding suggests that, as a result of the A allele, the ability to suppress tumors by YY1 might be reduced. The other variant we identified in our study, rs2232079, is located at the binding site of TF Paired Box 5 (PAX5) and the promoter region of Fermitin family member 1 (FERMT1), which encodes Kindlin-1. The Kindlin-1 protein is overexpressed in pancreatic cancer cell lines but only expressed at a low level in normal pancreatic epithelial cells and fibroblasts [49].

According to the enhancer map data, rs17358295(G) maps to an enhancer with 20 different target genes by activity-binding-contact with a range of ABC scores. The highest ABC score (0.130) for this variant was with the ETS homologous factor gene (EHF), whose roles in cancer remain largely unknown. Some studies showed that the overexpression of EHF protein plays a role in metastasis, proliferation, and shorter survival rates in cancer patients [50,51,52].

Several GWAS published in recent years have found that most disorders are associated with only a few common SNPs, and even when considered as a whole, their associated SNPs explain only a small percentage of the risk. Going beyond the analysis of GWAS primary results, performing secondary analyses on existing GWAS data has proven to help further our understanding of genetic risk factors which makes it possible to achieve better risk stratification.

The strengths of our study are its large sample size with multiple ethnicities and its comprehensive evaluation of the two major functional classes of polymorphisms, which led to our finding of a germline variant with a nearly genome-wide significance level for PDAC risk. On the other hand, one of the limitations of this study was the data we used for retrieving TFBS SNPs is only based on in silico predictions, and likewise for enhancers SNPs, our prediction of possible polymorphism function is based only on position. Another limitation is the lack of experimental validation of our findings. In the future, this limitation could be addressed by in vitro experiments with CRISPR-Cas9-edited cell lines. These experiments aim to assess SNP impacts on gene expression and pathways, revealing insights into differential binding to transcription factors and resulting expression. Integrating functional genomics, transcriptomics, and proteomics provides a comprehensive understanding. Optimal setting to perform these experiments would be creating healthy pancreatic ductal cells from induced pluripotent stem cell (iPSC) cultures [53].

Conclusions

With this study, we discovered several novel promising germline genetic risk loci for genetic susceptibility to PDAC, which are candidates for experimental functional validation.

Availability of data and materials

The PanScan and PanC4 genotyping data are publicly available from the database of Genotypes and Phenotypes (dbGaP, study accession numbers phs000206.v5.p3 and phs000648.v1.p1). The summary statistics of the East Asian GWAS are publicly available at http://www.aichi-med-u.ac.jp/JaPAN/current_initiatives-e.html. The PANDoRA primary data for this work will be made available to researchers who submit a reasonable request to the corresponding author, conditional to approval by the PANDoRA Steering Committee and Ethics Commission of the Medical Faculty of the University of Heidelberg, Germany. Data will be stripped from all information allowing the identification of study participants.

Abbreviations

ABC:: Activity-by-contact
ADHD:: Attention deficit hyperactivity disorder
BBJ:: BioBank Japan
CCDC34 :: Coiled-coil domain containing 34
dbGaP:: Database of genotypes and phenotypes
DEGS2 :: Delta 4-desaturase, sphingolipid 2
EHF :: ETS homologous factor gene
EPIC:: European Prospective Investigation into Cancer and Nutrition’
eQTLs:: Expression quantitative trait loci
ESTHER:: Epidemiological investigations on chances of preventing, recognizing early and optimally treating chronic diseases in an elderly population
FERMT1 :: Fermitin family member 1
GTEx:: The genotype-tissue expression
GWAS:: Genome-wide association studies
HD5 :: Human alpha defensin 5
JaPAN:: Japan Pancreatic Cancer Research
LGR4 :: Leucine-rich repeat-containing G protein-coupled receptor 4
LIN7C :: Lin-7 homolog C
lncRNAs:: Long noncoding RNAs
NCC:: National Cancer Center
MAF:: Minor allele frequency
OR:: Odds ratio
PAAD:: Pancreatic adenocarcinoma
PanC4:: Pancreatic Cancer Case–Control Consortium
PANDoRA:: The PANcreatic Disease ReseArch Consortium
PAX5 :: Paired box 5
PDAC:: Pancreatic ductal adenocarcinoma
PWM:: Position weight matrices
SNP:: Single-nucleotide polymorphisms
sQTL:: Splicing quantitative trait loci
TCGA:: The cancer genome atlas
TFBS:: Transcription factor binding site
UCSC:: University of California, Santa Cruz
YY1 :: Yin Yang 1

References

Siegel RL, Miller KD, Fuchs HE, Jemal A. Cancer statistics, 2022. CA Cancer J Clin. 2022;72(1):7–33. https://doi.org/10.3322/caac.21708.
Article PubMed Google Scholar
Afghani E, Klein AP. Pancreatic adenocarcinoma: trends in epidemiology, risk factors, and outcomes. Hematol Oncol Clin N Am. 2022;36(5):879–95. https://doi.org/10.1016/j.hoc.2022.07.002.
Article Google Scholar
Klein AP. Pancreatic cancer epidemiology: understanding the role of lifestyle and inherited risk factors. Nat Rev Gastroenterol Hepatol. 2021;18(7):493–502. https://doi.org/10.1038/s41575-021-00457-x.
Article PubMed PubMed Central Google Scholar
Ushio J, Kanno A, Ikeda E, et al. Pancreatic ductal adenocarcinoma: epidemiology and risk factors. Diagnostics. 2021;11(3):562. https://doi.org/10.3390/diagnostics11030562.
Article CAS PubMed PubMed Central Google Scholar
Klein AP. Genetic susceptibility to pancreatic cancer. Mol Carcinog. 2012;51(1):14–24. https://doi.org/10.1002/mc.20855.
Article CAS PubMed PubMed Central Google Scholar
Buniello A, MacArthur JAL, Cerezo M, et al. The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 2019;47(D1):D1005–12. https://doi.org/10.1093/nar/gky1120.
Article CAS PubMed Google Scholar
Zhang Y, Qi G, Park JH, Chatterjee N. Estimation of complex effect-size distributions using summary-level statistics from genome-wide association studies across 32 complex traits. Nat Genet. 2018;50(9):1318–26. https://doi.org/10.1038/s41588-018-0193-x.
Article CAS PubMed Google Scholar
Gentiluomo M, Canzian F, et al. Germline genetic variability in pancreatic cancer risk and prognosis. Semin Cancer Biol. 2022;79:105–31. https://doi.org/10.1016/j.semcancer.2020.08.003.
Article CAS PubMed Google Scholar
Childs EJ, Mocci E, Campa D, et al. Common variation at 2p13.3, 3q29, 7p13 and 17q25.1 associated with susceptibility to pancreatic cancer. Nat Genet. 2015;47(8):911–6. https://doi.org/10.1038/ng.3341.
Article CAS PubMed PubMed Central Google Scholar
Zhang YD, Hurson AN, Zhang H, et al. Assessment of polygenic architecture and risk prediction based on common variants across fourteen cancers. Nat Commun. 2020;11(1):3353. https://doi.org/10.1038/s41467-020-16483-3.
Article ADS CAS PubMed PubMed Central Google Scholar
Corradi C, Gentiluomo M, Gajdán L, et al. Genome-wide scan of long noncoding RNA single nucleotide polymorphisms and pancreatic cancer susceptibility. Int J Cancer. 2021;148(11):2779–88. https://doi.org/10.1002/ijc.33475.
Article CAS PubMed Google Scholar
Lu Y, Corradi C, Gentiluomo M, et al. Association of genetic variants affecting microRNAs and pancreatic cancer risk. Front Genet. 2021;12:693933. https://doi.org/10.3389/fgene.2021.693933.
Article CAS PubMed PubMed Central Google Scholar
Pistoni L, Gentiluomo M, Lu Y, et al. Associations between pancreatic expression quantitative traits and risk of pancreatic ductal adenocarcinoma. Carcinogenesis. 2021;42(8):1037–45. https://doi.org/10.1093/carcin/bgab057.
Article CAS PubMed Google Scholar
Gentiluomo M, Lu Y, Canzian F, Campa D. Genetic variants in taste-related genes and risk of pancreatic cancer. Mutagenesis. 2019;34(5–6):391–4. https://doi.org/10.1093/mutage/gez032.
Article CAS PubMed Google Scholar
Gentiluomo M, García PP, Galeotti AA, et al. Genetic variability of the ABCC2 gene and clinical outcomes in pancreatic cancer patients. Carcinogenesis. 2019;40(4):544–50. https://doi.org/10.1093/carcin/bgz006.
Article CAS PubMed Google Scholar
Walsh N, Zhang H, Hyland PL, et al. Agnostic pathway/gene set analysis of genome-wide association data identifies associations for pancreatic cancer. J Natl Cancer Inst. 2019;111(6):557–67. https://doi.org/10.1093/jnci/djy155.
Article CAS PubMed Google Scholar
Pennacchio LA, Bickmore W, Dean A, Nobrega MA, Bejerano G. Enhancers: five essential questions. Nat Rev Genet. 2013;14(4):288–95. https://doi.org/10.1038/nrg3458.
Article CAS PubMed PubMed Central Google Scholar
Wray GA, Hahn MW, Abouheif E, et al. The evolution of transcriptional regulation in eukaryotes. Mol Biol Evol. 2003;20(9):1377–419. https://doi.org/10.1093/molbev/msg140.
Article CAS PubMed Google Scholar
Johnston AD, Simões-Pires CA, Thompson TV, Suzuki M, Greally JM. Functional genetic variants can mediate their regulatory effects through alteration of transcription factor binding. Nat Commun. 2019;10(1):3472. https://doi.org/10.1038/s41467-019-11412-5.
Article ADS CAS PubMed PubMed Central Google Scholar
Lin Y, Nakatochi M, Hosono Y, et al. Genome-wide association meta-analysis identifies GP2 gene risk variants for pancreatic cancer. Nat Commun. 2020;11(1):3175. https://doi.org/10.1038/s41467-020-16711-w.
Article ADS CAS PubMed PubMed Central Google Scholar
Campa D, Rizzato C, Capurso G, et al. Genetic susceptibility to pancreatic cancer and its functional characterisation: the PANcreatic Disease ReseArch (PANDoRA) consortium. Dig Liver Dis Off J Ital Soc Gastroenterol Ital Assoc Study Liver. 2013;45(2):95–9. https://doi.org/10.1016/j.dld.2012.09.014.
Article CAS Google Scholar
Riboli E, Hunt KJ, Slimani N, et al. European prospective investigation into cancer and nutrition (EPIC): study populations and data collection. Public Health Nutr. 2002;5(6B):1113–24. https://doi.org/10.1079/PHN2002394.
Article CAS PubMed Google Scholar
Löw M, Stegmaier C, Ziegler H, Rothenbacher D, Brenner H, ESTHER Study. Epidemiological investigations of the chances of preventing, recognizing early and optimally treating chronic diseases in an elderly population (ESTHER study). Dtsch Med Wochenschr. 2004;129(49):2643–7. https://doi.org/10.1055/s-2004-836089.
Article PubMed Google Scholar
Kumar S, Ambrosini G, Bucher P. SNP2TFBS—a database of regulatory SNPs affecting predicted transcription factor binding site affinity. Nucleic Acids Res. 2017;45(D1):D139–44. https://doi.org/10.1093/nar/gkw1064.
Article CAS PubMed Google Scholar
Nasser J, Bergman DT, Fulco CP, et al. Genome-wide enhancer maps link risk variants to disease genes. Nature. 2021;593(7858):238–43. https://doi.org/10.1038/s41586-021-03446-x21.
Article ADS CAS PubMed PubMed Central Google Scholar
Kent WJ, Sugnet CW, Furey TS, et al. The human genome browser at UCSC. Genome Res. 2002;12(6):996–1006. https://doi.org/10.1101/gr.229102.
Article CAS PubMed PubMed Central Google Scholar
French JD, Edwards SL. The role of noncoding variants in heritable disease. Trends Genet. 2020;36(11):880–91. https://doi.org/10.1016/j.tig.2020.07.004.
Article CAS PubMed Google Scholar
Farh KKH, Marson A, Zhu J, et al. Genetic and epigenetic fine mapping of causal autoimmune disease variants. Nature. 2015;518(7539):337–43. https://doi.org/10.1038/nature1383.
Article ADS CAS PubMed Google Scholar
Gong Y, Qiu W, Ning X, et al. CCDC34 is up-regulated in bladder cancer and regulates bladder cancer cell proliferation, apoptosis and migration. Oncotarget. 2015;6(28):25856–67. https://doi.org/10.18632/oncotarget.4624.
Article PubMed PubMed Central Google Scholar
Liu LB, Huang J, Zhong JP, et al. High expression of CCDC34 Is associated with poor survival in cervical cancer patients. Med Sci Monit Int Med J Exp Clin Res. 2018;24:8383–90. https://doi.org/10.12659/MSM.913346.
Article CAS Google Scholar
Geng W, Liang W, Fan Y, Ye Z, Zhang L. Overexpression of CCDC34 in colorectal cancer and its involvement in tumor growth, apoptosis and invasion. Mol Med Rep. 2018;17(1):465–73. https://doi.org/10.3892/mmr.2017.7860.
Article CAS PubMed Google Scholar
Qi W, Shao F, Huang Q. Expression of coiled-coil domain containing 34 (CCDC34) and its prognostic significance in pancreatic adenocarcinoma. Med Sci Monit Int Med J Exp Clin Res. 2017;23:6012–8. https://doi.org/10.12659/msm.907951.
Article CAS Google Scholar
Glinka A, Dolde C, Kirsch N, et al. LGR4 and LGR5 are R-spondin receptors mediating Wnt/β-catenin and Wnt/PCP signaling. EMBO Rep. 2011;12(10):1055–61. https://doi.org/10.1038/embor.2011.175.
Article CAS PubMed PubMed Central Google Scholar
Fan G, Ye D, Zhu S, et al. RTL1 promotes melanoma proliferation by regulating Wnt/β-catenin signaling. Oncotarget. 2017;8(62):106026–37. https://doi.org/10.18632/oncotarget.22523.
Article PubMed PubMed Central Google Scholar
van Andel H, Ren Z, Koopmans I, et al. Aberrantly expressed LGR4 empowers Wnt signaling in multiple myeloma by hijacking osteoblast-derived R-spondins. Proc Natl Acad Sci. 2017;114(2):376–81. https://doi.org/10.1073/pnas.1618650114.
Article ADS CAS PubMed Google Scholar
Wang Z, Yin P, Sun Y, et al. LGR4 maintains HGSOC cell epithelial phenotype and stem-like traits. Gynecol Oncol. 2020;159(3):839–49. https://doi.org/10.1016/j.ygyno.2020.09.020.
Article CAS PubMed Google Scholar
Kang YE, Kim JM, Kim KS, et al. Upregulation of RSPO2-GPR48/LGR4 signaling in papillary thyroid carcinoma contributes to tumor progression. Oncotarget. 2017;8(70):114980–94. https://doi.org/10.18632/oncotarget.22692.
Article PubMed PubMed Central Google Scholar
Modi S, Kir D, Banerjee S, Saluja A. Control of apoptosis in treatment and biology of pancreatic cancer. J Cell Biochem. 2016;117(2):279–88. https://doi.org/10.1002/jcb.25284.
Article CAS PubMed PubMed Central Google Scholar
Mazerbourg S, Bouley DM, Sudo S, et al. Leucine-rich repeat-containing, G protein-coupled receptor 4 null mice exhibit intrauterine growth retardation associated with embryonic and perinatal lethality. Mol Endocrinol Baltim Md. 2004;18(9):2241–54. https://doi.org/10.1210/me.2004-0133.
Article CAS Google Scholar
Onda T, Uzawa K, Nakashima D, et al. Lin-7C/VELI3/MALS-3: an essential component in metastasis of human squamous cell carcinoma. Cancer Res. 2007;67(20):9643–8. https://doi.org/10.1158/0008-5472.CAN-07-1911.
Article CAS PubMed Google Scholar
Shinawi M, Sahoo T, Maranda B, et al. 11p14.1 microdeletions associated with ADHD, autism, developmental delay, and obesity. Am J Med Genet A. 2011;155A(6):1272–80. https://doi.org/10.1002/ajmg.a.33878.
Article CAS PubMed Google Scholar
Jung SW, Lee J, Cho AE. Elucidating the bacterial membrane disruption mechanism of human α-defensin 5: a theoretical study. J Phys Chem B. 2017;121(4):741–8. https://doi.org/10.1021/acs.jpcb.6b11806.
Article CAS PubMed Google Scholar
Yang E, Shen J. The roles and functions of Paneth cells in Crohn’s disease: a critical review. Cell Prolif. 2021;54(1):e12958. https://doi.org/10.1111/cpr.12958.
Article CAS PubMed Google Scholar
Selsted ME, Ouellette AJ. Mammalian defensins in the antimicrobial immune response. Nat Immunol. 2005;6:551–7. https://doi.org/10.1038/ni1206.
Article CAS PubMed Google Scholar
Tobi M, Kim M, Weinstein DH, et al. Prospective markers for early diagnosis and prognosis of sporadic pancreatic ductal adenocarcinoma. Dig Dis Sci. 2013;58:744–50. https://doi.org/10.1007/s10620-012-2387-x.
Article PubMed Google Scholar
Cunha DM, Koike MK, Barbeiro DF, et al. Increased intestinal production of α-defensins in aged rats with acute pancreatic injury. Exp Gerontol. 2014;60:215–9. https://doi.org/10.1016/j.exger.2014.11.008.
Article CAS PubMed Google Scholar
Shimizu Y, Nakamura K, Kikuchi M, et al. Lower human defensin 5 in elderly people compared to middle-aged is associated with differences in the intestinal microbiota composition: the DOSANCO health study. Geroscience. 2022;44(2):997–1009. https://doi.org/10.1007/s11357-021-00398-y.
Article CAS PubMed Google Scholar
Zhang JJ, Zhu Y, Xie KL, et al. Yin Yang-1 suppresses invasion and metastasis of pancreatic ductal adenocarcinoma by downregulating MMP10 in a MUC4/ErbB2/p38/MEF2C-dependent mechanism. Mol Cancer. 2014;13(1):130. https://doi.org/10.1186/1476-4598-13-130.
Article CAS PubMed PubMed Central Google Scholar
Mahawithitwong P, Ohuchida K, Ikenaga N, et al. Kindlin-1 expression is involved in migration and invasion of pancreatic cancer. Int J Oncol. 2013;42(4):1360–6. https://doi.org/10.3892/ijo.
Article CAS PubMed Google Scholar
Wang L, Ai M, Nie M, et al. EHF promotes colorectal carcinoma progression by activating TGF-β1 transcription and canonical TGF-β signaling. Cancer Sci. 2020;111(7):2310–24. https://doi.org/10.1111/cas.14444.
Article CAS PubMed PubMed Central Google Scholar
Zhou T, Liu J, Xie Y, et al. ESE3/EHF, a promising target of rosiglitazone, suppresses pancreatic cancer stemness by downregulating CXCR4. Gut. 2022;71(2):357–71. https://doi.org/10.1136/gutjnl-2020-321952.
Article CAS PubMed Google Scholar
Liu J, Jiang W, Zhao K, et al. Tumoral EHF predicts the efficacy of anti-PD1 therapy in pancreatic ductal adenocarcinoma. J Exp Med. 2019;216(3):656–73. https://doi.org/10.1084/jem.2018074920.
Article CAS PubMed PubMed Central Google Scholar
Merz S, Breunig M, Melzer MK, et al. Single-cell profiling of GP2-enriched pancreatic progenitors to simultaneously create acinar, ductal, and endocrine organoids. Theranostics. 2023;13(6):1949–73. https://doi.org/10.7150/thno.78323.
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We would like to acknowledge the contribution of the late Dr. Bas Bueno-de-Mesquita. This publication is based upon work from COST Action TRANSPAN, CA21116, supported by COST (European Cooperation in Science and Technology).

Novelty and impact statement

Germline polymorphisms in transcription factor binding sites and enhancer regions might control the expression of multiple genes, thus having a powerful effect on cancer susceptibility. Focusing on transcription factor binding sites and enhancer regions genomic regions led us to discover four novel strong SNP associations with pancreatic cancer risk, with one approaching genome-wide significance. (rs2472632, p = 5.5 × 10⁻⁸).

Funding

Open Access funding enabled and organized by Projekt DEAL. This work was supported by intramural funding of German Cancer Research Center (DKFZ). Francesca Tavano’s team part funded by the Italian Minister of Health, Ricerca Corrente program 2022–2024, to the Division of Gastroenterology Fondazione IRCCS “Casa Sollievo della Sofferenza” Hospital, San Giovanni Rotondo. Pavel Souček’s team part funded by the project National Institute for Cancer Research—NICR (Programme EXCELES, ID Project No. LX22NPO5102)—Funded by the European Union—Next Generation EU. Ludmila Vodickova’s team part funded by the project AZV NU21-07–00247, National Operation Programme: National Institute for cancer research LX22NPO05102, and grant by Charles University Research fund (Cooperation No.43—Surgical Disciplines) Operational Programme Integrated Infrastructure for the project: Integrative strategy in development of personalized medicine of selected malignant tumors and its impact on quality of life, IMTS: 313011V446, co-financed by the European Regional Development Fund.

Author information

Authors and Affiliations

Genomic Epidemiology Group, German Cancer Research Center, In Neuenheimer Feld 280, 69120, Heidelberg, Germany
Pelin Ünal, Ye Lu & Federico Canzian
Department for Determinants of Chronic Diseases, National Institute for Public Health and the Environment, Bilthoven, The Netherlands
Bas Bueno-de-Mesquita
Department of Surgery, Erasmus MC University Medical Center, Rotterdam, The Netherlands
Casper H. J. van Eijck
Department of Digestive Tract Diseases, Medical University of Lodz, Lodz, Poland
Renata Talar-Wojnarowska, Ewa Małecka-Wojciesko & Barbara Włodarczyk
Institute for Translational Medicine, Medical School, University of Pécs, Pécs, Hungary
Andrea Szentesi, Bálint Erőss & Péter Hegyi
János Szentágothai Research Center, University of Pécs, Pécs, Hungary
Péter Hegyi
Laboratory of Biology, Medical School, National and Kapodistrian University of Athens, Athens, Greece
Maria Gazouli
Gastroenterology Department and Institute for Digestive Research, Lithuanian University of Health Sciences, Kaunas, Lithuania
Edita Kreivenaite, Jurate Kondrackiene, Vytautas Kiudelis & Juozas Kupcinskas
Division of Gastroenterology and Research Laboratory, Fondazione IRCCS “Casa Sollievo della Sofferenza” Hospital, San Giovanni Rotondo, FG, Italy
Francesca Tavano, Orazio Palmieri & Francesco Perri
Center for Translational Medicine, Semmelweis University, Budapest, Hungary
Bálint Erőss, Stefania Bunduc & Péter Hegyi
Division of Pancreatic Diseases, Heart and Vascular Center, Semmelweis University, Budapest, Hungary
Bálint Erőss, Stefania Bunduc & Péter Hegyi
Department of Surgery, University Hospital Kralovske Vinohrady, Third Faculty of Medicine, Charles University, Prague, Czech Republic
Martin Oliverius
Carol Davila University of Medicine and Pharmacy, Bucharest, Romania
Stefania Bunduc
Laboratory for Applied Science and Technology in Health, Carlos Chagas Institute, Curitiba, PR, Brazil
Mateus Nóbrega Aoki
Biomedical Center, Faculty of Medicine in Pilsen, Charles University, Plzeň, Czech Republic
Ludmila Vodickova, Pavel Soucek & Viktor Hlavac
Department of Molecular Biology of Cancer, Institute of Experimental Medicine of the Czech Academy of Sciences, Prague, Czech Republic
Ludmila Vodickova & Klara Vokacova
Institute of Biology and Medical Genetics, Institute of Physiology, 1st Faculty of Medicine, Charles University, Prague, Czech Republic
Ludmila Vodickova & Klara Vokacova
Division of General and Transplant Surgery, Pisa University Hospital, Pisa, Italy
Ugo Boggi
Department of Biology, University of Pisa, Pisa, Italy
Matteo Giaccherini, Stefano Landi, Manuel Gentiluomo & Daniele Campa
Department of Radiology and Oncology, Institute of Cancer of São Paulo, São Paulo, Brazil
Roger Chammas
First Propaedeutic University Surgery Clinic, Hippocratio General Hospital, Medical School, National and Kapodistrian University of Athens, Athens, Greece
George E. Theodoropoulos
Laboratory for Experimental Oncology and Radiobiology, Center of Experimental Molecular Medicine, Amsterdam UMC Location University of Amsterdam, Amsterdam, the Netherlands
Maarten F. Bijlsma
Cancer Center Amsterdam, Imaging and Biomarkers, Amsterdam, the Netherlands
Maarten F. Bijlsma
Department of Medicine, Laboratory Medicine, University of Padova, Padua, Italy
Daniela Basso
Department of Oncology, Faculty of Medicine and Dentistry, Palacky University, Olomouc, Czech Republic
Beatrice Mohelnikova-Duchonova
Department of General Visceral and Thoracic Surgery, University of Hamburg Medical Institutions, Hamburg, Germany
Jakob R. Izbicki, Faik G. Uzunoglu & Mara Goetz
PancreatoBiliary Endoscopy and Endosonography Division, Pancreas Translational and Clinical Research Center, San Raffaele Scientific Institute, Milan, Italy
Giuseppe Vanella, Paolo Giorgio Arcidiacono, Sabrina Gloria Giulia Testoni, Maria Chiara Petrone, Livia Archibugi & Gabriele Capurso
Digestive and Liver Disease Unit, S. Andrea Hospital, Rome, Italy
Giuseppe Vanella, Livia Archibugi & Gabriele Capurso
Department of General, Visceral and Transplant Surgery, Heidelberg University Hospital, Heidelberg, Germany
Thilo Hackert & John P. Neoptolemos
Division of Clinical Epidemiology and Aging Research, German Cancer Research Center, Heidelberg, Germany
Ben Schöttker, Hermann Brenner & Hannah Stocker
Network Aging Research (NAR), Heidelberg University, Heidelberg, Germany
Ben Schöttker, Hermann Brenner & Hannah Stocker
Blood Transfusion Service, Meyer Children’s Hospital, Florence, Italy
Franco Bambi & Stefano Ermini
Division of Preventive Oncology, German Cancer Research Center and National Center for Tumor Diseases, Heidelberg, Germany
Hermann Brenner
German Cancer Consortium (DKTK), German Cancer Research Center, Heidelberg, Germany
Hermann Brenner
Endoscopic Unit, Department of Gastroenterology, IRCCS Humanitas Research Hospital, Milan, Italy
Silvia Carrara
Department of Pathology, Cancer Center Amsterdam, Amsterdam UMC, University of Amsterdam, Amsterdam, the Netherlands
Frederike Dijk
Department of Oncology, Fondazione IRCCS “Casa Sollievo della Sofferenza” Hospital, San Giovanni Rotondo, FG, Italy
Evaristo Maiello
Pancreatic Unit, IRCCS Humanitas Research Hospital, Milan, Italy
Giovanni Capretti
Department of Biomedical Sciences, Humanitas University, Milan, Italy
Giovanni Capretti
Gastroenterology and Gastrointestinal Endoscopy Unit, IRCCS San Raffaele Scientific Institute, Vita-Salute San Raffaele University, Milan, Italy
Giulia Martina Cavestro
Department of Gastroenterology, San Carlo Hospital, Potenza, Italy
Raffaele Pezzilli
General Surgery Unit, Department of Translational Research and New Technologies in Medicine and Surgery, University of Pisa, Pisa, Italy
Gregorio Di Franco & Luca Morelli
Department of Surgery, Oncology and Gastroenterology, University of Padova, Padua, Italy
Anna Caterina Milanetto, Cosimo Sperti & Claudio Pasquali
Department of Diagnostics and Public Health, ARC-Net Centre for Applied Research on Cancer, University of Verona, Verona, Italy
Rita T. Lawlor
Department of Surgery, Fondazione IRCCS “Casa Sollievo della Sofferenza” Hospital, San Giovanni Rotondo, FG, Italy
Francesca Bazzocchi

Authors

Pelin Ünal
View author publications
You can also search for this author in PubMed Google Scholar
Ye Lu
View author publications
You can also search for this author in PubMed Google Scholar
Bas Bueno-de-Mesquita
View author publications
You can also search for this author in PubMed Google Scholar
Casper H. J. van Eijck
View author publications
You can also search for this author in PubMed Google Scholar
Renata Talar-Wojnarowska
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Szentesi
View author publications
You can also search for this author in PubMed Google Scholar
Maria Gazouli
View author publications
You can also search for this author in PubMed Google Scholar
Edita Kreivenaite
View author publications
You can also search for this author in PubMed Google Scholar
Francesca Tavano
View author publications
You can also search for this author in PubMed Google Scholar
Ewa Małecka-Wojciesko
View author publications
You can also search for this author in PubMed Google Scholar
Bálint Erőss
View author publications
You can also search for this author in PubMed Google Scholar
Martin Oliverius
View author publications
You can also search for this author in PubMed Google Scholar
Stefania Bunduc
View author publications
You can also search for this author in PubMed Google Scholar
Mateus Nóbrega Aoki
View author publications
You can also search for this author in PubMed Google Scholar
Ludmila Vodickova
View author publications
You can also search for this author in PubMed Google Scholar
Ugo Boggi
View author publications
You can also search for this author in PubMed Google Scholar
Matteo Giaccherini
View author publications
You can also search for this author in PubMed Google Scholar
Jurate Kondrackiene
View author publications
You can also search for this author in PubMed Google Scholar
Roger Chammas
View author publications
You can also search for this author in PubMed Google Scholar
Orazio Palmieri
View author publications
You can also search for this author in PubMed Google Scholar
George E. Theodoropoulos
View author publications
You can also search for this author in PubMed Google Scholar
Maarten F. Bijlsma
View author publications
You can also search for this author in PubMed Google Scholar
Daniela Basso
View author publications
You can also search for this author in PubMed Google Scholar
Beatrice Mohelnikova-Duchonova
View author publications
You can also search for this author in PubMed Google Scholar
Pavel Soucek
View author publications
You can also search for this author in PubMed Google Scholar
Jakob R. Izbicki
View author publications
You can also search for this author in PubMed Google Scholar
Vytautas Kiudelis
View author publications
You can also search for this author in PubMed Google Scholar
Giuseppe Vanella
View author publications
You can also search for this author in PubMed Google Scholar
Paolo Giorgio Arcidiacono
View author publications
You can also search for this author in PubMed Google Scholar
Barbara Włodarczyk
View author publications
You can also search for this author in PubMed Google Scholar
Thilo Hackert
View author publications
You can also search for this author in PubMed Google Scholar
Ben Schöttker
View author publications
You can also search for this author in PubMed Google Scholar
Faik G. Uzunoglu
View author publications
You can also search for this author in PubMed Google Scholar
Franco Bambi
View author publications
You can also search for this author in PubMed Google Scholar
Mara Goetz
View author publications
You can also search for this author in PubMed Google Scholar
Viktor Hlavac
View author publications
You can also search for this author in PubMed Google Scholar
Hermann Brenner
View author publications
You can also search for this author in PubMed Google Scholar
Francesco Perri
View author publications
You can also search for this author in PubMed Google Scholar
Silvia Carrara
View author publications
You can also search for this author in PubMed Google Scholar
Stefano Landi
View author publications
You can also search for this author in PubMed Google Scholar
Péter Hegyi
View author publications
You can also search for this author in PubMed Google Scholar
Frederike Dijk
View author publications
You can also search for this author in PubMed Google Scholar
Evaristo Maiello
View author publications
You can also search for this author in PubMed Google Scholar
Giovanni Capretti
View author publications
You can also search for this author in PubMed Google Scholar
Sabrina Gloria Giulia Testoni
View author publications
You can also search for this author in PubMed Google Scholar
Maria Chiara Petrone
View author publications
You can also search for this author in PubMed Google Scholar
Hannah Stocker
View author publications
You can also search for this author in PubMed Google Scholar
Stefano Ermini
View author publications
You can also search for this author in PubMed Google Scholar
Livia Archibugi
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Gentiluomo
View author publications
You can also search for this author in PubMed Google Scholar
Giulia Martina Cavestro
View author publications
You can also search for this author in PubMed Google Scholar
Raffaele Pezzilli
View author publications
You can also search for this author in PubMed Google Scholar
Gregorio Di Franco
View author publications
You can also search for this author in PubMed Google Scholar
Anna Caterina Milanetto
View author publications
You can also search for this author in PubMed Google Scholar
Cosimo Sperti
View author publications
You can also search for this author in PubMed Google Scholar
John P. Neoptolemos
View author publications
You can also search for this author in PubMed Google Scholar
Luca Morelli
View author publications
You can also search for this author in PubMed Google Scholar
Klara Vokacova
View author publications
You can also search for this author in PubMed Google Scholar
Claudio Pasquali
View author publications
You can also search for this author in PubMed Google Scholar
Rita T. Lawlor
View author publications
You can also search for this author in PubMed Google Scholar
Francesca Bazzocchi
View author publications
You can also search for this author in PubMed Google Scholar
Juozas Kupcinskas
View author publications
You can also search for this author in PubMed Google Scholar
Gabriele Capurso
View author publications
You can also search for this author in PubMed Google Scholar
Daniele Campa
View author publications
You can also search for this author in PubMed Google Scholar
Federico Canzian
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

FC conceived and designed the study. PÜ performed the lab work and the analysis of the data. PÜ wrote the first draft of the manuscript. All authors contributed to the writing and approved the final version of the manuscript.

Corresponding author

Correspondence to Federico Canzian.

Ethics declarations

Competing interests

MFB has received research funding from Celgene, Frame Therapeutics and Lead Pharma and has acted as a consultant to Servier. Other coauthors declare no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Deceased: Bas Bueno-de-Mesquita.

Supplementary Information

Additional file 1.

Data S1 Detailed information of overall meta-analysis result in all-three populations.

Additional file 2.

Data S2 ABC scores and targeted genes of the selected enhancer SNPs.

Additional file 3.

Data S3 Detailed information of selected variants from discovery phase.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Ünal, P., Lu, Y., Bueno-de-Mesquita, B. et al. Polymorphisms in transcription factor binding sites and enhancer regions and pancreatic ductal adenocarcinoma risk. Hum Genomics 18, 12 (2024). https://doi.org/10.1186/s40246-024-00576-x

Download citation

Received: 09 November 2023
Accepted: 23 January 2024
Published: 02 February 2024
DOI: https://doi.org/10.1186/s40246-024-00576-x

Polymorphisms in transcription factor binding sites and enhancer regions and pancreatic ductal adenocarcinoma risk

Abstract

Introduction