A new bioinformatics tool to help assess the significance of BRCA1 variants
Human Genomics volume 12, Article number: 36 (2018)
Germline pathogenic variants in the breast cancer type 1 susceptibility gene BRCA1 are associated with a 60% lifetime risk for breast and ovarian cancer. This overall risk estimate is for all BRCA1 variants; obviously, not all variants confer the same risk of developing a disease. In cancer patients, loss of BRCA1 function in tumor tissue has been associated with an increased sensitivity to platinum agents and to poly-(ADP-ribose) polymerase (PARP) inhibitors. For clinical management of both at-risk individuals and cancer patients, it would be important that each identified genetic variant be associated with clinical significance. Unfortunately for the vast majority of variants, the clinical impact is unknown. The availability of results from studies assessing the impact of variants on protein function may provide insight of crucial importance.
Results and conclusion
We have collected, curated, and structured the molecular and cellular phenotypic impact of 3654 distinct BRCA1 variants. The data was modeled in triple format, using the variant as a subject, the studied function as the object, and a predicate describing the relation between the two. Each annotation is supported by a fully traceable evidence. The data was captured using standard ontologies to ensure consistency, and enhance searchability and interoperability. We have assessed the extent to which functional defects at the molecular and cellular levels correlate with the clinical interpretation of variants by ClinVar submitters. Approximately 30% of the ClinVar BRCA1 missense variants have some molecular or cellular assay available in the literature. Pathogenic variants (as assigned by ClinVar) have at least some significant functional defect in 94% of testable cases. For benign variants, 77% of ClinVar benign variants, for which neXtProt Cancer variant portal has data, shows either no or mild experimental functional defects. While this does not provide evidence for clinical interpretation of variants, it may provide some guidance for variants of unknown significance, in the absence of more reliable data.
The neXtProt Cancer variant portal (https://www.nextprot.org/portals/breast-cancer) contains over 6300 observations at the molecular and/or cellular level for BRCA1 variants.
The breast cancer type 1 susceptibility gene BRCA1 encodes a large protein of 1863 amino acids that acts as a tumor suppressor. Among other cellular functions, the BRCA1 protein is essential for maintaining the genome integrity by promoting DNA double-strand break repair via homologous recombination in response to DNA damage, critical for its tumor suppressor activity [1,2,3]. Pathogenic variants in BRCA1 confer susceptibility to breast and ovarian cancer. By the age of 70, women carrying germline mutations in BRCA1 have a 60% average cumulative risk for breast cancer and a 59% risk for ovarian cancer . About 3–5% of all breast cancers and 10–15% of all ovarian cancers are associated with BRCA1 germline pathogenic variants [5,6,7,8]. Since the identification of BRCA1 pathogenic variants allows for specific preventive and surveillance measures, the establishment of the pathogenicity of BRCA1 variants is crucial for clinical management of cancer patients and family members at risk of breast and ovarian malignancy. Indeed, prophylactic bilateral salpingo-oophorectomy from 35 years of age and prophylactic bilateral mastectomy in BRCA1 carriers reduce the risk of developing breast cancer by 50% and by more than 90%, respectively [9, 10], although this has been recently challenged [11, 12]. In cancer patients, inactivation of BRCA1 in tumor tissue, either due to BRCA1 germline or somatic pathogenic variants, epigenetic changes, or loss of wild-type alleles, is predictive for response to crosslinking chemotherapeutic agents, such as platinum-based drugs, and to PARP-inhibitors, leading to synthetic lethality in the presence of BRCA1/BRCA2 deficiency [13, 14]. Thus, two PARP inhibitors have been approved in the European Union and in the USA for the treatment of patients affected by advanced ovarian cancer with pathogenic or likely pathogenic BRCA1 or BRCA2 germline or somatic variants.
The American College of Medical Genetics and Genomics and the Association for Molecular Pathology (ACMG-AMP) have published guidelines for the interpretation of sequence variants . Genetic variants are classified based on clinical assessment as pathogenic, likely pathogenic, variant of uncertain significance (VUS), likely benign, or benign. This interpretation often takes into account co-segregation, the variant frequency in unaffected individuals, and the variant impact analyzed with prediction tools such as SIFT  and PolyPhen-2 . Of particular concern are the VUS, whose association with disease risk is unclear. For these, evidence from in vitro studies can provide insight of crucial importance for patient management.
Several public databases report clinical data for BRCA1 variants: ClinVar,Footnote 1 NHGRI Breast Cancer Information Core (BIC),Footnote 2 Breast Cancer IARC database,Footnote 3 the ARUP database,Footnote 4 and the BRCA Share database (UMD)Footnote 5 . The COSMIC databaseFootnote 6 (Catalog Of Somatic Mutations In Cancer) collects somatic variants that have been identified in cancer specimens. The Cancer Gene Census project within COSMIC is an effort to identify cancer-causing genes . More recently, the BRCA Exchange siteFootnote 7 has been providing information on cataloged BRCA1 and BRCA2 genetic variants. BRCA Exchange is a product of the BRCA ChallengeFootnote 8 of the Global Alliance for Genomics and Health.Footnote 9 These resources are focused on the clinical impact of variants.
Some information on molecular and cellular impacts of variants is available at the BRCA1 implementation of Leiden Open Variation Database (LOVD),Footnote 10 which, in addition to clinical relevance of variants, reports data from in vitro studies. BRCA CircosFootnote 11  is a visualization resource that compiles and displays functional data on all documented BRCA1 missense variants available at the time (679 variants).
While a large amount of information is available on BRCA1 variants, their pathogenic classification may vary depending on how the assessment was done. In a recent study , classification of variants was entrusted to nine molecular diagnostic laboratories on 99 variants, using both the laboratory’s own method and the ACMG-AMP criteria . This study reports that there was only 34% concordance for either classification system across laboratories, and that the agreement was improved to 71% after discussions and detailed review of the ACMG-AMP criteria. Another study by  tested pairwise comparisons between BRCA Share™, ClinVar, and ARUP and found that BRCA Share™ and ClinVar agree on 72% of classifications, BRCA Share™ and ARUP on 81%, and ARUP and ClinVar on 60% of shared variants. Approximately 24% variants classified as VUS by BRCA Share™ were classified in another category by ClinVar, whereas 19% of variants classified as VUS by ClinVar were classified otherwise by BRCA Share™.
Hence, there is a need for an integrated and comprehensive resource that reports the current state of knowledge of the impact of variants at the molecular and/or cellular levels, in a format readily compatible with computational analysis . This requires describing variants, functional data, and experimental details according to standard nomenclature and implementing ontologies. The data provenance should also be explicitly described, ensuring citing studies that only present original data.
In this paper, we describe the generation of a data corpus of BRCA1 variants consisting of nearly 6300 observations at the molecular and/or cellular level on 3654 variants. We included two types of mutations: mutations found in patients, which we refer to as “variants,” and mutations generated by site-directed mutagenesis to study specific aspects of protein function, since these may be found in patients as sequencing of individual genomes becomes more widespread for diagnostic and prognostic purposes. All variants are on the protein-coding region of BRCA1; most variants are missense (3455), as well as a small number of truncations, frameshifts, deletions, and indels.
BRCA1 structure and function
The molecular phenotypes reported in the literature relate to the normal function of the protein, which we briefly summarized here.
BRCA1 is composed of three parts: the amino-terminus, containing a RING-type zinc finger; a large central part that contains a nuclear location signal and a coiled coil region; and a carboxyl-terminus bearing two BRCT (BRCA1 C-terminus) domains (Fig. 1).
The helix (amino acids 65–90) immediately next to the RING domain (amino acids 24 to 65) at the N-terminus of the protein forms a heterodimer with BARD1 , another RING zinc finger-containing protein [24,25,26]. The RING domain of BRCA1 possesses E3 ubiquitin ligase activity and interacts with E2 enzymes, allowing the transfer of ubiquitin from the E2 enzyme to the substrate . The BRCA1/BARD1 heterodimer is a core component of all BRCA1 complexes . BRCA1/BARD1 mediates the polyubiquitination via Lys6 of ubiquitin, which acts as a post-translational modification [29,30,31]. Ubiquitinated BRCA1/BARD1 substrates, including RBBP8 and NMP1, have been implicated in the assembly of protein complexes important for homologous recombination [32, 33]. BRCA1/BARD1 itself is a substrate for its own ubiquitin ligase E3 activity, which increases the stability and activity of the complex [30, 34]. Substrates of BRCA1/BARD1 include RBBP8, an end resection factor that generates 3′ single-stranded DNA tails essential for homologous recombination, and UIMC1 (RAP80), a component of the BRCA1-A complex (see section on the BRCT domains), which targets BRCA1 to DNA double strand breaks [35,36,37].
The central part of BRCA1 contains binding domains for several proteins including RAD50, RAD51, MYC, and PALB2. These interactions are important for homologous recombination. The central region also contains important regulatory phosphorylation sites: the CHEK2-dependent phosphorylation site Ser988 [38,39,40], required for the BRCA1/PALB2/BRCA2 complex function in RAD51-mediated homologous recombination . The Serine Cluster Domain (SCD), spanning amino acids 1280 to 1524, is a Ser-Gln/Thr-Gln-rich cluster that contains approximately 10 ATM phosphorylation sites [42,43,44]. SCD phosphorylation by ATM is important for BRCA1-mediated G2/M and the G1/S-checkpoint activation.
The carboxyl terminus bears two BRCT domains (amino acids 1642 to 1736 and 1756 to 1855) that are phosphoprotein-binding modules . Interaction partners BRIP1 , RBBP8 [47,48,49,50], and ABRAXAS1 [51, 52] contain a consensus BRCT-interacting motif (SerXXPhe), which is phosphorylated at Ser to mediate the interaction [53,54,55]. BRCA1 is part of three distinct complexes, called BRCA1-A, BRCA1-B, and BRCA1-C  involved in multiple cellular functions, such as transcriptional regulation, cell-cycle checkpoint activation, and DNA repair. BRCA1 binds double-strand breaks through its association with ABRAXAS1, which recruits UIMC1, both components of the BRCA1-A complex that participates to the G2-M phase checkpoint regulation . The BRCA1-B complex contains, among other members, BRIP1 and TOPBP1, and binds to DNA damage sites . The BRCA1-B complex is required for S-phase checkpoint activation when replication forks are stalled or collapsed [46, 58, 59], and thus involved in repair during DNA replication. BRCA1 binds RBBP8 within the BRCA1-C complex which facilitates DNA double-strand breaks resection, promotes ATR activation and homologous recombination [60,61,62].
Annotation statements (Table 1A) are triplets composed of (1) a subject, which corresponds to the protein variation being annotated; (2) a predicate (or relation) describing how the property is affected (Table 1B); and (3) an object that describes the function, localization, or protein property being tested.
For functional phenotypes, the annotation subjects are variants, which can be of three different origins: germline, somatic, or artificial. All variants affecting the protein sequence are captured: non-synonymous codon, stop gained, frameshift, and in-frame codon loss variants. The object can correspond to the protein’s molecular function or its localization, effect at the level of the organism, or interactions with proteins or small molecules. The relations linking subjects and objects are listed in Table 1 (also available from our ftp site).
We make extensive use of standard nomenclature, ontologies, and controlled vocabularies to capture data, to ensure consistency and unambiguity of the molecular entities and concepts captured. We capture three types of molecular entities: (i) proteins, which are captured using the entry accession number from neXtProt; for example BARD1 corresponds to NX_Q99728, which can be found on the neXtProt website at https://www.nextprot.org/entry/NX_Q99728; (ii) variants, described using the HGVS nomenclature, a standard for unambiguously describing mutations at the DNA, RNA, and protein level developed by the Human Genome Variation Society , and (iii) small molecules, captured using the ChEBI dictionary of molecular entities, an ontology of molecular entities focused on “small” chemical compounds ).
Concepts are captured using either Gene Ontology  that provides a logical structure of biological functions (“terms”) and their relationships to one another; phenotypic observations are captured with the mammalian phenotype ontology , while changes in protein stability or abundance are captured with in-house vocabulary available on our FTP site (ftp://ftp.nextprot.org/pub/current_release/controlled_vocabularies/). Example uses of these different vocabularies are shown in Table 1B.
For each annotation, detailed information about the experimental support of each statement is captured as evidence statements (Table 1C). The annotation evidence is composed of (1) one or more terms from ECO, Evidence and Conclusion Ontology , describing the experiment performed, (2) the Protein origin, which represents the species from which the protein was obtained for the experiment described using the NCBI taxonomy (http://www.ncbi.nlm.nih.gov/taxonomy); (3) the biological system in which the experiment was done that may contain one or more of these elements: the organism from the NCBI taxonomy; the tissue or cell type, from the CALOHA human anatomy vocabulary (ftp://ftp.nextprot.org/pub/current_release/controlled_vocabularies/caloha.obo) or the cell line, from the Cellosaurus database (http://web.expasy.org/cellosaurus/); (4) a qualitative assessment of the severity of the phenotype, either “mild,” “moderate,” or “severe”; and (5) a quality flag: each evidence is labeled as either Gold (high quality) or Silver (good quality). For experiments that involved statistical analyses (such as proteomics studies), Gold quality is assigned for p ≤ 0.01, and Silver for 0.01 < p < 0.05. Data is not integrated for p values greater than 0.05. In most cases, however, the decision to assign a Gold or Silver tag is based on the curator’s judgment. Decision factors include parameters such as the lack of qualitative and/or statistical evaluation when that would be expected, very large errors in replicates, low confidence assay (for example, low replicate number, and poorly defined experimental systems), or experiments are carried out using non-human proteins that is evolutionarily distant from the human protein; and (6) a reference, captured as a cross-link to PubMed.
Selection of data for curation
The effects of BRCA1 variants found in breast and ovarian cancer patients, as well as the effect of experimental amino acid mutations, were manually annotated based on in vitro data from the primary literature. Variants include non-synonymous and nonsense substitutions, in-frame deletions, and frameshifts. Papers describing the functional impact of variants and mutants were obtained from PubMed.
Both automated and manual checks are performed on the annotations to ensure data integrity. For example, for variants, our software checks that the original amino acid at the position annotated is found in the sequence being annotated. For the annotations, automated checks ensure that the annotation is complete, i.e., that it contains a subject, a relation, an object, a reference, at least one evidence code, and the species in which the experiment was done. Additional sanity checks are performed, for example to ensure that the evidence codes are consistent with the annotation made, e.g., protein levels cannot be detected by Northern blots.
Using the information derived from 100 publications, the functional impact of 3654 unique BRCA1 variants was captured: 431 natural variants (both somatic and germline) and 3223 mutants generated by site-directed mutagenesis. The mutants generated by site-directed mutagenesis were annotated as such if they had never been identified as natural variants at the time of annotation. Of these, 3453 variants are missense mutations. A total of 6317 observations, including 6020 distinct triplet statements, were captured.
The most assayed phenotypes for BRCA1 variants (Fig. 2) are its ubiquitin-protein transferase activity and its binding to BARD1. Most of this data is the work of Starita et al. , who have performed extensive mutational analysis of the first 300 amino acids of BRCA1. As shown in Fig. 3, there is a clear overlap in BARD1/UBE2D1 binding defects and defects in ubiquitin transferase activity. Also, the mutations in the RING domain most often cause severe defects in BRCA1 function (Fig. 3). Apart from the ubiquitin transferase activity, the most frequently tested phenotypes are defects in transcriptional regulation, defects in DNA damage response (survival, checkpoints, DNA repair, etc.), changes in protein stability, changes in the nuclear localization, and impact on the regulation of cell proliferation.
Correlation between molecular phenotypes and clinical severity of variants
The number of possible variants over a large gene such as BRCA1 is in the tens of thousands. However, only about 50 BRCA1 missense variants are clearly documented in the literature as pathogenic. There is a strong correlation between defects in protein function and clinical pathogenicity . While this correlation is not absolute, it provides useful insight for estimating the potential pathogenicity of a variant of unknown significance.
To determine the correlation between the clinical significance of variants and their functional defects, we have compared the ClinVar pathogenicity assessments with the severity of the defect on the protein function. The vast majority of variants in a high confidence classes “pathogenic” and “benign” have evidence from clinical tests, thus avoiding potential issues of circular reasoning in the assessment of our work. Unfortunately, in many cases the evidence is not provided, as much of the data comes from testing done by genetic screening companies. Nevertheless, this clinical information does provide an independent benchmark to evaluate the predictive value of the functional impact of variants.
As of June 2017, ClinVar listed 1546 missense variants for BRCA1, broken down into 50 pathogenic/likely pathogenic, 105 benign, 188 with conflicting evidence, 1126 VUS, and 77 variants for which no pathogenicity assessment was provided. In the neXtProt Cancer variant portal, we have captured data for 466, i.e., 30% of the BRCA1 missense variants available in ClinVar (Table 2). Out of the 50 ClinVar pathogenic or likely pathogenic missense variants, 11 variants are adjacent to splice junctions (Arg71Gly/Lys/Met/Thr, Arg1495Met/Thr, Glu1559Gln/Lys, Asp1692Asn/His/Tyr) and 4 on the first methionine of the protein (Met1Arg/Ile/Thr/Val). These 15 variants are thus expected to be pathogenic for reasons unrelated to a missense substitution in the encoded protein, but are likely to result in a nonfunctional truncated product. These were excluded from our analysis because they are almost never tested in vitro. Of the remaining 35, 31 have experimental data testing the BRCA1 function, 29 (94%) of which having at least one severe or moderate defect in functional assays. For example, Ala1708Glu affects BRCA1’s binding to BRIP1, which impacts its transcription factor activity and its role in DNA recombination. Among the 33 function and phenotype evidences reported for this variant, 72% are tagged as severe and 12% as moderate.
We have found experimental data for 49 variants out of the 103 missense variants (48%) located in protein coding regions and annotated as benign or likely benign in ClinVar. There are 14 variants (23%) classified as benign in ClinVar that have at least some moderate or severe functional defect in the Cancer Variant Portal. When we looked more closely at the data, we noticed that these variants affected phenotypes of a relatively minor function, or in a “Silver” grade assay. This highlights the fact that any pathogenicity predictor that could be developed from the data presented here should not be a binary classifier. None of these 14 variants looked reliably pathogenic based on the functional data we have captured.
Four of these variants have at one severe functional defect: Thr826Lys, Met1652Thr, Gly1706Ala, and Val1804Asp; while one has a severe defect in two assays, Gln356Arg. We took a closer look at these variants to understand how these apparently conflicting data could be reconciled with the ClinVar assessments. The classification as benign seems appropriate for three variants: Gln356Arg affects estrogen receptor-dependent transcriptional repressor function ; this function may not be important for the tumor-suppressor role of BRCA1. The other severe defect for this protein is a defect in binding to a transcription factor, ZNF350 . This interaction is not very well studied and there is no evidence that it provides any predictive value for pathogenicity. Moreover, Gln356Arg has been identified as a polymorphism in several studies [72,73,74], indicating that it is very likely benign.
Met1652Thr impairs cell proliferation in a yeast assay. This assay is not very reliable since yeast lacks a BRCA1 ortholog. The other assays examining transcription, BRIP1 binding, and protein stability are not majorly impaired.
Gly1706Ala gave contradictory results in two different transcriptional assays in one article; in one case, it has normal activity; and in another experiment, it is severely impaired . Two other papers found no defect in this function [40, 76], indicating that this function is likely normal. This variant is also normal in seven other tests, including protein stability, nuclear localization , and double-strand break repair , further supporting its classification as a benign variation. However, according to the AGCM guidelines, when two criteria for pathogenicity assessment are contradictory, the variant is classified as a VUS.
Classification of Thr826Lys and Val1804Asp as benign has less support. Thr826Lys has little evidence for pathogenicity assessment, either functional or clinical, but has been found in patients and not in control populations [78, 79], so it should be further investigated before being assigned as benign. Val1804Asp shows embryonic lethality in mouse embryonic stem cells  and was described as potentially deleterious.
VUS and variants with conflicting interpretations
For 53% of the ClinVar variants with conflicting interpretations, the neXtProt Cancer variant portal has identified functional data. Approximately half of these variants have severe to moderate functional defects. The neXtProt Cancer variant portal has data for 42% of the ClinVar variants of unassigned pathogenicity, 37% of which have severe to moderate functional defects. Out of the 1126 ClinVar VUS, we have functional data for 244 variants, of which 30% have at least severe to moderate functional defects.
Data access and visualization
The functional impact of BRCA1 variants is available on our neXtProt Cancer variant portal (Fig. 4), accessible at https://www.nextprot.org/portals/breast-cancer. The data is presented in table form, with the following information:
Position: Position of the mutation on the canonical protein sequence
Protein variation: Protein mutation name according to HGVS nomenclature (http://varnomen.hgvs.org/recommendations/protein/)
Mutation origin: The mutation origin describes either inherited mutation (germline_variant), acquired mutation (somatic_variant) or mutations generated by site-directed mutagenesis (mutated_variant_site) according to Sequence Ontology
Phenotype intensity: Amplitude of the functional defect/phenotype observed: severe, moderate or mild. Not applicable for observation where the mutant has no significant impact: N/A
Relation: In-house vocabulary of relations describing whether there is a functional defect/phenotype (Impact), or the absence of functional defect/phenotype (No impact) for a variant relative to a Function.
Effect on protein function/biological process/cellular localization is captured with Gene Ontology terms
Effect on binding to proteins and protein complexes is captured with neXtProt entry accessions
Effect on binding to chemicals is captured with ChEBI entry accessions
Data confidence: Evidence is tagged “Gold” or “Silver” according to curator judgment on the quality of the data (see the “Data Model” section for details)
Reference: PubMed ID reference of the study supporting the evidence http://www.ncbi.nlm.nih.gov/pubmed
Protein origin: Organism species from which the protein being studied was derived. Note that it is different from the experimental system.
Experimental system: Organism species of the model in which the mutated protein is studied, such as cell lines, primary cells, and whole organism.
Each column of the table can be filtered for specific information and sorted alphabetically or numerically, according to the data type. The entire dataset can be downloaded in cvs format.
Correlation between functional defects and variant pathogenicity
Our data shows that there is a very good correlation between defects in protein function and the clinical classification of variants (Table 2). For ClinVar pathogenic variants for which neXtProt Cancer variant portal has data, 94% of variants are reported with severe to moderate functional defects. About 77% of ClinVar benign variants, for which neXtProt Cancer variant portal has data, shows either no or mild experimental functional defects.
We detailed the discrepancies of the functional studies and the ClinVar variants for the five benign variants having at least one functional defect in the results section. This analysis leads to a number of observations: first, not all phenotypes are of equal value in determining the potential pathogenicity of a variant. Also, the additional information that we are integrating (phenotype severity and quality) can be very valuable in evaluating the potential pathogenicity. Clearly, a binary classifier that would put any variant with an aberrant phenotype as potentially pathogenic would not perform very well; a much finer decision mechanism must be developed. We are developing a pathogenicity prediction tool that gives different weights to different phenotypes for different targets, with promising results.
The neXtProt Cancer variant portal data, although not conclusive for clinical decisions, may provide guidance for variant classification. Hence, for the 375 variants classified as VUS, unassigned, and having conflicting interpretations having functional data, that data may provide some insight for the potential pathogenicity.
Workload and sustainability
The manual annotation work needed to compile all this data is substantial: at an estimated average of up 10 annotations per hour, it takes about 15 weeks to complete the annotation of a protein with as much literature as BRCA1. The systematic approach we have used is likely to be more exhaustive than doing literature search for specific variants, since the variants are not easily found in the literature using standard nomenclature, especially for older papers. Moreover, the literature also contains numerous mistakes. One such example is Leu22Ser a study in which the Leu22Ter variant is mislabeled Leu22Ser in Table 2 in . Leu22Ser has been found in another clinical study  in which the patient has a family history of breast cancer, but the relative has not been genotyped. Hence, the pathogenic variant is more likely to be Leu22Ter. There are also numerous errors in converting the one-letter amino acids code to the three-letter code, errors in the reference sequence provided, lack of a reference sequence, etc. We have tried to resolve these inconsistencies whenever possible.
This important effort is amply justified by the benefits gained by researchers and clinicians in having the variants annotated in a standardized, computable manner. While the current study does not aim to provide pathogenicity assessments for those variants, we believe that the functional phenotype data can provide a most useful additional source of information to help experts refine their final decision based on the corpus of criteria that need to be aggregated for variant prioritization. The neXtProt Cancer variant portal provides an easily accessible overview of experimental variant’s phenotypic impact, which provides useful information to assess a VUS’ potential pathogenicity. Clinicians might consider a closer monitoring of patients bearing variants with some evidence of a functional defect.
As mentioned earlier, the best evidence for the pathogenicity of a genetic variation is its strong co-occurrence with the associated disease phenotype(s). In most patients, clinical decisions must nevertheless be made based on available evidence. While evidence that a genetic variation may be pathogenic based on functional assays should provide incentive for a close monitoring of the patient’s condition, researchers and clinicians should be highly aware that the functional assays captured in the present work may not relate to the in situ effect of the variants. To avoid misinterpretation of the effect of a variant based on functional or predictive methods alone, the BRCA1 portal should be used as supporting data to validate clinical observations, as appropriate, but not to guide clinical decisions.
The neXtProt Cancer variant portal we have developed provides an exhaustive list of BRCA1 variants for which molecular phenotypes are available, curated in a highly structured model, without redundancy in the data and with complete traceability to the original experimental results. Researchers, as well as clinical geneticists will be able to consult this database to have a comprehensive overview of the available data. We are capturing the functional defects in variants of other cancer genes, including BRCA2 and the Lynch syndrome genes (MSH2, MSH6, and MLH1), among others. These annotations are available in the neXtProt Cancer variant portal.
American College of Medical Genetics and Genomics
Association for Molecular Pathology
BRCA1 carboxyl terminus domain
Evidence and Conclusion Ontology
Human Genome Variation Society
National Center for Biotechnology Information
Variant of uncertain (or unknown) significance
Moynahan ME, Chiu JW, Koller BH, Jasin M. Brca1 controls homology-directed DNA repair. Mol Cell. 1999;4(4):511–8.
Li ML, Greenberg RA. Links between genome integrity and BRCA1 tumor suppression. Trends Biochem Sci. 2012;37(10):418–24.
Roy R, Chun J, Powell SN. BRCA1 and BRCA2: different roles in a common pathway of genome protection. Nat Rev Cancer. 2012;12(1):68–78.
Mavaddat N, Peock S, Frost D, Ellis S, Platte R, Fineberg E, Evans DG, Izatt L, Eeles RA, Adlard J, et al. Cancer risks for BRCA1 and BRCA2 mutation carriers: results from prospective analysis of EMBRACE. J Natl Cancer Inst. 2013;105(11):812–22.
Walsh T, Casadei S, Lee MK, Pennil CC, Nord AS, Thornton AM, Roeb W, Agnew KJ, Stray SM, Wickramanayake A, et al. Mutations in 12 genes for inherited ovarian, fallopian tube, and peritoneal carcinoma identified by massively parallel sequencing. Proc Natl Acad Sci U S A. 2011;108(44):18032–7.
ACOG Committee on Practice Bulletins. Hereditary breast and ovarian cancer syndrome. Gynecol Oncol. 2009;113(1):6–11.
Kuchenbaecker KB, Neuhausen SL, Robson M, Barrowdale D, McGuffog L, Mulligan AM, Andrulis IL, Spurdle AB, Schmidt MK, Schmutzler RK, et al. Associations of common breast cancer susceptibility alleles with risk of breast cancer subtypes in BRCA1 and BRCA2 mutation carriers. Breast Cancer Res. 2014;16(6):3416.
Cancer Genome Atlas Research Network. Integrated genomic analyses of ovarian carcinoma. Nature. 2011;474(7353):609–15.
Stuckey AR, Onstad MA. Hereditary breast cancer: an update on risk assessment and genetic testing in 2015. Am J Obstet Gynecol. 2015;213(2):161–5.
Hartmann LC, Lindor NM. The role of risk-reducing surgery in hereditary breast and ovarian cancer. N Engl J Med. 2016;374(5):454–68.
Heemskerk-Gerritsen BA, Seynaeve C, van Asperen CJ, Ausems MG, Collee JM, van Doorn HC, Gomez Garcia EB, Kets CM, van Leeuwen FE, Meijers-Heijboer HE, et al. Breast cancer risk after salpingo-oophorectomy in healthy BRCA1/2 mutation carriers: revisiting the evidence for risk reduction. J Natl Cancer Inst. 2015;107(5).
Kotsopoulos J, Huzarski T, Gronwald J, Singer CF, Moller P, Lynch HT, Armel S, Karlan B, Foulkes WD, Neuhausen SL, et al. Bilateral oophorectomy and breast cancer risk in BRCA1 and BRCA2 mutation carriers. J Natl Cancer Inst. 2017;109(1).
Fong PC, Boss DS, Yap TA, Tutt A, Wu P, Mergui-Roelvink M, Mortimer P, Swaisland H, Lau A, O'Connor MJ, et al. Inhibition of poly(ADP-ribose) polymerase in tumors from BRCA mutation carriers. N Engl J Med. 2009;361(2):123–34.
Byrski T, Gronwald J, Huzarski T, Grzybowska E, Budryk M, Stawicka M, Mierzwa T, Szwiec M, Wisniowski R, Siolek M, et al. Pathologic complete response rates in young women with BRCA1-positive breast cancers after neoadjuvant chemotherapy. J Clin Oncol. 2010;28(3):375–9.
Richards S, Aziz N, Bale S, Bick D, Das S, Gastier-Foster J, Grody WW, Hegde M, Lyon E, Spector E, et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet Med. 2015;17(5):405–24.
Choi Y, Sims GE, Murphy S, Miller JR, Chan AP. Predicting the functional effect of amino acid substitutions and indels. PLoS One. 2012;7(10):e46688.
Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, Bork P, Kondrashov AS, Sunyaev SR. A method and server for predicting damaging missense mutations. Nat Methods. 2010;7(4):248–9.
Beroud C, Letovsky SI, Braastad CD, Caputo SM, Beaudoux O, Bignon YJ, Bressac-De Paillerets B, Bronner M, Buell CM, Collod-Beroud G, et al. BRCA share: a collection of clinical BRCA gene variants. Hum Mutat. 2016;37(12):1318–28.
Futreal PA, Coin L, Marshall M, Down T, Hubbard T, Wooster R, Rahman N, Stratton MR. A census of human cancer genes. Nat Rev Cancer. 2004;4(3):177–83.
Jhuraney A, Velkova A, Johnson RC, Kessing B, Carvalho RS, Whiley P, Spurdle AB, Vreeswijk MP, Caputo SM, Millot GA, et al. BRCA1 Circos: a visualisation resource for functional analysis of missense variants. J Med Genet. 2015;52(4):224–30.
Amendola LM, Jarvik GP, Leo MC, McLaughlin HM, Akkari Y, Amaral MD, Berg JS, Biswas S, Bowling KM, Conlin LK, et al. Performance of ACMG-AMP variant-interpretation guidelines among nine laboratories in the Clinical Sequencing Exploratory Research Consortium. Am J Hum Genet. 2016;98(6):1067–76.
Toland AE, Andreassen PR. DNA repair-related functional assays for the classification of BRCA1 and BRCA2 variants: a critical review and needs assessment. J Med Genet. 2017;54(11):721–31.
Atipairin A, Canyuk B, Ratanaphan A. The RING heterodimer BRCA1-BARD1 is a ubiquitin ligase inactivated by the platinum-based anticancer drugs. Breast Cancer Res Treat. 2011;126(1):203–9.
Ayi TC, Tsan JT, Hwang LY, Bowcock AM, Baer R. Conservation of function and primary structure in the BRCA1-associated RING domain (BARD1) protein. Oncogene. 1998;17(16):2143–8.
Meza JE, Brzovic PS, King MC, Klevit RE. Mapping the functional domains of BRCA1. Interaction of the ring finger domains of BRCA1 and BARD1. J Biol Chem. 1999;274(9):5659–65.
Wu LC, Wang ZW, Tsan JT, Spillman MA, Phung A, Xu XL, Yang MC, Hwang LY, Bowcock AM, Baer R. Identification of a RING protein that can interact in vivo with the BRCA1 gene product. Nat Genet. 1996;14(4):430–40.
Lorick KL, Jensen JP, Fang S, Ong AM, Hatakeyama S, Weissman AM. RING fingers mediate ubiquitin-conjugating enzyme (E2)-dependent ubiquitination. Proc Natl Acad Sci U S A. 1999;96(20):11364–9.
Prakash R, Zhang Y, Feng W, Jasin M. Homologous recombination and human health: the roles of BRCA1, BRCA2, and associated proteins. Cold Spring Harb Perspect Biol. 2015;7(4):a016600.
Nishikawa H, Ooka S, Sato K, Arima K, Okamoto J, Klevit RE, Fukuda M, Ohta T. Mass spectrometric and mutational analyses reveal Lys-6-linked polyubiquitin chains catalyzed by BRCA1-BARD1 ubiquitin ligase. J Biol Chem. 2004;279(6):3916–24.
Chen A, Kleiman FE, Manley JL, Ouchi T, Pan ZQ. Autoubiquitination of the BRCA1*BARD1 RING ubiquitin ligase. J Biol Chem. 2002;277(24):22085–92.
Wu-Baer F, Ludwig T, Baer R. The UBXN1 protein associates with autoubiquitinated forms of the BRCA1 tumor suppressor and inhibits its enzymatic function. Mol Cell Biol. 2010;30(11):2787–98.
Ohta T, Sato K, Wu W. The BRCA1 ubiquitin ligase and homologous recombination repair. FEBS Lett. 2011;585(18):2836–44.
Sharma B, Preet Kaur R, Raut S, Munshi A. BRCA1 mutation spectrum, functions, and therapeutic strategies: the story so far. Curr Probl Cancer. 2018;42(2):189–207.
Mallery DL, Vandenberg CJ, Hiom K. Activation of the E3 ligase function of the BRCA1/BARD1 complex by polyubiquitin chains. EMBO J. 2002;21(24):6755–62.
Huen MS, Grant R, Manke I, Minn K, Yu X, Yaffe MB, Chen J. RNF8 transduces the DNA-damage signal via histone ubiquitylation and checkpoint protein assembly. Cell. 2007;131(5):901–14.
Yan J, Jetten AM. RAP80 and RNF8, key players in the recruitment of repair proteins to DNA damage sites. Cancer Lett. 2008;271(2):179–90.
Yan J, Kim YS, Yang XP, Li LP, Liao G, Xia F, Jetten AM. The ubiquitin-interacting motif containing protein RAP80 interacts with BRCA1 and functions in DNA damage repair response. Cancer Res. 2007;67(14):6647–56.
Lee JS, Collins KM, Brown AL, Lee CH, Chung JH. hCds1-mediated phosphorylation of BRCA1 regulates the DNA damage response. Nature. 2000;404(6774):201–4.
Stolz A, Ertych N, Kienitz A, Vogel C, Schneider V, Fritz B, Jacob R, Dittmar G, Weichert W, Petersen I, et al. The CHK2-BRCA1 tumour suppressor pathway ensures chromosomal stability in human somatic cells. Nat Cell Biol. 2010;12(5):492–9.
Zhang J, Willers H, Feng Z, Ghosh JC, Kim S, Weaver DT, Chung JH, Powell SN, Xia F. Chk2 phosphorylation of BRCA1 regulates DNA double-strand break repair. Mol Cell Biol. 2004;24(2):708–18.
Sy SM, Huen MS, Chen J. PALB2 is an integral component of the BRCA complex required for homologous recombination repair. Proc Natl Acad Sci U S A. 2009;106(17):7155–60.
Cortez D, Wang Y, Qin J, Elledge SJ. Requirement of ATM-dependent phosphorylation of brca1 in the DNA damage response to double-strand breaks. Science. 1999;286(5442):1162–6.
Xu B, O'Donnell AH, Kim ST, Kastan MB. Phosphorylation of serine 1387 in Brca1 is specifically required for the Atm-mediated S-phase checkpoint after ionizing irradiation. Cancer Res. 2002;62(16):4588–91.
Kang Y, Cheong HM, Lee JH, Song PI, Lee KH, Kim SY, Jun JY, You HJ. Protein phosphatase 5 is necessary for ATR-mediated DNA repair. Biochem Biophys Res Commun. 2011;404(1):476–81.
Wu Q, Jubb H, Blundell TL. Phosphopeptide interactions with BRCA1 BRCT domains: more than just a motif. Prog Biophys Mol Biol. 2015;117(2–3):143–8.
Cantor SB, Bell DW, Ganesan S, Kass EM, Drapkin R, Grossman S, Wahrer DC, Sgroi DC, Lane WS, Haber DA, et al. BACH1, a novel helicase-like protein, interacts directly with BRCA1 and contributes to its DNA repair function. Cell. 2001;105(1):149–60.
Wong AK, Ormonde PA, Pero R, Chen Y, Lian L, Salada G, Berry S, Lawrence Q, Dayananth P, Ha P, et al. Characterization of a carboxy-terminal BRCA1 interacting protein. Oncogene. 1998;17(18):2279–85.
Drikos I, Nounesis G, Vorgias CE. Characterization of cancer-linked BRCA1-BRCT missense variants and their interaction with phosphoprotein targets. Proteins. 2009;77(2):464–76.
Yu X, Wu LC, Bowcock AM, Aronheim A, Baer R. The C-terminal (BRCT) domains of BRCA1 interact in vivo with CtIP, a protein implicated in the CtBP pathway of transcriptional repression. J Biol Chem. 1998;273(39):25388–92.
Yu X, Chen J. DNA damage-induced cell cycle checkpoint control requires CtIP, a phosphorylation-dependent binding partner of BRCA1 C-terminal domains. Mol Cell Biol. 2004;24(21):9478–86.
Kim H, Huang J, Chen J. CCDC98 is a BRCA1-BRCT domain-binding protein involved in the DNA damage response. Nat Struct Mol Biol. 2007;14(8):710–5.
Liu Z, Wu J, Yu X. CCDC98 targets BRCA1 to DNA damage sites. Nat Struct Mol Biol. 2007;14(8):716–20.
Rodriguez M, Yu X, Chen J, Songyang Z. Phosphopeptide binding specificities of BRCA1 COOH-terminal (BRCT) domains. J Biol Chem. 2003;278(52):52914–8.
Mohammad DH, Yaffe MB. 14-3-3 proteins, FHA domains and BRCT domains in the DNA damage response. DNA Repair (Amst). 2009;8(9):1009–17.
Lee MS, Green R, Marsillac SM, Coquelle N, Williams RS, Yeung T, Foo D, Hau DD, Hui B, Monteiro AN, et al. Comprehensive analysis of missense variations in the BRCT domain of BRCA1 by structural and functional assays. Cancer Res. 2010;70(12):4880–90.
Wang B, Matsuoka S, Ballif BA, Zhang D, Smogorzewska A, Gygi SP, Elledge SJ. Abraxas and RAP80 form a BRCA1 protein complex required for the DNA damage response. Science. 2007;316(5828):1194–8.
Cantor SB, Andreassen PR. Assessing the link between BACH1 and BRCA1 in the FA pathway. Cell Cycle. 2006;5(2):164–7.
Litman R, Peng M, Jin Z, Zhang F, Zhang J, Powell S, Andreassen PR, Cantor SB. BACH1 is critical for homologous recombination and appears to be the Fanconi anemia gene product FANCJ. Cancer Cell. 2005;8(3):255–65.
Greenberg RA, Sobhian B, Pathania S, Cantor SB, Nakatani Y, Livingston DM. Multifactorial contributions to an acute DNA damage response by BRCA1/BARD1-containing complexes. Genes Dev. 2006;20(1):34–46.
Williams RS, Dodson GE, Limbo O, Yamada Y, Williams JS, Guenther G, Classen S, Glover JN, Iwasaki H, Russell P, et al. Nbs1 flexibly tethers Ctp1 and Mre11-Rad50 to coordinate DNA double-strand break processing and repair. Cell. 2009;139(1):87–99.
Limbo O, Chahwan C, Yamada Y, de Bruin RA, Wittenberg C, Russell P. Ctp1 is a cell-cycle-regulated protein that functions with Mre11 complex to control double-strand break repair by homologous recombination. Mol Cell. 2007;28(1):134–46.
Sartori AA, Lukas C, Coates J, Mistrik M, Fu S, Bartek J, Baer R, Lukas J, Jackson SP. Human CtIP promotes DNA end resection. Nature. 2007;450(7169):509–14.
den Dunnen JT, Dalgleish R, Maglott DR, Hart RK, Greenblatt MS, McGowan-Jordan J, Roux AF, Smith T, Antonarakis SE, Taschner PE. HGVS recommendations for the description of sequence variants: 2016 update. Hum Mutat. 2016;37(6):564–9.
Hastings J, Owen G, Dekker A, Ennis M, Kale N, Muthukrishnan V, Turner S, Swainston N, Mendes P, Steinbeck C. ChEBI in 2016: improved services and an expanding collection of metabolites. Nucleic Acids Res. 2016;44(D1):D1214–9.
Gene Ontology Consortium. Gene Ontology Consortium: going forward. Nucleic Acids Res. 2015;43(Database issue):D1049–56.
Smith CL, Eppig JT. The mammalian phenotype ontology as a unifying standard for experimental and high-throughput phenotyping data. Mamm Genome. 2012;23(9–10):653–68.
Chibucos MC, Mungall CJ, Balakrishnan R, Christie KR, Huntley RP, White O, Blake JA, Lewis SE, Giglio M. Standardized description of scientific evidence using the Evidence Ontology (ECO). Database (Oxford). 2014;2014.
Starita LM, Young DL, Islam M, Kitzman JO, Gullingsrud J, Hause RJ, Fowler DM, Parvin JD, Shendure J, Fields S. Massively parallel functional analysis of BRCA1 RING domain variants. Genetics. 2015;200(2):413–22.
Woods NT, Baskin R, Golubeva V, Jhuraney A, De-Gregoriis G, Vaclova T, Goldgar DE, Couch FJ, Carvalho MA, Iversen ES, et al. Functional assays provide a robust tool for the clinical annotation of genetic variants of uncertain significance. NPJ Genom Med. 2016;1.
Schoumacher F, Glaus A, Mueller H, Eppenberger U, Bolliger B, Senn HJ. BRCA1/2 mutations in Swiss patients with familial or early-onset breast and ovarian cancer. Swiss Med Wkly. 2001;131(15–16):223–6.
Zheng L, Pan H, Li S, Flesken-Nikitin A, Chen PL, Boyer TG, Lee WH. Sequence-specific transcriptional corepressor function for BRCA1 through a novel zinc finger protein, ZBRK1. Mol Cell. 2000;6(4):757–68.
Tavtigian SV, Deffenbaugh AM, Yin L, Judkins T, Scholl T, Samollow PB, de Silva D, Zharkikh A, Thomas A. Comprehensive statistical study of 452 BRCA1 missense substitutions with classification of eight recurrent substitutions as neutral. J Med Genet. 2006;43(4):295–305.
Johnston JJ, Rubinstein WS, Facio FM, Ng D, Singh LN, Teer JK, Mullikin JC, Biesecker LG. Secondary variants in individuals undergoing exome sequencing: screening of 572 individuals identifies high-penetrance mutations in cancer-susceptibility genes. Am J Hum Genet. 2012;91(1):97–108.
Dombernowsky SL, Weischer M, Freiberg JJ, Bojesen SE, Tybjaerg-Hansen A, Nordestgaard BG. Missense polymorphisms in BRCA1 and BRCA2 and risk of breast and ovarian cancer. Cancer Epidemiol Biomark Prev. 2009;18(8):2339–42.
Phelan CM, Dapic V, Tice B, Favis R, Kwan E, Barany F, Manoukian S, Radice P, van der Luijt RB, van Nesselrooij BP, et al. Classification of BRCA1 missense variants of unknown clinical significance. J Med Genet. 2005;42(2):138–46.
Lovelock PK, Healey S, Au W, Sum EY, Tesoriero A, Wong EM, Hinson S, Brinkworth R, Bekessy A, Diez O, et al. Genetic, functional, and histopathological evaluation of two C-terminal BRCA1 missense variants. J Med Genet. 2006;43(1):74–83.
Bouwman P, van der Gulden H, van der Heijden I, Drost R, Klijn CN, Prasetyanti P, Pieterse M, Wientjens E, Seibler J, Hogervorst FB, et al. A high-throughput functional complementation assay for classification of BRCA1 missense variants. Cancer Discov. 2013;3(10):1142–55.
Tonin PN, Mes-Masson AM, Futreal PA, Morgan K, Mahon M, Foulkes WD, Cole DE, Provencher D, Ghadirian P, Narod SA. Founder BRCA1 and BRCA2 mutations in French Canadian breast and ovarian cancer families. Am J Hum Genet. 1998;63(5):1341–51.
Meindl A. Comprehensive analysis of 989 patients with breast or ovarian cancer provides BRCA1 and BRCA2 mutation profiles and frequencies for the German population. Int J Cancer. 2002;97(4):472–80.
Chang S, Biswas K, Martin BK, Stauffer S, Sharan SK. Expression of human BRCA1 variants in mouse ES cells allows functional analysis of BRCA1 mutations. J Clin Invest. 2009;119(10):3160–71.
Cunningham F, Moore B, Ruiz-Schultz N, Ritchie GR, Eilbeck K. Improving the Sequence Ontology terminology for genomic variant annotation. J Biomed Semantics. 2015;6:32.
Katagiri T, Kasumi F, Yoshimoto M, Nomizu T, Asaishi K, Abe R, Tsuchiya A, Sugano M, Takai S, Yoneda M, et al. High proportion of missense mutations of the BRCA1 and BRCA2 genes in Japanese breast cancer families. J Hum Genet. 1998;43(1):42–8.
Sweet K, Senter L, Pilarski R, Wei L, Toland AE. Characterization of BRCA1 ring finger variants of uncertain significance. Breast Cancer Res Treat. 2010;119(3):737–43.
We thank the members of our teams for useful discussions and the reviewers for constructive suggestions.
The work was funded by the Recherche Suisse Contre le Cancer (Grant #KFS-3297-08-2013 to AB and PH).
Availability of data and materials
The data presented here is available on https://www.nextprot.org/, in tabular format at https://www.nextprot.org/portals/breast-cancer, via the neXtProt API (https://api.nextprot.org/), SPARQL endpoint (https://sparql.nextprot.org/), and in XML (ftp://ftp.nextprot.org/pub/current_release/xml/).
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
About this article
Cite this article
Cusin, I., Teixeira, D., Zahn-Zabal, M. et al. A new bioinformatics tool to help assess the significance of BRCA1 variants. Hum Genomics 12, 36 (2018). https://doi.org/10.1186/s40246-018-0168-0
- Genetic variants
- Molecular phenotypes
- Functional defect assessment
- Biological database