- Primary research
- Open Access
Whole-exome sequencing identifies novel pathogenic mutations and putative phenotype-influencing variants in Polish limb-girdle muscular dystrophy patients
Human Genomics volume 12, Article number: 34 (2018)
Limb girdle muscular dystrophies (LGMD) are a group of heterogeneous hereditary myopathies with similar clinical symptoms. Disease onset and progression are highly variable, with an elusive genetic background, and around 50% cases lacking molecular diagnosis.
Whole exome sequencing (WES) was performed in 73 patients with clinically diagnosed LGMD. A filtering strategy aimed at identification of variants related to the disease included integrative analysis of WES data and human phenotype ontology (HPO) terms, analysis of genes expressed in muscle, analysis of the disease-associated interactome and copy number variants analysis.
Genetic diagnosis was possible in 68.5% of cases. On average, 36.3 rare variants in genes associated with various muscle diseases per patient were found that could relate to the clinical phenotype. The putative causative mutations were mostly in LGMD-associated genes, but also in genes not included in the current LGMD classification (DMD, COL6A2, and COL6A3). In three patients, mutations in two genes were suggested as the joint cause of the disease (CAPN3+MYH7, COL6A3+CACNA1S, DYSF+MYH7). Moreover, a variety of phenotype-influencing variants were postulated, including in patients with an identified already known primary pathogenic mutation.
We hypothesize that LGMD could be better described as oligogenic disorders in which dominant clinical presentation can result from the combined effect of mutations in a set of genes. In this view, the inter- and intrafamilial variability could reflect a specific genetic background and the presence of sets of phenotype-influencing or co-causative mutations in genes that either interact with the known LGMD-associated genes or are a part of the same pathways or structures.
Limb girdle muscular dystrophies (LGMD) are a heterogeneous group of genetic disorders with similar clinical features, and a diverse and partially unknown genetic background. LGMD are characterized clinically by progressive muscle weakness and atrophy, predominantly or primarily of the pelvic and shoulder girdle muscles, without facial muscle dysfunction. The clinical course of the disease may be variable, ranging from severe forms with early onset and rapid progression to milder forms with later onset and minor physical disability. In the majority of cases, serum creatine kinase (CK) is elevated and a dystrophic pattern with necrosis and regeneration is observed on muscle biopsy [1, 2]. Most patients show a definable phenotype, but there are numerous exceptions as well as intrafamilial variability. LGMD are very rare disorders, fulfilling the criteria for orphan diseases with an estimated prevalence of 1/44000–1/123000 [3, 4].
Numerous studies performed world-wide in the last two decades have led to the identification of mutations in 30 genes (and one associated locus) causally involved in LGMD pathophysiology . Current LGMD classification is based on the mode of inheritance and the particular gene involved. The two general categories, autosomal dominant LGMD1 and autosomal recessive LGMD2, are divided into subgroups with different alphabetic designators, each caused by mutations in a specific gene.
To date, eight LGMD1 and 23 LGMD2 subtypes have been recognized [6,7,8]. This list is still expanding, with seven subtypes added in the last 3 years. The diagnosis of muscular dystrophies (including within the LGMD group) is difficult due to the presence of a number of different conditions with similar clinical phenotypes, including limb-girdle muscle weakness, e.g., myofibrillar myopathies, Bethlem myopathy, Becker muscular dystrophy, facioscapulohumeral muscular dystrophy, and Emery-Dreifuss muscular dystrophy. In fact, some of the latter have been considered a form of LGMD [1, 6].
According to the latest guidelines, the precise diagnosis of LGMD should rely on a detailed clinical examination, muscle biopsy, and genetic analysis to detect the causative mutations . However, muscle biopsy findings are often not sufficiently specific; therefore, genetic testing is considered the most reliable tool in LGMD diagnosis.
The molecular pathophysiology of LGMD is heterogeneous, with mechanisms ranging from defects in the dystrophin-dystroglycan complex, through abnormal localization of components of the muscle cytoskeleton and enzymatic defects, to sarcomeric and nuclear lamina dysfunctions. Different mutations in the same gene can cause widely different phenotypes (e.g., individual FKRP mutations cause a form of muscle-eye brain disease, a congenital severe muscular dystrophy, and a classical, adult LGMD form [10, 11]). The functional diversity of the protein products of the disease-causing genes makes the diagnosis very difficult and complex, requiring deep phenotyping.
It should be emphasized that despite intensive research, especially on the identification of novel causative genes, up to 50% of clinically defined LGMD cases are still without genetic diagnosis. Furthermore, the treatment of LGMD remains supportive and palliative, although it is expected that early diagnosis of the disease subtypes, based mainly on genetic testing, will allow the development of therapeutic strategies preventing or delaying the pathological process in the foreseeable future [12, 13]. Proactive multidisciplinary care and genetic counseling of LGMD patients is recommended, preferably immediately after diagnosis.
Since not all genetic risk factors of LGMD have been identified, further studies into the genetic background of the disease are necessary. Whole exome sequencing (WES) and, even more so, whole genome sequencing (WGS) provide a non-biased approach towards discovery of potentially causative mutations . Subsequent comprehensive bioinformatic analyses of the resulting list of genomic variants could not only pinpoint novel genes that could be associated with the disease, but also reveal mutations in genes related to other disorders explaining some of the as-yet molecularly undiagnosed cases. Additionally, apart from the causative mutations, variants that could be called phenotype-influencing, or even co-causative, could add up to the clinical phenotype.
Here, we report genetic variants identified using WES and comprehensive bioinformatic analyses in a fairly large group of Polish patients with clinically diagnosed LGMD. We found putative pathogenic mutations in known myopathy-related genes in 68.5% of cases. In all the cases, we propose numerous possibly phenotype-influencing or even co-causative mutations, including in genes not previously related to LGMD.
The study involved 72 cases (73 patients including a pair of siblings) with clinically diagnosed limb-girdle muscular dystrophy from a single neuromuscular diagnosis and treatment medical center. LGMD was defined as a progressive muscle weakness and atrophy of the pelvic and shoulder girdle muscles, as well as proximal limb muscles, without an involvement of facial muscles. The diagnosis was made on the basis of clinical assessment and muscle biopsy. Childhood cases with early-onset asymptomatic persistent hyperCKemia were included when the muscle biopsy showed evident features of muscular dystrophy. Miyoshi myopathy phenotypes (muscular dystrophy with predominant calf atrophy and high CK level) were also included because of their considerable genetic and phenotypic overlap with LGMD2B and 2L. Other types of muscular dystrophies (Duchenne muscular dystrophy, Becker muscular dystrophy, facioscapulohumeral muscular dystrophy, Emery-Dreifuss muscular dystrophy, and myotonic dystrophy type 1 and 2) and other myopathies (congenital, metabolic, mitochondrial, myofibrillar, and inflammatory) were excluded on the base of their clinical, electrophysiological, and morphological characteristics. To avoid Becker muscular dystrophy cases, we included male patients who had either similarly affected female siblings or previous negative MLPA (multiplex ligation-dependent probe amplification) results (which excluded large deletions or duplications in the dystrophin gene) together with normal muscle immunofluorescence staining for dystrophin.
The mean age of the patients was 26 years (range 3–78). The basic clinical data are shown in a supplementary file (see Additional file 1). Thirty-nine probands had one or more affected siblings. Two patients (no. 243 and 407) had a positive family history suggesting autosomal dominant inheritance; however, no parent was available for clinical assessment. In four patients, there was a background of second-degree parental consanguinity.
We also performed WES for 12 patients with non-classic muscular disease phenotypes where an LGMD diagnosis could not be definitively excluded (see Additional file 1). Data from these cases, and from an additional 16 patients from seven families with non-muscular neurological diseases, were used for comparison during the bioinformatic assessment of WES results.
DNA was extracted from peripheral blood using standard methods . Whole exome sequencing (WES) was performed commercially at BGI Tech Solutions (Hong Kong) using a SureSelect Human All Exon v5+UTR enrichment kit and paired-end 100-nt sequencing on the Illumina HiSeq2000 platform. Fast read files were generated from the sequencing platform via the Illumina pipeline. Adapter sequences in the raw data were removed and low-quality reads with low base quality discarded. On average, 240,451,900 “clean” paired-end reads per sample were aligned to the human reference genome hg19 using the Burrows-Wheeler Alignment (BWA) package . Duplicate reads were removed with Picard and base quality Phred scores were recalibrated using GATK’s covariance recalibration . The obtained 15 Giga-bases of aligned sequence data resulted in 125x median coverage of the target capture regions with 97.4% of target bases covered at least 10×. Capture performance statistics were calculated using CollectHsMetrics in Picard 2.17.10. The alignments were viewed with an Integrative Genomics Viewer . SNVs (single-nucleotide variants) and indels (small insertion/deletion) variants were called using the GATK Unified Genotyper. Annovar was used for initial variant annotation  with further annotation, filtering, and analysis performed on Galaxy (on PL-Grid Infrastructure) and GeneTraps (Intelliseq) platforms.
Copy number variant analysis
Copy number variants (CNVs) were called using CODEX software (version 1.8) . The analysis was performed within technical batches of samples. CNVs were annotated with allele frequencies using best-matching CNVs from 1000 genomes, and all the CNVs matching common CNVs (MAF > 1%) were removed. Genes overlapping each CNV in patients were identified using Ensembl version 86. The genes assigned to CNVs were annotated using diseases and phenotypes from Human Phenotype Ontology (HPO)  and tissue expression scores obtained from the GeneAtlas.
Whole exome sequencing identified on average 125,000 SNVs and 23,000 indels in each sample, of which 76,000 and 15,000, respectively, were off target (defined as intergenic or intronic, but not affecting splice sites) and therefore removed from further consideration. Only variants with an impact on coding regions were retained: missense, nonsense, frameshift, and essential splice site mutations. Further filtering was based on Phred quality scores, allele frequency in the ExAC (Exome Aggregation Consortium) database (< 3% for variants in genes already associated with LGMD, and < 1% for variants in other genes), association with HPO terms and predicted pathogenicity . The HPO terms used were: “muscular_dystrophy,” “muscle_weakness,” “limb-girdle,” “myopathy,” “muscular_atrophy,” “muscle_atrophy,” and “creatine phosphokinase.” Variants predicted to be pathogenic by at least one of the following programs were taken into further consideration: Mutation Taster, PolyPhen2, and SIFT. In total, among all the samples, 1880 variants were analyzed further (see Additional file 2). Prioritization was based on the following: the predicted effect, with truncating and elongating variants being evaluated more carefully; the predicted pathogenicity; and, finally, known association with myopathic phenotypes. All these variants were individually assessed by a board of geneticists and clinicians according to the guidelines of the American College of Medical Genetics and Genomics , with emphasis on the actual phenotype of each patient. Variants were then categorized as putative pathogenic (fit the phenotype effect very well) or potentially phenotype-influencing (could be responsible for naturally occurring variability of symptoms in frame of a typical LGMD clinical phenotype), with other variants assessed as unlikely to be related to the disease.
Independently, additional analyses were carried out with Exomiser2, PhenIX and Exome walker, with prioritization of variants based on possible association with limb-girdle muscular dystrophy (HP:0006785), and (based) on random-walk analysis of protein interaction networks with proteins already associated with the LGMD phenotype .
Additionally, in a second approach, rare (< 1% in ExAC database) variants in genes expressed in the human muscle, and in genes whose mouse homologs are expressed in muscle, were analyzed. Further analysis included ultra-rare variants (< 0.1% in ExAC), highly damaging mutations (including nonsense, frameshift, splice site mutations), interactome and association with pathways and structures that could play a role in LGMD pathogenesis. Known LGMD-associated genes were used to query the Biological General Repository for Interaction Datasets (BioGRID, version 3.4.151, accessed 3 August 2017) and Kyoto Encyclopedia of Genes and Genomes (KEGG, release 83.1, accessed 3 August 2017), which yielded potential interactors and muscle related pathways. Variants in genes that either interact with known LGMD-associated genes or are in the same pathway (as suggested by BioGRID and KEGG databases) were selected (see Additional file 3). Again, at the end of this discrimination pipeline (Fig. 1.), extracted variants were also correlated with patient phenotype and results of clinical examinations.
Selected variants (including all the putative causative mutations) were confirmed using direct fluorescence-based sequencing (ABI 3130 Genetic Analyzer, Applied Biosystems, USA). Segregation analysis including confirmation of trans configuration of compound heterozygotes could not always be performed because of the limited availability of DNA samples of the relatives.
Using whole exome sequencing, we found putative pathogenic mutations (pathogenic or likely pathogenic according to ACMG criteria) in known myopathy genes in 50 of the 72 LGMD cases (68.5%). In 43 cases, the identified variants were known to be pathogenic (found in OMIM, ClinVar, HGMD, or LOVD databases, or already described). These were associated mainly with the LGMD phenotype, but also with collagen-related myopathy and MYH-7 related myopathy. Putative causative mutations were found not only in LGMD-related genes (45 cases, 62.5%) but also in other myopathy-related genes (5 cases, 8%), highlighting the clinical overlap between muscular disorders. The latter cases had mutations in collagen myopathy-related genes (4 cases) and Becker’s muscular dystrophy (1 case). Selected variants most probably influencing the phenotype are presented in Table 1. A full version of the table including basic clinical data and evidence of pathogenicity is provided in a supplementary table (see Additional file 1).
The dominant forms of LGMD were relatively uncommon, with only one case each of LGMD1B (patient19) and LGMD1E (patient 275B). In two cases, mutations in MYOT (patient 175d) and CAV3 (patient 196) could also be considered responsible for the disease, but mutations in other genes (CAPN3 and COL6A2, respectively) were assessed as better explaining the patients’ phenotypes. It is noteworthy that all four mutations, G523R in LMNA, G77E in DNAJB6, R126H in CAV3, and R370C in MYOT, are ultra-rare, with a prevalence of < 0.02% in the European population according to the ExAC database.
In 22 cases, we found homozygous or compound heterozygous mutations in CAPN3. Additionally, in 13 cases, we found only a single heterozygous putative pathogenic mutation in CAPN3, and 11 of these cases were in the group of patients without an identified causative mutation. These cases were familial and clearly autosomal recessive.
Mutations in DYSF (homozygous or compound heterozygous) were found to be responsible for the disease in six patients from five families. Interestingly, we found the same compound heterozygous mutations in the DYSF gene in siblings with discordant phenotypes (LGMD in the sister, Miyoshi myopathy in the brother). In seven cases, we found mutations in FKRP; sarcoglycanopathies were represented by four families: two cases with mutations in SGCA and two in SGCB. Another two cases had ANO5 mutations.
In five cases, we found putative causative mutations in genes typically associated with other forms of myopathy. Four families had mutations in COL6A2 and COL6A3, typically associated with Bethlem myopathy, but recently also found in LGMD-like cases. In one male proband, a single known pathogenic deletion of three nucleotides was found in the DMD gene.
In all the cases, additional variants that could relate to case-specific muscle weakness phenotypes were found. Possible causes of phenotypic overlap, and of inter- and intrafamilial (patients 24 and 3) differences, were mutations in other LGMD-associated genes, and mutations in genes associated with myofibrillar myopathy, congenital muscular dystrophy, collagen myopathy, Duchenne / Becker muscular dystrophy, Emery-Dreifuss muscular dystrophy, or cardiomyopathy. A number of other variants in muscle pathogenesis-related genes were found, with uncertain significance. Filtering of the variants annotated with myopathy phenotype-related HPO terms returned between 21 and 59 variants per sample, with an average of 36.3 per sample.
Analysis of genes expressed in muscle (based on the Geneatlas database) gave 2036 variants (total for all analyzed patients), of which 1271 were ultra-rare (< 0.1%) and 214 had a putative high impact on the protein (nonsense, frameshift, splice site mutations). Interactome-based analysis of these variants reduced their number to 83 in 20 genes associated with known LGMD-related genes (Table 2). A supplementary table lists variants found in genes whose products interact with myopathy-related proteins (see Additional file 3).
Rare copy number variants in LGMD-related genes were found in 18 cases. A supplementary file shows detected CNVs in detail, not only in LGMD-related genes but also in the interactome of those genes, in other myopathy-related genes, and in genes expressed in muscles (see Additional file 4). It must be stressed that CNV predictions from WES data are not completely reliable [25, 26] and are presented in the supplementary material for indicative purposes only.
We performed the first comprehensive genetic analysis of patients with clinically defined LGMD in the Polish population.
On average, 36.3 rare variants per sample possibly related to the myopathic phenotype were identified. These variants were located in genes previously implicated in diverse muscle diseases (not just LGMD). These genes can be grouped according to the functional or structural association of their products: (i) dystrophin glycoprotein complex (SGCA, SGCB, SGCD, SGCG, DAG1), (ii) sarcomere structure (TCAP, TTN, PLEC, DES, MYOT) or assembly (CAPN3, DNAJB6), (iii) glycosylation (FKRP, POMT1, POMT2, POMGNT1, ISPD), (iv) signal transduction (CAV3, DAG1, BVES), (v) trafficking (TRAPPC11, CAV3, DYSF, BVES), and (vi) splicing (TNPO3, HNRPDL). After confirming the consistency of these results with the clinical and pathological characteristics of the patients, highly probable pathogenic genotypes could be identified in 50 out of 72 cases (68.5%). The above results gave a similar diagnostic rate to other recent NGS (next-generation sequencing) studies in genetically undiagnosed cohorts of LGMD: 47% in the Czech Republic, 62% in China, and 76% in Saudi Arabia [27,28,29]. Lower yields have been reported in studies involving patients pre-screened by targeted gene sequencing: 33% in Germany, 40% in the USA, and 45% in Australia [30,31,32]. The distribution of LGMD subtypes was similar to those observed in Germany  and Italy , with CAPN3 being the most frequent main putative pathogenic cause, and frequent cases with FKRP and DYSF mutations.
It should be noted that putative causative mutations in genes not included in the LGMD classification, COL6A2, COL6A3, and DMD, have also been reported by other authors in their LGMD cohorts [27,28,29,30,31], indicating phenotypic overlap between LGMD and other myopathies, making clinical diagnosis difficult in some cases. As a result of WES analysis, a diagnosis correction was made in the case of patient 135 to Becker muscular dystrophy. In the case of patients with causative mutations in COL6 genes, the diagnosis was also changed to suspected collagen myopathy (as COL6 genes mutations are not included in the current LGMD classification). In other cases, including those that were genetically unsolved, we upheld the clinical diagnosis of LGMD.
In the unsolved cases, the pathogenic mutations or copy number variants could be located in noncoding, regulatory or deep intronic regions. This could explain, for instance, the excess of single CAPN3 mutation carriers as compared to population-wide data. It is also likely that additional co-responsible CAPN3 mutations are located in the regulatory regions of the gene and therefore missed by exome sequencing. In sporadic or first cases in a family, a post-zygotic mutation event in the muscle could also be the cause of the disease .
In at least three cases, the patient’s clinical phenotype could be plausibly explained by mutations in more than one gene (CAPN3 + MYH7, COL6A3 + CACNA1S, DYSF + MYH7). In these cases apart from clinical phenotype of LGMD, additional features included the following: considerable distal weakness with early onset typical for MYH7-related myopathies (patient 18, CAPN3 + MYH7), early disease onset not typical for LGMD2B and possible autosomal dominant inheritance (patient 407, DYSF + MYH7), and almost exclusively type 1 fibers in biopsy unexpected for LGMD or Bethlem myopathy, whereas encountered in CACNA1S-related myopathies (patient 901, COL6A3 + CACNA1S). In these cases, a clinical diagnosis was upheld with possible co-existing MYH7 and CACNA1S-related myopathy. Likewise, in the majority of cases, additional variants in other genes apart from the highly probable major pathogenic mutations could at least add to the phenotypic manifestation; however, selecting co-causative variants from those classified as potentially modifying was not possible. Discrimination between possible phenotype-influencing variants and thousands of insignificant variants harbored by each individual became one of the most difficult novel challenges. Indeed, in all the studied cases, we encountered novel and rare variants related to LGMD and other myopathies, but their relevance could not be established based on the inheritance mode, patient’s phenotype, and known effect of mutations in these genes.
Here, we adopted a strategy for identifying the phenotype-influencing variants that linked the genes bearing found variants with any of the terms from the HPO database pointing to muscle physiology or structures. However, this approach could result in missing variants located in genes not yet associated with muscle disease, or missing variants coding for an interactome of the known causative proteins. Therefore, we additionally tried to identify putative phenotype-influencing variants by comprehensively analyzing those with MAF < 0.1%, expressed in muscle (human and/or murine) and markedly influencing the structure/function of the encoded protein (human and/or murine), but with no known association with the myopathy clinical phenotype (therefore excluding variants identified in the first approach). These variants were analyzed further based on the association of respective genes with known LGMD-related genes or pathways in which LGMD-related genes are involved. This reduced the overall number of such variants to 68 in 19 genes (0–3 per case). In light of their inheritance pattern, their presence in our in-house WES/WGS control group as well, and the presence of other variants that seem to explain the phenotype well in many cases, it is unlikely that the aforementioned 68 variants are causative. Still, putative phenotype-influencing variants could be within those in genes expressed in skeletal muscle.
By using various filtering approaches to WES results, one can gain insight into the possible influence of new genes on the disease. A list of such selected genes previously not associated with LGMD, but, according to our analysis, with a likely effect on the disease, is presented in Table 3.
Numerous genetic muscular disorders phenotypically overlap with LGMD, as limb-girdle weakness is one of their common symptoms. NGS-based genetic analyses can resolve clinical dilemmas and facilitate exact diagnosis [35,36,37]. Additionally, with the reporting of new cases, the spectrum of clinical manifestations of mutations in a given gene is likely to expand. Moreover, recent mass sequencing results show that the genetic background is more complex than previously considered [5, 6]. Also, our data suggest that mutations in more than one gene in a single patient can result in the LGMD phenotype. Taking into account the phenotypic variability within a given LGMD subtype or even between patients with the same causative mutation [38,39,40], one should expect a strong influence of disease-modifying genes, although no specific modifier or co-causative genes have been described to date. In our patients with identified primary causative mutations, at least a dozen additional variants that could influence or modify the phenotype were found, even when only genes known to be associated with muscle pathology were taken into consideration. It is therefore likely that the spectrum of genetic factors influencing the disease is substantially wider than previously recognized.
Indeed, a common polymorphism in the LTPB4 gene has been shown to be a disease-modifying factor in dystrophinopathy . Moreover, in some cases, mutations in more than one gene could be necessary to cause the disease . Thus, digenic inheritance has been proven for a subtype of facioscapulohumeral muscular dystrophy  found in congenital myasthenic syndrome , and it has also been suggested for calpainopathy .
Proteins of the muscle cell form a complex machinery where structural or functional impairment of any of its components can result in progressive muscle dysfunction and eventual destruction. The mutational burden in the numerous genes involved in muscular diseases must not be overlooked, as accumulation of minor defects, even those without an apparent overall effect when present in isolation, could result in a similar phenotype. Indeed, oligogenic etiology may be most easily observable in unsolved, sporadic LGMD cases, where the main putative causative mutation has not been identified. Interestingly, two of our “double trouble” cases were found precisely in sporadic patients.
On the other hand, some of the mutations described as disease-causing prior to the NGS era might only have a modifying effect, incomplete penetrance, and require additional variants to bring about pathology [40, 46]. The overrepresentation of single heterozygous CAPN3 mutations in our group may also indicate digenic or oligogenic inheritance.
Multiple annotation tools have become available using various algorithms and databases to predict the functional effects of genomic variants. One should bear in mind, however, that the functional scores of a given variant may differ substantially between different databases and prediction tools as they can be based on different functional aspects and prior knowledge. The superiority of high-scale bioinformatic analysis over focused genetic studies lies in the possibility of repeating the analysis and making use of novel knowledge .
Ideally, genetic testing should be combined not only with deep phenotyping but also with comprehensive analyses of transcripts and protein isoforms to pinpoint novel causative, co-causative, and phenotype-modifying variants.
The availability of exome and whole genome data for various conditions, including LGMD, challenges the classical definition of genetic causality and the concept of strictly monogenic disorders  and underlines the heterogeneity and complexity of the human genome . Our results show a range of phenotypes associated not only with genes previously and typically associated with LGMD but also with genes related to similar muscular disorders, such as Bethlem myopathy, myofibrillar myopathy, or congenital muscle dystrophy, as well as with genes not previously considered in the context of myopathies. Even if it is not always possible to prove the effect of putative modifying variants on the phenotype, aggregate analysis of mutations suggests that the sheer “variant burden” contributes to phenotypic variability.
Based on the obtained exomic data, we propose that LGMD could be better defined as a group of oligogenic disorders, in which variable clinical symptoms result from the combined effects of mutations in a set of genes and can result in a broad spectrum of clinical presentation rather than distinct disease entities. This could explain the fact that NGS methods fail to identify a single main causative gene in many LGMD cases, but indicate a range of possibly pathogenic and/or co-causative mutations in almost every case.
This could also explain the clinical heterogeneity not only of LGMD or within subtypes but also among individuals harboring the same known pathogenic mutations, and even between affected members of the same family. While a considerable proportion of LGMD cases can be easily attributed clinically to a single gene, the high number of variants that could relate to myopathy and sometimes to specific phenotype features in cases with mutations in known LGMD-associated genes suggests that the oligogenic nature of the disease may be important even in patients with a well-defined primary pathogenic cause.
However, unequivocal identification of such modifying variants requires comprehensive bioinformatic analyses integrated with deep phenotyping to make a final diagnosis . It should be remembered, nevertheless, that it is practically impossible to ascertain the causality even of a single gene in a single subject or a risk-family . Identification of all risk or co-causative factors requires bioinformatical analysis of combined genomic and clinical data on large groups of ethnically diverse patients with various muscle diseases followed by functional in vitro studies.
We expect that with the appearance of genomic data from large groups of patients with a large spectrum of myopathies, it will become possible to examine not just a limited number of genes and variants, but groups of genes encoding entire pathways and modules . As a result, the traditional descriptive classification of muscle diseases will transform into a systemic and pathway-based view of clinical phenotypes . The presented results are the first and indispensable step towards this goal of translational medicine.
Burrows-Wheeler alignment (software)
Serum creatine kinase
Copy number variant
The exome aggregation consortium (database)
Genome analysis toolkit (software)
Human gene mutation database
The human phenotype ontology (database)
- indel :
Limb-girdle muscular dystrophy
Leiden open variation database
Multiplex ligation-dependent probe amplification
Online Mendelian inheritance in man (database)
Polymerase chain reaction
Sorting intolerant from tolerant (software)
Whole exome sequencing
Whole genome sequencing
Mitsuhashi S, Kang PB. Update on the genetics of limb girdle muscular dystrophy. Semin Pediatr Neurol. 2012;19(4):211–8. https://doi.org/10.1016/j.spen.2012.09.008.
Bushby K. Diagnosis and management of the limb girdle muscular dystrophies. Pract Neurol. 2009;9:314–23. https://doi.org/10.1136/jnnp.2009.193938.
Norwood FL, Harling C, Chinnery PF, Eagle M, Bushby K, Straub V. Prevalence of genetic muscle disease in Northern England: in-depth analysis of a muscle clinic population. Brain. 2009;132:3175–86. https://doi.org/10.1093/brain/awp236.
van der Kooi AJ, Barth PG, Busch HF, de Haan R, Ginjaar HB, van Essen AJ et al. The clinical spectrum of limb girdle muscular dystrophy. A survey in The Netherlands. Brain. 1996;119(Pt 5): 1471–1480 doi: https://doi.org/10.1093/brain/119.5.1471.
Nigro V, Savarese M. Genetic basis of limb-girdle muscular dystrophies: the 2014 update. Acta Myol. 2014;33(1):1–12.
Vissing J. Limb girdle muscular dystrophies: classification, clinical spectrum and emerging therapies. Curr Opin Neurol. 2016;29(5):635–41. https://doi.org/10.1097/WCO.0000000000000375.
Ghaoui R, Benavides T, Lek M, Waddell LB, Kaur S, North KN, et al. TOR1AIP1 as a cause of cardiac failure and recessive limb-girdle muscular dystrophy. Neuromuscul Disord. 2016;26(8):500–3. https://doi.org/10.1016/j.nmd.2016.05.013.
Servián-Morilla E, Takeuchi H, Lee TV, Clarimon J, Mavillard F, Area-Gómez E, et al. A POGLUT1 mutation causes a muscular dystrophy with reduced Notch signaling and satellite cell loss. EMBO Mol Med. 2016;8(11):1289–309. https://doi.org/10.15252/emmm.201505815.
Narayanaswami P, Weiss M, Selcen D, David W, Raynor E, Carter G, et al. Evidence-based guideline summary: diagnosis and treatment of limb-girdle and distal dystrophies: report of the Guideline Development Subcommittee of the American Academy of Neurology and the Practice Issues Review Panel of the American Association of Neuromuscular & Electrodiagnostic medicine. Neurology. 2014;83(16):1453–63. https://doi.org/10.1212/WNL.0000000000000892.
Beltran-Valero de Bernabé D, Voit T, Longman C, Steinbrecher A, Straub V, Yuva Y, et al. Mutations in the FKRP gene can cause muscle-eye-brain disease and Walker-Warburg syndrome. J Med Genet. 2004;41:e61. https://doi.org/10.1136/jmg.2003.013870.
Chan YM, Keramaris-Vrantsis E, Lidov HG, Norton JH, Zinchenko N, Gruber HE, et al. Fukutin-related protein is essential for mouse muscle, brain and eye development and mutation recapitulates the wide clinical spectrums of dystroglycanopathies. Hum Mol Genet. 2010;19:3995–4006. https://doi.org/10.1093/hmg/ddq314.
Qiao C, Wang CH, Zhao C, Lu P, Awano H, Xiao B, et al. Muscle and heart function restoration in a limb girdle muscular dystrophy 2I (LGMD2I) mouse model by systemic FKRP gene delivery. Mol Ther. 2014;22:1890–9. https://doi.org/10.1038/mt.2014.141.
Sondergaard PC, Griffin DA, Pozsgai ER, Johnson RW, Grose WE, Heller KN, et al. AAV dysferlin overlap vectors restore function in dysferlinopathy animal models. Ann Clin Transl Neurol. 2015;2:256–70. https://doi.org/10.1002/acn3.172.
Lek M, MacArthur D. The challenge of next generation sequencing in the context of neuromuscular diseases. J Neuromuscul Dis. 2014;1:135–49. https://doi.org/10.3233/JND-140032.
Miller SA, Dykes DD, Polesky HF. A simple salting out procedure for extracting DNA from human nucleated cells. Nucleic Acids Res. 1988;16:1215. https://doi.org/10.1093/nar/16.3.1215.
Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–60. https://doi.org/10.1093/bioinformatics/btp324.
McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–303. https://doi.org/10.1101/gr.107524.110.
Robinson JT, Thorvaldsdóttir H, Winckler W, Guttman M, Lander ES, Getz G, et al. Integrative genomics viewer. Nat Biotechnol. 2011;29:24–6. https://doi.org/10.1038/nbt.1754.
Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38:e164. https://doi.org/10.1093/nar/gkq603.
Jiang Y, Oldridge DA, Diskin SJ, Zhang NR. CODEX: a normalization and copy number variation detection method for whole exome sequencing. Nucleic Acids Res. 2015;43(6):e39. https://doi.org/10.1093/nar/gku1363.
Köhler S, Schoeneberg U, Czeschik JC. Clinical interpretation of CNVs with cross-species phenotype data. J Med Genet. 2014;51(11):766–72. https://doi.org/10.1136/jmedgenet-2014-102633.
Masino AJ, Dechene ET, Dulik MC, Wilkens A, Spinner NB, Krantz ID, et al. Clinical phenotype-based gene prioritization: an initial study using semantic similarity and the human phenotype ontology. BMC Bioinformatics. 2014;15:248. https://doi.org/10.1186/1471-2105-15-248.
Richards S, Aziz N, Bale S, Bick D, Das S, Gastier-Foster J, et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet Med. 2015;17(5):405–24. https://doi.org/10.1038/gim.2015.30.
Smedley D, Robinson PN. Phenotype-driven strategies for exome prioritization of human Mendelian disease genes. Genome Med. 2015;7(1):81. https://doi.org/10.1186/s13073-015-0199-2.
Khateb S, Hanany M, Khalaileh A, Beryozkin A, Meyer S, Abu-Diab A, et al. Identification of genomic deletions causing inherited retinal degenerations by coverage analysis of whole exome sequencing data. J Med Genet. 2016;53(9):600–7. https://doi.org/10.1136/jmedgenet-2016-103825.
Belkadi A, Bolze A, Itan Y, Cobat A, Vincent QB, Antipenko A, et al. Whole-genome sequencing is more powerful than whole-exome sequencing for detecting exome variants. Proc Natl Acad Sci U S A. 2015;112(17):5473–8. https://doi.org/10.1073/pnas.1418631112.
Stehlíková K, Skálová D, Zídková J, Haberlová J, Voháňka S, Mazanec R, et al. Muscular dystrophies and myopathies: the spectrum of mutated genes in the Czech Republic. Clin Genet. 2017;91(3):463–9. https://doi.org/10.1111/cge.12839.
Yu M, Zheng Y, Jin S, Gang Q, Wang Q, Yu P, et al. Mutational spectrum of Chinese LGMD patients by targeted next-generation sequencing. PLoS One. 2017;12(4):e0175343. https://doi.org/10.1371/journal.pone.0175343.
Monies D, Alhindi HN, Almuhaizea MA, Abouelhoda M, Alazami AM, Goljan E, et al. A first-line diagnostic assay for limb-girdle muscular dystrophy and other myopathies. Hum Genomics. 2016;10(1):32. https://doi.org/10.1186/s40246-016-0089-8.
Kuhn M, Gläser D, Joshi PR, Zierz S, Wenninger S, Schoser B, et al. Utility of a next-generation sequencing-based gene panel investigation in German patients with genetically unclassified limb-girdle muscular dystrophy. J Neurol. 2016;263(4):743–50. https://doi.org/10.1007/s00415-016-8036-0.
Reddy HM, Cho KA, Lek M, Estrella E, Valkanas E, Jones MD, et al. The sensitivity of exome sequencing in identifying pathogenic mutations for LGMD in the United States. J Hum Genet. 2017;62(2):243–52. https://doi.org/10.1038/jhg.2016.116.
Ghaoui R, Cooper ST, Lek M, Jones K, Corbett A, Reddel SW, et al. Use of whole-exome sequencing for diagnosis of limb-girdle muscular dystrophy: outcomes and lessons learned. JAMA Neurol. 2015;72(12):1424–32. https://doi.org/10.1001/jamaneurol.2015.2274.
Magri F, Nigro V, Angelini C, Mongini T, Mora M, Moroni I, et al. The italian limb girdle muscular dystrophy registry: relative frequency, clinical features, and differential diagnosis. Muscle Nerve. 2017;55(1):55–68. https://doi.org/10.1002/mus.25192.
Acuna-Hidalgo R, Bo T, Kwint MP, van de Vorst M, Pinelli M, Veltman JA, et al. Post-zygotic point mutations are an underrecognized source of de novo genomic variation. Am J Hum Genet. 2015;97(1):67–74. https://doi.org/10.1016/j.ajhg.2015.05.008.
Simeoni S, Russo V, Gigli GL, Scalise A. Facioscapulohumeral muscular dystrophy and limb-girdle muscular dystrophy: “double trouble” overlapping syndrome? J Neurol Sci. 2015;348(1–2):292–3. https://doi.org/10.1016/j.jns.2014.12.009.
Ciafaloni E, Chinnery PF, Griggs RC. Evaluation and Treatment of myopathies. Oxford: Oxford University Press; 2014.
Pegoraro E and Hoffman EP, Limb-Girdle Muscular Dystrophy Overview. In Pagon RA, Bird TD, Dolan CR, eds. GeneReviews, University of Washington, Seattle, WA, USA, 2012.
Weiler T, Bashir R, Anderson LV, Davison K, Moss JA, Britton S, et al. Identical mutation in patients with limb girdle muscular dystrophy type 2B or Miyoshi myopathy suggests a role for modifier gene(s). Hum Mol Genet. 1999;8:871–7. https://doi.org/10.1093/hmg/8.5.871.
Illarioshkin SN, Ivanova-Smolenskaya IA, Greenberg CR, Nylen E, Sukhorukov VS, Poleshchuk VV, et al. Identical dysferlin mutation in limb-girdle muscular dystrophy type 2B and distal myopathy. Neurology. 2000;55:1931–3. https://doi.org/10.1212/WNL.55.12.1931.
Fee DB, So YT, Barraza C, Figueroa KP, Pulst SM. Phenotypic variability associated with Arg26Gln mutation in caveolin3. Muscle Nerve. 2004;30(3):375–8. https://doi.org/10.1002/mus.20092.
Flanigan KM, Ceco E, Lamar KM, Kaminoh Y, Dunn DM, Mendell JR, et al. LTBP4 genotype predicts age of ambulatory loss in Duchenne muscular dystrophy. Ann Neurol. 2013;73(4):481–8. https://doi.org/10.1002/ana.23819.
Posey JE, Harel T, Liu P, Rosenfeld JA, James RA, Coban Akdemir ZH, et al. Resolution of disease phenotypes resulting from multilocus genomic variation. N Engl J Med. 2017;376(1):21–31. https://doi.org/10.1056/NEJMoa1516767.
Lemmers RJ, Tawil R, Petek LM, Balog J, Block GJ, Santen GW, et al. Digenic inheritance of an SMCHD1 mutation and an FSHD-permissive D4Z4 allele causes facioscapulohumeral muscular dystrophy type 2. Nat Genet. 2012;44(12):1370–4. https://doi.org/10.1038/ng.2454.
Lam CW, Wong KS, Leung HW, Law CY. Limb girdle myasthenia with digenic RAPSN and a novel disease gene AK9 mutations. Eur J Hum Genet. 2017;25(2):192–9. https://doi.org/10.1038/ejhg.2016.162.
Saenz A, Lopez de Munain A. Dominant LGMD2A: alternative diagnosis or hidden digenism? Brain. 2017;140(Pt 2):e7. https://doi.org/10.1093/brain/aww281.
Di Fruscio G, Garofalo A, Mutarelli M, Savarese M, Nigro V. Are all the previously reported genetic variants in limb girdle muscular dystrophy genes pathogenic? Eur J Hum Genet. 2016;24(1):73–7. https://doi.org/10.1038/ejhg.2015.76.
Need AC, Shashi V, Schoch K Petrovski S, Goldstein DB. The importance of dynamic re-analysis in diagnostic whole exome sequencing. J Med Genet. 2017;54:155–6. https://doi.org/10.1136/jmedgenet-2016-104306.
Katsanis N. The continuum of causality in human genetic disorders. Genome Biol. 2016;17:233. https://doi.org/10.1186/s13059-016-1107-9.
Gao F, Keinan A. High burden of private mutations due to explosive human population growth and purifying selection. BMC Genomics. 2014;15:S3. https://doi.org/10.1186/1471-2164-15-S4-S3.
Hennekam RC, Biesecker LG. Next-generation sequencing demands next-generation phenotyping. Hum Mutat. 2012;33(5):884–6. https://doi.org/10.1002/humu.22048.
Marian AJ. The case of “missing causal genes” and the practice of medicine. Circ Res. 2016;119:21–4. https://doi.org/10.1161/CIRCRESAHA.116.308830.
McCarthy MI, MacArthur DG. Human disease genomics: from variants to biology. Genome Biol. 2017;30(18):20. https://doi.org/10.1186/s13059-017-1160-z.
Thompson R, Straub V. Limb-girdle muscular dystrophies—international collaborations for translational research. Nat Rev Neurol. 2016;12(5):294–309. https://doi.org/10.1038/nrneurol.2016.35.
This work was supported by National Science Centre, Poland grants (NCN, 2013/09/B/NZ4/03258 and 2015/19/N/NZ2/02915), and by the KNOW-MMRC project financed by the Ministry of Science and Higher Education (to JPF). The bioinformatics part of the study (JPF) was supported by PL-Grid Infrastructure financed by the Polish Ministry of Science and Higher Education and co-funded by the European Regional Development Fund as part of the Innovative Economy program. The funding sources had no involvement in the experiment design, analysis, data interpretation, and manuscript preparation.
Availability of data and materials
All data generated during this study are included in the supplementary information files. The raw data analyzed during the current study are available from the corresponding author on reasonable request.
Ethics approval and consent to participate
The study was approved by the Ethics Committee of the Warsaw Medical University and MSW Hospital (Warsaw, Poland) in compliance with national legislation and the Code of Ethical Principles for Medical Research Involving Human Subjects of the World Medical Association. Written consent was obtained from all patients and healthy individuals according to the Declaration of Helsinki (BMJ 1991; 302:1194). The authors are very grateful to all the families for their participation in this study.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Clinical characteristics and WES results (including evidence of pathogenicity according to ACMG criteria) of 85 patients studied. (XLSX 24 kb)
Lists of rare (< 1%) variants in genes related with “muscle phenotype” for individual patients. (XLSX 3003 kb)
List of variants in genes expressed in the human muscle and in genes whose mouse homologs are expressed in muscle, which are in the interactome of known LGMD- and MFM-associated genes. (XLSX 1451 kb)
List of copy number variants (CNVs) identified in the vicinity of LGMD-related genes. (XLS 2184 kb)
Selected genes with reported skeletal muscle expression which could contribute to LGMD. (DOCX 16 kb)