Evolutionary acquisition of promoter-associated non-coding RNA (pancRNA) repertoires diversifies species-dependent gene activation mechanisms in mammals

Background Recent transcriptome analyses have shown that long non-coding RNAs (ncRNAs) play extensive roles in transcriptional regulation. In particular, we have reported that promoter-associated ncRNAs (pancRNAs) activate the partner gene expression via local epigenetic changes. Results Here, we identify thousands of genes under pancRNA-mediated transcriptional activation in five mammalian species in common. In the mouse, 1) pancRNA-partnered genes confined their expression pattern to certain tissues compared to pancRNA-lacking genes, 2) expression of pancRNAs was significantly correlated with the enrichment of active chromatin marks, H3K4 trimethylation and H3K27 acetylation, at the promoter regions of the partner genes, 3) H3K4me1 marked the pancRNA-partnered genes regardless of their expression level, and 4) C- or G-skewed motifs were exclusively overrepresented between−200 and−1 bp relative to the transcription start sites of the pancRNA-partnered genes. More importantly, the comparative transcriptome analysis among five different mammalian species using a total of 25 counterpart tissues showed that the overall pancRNA expression profile exhibited extremely high species-specificity compared to that of total mRNA, suggesting that interspecies difference in pancRNA repertoires might lead to the diversification of mRNA expression profiles. Conclusions The present study raises the interesting possibility that the gain and/or loss of gene-activation-associated pancRNA repertoires, caused by formation or disruption of the genomic GC-skewed structure in the course of evolution, finely shape the tissue-specific pattern of gene expression according to a given species. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3662-1) contains supplementary material, which is available to authorized users.


Background
Mycoplasma pneumoniae pneumonia (MPP), as a common community-acquired pneumonia, counts for 20 to 40% of children pneumonia and may reach 50 to 80% during the time of local outbreak [1,2]. MPP is usually described as mild and self-limited; however, more and more severe or even fatal cases of MPP with severe complications such as pulmonary necrosis and chronic interstitial fibrosis have been reported recently [3][4][5]. Macrolide-resistant and excessive immunological inflammation are also commonly found in severe MPP [6]. Therefore, it is essential for pediatricians to recognize severe MPP early, treat it promptly, and prevent the progression of the disease effectively. However, the mechanism and etiology of severe MPP are largely unknown.
Based on published hypotheses, severe MPP is considered as a hyper-immune response that originates from repeated or longer lasting childhood MP infections in the lung [7]; further, severe MPP can be an overactive innate immune response such as macrophage activation via heterodimerization of Toll-like receptors two and six of the bronchoepithelial cells to M. pneumoniae lipoproteins [8]. With ELISA and real-time quantitative PCR techniques, researchers have found that the cellmediated immune response plays an important role in the pathogenesis of MPP [9][10][11] but the role of humoralmediated immune response in mild and severe MPP is still unclear.
High-throughput RNA sequencing technology, so called next-generation sequencing, revolutionarily enhanced our understanding on the complexity of eukaryotic transcriptome [12,13]. It has several key advantages including being independent on the predetermined genome sequences, highly accurate in detecting gene expression with very wide dynamic detection ranges with low background. Thus, RNA sequencing is not only useful to precisely determine gene expression profiles but also particularly powerful to detect novel transcription variants via alternative splicing [12].
In the present study, we observed the transcriptome of bronchoalveolar lavage fluid (BALF) from children with mild MPP and severe MPP. The large sum of novel information on the gene expression profiles as well as novel transcripts through alternative splicing would provide not only insights into the pathogenesis of severe MPP but also as basis for the development of biomarkers and therapeutic targets.

Study subjects
The current study was conducted at the First Hospital of Jilin University (Changchun City, Jilin Province, People's Republic of China). Six newly diagnosed children (three male and three female) with acute stage of MPP admitted to our hospital were recruited [see Additional file 1: Table  S1]. All of the children enrolled in this study had no recurrent severe or unusual infections and had no inflammatory disorders or autoimmunity. Therefore, based on the published diagnostic criteria, they had no history of common variable immunodefiency (CVID) [14]. After admission to our hospital, the levels of immunoglobulins in the blood of these children had been examined; the levels of IgG, IgA, and IgM had been found within normal range published for children [see Additional file 2: Figure  S1] [15]. Lymphocyte profiles in the peripheral blood of these children had also been examined, the cell numbers and percentage of T cells, B cells, and natural killer cells had been found within normal range [see Additional file 3: Table S2] [15]. Therefore, the enrolled children had been excluded from having CVID, autosomal recessive agammaglobulinemia [15], or high IgM syndrome [16]. All children did not have untreated metabolic/congenital systemic diseases. The diagnosis of pneumonia was based on clinical manifestations (cough, fever, dry or productive sputum, dyspnea, abnormal breath sound, radiological pulmonary abnormalities). The diagnosis of Mycoplasma pneumoniae (MP) infection was based on positive results of serologic test (MP-IgM test ≥1:40) and positive results of MP DNA (>500 copy/L) in BALF with real-time quantitative PCR. MP was the only pathogen identified in all the MPP subjects. The mild and severe community-acquired pneumonia was defined based on the criteria described [17,18]. Mild group was defined as fever <38.5°C at any age, tachypnea but respiratory rate <70 breaths/min at age <3 years old or <50 breaths/min at age ≥3 years old, normal food-intake, and no dehydration. Severe group was defined as fever ≥38.5°C at any age, breathless with respiratory rate ≥70 breaths/min at age <3 years old or ≥50 breaths/min at age ≥3 years old (excluding the reasons of fever and cry), cyanosis, marked retractions, anorexia, and dehydration.
The written informed consents were obtained by care givers of all children. The study was approved by the Institutional Medical Ethics Review Board of the First Hospital of Jilin University in compliance with the Declaration of Helsinki.

Bronchoscopy and bronchoalveolar lavage
Following the guidelines described previously [17], flexible fiber optic bronchoscopy with bronchoalveolar lavage (BAL) was performed within 3 days after the admission. Both groups received similar supportive and symptomatic treatment, including sputum aspiration, nebulization, and fluid therapy. Corticosteroid and the other immune regulation agents were not permitted before bronchoscopy. A 2.8-mm pediatric flexible bronchoscope (Olympus BF-XP60, New Hyde Park, NY) or a 4.0-mm flexible bronchoscope (Olympus P-260, New Hyde Park, NY) was used for children depending on their age and body weight. All of the enrolled subjects had indications for bronchoscopy and BAL: radiologically proven large pulmonary lesions (including atelectasis and consolidation of lung fields). Supplemental oxygen was administered during the procedure. Transcutaneous oxygen saturation and pulse rate (Masimo Radical-7 pulse oximetry, Masimo, CA) were continuously monitored during the bronchoscopy.
Intravenous injection of midazolam (0.1-0.15 mg/kg) was used for sedation; aerosolized lidocaine spraying on the throat insertion was performed 5-10 min before bronchoscope for throat local anesthesia; dripping 2% lidocaine through flexible bronchoscopy was used for the topical anesthesia of the upper and lower airways. BAL was performed in an area most prominently affected based on the chest radiology (MPP groups) by gently wedging the tip of the Bronchoscope in a segmental or subsegmental bronchus. 1 ml/kg sterile saline was instilled through the instrumentation channel. The BALF was gently aspirated and collected in a sterile container and immediately centrifuged. The pallet was resuspended in TRIzol (Life Technologies, CA, USA) and stored in −80°C freezer. The composition of the nucleated cells in the BALF was counted; results were shown in Additional file 4: Table S3.

RNA preparation and sequencing
Total RNA was extracted using TRIzol according to the manufacturer's instructions. RNA degradation and contamination were monitored on 1% agarose gels. RNA purity was checked using the NanoPhotometer spectrophotometer (IMPLEN, CA, USA). RNA concentration was measured using Qubit RNA Assay Kit in Qubit 2.0 Flurometer (Life Technologies, CA, USA). RNA integrity was assessed using the RNA Nano 6000 Assay Kit of the Bioanalyzer 2100 system (Agilent Technologies, CA, USA).
A total amount of 3 μg RNA per sample was used as input material for the RNA sample preparation. Sequencing libraries were generated using NEBNext® Ultra™ RNA Library Prep Kit for Illumina® (NEB, USA) following the manufacturer's recommendations, and index codes were added to attribute sequences to each sample. The clustering of the index-coded samples was performed on a cBot Cluster Generation System using TruSeq PE Cluster Kit v3-cBot-HS (Illumia) according to the manufacturer's instructions. After cluster generation, the library preparations were sequenced on an Illumina Hiseq platform and 125 bp/150 bp paired-end reads were generated.

Sequencing data analysis
Raw data (raw reads) of fastq format were firstly processed through in-house perl scripts, and clean data (clean reads) were obtained. Index of the reference genome was built using Bowtie v2.2.3, and paired-end clean reads were aligned to the reference genome using TopHat v2.0.12. Only uniquely and properly mapped read pairs were used for further analysis. The differentially expressed genes between BALF samples were identified using the DESeq R package (1.18.0) [19]. Differentially expressed genes were defined as those with changes of at least twofold between samples. The resulting p values were adjusted using the Benjamini and Hochberg approach for controlling the false discovery rate (FDR). Genes with an adjusted p value <0.05 found by DESeq were assigned as differentially expressed. Protein functional classification of differentially expressed genes was performed using the PANTHER classification system [20]. KOBAS software was used to test the statistical enrichment of differential expression genes in KEGG pathways. The significance of enriched KEGG pathways were determined by corrected p value <0.05. Cufflinks v2.1.1 Reference Annotation Based Transcript (RABT) assembly method was used to construct and identify both known and novel transcripts from TopHat alignment results. The analysis of alternative splicing events was performed using MATS and IGV software [21]; alternative splicing events were classified to five basic types by software Asprofile v1.0. The differences in alternative splicing of genes were considered significant with a cutoff of 5% FDR.

RNA sequencing results
Total RNA was extracted from six BALF samples of children with severe MPP or mild MPP [see Additional file 1: Table S1]. Then, mRNAs from each sample were sequenced. After the removal of adaptor sequences, ambiguous reads and low-quality reads, about 40-70 million pairs of clean read, were generated for each sample ( Table 1). The percentage of reads mapped to the forward chain was equal to that mapped to the reverse chain. When compared with the reference sequence of the Genome Reference Consortium GRCh37/ hg19, more than 85% of total read pairs were uniquely mapped on the human genome (Table 1). A correlation matrix shown a high consistency of measurements within each group, R 2 > 0.8 [see Additional file 5: Figure  S2A]. Principal component analysis (PCA) was carried out to assess the clustering nature of these samples. Samples of each group had been clustered together; data shown good repeatability and correlation [see Additional file 5: Figure S2B].
Identification and classification of differentially expressed genes between severe MPP and mild MPP Totally, 48 differentially expressed genes were identified between the severe MPP group and mild MPP group ( Fig. 1a). The 14 up-regulated genes listed in Table 2 were IGHV1-69, CH17-472G23.  (Table 3).
Protein functional classification of differentially expressed genes between severe MPP and mild MPP was performed. As a result, the 48 differentially expressed genes were divided into 13 different classes of protein (Fig. 1b). The expression levels of genes classified as signaling molecule (CLEC5A, IL13, INHBA), transporter (ATP1B2), and transfer/carrier protein (APOL4) significantly increased in the severe MPP group comparing to the mild MPP group [see Additional file 6: Table S4]. On the other hand, genes in the categories of nucleic acid binding molecule (AICDA, PAX5, SPIB, TCF7), genes encodes defense/ immunity protein (IL22RA2, CD79A), hydrase (PLA2G2D, HTRA3, AICDA), receptor (MS4A1, IL22RA2), and transcription factor (PAX5), were predominantly expressed in the mild MPP group. Among the cell adhesion molecules, FCER2 and FCRLB were up-regulated in the severe MPP group; FCRLA and FCRL4 were up-regulated in the mild MPP group.
The clustering analysis of differentially expressed genes indicated that FCER2, FLT1, IL13, and CLEC5A were The statistics of sequence that were mapped on the "+" chain of the genome Reads map to "−": The statistics of sequence that were mapped on the "−" chain of the genome Fig. 1 Differentially expressed genes between the severe MPP group and mild MPP group in BALF. a Volcano plot of genes differentially expressed between the severe MPP group and mild MPP group. Each point represents one gene that is detectable in both groups. The red points represent significantly up-regulated genes; the green points represent significantly down-regulated genes. b Protein functional classification of differentially expressed genes has been performed using the PANTHER tool between the severe MPP group and mild MPP group. The protein category is shown on x-axis and the gene frequency is shown on y-axis up-regulated in the severe MPP group comparing to the mild MPP group (Fig. 2). Similar up-regulation or downregulation patterns among these genes were identified, which may indicate related local protein functions of these genes under MP infection in children.
Functional annotation of KEGG pathway enrichment analysis revealed changes related to primary immunodeficiency in the BALF of severe MPP children, which was associated with the down-regulation of CD19, TNFRSF13C, CD79A, and AICDA genes (Table 4). CD19, TNFRSF13C, CD79A, and AICDA were involved in the B cell differentiation process; down-regulation of these genes may restrain B cell maturation and antibody production ( Fig. 3) [22]. Furthermore, COL6A1, COL6A2, COL5A3, COL3A1, and COL1A2 genes were involved in the pathways including protein digestion and absorption, ECM-receptor interaction, P13K-Akt signaling, and focal adhesion; these molecules were found down-regulated in the severe MPP group comparing to the mild MPP group (Table 4). COL6A1, COL6A2, COL5A3, COL3A1, and COL1A2 belong to the collagen super family, which play a role in maintaining the integrity of various tissues including the lung [23]. FLT1, which encoded a member of vascular endothelial growth factor receptor (VEGFR) family, was also found significantly increased in the severe MPP group comparing to the mild MPP group in the P13K-Akt signaling and focal adhesion pathways (Tables 2 and 4).
Alternative splicing events between severe MPP and mild MPP More than 90% of human genes are alternatively spliced through different types of splicing [24]. MATS analysis of the RNA sequencing data significantly revealed 1500 differential alternative splicing events with a cutoff of 5% FDR (Table 5), more than half (50.5%) of them belong to the skipped exon type. Among the differentially expressed genes between the severe MPP group and mild MPP group, FECR2 and FCRLA were identified to have significantly alternative splicing [see Additional file 7: Table S5, Additional file 8: Table S6, Additional file 9: Table S7, Additional file 10: Table  S8, and Additional file 11: Table S9].
FCER2 locates on chromosome 19p13.2 (chr 19:7,688,758-7,702,146); its encoding protein is a B cell specific antigen, which is a low-affinity receptor for IgE. This protein plays essential roles in B cell growth, differentiation, and regulation of IgE production. It also exists as a soluble secreted form and acts as a potent mitogenic growth factor. Retained intron (RI) was identified in FCER2 in some of the samples. RI lies on chr19: 7,698,355-7,698,854; the upstream exon locates on chr19: 7,698,355-7,698,409 and the downstream exon locates on chr19: 7,698,740-7,698,854 [see Additional file 8: Table S6]. Compared to the mild MPP group, the severe MPP group had less RI spliced transcript events of FCER2, which could be an explanation for the upregulation of the FCER2 in severe MPP children (Fig. 4a).  Additional file 9: Table S7]. The analysis of alternative splicing events indicated that the MEX frequency of the first exon in the severe MPP group was significantly higher than that in the mild MPP group, which may result in the down-regulated expression of FCRLA in severe MPP children (Fig. 4b).

Discussion
Recent developments in RNA sequencing technology enabled elaborate analysis of gene expression in numerous human diseases. However, to our best knowledge, no report of RNA-sequencing study on human MPP has been published yet. The current study provides extensive information on gene expression and alternative splicing in the BALF of MPP children through transcriptome analysis, which is crucial for understanding the pathogenesis of severe MPP. The gene expression analysis revealed 14 up-regulated genes and 34 down-regulated genes in severe MPP children comparing to mild MPP children. The top 10 most up-regulated genes are IGHV1-   (Fig. 1a, Table 2). The top 10 most down-regulated genes are OSTN-AS1, IL22RA2, COL3A1, C1orf141, IGKV2-29, RP11-731F5.2, IGHV4-4, KIRREL, DNASE1L3, and COL6A2 (Fig. 1a, Table 3).
Several key genes that are differentially expressed between severe MPP and mild MPP are associated with hyper-immune response and signaling. For example, IL13, a cytokine secreted by T helper type 2 (Th2) cells, can induce many features of allergic lung disease including airway hyper-responsiveness, goblet cell metaplasia, and mucus hyper-secretion, which all contribute to airway obstruction [25,26]. IL13 is found to cause mucin overproduction through STAT6/EGFR-FOXA2 signaling and mucus plugging formation in MP infection, which results in pulmonary atelectasis or consolidation [27]. These results prove that IL13 play an important role in the airway obstruction of severe MPP. In addition, Wu Q et al. [26] have reported that IL13 can restrain MP clearance by the suppression of Toll-like receptor 2 in mice. Therefore, high levels of IL13 may make severe MPP children lose the ability to eradicate MP from the lung in primary infection, resulting in longer lasting MP infection and a hyper-immune response [7]. Another gene associated with hyper-immune response is FCER2.
FCER2 is a B cell specific antigen and a low-affinity receptor for IgE. It has essential roles in B cell growth, differentiation, and the regulation of IgE production. On the basis of study with animal models, FCER2 has been implicated in IgE-mediated allergic diseases and bronchial hyper-reactivity [28]. It has been proved that FCER2 is involved in the pharmacogenetic basis for severe exacerbations in children with asthma [29]. Upregulated FCER2 in severe MPP children has been discovered in the present study. Similarly, increased IgE levels in the serum of MPP patients have been reported [30]. Therefore, further study will be needed to prove that up-regulated FCER2 causes the bronchial inflammation and hyper-immune reactivity. Among the upregulated genes in severe MPP comparing to mild MPP, FLT1 may also associate with hyper-immune response. FLT1 (fms-like tyrosine kinase 1) encodes a member of the vascular endothelial growth factor receptor (VEGFR) family. Wu WK et al. demonstrate that Th2-related cytokines, such as IL4 and IL13, could drive the expression of FLT1 [31]. Moreover, Th2-related cytokines can promote VEGF release in the airway, and VEGF has been proposed to be associated with severe MPP [32,33]. Therefore, atopic children may be more prone to develop severe pneumonia [34]. We tentatively put forward the hypothesis that IL13, FCER2, and FLT1 may be  associated with each other and all of them are involved in the pathogenesis of severe MPP. Based on our data [see Additional file 4: Table S3] and published results [35], more than half of the nucleated cells in BALF of MPP children without other pathogen infection are macrophages. But it is still unclear how macrophages are involved in the pathogenesis of severe MPP. Clustering analysis of the differentially expressed genes reveals that the expression patterns of IL13, FCER2, FLT1, and CLEC5A are similar in the severe MPP group (Fig. 2). One possible mechanism of severe MPP is the overactivation of macrophage in innate immune response [7]. CLEC5A (C-type lectin domain family 5, member A) is expressed on alveolar macrophages; it has been demonstrated to mediate macrophage response and play roles in pro-inflammatory cytokine expression and airspace enlargement in a mice model of chronic obstructive pulmonary disease (COPD) [36]. Muro S et al. suggested that MP infection could be an independent risk factor for COPD in the general population [37]. CLEC5A encodes a member of the CTL/CTLD (C-type lectin/C-type lectin-like domain) superfamily, which family members play roles in inflammation and immune responses. Teng O et al. have revealed that CLEC5A-mediated enhancement of the inflammatory response in myeloid cells contributes to influenza's pathogenicity in vivo [38]. We found significant higher expression levels of CLEC5A in the BALF of severe MPP comparing to that of mild MPP. Therefore, our results support the hypothesis that CLEC5A is involved in the pathogenesis of severe MPP through the overactivation of macrophage. Protein functional classification of the differentially expressed genes indicates that signaling molecules including IL13, CLEC5A, and INHBA are obviously increased in severe MPP comparing to mild MPP [see Fig. 1b, Additional file 6: Table  S4]. INHBA (Inhibin beta A) encodes a member of the transforming growth factor superfamily. The encoded preproprotein is proteolytically processed to generate a subunit of the dimeric activin and inhibin protein complexes. Rheumatoid arthritis synovium fluid (RA-SF) promotes INHBA production as a pro-inflammatory cytokine from macrophages in vitro [39]. Similarly, INHBA is reported to be up-regulated in endometritis by Hoelker M et al. [40]. It is interesting that the current study finds increased levels of INHBA in severe MPP. The causal relationship between the up-regulation of INHBA and severe MPP requires further investigation.
It is still unclear how B cells are involved in the pathogenesis of severe MPP. In the current study, KEGG pathway enrichment analyses have revealed changes related to primary immunodeficiency in severe MPP patients, which involves CD79A, AICDA, CD19, and TNFRSF13C genes (Table 4, Fig. 3) [22]. CD79A, AICDA, CD19, and TNFRSF13C are related to the B cell antigen signaling pathway; lower expression of these genes can lead to the deficiency of B cell functions. CD79A encodes the Igα protein of the B cell antigen component, which is Fig. 4 Differential alternative splicing of FCER2 and FCRLA. a Read distribution plot for FCER2 with differential isoform expression due to retained intron in the mild MPP group is shown. The black box and red arrows indicated the location of the retained intron. b Read distribution plot for FCRLA with differential isoform expression due to mutually exclusive exon is shown. The ratio of mutually exclusive exon happened in the severe MPP group is compared with that in the mild MPP group necessary for expression and function of the B cell antigen receptor. Defected CD79A has been discovered in immunodeficiency-related diseases [41]. Similarly, AICDA (activation-induced cytidine deaminase) encodes a RNAediting deaminase, which is expressed in a B cell differentiation stage-specific fashion. AICDA is involved in somatic hyper-mutation, gene conversion, and classswitch recombination of immunoglobulin genes [42]. Defects in AICDA can cause autosomal recessive hyper-IgM immunodeficiency syndrome type 2 (HIGM2) [43]. CD19 is a B cell-specific molecule, which serves as a major costimulatory molecule for amplifying B cell receptor (BCR) responses. Morbach H et al. have revealed that CD19 is required for TLR9-induced B cell activation and CD19/PI3K/AKT/BTK is an essential axis integrating BCRs and TLR9 signaling in human B cells [44]. In addition, biallelic CD19 gene mutations cause common variable immunodeficiency in human. BCR-induced B cell responses are impaired in most patients with common variable immunodeficiency. TNFRSF13C (tumor necrosis factor receptor superfamily member 13C) encodes a receptor for BAFF (B cell activating factor), which enhances B cell survival in vitro and regulates the peripheral B cell population. TNFRSF13C is a principal receptor required for BAFF-mediated mature B cell survival and it has been reported to be associated with common variable immunodeficiency [45]. In our study, significantly decreased expression levels of CD79A, AICDA, CD19, and TNFRSF13C have been observed in the BALF of the severe MPP group comparing to the mild MPP group. Comparing to the local reduction of CD79A, AICDA, CD19, and TNFRSF13C genes found in BALF, these patient's immunoglobulins and lymphocyte profiles in the peripheral blood are normal, which means they do not have any systemic immunodeficiency disease [see Additional file 2: Fig. S1, Additional file 3: Table S2 ]. We have found these local changes related to primary immunodeficiency in the bronchoalveolar of severe MPP. Local B cell-related immunodeficiency may be involved in the pathogenesis of severe MPP comparing with mild MPP. Figure 3 shows how these molecules are involved in the pathway of B cell differentiation and immunoglobulin gene class switch; down-regulation of these molecules may restrain antibody production and make the MPP children lose the ability to eradicate MP from the lung, which leads to happen of severe MPP [7].
Alternative splicing events of genes are involved in the diversity of proteome as well as genome evolution, control of developmental processes, and physiological regulation of various biological systems [46]. It could be deduced that dysregulation of alternative splicing event is often linked to various human diseases [47]. However, alternative splicing events in the context of MPP have rarely been investigated. The current study discovered significant differential alternative splicing events in FCER2 and FCRLA which could be an explanation for the differential expression of these two genes between severe MPP and mild MPP. The alternative splicing type of FCER2 is retained intron (RI), which can change the function and expression levels of a gene [48]. Therefore, lower rate of RI in the severe MPP group comparing to the mild MPP group may partly explain the significantly increased expression levels of FCER2 in the severe MPP group. FCRLA is a soluble resident endoplasmic reticulum protein. It is capable of associating with multiple Ig isotypes including IgM, IgG, and IgA, which makes it unique among the large family of Fc receptors. The expression of FCRLA is restricted to B lineage and is most abundant in germinal center B lymphocytes [49]. In the present study, the alternative splicing type of FCRLA is MEX, which is significantly increased in the severe MPP group comparing to the mild MPP group. According to the literatures [50,51], alternative splicing may be a causal explanation for the down-regulation of FCRLA in the severe MPP group.
Previous studies have found that cell-mediated immune response, specifically Th1-type cytokines such as IL8, IL18, and IFNγ, plays important roles in the mechanism of MPP [9][10][11]52]. Most of these studies observed serum levels of Th1 cytokines, but we have chosen BALF to study the local differentially expressed genes. Cytokines may be expressed differently in BALF comparing to that in the peripheral blood; this could partly explain why significant difference of IL8, IL18, and IFNγ was not found in the current study. Additionally, our transcriptome analysis focused on comparing severe MPP with mild MPP in order to study the specific pathogenesis of severe MPP; similar results have been reported by Kang YM et al. group [53]. In their study, higher levels of IFNγ in BALF of MPP patients comparing with that of control children were discovered, but no significant difference of IFNγ was found between the severe MPP group and mild MPP group.
Some limitations of the current study should be discussed. First, this is a small size transcriptome analysis between mild MPP children and severe MPP children; bigger sized analysis would be preferred in future study to support solid conclusions. However, three samples in each group have been chosen carefully to receive nextgeneration sequencing separately; quality control analysis of the whole process has been greatly satisfied. Correlation matrix shows a high consistency of measurements within each group. Principal component analysis (PCA) shows good repeatability and correlation of these samples. Second, this is a preliminary study on the pathogenesis of severe MPP comparing to mild MPP. Further study will be needed to further prove the hypothesis that gene reduction related to primary immunodeficiency in bronchoalveolar is involved in the pathogenesis of severe MPP. Patient's primary cell culture and in vivo experiments using animal model may help to clarify how these genes and pathways are involved in the pathogenesis of severe MPP in the future.