Stemness-related gene signature for predicting therapeutic response in patients with esophageal cancer
Introduction
Esophageal cancer (ESCA) is the second deadliest gastrointestinal cancer. Patients generally have poor prognosis, with a 5-year survival rate of 40% for early-stage patients (1). There are two pathological types of ESCA that show regional specificities because of differences in living habits and genetic backgrounds. Esophageal squamous cell carcinoma is more prevalent in Southeast Asia, while esophageal adenocarcinoma is more frequent in Europe and America (2). Patients with ESCA can be divided into resectable patients and those who are inoperable (3). Traditional treatments such as surgery, radiotherapy, chemotherapy, immunotherapy, and targeted therapy, have limited efficacy in both patients with resectable and those with inoperable ESCA, leading to poor outcomes (4). In patients with advanced metastatic ESCA, nanoparticles AbraxaneTM and DoxilTM can be used as treatment options (3). Carboplatin and paclitaxel are the most common preoperative treatment agents for ESCA (5), while capecitabine and fluorouracil are administered postoperatively (6). Minimally invasive esophagectomy is a common surgical method for patients with operable ESCA to reduce postoperative morbidity. For patients with inoperable ESCA, cebox and cisplatin are used in combination to prolong survival (3). PembrolizumabTM, which targets specific genetic features, has been approved by the Food and Drug Administration (FDA) for the treatment of advanced ESCA (7). Therefore, current research on ESCA should be continuously adjusted to develop more effective treatment strategies.
Cancer stem cells (CSCs) are highly relevant for tumorigenesis, progression, recurrence, and metastasis of various cancers (8). The messenger RNA (mRNA) expression-based stemness index (mRNAsi) is used to evaluate the unique characteristics of CSCs and to assess tumor development (9). There is mounting evidence to support the existence of CSCs in lung adenocarcinoma (10), breast cancer (11), colorectal cancer (12), and prostate cancer (13), as well as their roles in metastasis, drug resistance, and cancer adaptation to changing microenvironments. For instance, single-cell transcriptome analysis revealed the landscape of intra-tumoral heterogeneity and stemness-related subpopulations of liver cancer cells (14).
Stemness-related genes (SRGs) have been identified in numerous cancers, such as cervical squamous cell carcinoma (15), colorectal cancer (16), renal clear cell carcinoma (17), lung cancer (18), gastric cancer (19), and ESCA (20). Moreover, SRGs have been shown to promote the formation, progression, and metastasis of different cancers (21). The AGR3 gene has been shown to promote the stemness of CSCs by upregulating SRGs via Frizzled 4 (FZD4), which is involved in the Wnt/β-catenin signaling pathway (22). Study examining ESCA stemness and clinical characteristics have indicated that higher-stage and metastatic tumors feature great phenotypic dedifferentiation (20). This study examined how SRGs promote ESCA formation, progression, and metastasis. The differentially expressed stemness-related (DESR) genes (DESRGs), DESR microRNAs (DESRmiRNAs), and DESR long non-coding RNAs (DESRlncRNAs) were firstly identified and functional analyses were performed. CSCs could change tumor microenvironment into immunosuppressive (23). Tumors with significant positive DESRGs, DESRlncRNAs, and DESRmiRNAs expression may exist higher immune infiltration in the tumor microenvironment (21). The DESRGs, DESRmiRNAs, and DESRlncRNAs were used to develop a prognostic risk assessment model and to construct a protein-protein interaction (PPI) network and a competing endogenous RNA (ceRNA) network, which together with the tumor-infiltrating immune cell analyses, corroborated the model results (Figure 1). Potential stemness-related prognostic markers for ESCA were identified. We present the following article in accordance with the TRIPOD reporting checklist (available at https://tcr.amegroups.com/article/view/10.21037/tcr-22-1723/rc).
Methods
Data collection and analysis of differential gene expression
The ESCA gene expression profile was obtained from The Cancer Genome Atlas-ESCA (TCGA-ESCA) database containing 11 normal tissues and 152 tumor tissues, and analyzed using the R package TCGA biolinks (24). Among these, 145 tumor tissues had complete clinical information, including age, sex, survival status, and tumor, node, metastasis (TNM) stage. The read counts were converted to transcripts per kilobase million (TPM) to compare the relative gene expression level between samples, while the miRNA data were converted to reads per kilobase million (RPKM) (25).
Differential gene expression between tumor and normal tissue samples was analysis using the R package DESeq2 (26). The significant threshold for the mRNA and lncRNA differential expression was defined as the log2 of the absolute fold change (FC) value greater than 1 (log2FC >1) and adjusted P value less than 0.05 (q value <0.05). The threshold to determine differentially expressed miRNA was set as log2FC >0.5 and q value <0.05. The differential analysis results are displayed as heat maps using the R package pheatmap (https://CRAN.R-project.org/package=pheatmap) and as volcano maps using the ggplot2 R package (27). The DESRGs were obtained using the gene set enrichment analysis (GSEA) (https://www.gsea-msigdb.org/gsea/index.jsp) tool of the GSEA software (28). The study was conducted in accordance with the Declaration of Helsinki (as revised in 2013).
Construction of the PPI network and the ceRNA network
The single-sample GSEA (ssGSEA) method (29) was used to obtain the enrichment score for cell stemness of each sample based on the ESCA expression data and the DESRG set. To identify the DESRmiRNAs and DESRlncRNAs, correlation analysis was performed between the differentially expressed mRNA and the lncRNA or miRNA levels. Significant correlation coefficient greater than 0.2 (ρ>0.2; P<0.05) indicated strong positive correlation.
To construct the PPI network, the STRING database (https://string-db.org/), containing 2,031 species, 9.6 million proteins, and 1.38 million PPIs was used. Our PPI analysis included experimental results, text-mined results from PubMed abstracts, and synthesized and predicted results from bioinformatics analyses (30). A correlation coefficient above 0.7 (ρ>0.7; P<0.05) indicated a strong association between the differentially expressed genes and cell stemness. The PPI results were exported from the STRING database and visualized using the Cytoscape StringApp (31). In addition, the molecular complex detection (MCODE) plugin from Cytoscape was used to analyze hub genes in the PPI network.
MiRcode (https://bio.tools/miRcode) provides transcriptome-wide human miRNA target predictions based on comprehensive GENCODE gene annotations (https://www.gencodegenes.org/), including 10,000 lncRNA genes (32). Therefore, MiRcode was used to identify the miRNAs and lncRNAs in our dataset. The TargetScan database was used to predict the biological targets of the identified miRNAs by searching for the presence of conserved 6mer-8mer sites on the mRNA sequences matching the seed region of each miRNA (33). The miRDB database (http://mirdb.org/) was used to confirm the predicted miRNA targets and functional annotations by implementing its MirTarget tool, which uses thousands of miRNA-target interactions (MTIs) from high-throughput sequencing experiments (34). miRTarBase (https://mirtarbase.cuhk.edu.cn) is an experimentally validated MTI database that has accumulated more than 360,000 MTIs obtained from natural language processing after a manual survey of relevant literature (35). For the analysis using miRTarBase, research articles were systematically screened to collect those related to miRNA functional studies. The MTIs from TargetScan, miRDB, and miRTarBase analysis were combined with the identified DESRGs, DESRmiRNAs, and DESRlncRNAs to construct a ceRNA network, which was visualized using Cytoscape.
Functional analysis
Gene Ontology (GO) analysis is a standard method for large-scale functional enrichment studies that includes terms related to biological processes (BPs), molecular functions (MFs), and cellular components (CCs) (36). The Kyoto Encyclopedia of Genes and Genomes (KEGG) is a widely used database for storing information on genomes, biological pathways, diseases, and drugs (37). GO and KEGG pathway enrichment analyses of DESRGs were performed using the clusterProfiler (38) R package, with a cutoff value of false discovery rate (FDR) <0.05 for statistical significance.
To investigate the differences in enriched BPs between normal and tumor tissues, GSEA was performed based on the TCGA gene expression profiling dataset of patients with ESCA (39). For GSEA analysis, the control “c2.cp.kegg.v7.1.entrez” dataset used was downloaded from the MsigDB database and compared with the gene expression profiling dataset of patients with ESCA (FDR <0.25).
Development and validation of a prognostic risk assessment model
Univariate Cox (uni-Cox) regression analysis (P<0.05) was performed for the DESRGs, DESRmiRNAs, and DESRlncRNAs to estimate ESCA prognosis. The uni-Cox results were visualized using forest plot. Survival analysis was performed on the DESRGs, DESRmiRNAs, and DESRlncRNAs and the results were displayed using the R package survminer (40).
Least absolute shrinkage and selection operator (LASSO) regression analysis was performed to filter the DESRGs, DESRmiRNAs, and DESRlncRNAs with a 10-fold cross-validation. Furthermore, the DESRGs, DESRmiRNAs, and DESRlncRNAs screened by the LASSO method were used for multivariate Cox (multi-Cox) proportional hazards regression and risk model construction. The LASSO glmnet package (41) was used to divide the TCGA samples with complete clinical information into a training set and a validation set at a ratio of 1:1. The training dataset was composed of the DESRGs, DESRmiRNAs, and DESRlncRNAs (single factor P<0.05). The accuracy of the final model was verified through the validation set. The R package ggrisk (https://CRAN.R-project.org/package=ggrisk) was used to display the risk factor and perform the survival analysis on the risk scores of the training and validation sets. The R package pROC was used to obtain the receiver operating characteristic (ROC) curve, nomogram, and standard curve, showing the accuracy of the model. Finally, multi-Cox analysis with clinical traits was used to determine whether the risk score is independent of other traits.
Tumor-infiltrating immune cells analyses
CIBERSORT (https://cibersort.stanford.edu/index.php) was used to analysis 22 tumor-infiltrating immune cell types in each tissue, using the LM22 signature matrix with 1,000 permutations (42). The gene expression matrix data (TPM) was uploaded to CIBERSORT combined with the LM22 eigengene matrix, and the immune cell infiltration matrix was derived. A histogram was drawn showing the distribution of the 22 infiltrating immune cells in each sample. The R package corrplot (https://github.com/taiyun/corrplot) was used to calculate and display the correlation between the prognosis-related mRNA, lncRNA, miRNA, and the 22 tumor-infiltrating immune cell types.
Statistical analyses
All statistical analyses were performed using R software version 4.1.3 (https://www.r-project.org/). The ROC curves assessed the accuracy of the risk score in estimating the prognosis by calculating the area under the ROC curve (AUC). Spearman’s correlation (ρ) was used to determine all correlations between two samples. All statistical analyses were two-sided, with a significance of P<0.05.
Results
Differential expression analyses
DESeq2 was used to analyze the differential expression between TCGA-ESCA tumors [152] and normal tissues [11]. Baseline data of TCGA patients with ESCA was detected in Table 1. A total of 3,504 differentially expressed mRNAs (including 1,900 upregulated and 1,604 downregulated mRNAs; Figure 2A), 1,564 differentially expressed lncRNAs (including 993 upregulated and 571 downregulated lncRNAs; Figure 2B), and 209 differentially expressed miRNAs (including 130 upregulated and 79 downregulated miRNAs; Figure 2C) were identified. Heatmaps were used to identify the most significant differentially expressed mRNAs, lncRNAs, and miRNAs (Figure 2D-2F).
Table 1
Type | Alive (n=98) | Deceased (n=46) | P value |
---|---|---|---|
Tumor grade, n (%) | 0.238 | ||
G1 | 16 (16.3) | 2 (4.3) | |
G2 | 38 (38.8) | 19 (41.3) | |
G3 | 21 (21.4) | 12 (26.1) | |
GX | 23 (23.5) | 13 (28.3) | |
Gender, n (%) | 0.448 | ||
Female | 17 (17.3) | 5 (10.9) | |
Male | 81 (82.7) | 41 (89.1) | |
Age (years), n (%) | 0.912 | ||
<60 | 43 (43.9) | 19 (41.3) | |
≥60 | 55 (56.1) | 27 (58.7) | |
Stage, n (%) | 0.0011 | ||
I | 17 (17.3) | 3 (6.5) | |
II | 48 (49.0) | 14 (30.4) | |
III | 31 (31.6) | 22 (47.8) | |
IV | 2 (2.0) | 7 (15.2) | |
T staging, n (%) | 0.191 | ||
T1 | 18 (18.4) | 9 (19.6) | |
T2 | 25 (25.5) | 9 (19.6) | |
T3 | 55 (56.1) | 26 (56.5) | |
T4 | 0 (0.0) | 2 (4.3) | |
M staging, n (%) | 0.00238 | ||
M0 | 86 (87.8) | 34 (73.9) | |
M1 | 1 (1.0) | 7 (15.2) | |
MX | 11 (11.2) | 5 (10.9) | |
N staging, n (%) | 0.00909 | ||
N0 | 48 (49.0) | 11 (23.9) | |
N1 | 33 (33.7) | 28 (60.9) | |
N2 | 6 (6.1) | 4 (8.7) | |
N3 | 6 (6.1) | 0 (0.0) | |
NX | 5 (5.1) | 3 (6.5) |
TCGA, The Cancer Genome Atlas; ESCA, esophageal cancer.
Stemness-related ceRNA network
Correlation analysis of the stemness enrichment scores revealed 1,106 DESRGs, 84 DESRmiRNAs, and 320 DESRlncRNAs (Figure 3A). Network clustering was performed (Figure 3B-3D) and the top 20 connection points were identified, including CDC20 that connects to 136 adjacent nodes (Figure 3E). A ceRNA network was constructed, including 17 DESRmiRNAs, 44 DESRlncRNAs, and 55 DESRGs (Figure 3F).
Functional analysis of the DESRGs
GO enrichment analysis for the DESRGs (Figure 4A,4B) identified nuclear division, organelle fission, mitotic nuclear division, DNA-dependent DNA replication, and DNA replication as enriched initiation terms. KEGG pathway enrichment analysis revealed that the DESRGs were mainly enriched in biological pathways such as cell cycle, DNA replication, human papillomavirus infection, Cushing syndrome, and protein digestion and absorption (Figure 4C). GSEA functional analysis revealed that the DESRGs were enriched in the following KEGG pathways: cytokine-cytokine receptor interaction, cell cycle, calcium signaling pathway, neuroactive ligand-receptor interaction, fatty acid metabolism, DNA replication, spliceosome, NOD-like receptor signaling pathway, vascular smooth muscle contraction pathway, and graft-versus-host disease (Figure 5A). The most enhanced pathways were cytokine-cytokine receptor interaction, cell cycle, and DNA replication (Figure 5B), whereas the most inhibited pathways were related to calcium signaling pathway, neuroactive ligand-receptor interaction, and fatty acid metabolism (Figure 5C).
Survival analysis of the DESRGs, DESRmiRNAs, and DESRlncRNAs
Uni-Cox and survival analysis of the DESRGs, DESRmiRNAs, and DESRlncRNAs was performed to identify the correlation on the survival of patients with ESCA. The uni-Cox model (P<0.05) used 73 DESRGs, 7 DESRmiRNAs, and 23 DESRlncRNAs (Table S1). The pseudogene MTRNR2L13 and the chromosome 15 open reading frame 54 (C15orf54) were identified as high-risk factors associated with patient mortality. Notably, the centromere protein I (CENPI), PRICKLE2 antisense RNA 1 (PRICKLE2-AS1), and human miRNAs hsa-miR-101-1 and hsa-miR-101-2 were found to negatively correlate with patient mortality. Survival analysis revealed six highly significant DESRGs, DESRmiRNAs, and DESRlncRNAs (Figure S1).
Construction and validation of the prognostic model
The TCGA-ESCA datasets were randomly divided into training and validation sets, and univariate analysis was performed to detect prognostic-related DESRGs. LASSO regression was used to identify meaningful genes in the uni-Cox analysis, including six DESRGs, namely, NCAPG, HIST1H4J, KIAA1456, SCUBE2, PCCA, and ARID3A, that were selected to build the prediction model (Figure 6A). The expression profiles of these selected genes were used to calculate the risk scores in the training and validation sets separately. The total risk score was calculated as follows: expression level of (NCAPG × 0.0245 + HIST1H4J ×1.2293 + KIAA1456 × 0.0358 + SCUBE2 × 0.1375 + PCCA × 0.0738 + ARID3A × 0.0134). The distribution of risk scores in patients with ESCA and the correlation between survival time and risk score in the training set revealed that patients with high scores were associated with a higher mortality (Figure 6B), indicating that high-risk patients generally had poor overall survival (OS). A similar trend was observed in the validation set (Figure 6C). Based on the median value of the risk scores, the patients were divided into a high-risk group and a low-risk group. The survival rate of patients in the high-risk group was significantly lower than that in the low-risk group in both the training (P<0.0001; Figure 6D) and validation sets (P=0.01; Figure 6E). The AUC for 1-, 2-, and 3-year survival rates in the training group were 0.995, 0.937, and 0.818, respectively (Figure 6F), while those for the validation group were 0.732, 0.838, and 0.955, respectively (Figure 6G). These results indicated that the DESRGs selected as prognostic indicators have robust potential for survival prediction in patients with ESCA.
Uni-Cox (Figure 6H) and multi-Cox (Figure 6I) regression analyses with clinical variables (risk score, sex, age, grade, and stage) revealed that the risk score and stage were independent prognostic indicators for patients with ESCA. A nomogram combining risk score and clinicopathological parameters was selected to predict ESCA patient prognosis (Figure 6J). Calibration curves were used to compare the actual probabilities of survival and the predicted survival rates. The results showed a significant correlation between the probability of OS and the actual survival rates for 1, 2, and 3 years (Figure 6K).
A prognostic model for the DESRlncRNAs and DESRmiRNAs was constructed and validated. The regression analysis identified SCOC-AS1, RP11-626E13.1, RP1-136B1.1, and RP11-16E23.3 as meaningful lncRNAs for the prediction model. The risk score for DESRlncRNAs was calculated as follows: expression level of SCOC-AS1 × 0.7209 + expression level of RP11-626E13.1 × 2.2361 + expression level of RP1-136B1.1 × 1.4229 + expression level of RP11-16E23.3 × 6.7802. Furthermore, the miRNAs selected for the risk analysis were hsa-miR-269a, hsa-miR-214, hsa-miR-3652, and hsa-miR-503. The DESRmiRNAs risk score was calculated as follows: expression level of hsa-miR-1269a × 0.003 + expression level of hsa-miR-214 × 0.03181 + expression level of hsa-miR-3652 × 0.1195 + expression level of hsa-miR-503 × 0.04297. The risk score results for the DESRlncRNAs (Figure S2) and DESRmiRNAs (Figure S3) showed that the model provided high accuracy in predicting the prognosis of patients with ESCA.
Tumor-infiltrating immune cells
To analyze the effect of tumor-infiltrating immune cells on the model, the CIBERSORT algorithm was used to calculate the degree of infiltration of 22 types of infiltrating cells and display their overall distribution scores (Figure 7A) and correlations (Figure 7B). The correlation heatmap of the 22 types of infiltrating cells revealed that activated and resting natural killer (NK) cells, activated and resting mast cells, resting M0 macrophages, and CD4+ memory T cells had significant negative correlations. Monocytes were positively correlated with resting dendritic cells.
The association between infiltrating levels of the immune cells and the expression of DESRGs (Figure 7C), DESRlncRNAs (Figure 7D), and DESRmiRNAs (Figure 7E) from patients with ESCA was analyzed. ARID3A was found to be positively correlated with infiltrating levels of immune cells, including follicular helper T cells (P<0.001) and activated CD4+ memory T cells (P<0.01). KIAA1456 expression was positively correlated with gamma delta (γδ) T cells (P<0.01) and negatively correlated with activated mast cells (P<0.01). The expression of NCAPG was positively correlated with resting NK cells (P<0.01). The expression of PCCA was positively correlated with resting CD4+ memory T cells (P<0.01) and plasma cells (P<0.01), and negatively correlated with activated CD4+ memory T cells (P<0.01). SCUBE2 expression was negatively correlated with activated dendritic cells (P<0.01). RP1-136B1.1 levels were positively correlated with activated NK cells (P<0.001) and CD8+ T cells (P < 0.01), and negatively correlated with follicular helper T cells (P<0.01), resting NK cells (P<0.001), and M0 macrophage (P<0.001). Finally, hsa-miR-1269a expression was negatively correlated with resting CD4+ memory T cells (P<0.01).
Discussion
To the best of our knowledge, this is the first study to perform a comprehensive characterization of ESCA stemness based on combined analyses of mRNA, miRNA, and lncRNA expression in ESCA tumors and normal tissues. A total of 1,106 DESRGs, 84 DESRmiRNAs, and 320 DESRlncRNAs were identified using the TCGA-ESCA dataset.
Non-coding RNAs, including miRNAs and lncRNAs, play important roles in transcriptional and translational regulation in several cancers (43). lncRNAs act as miRNA sponges to reduce their regulatory abilities (44). Interactions between lncRNAs, miRNAs, and mRNAs form ceRNA networks (45). Understanding the role of ceRNA networks in pan-cancer analysis is crucial (46). However, to date, no such study has been conducted on stemness-related ceRNA networks in ESCA. In this present investigation, we constructed a stemness-related ceRNA network based on 55 DESRGs, 17 DESRmiRNAs, and 44 DESRlncRNAs, and analyzed its correlation with the survival data of patients with ESCA. PRICKLE2-AS1, FAM13A-AS1, C15orf54, and LINC00304 were identified as high-risk DESRlncRNAs that correlated with high ESCA severity. Previous study has demonstrated that LINC00304 directly promotes cell proliferation and cell cycle progression in prostate cancer (47). PRICKLE2-AS1 has been shown to be differentially methylated in vulvar squamous cell carcinoma (48) and FAM13A-AS1 is involved in regulating RNA splicing and mRNA processing in thyroid cancer (49). Herein, PRICKLE2-AS1, FAM13A-AS1, and LINC00304 were shown to be downregulated in ESCA tumors, whereas C15orf54 was upregulated (log2FC =1.51; q value =0.02). C15orf54 is known to be positively associated with high-grade gastric cancer (50) and indeed, C15orf54 may be a potential biomarker for ESCA prognosis.
NCAPG is one of the DESRGs selected to build the prediction model. Increased expression of NCAPG promotes oncogenesis in non-small cell lung cancer (51), glioma (52), lung adenocarcinoma (53), prostate cancer (54), and esophageal squamous cell carcinoma (55). NCAPG (log2FC =1.81; q value =2.68×10−11) was significantly upregulated in ESCA and positively correlated with resting NK cells, suggesting that human NK cells rest via the overexpression of NCAPG in ESCA. Candidate DESRGs, DESRlncRNAs, and DESRmiRNAs that show significant positive correlations with immune infiltration cells, increased expression of SRGs and increased immune infiltration may be related to poor prognosis. Notably, the oncogene NCAPG in prostate cancer is directly targeted by the antitumor hsa-miR-145-3p, which is associated with cancer pathogenesis (54). In the present study, we found that hsa-miR-1269a was significantly upregulated in ESCA tumors (log2FC =3.98; q value =2.92×10−4) with poor prognostic features. Additionally, the number of resting CD4+ memory T cell as negatively corrected with hsa-miR-1269a (P<0.01). hsa-miR-145-3p is known as a diagnostic and oncogenic marker in hepatocellular carcinoma (56), colorectal cancer (57), and non-small cell lung cancer (58), and it may be involved in the molecular pathogenesis of various cancers, including ESCA (55).
ARID3A acts as an oncogene, and its overexpression affects the development of colorectal cancer (59), ovarian cancer (60), gastric cancer (61), hepatocellular carcinoma (62), and rectal cancer (63). This report revealed that ARID3A was upregulated and positively associated with infiltrating follicular helper T cells and activated CD4+ memory T cells, and thus, may play an immunological role in ESCA. Follicular helper T cells are a subset of T-helper cells, which provide critical support to B cells and are essential for maintaining humoral immune responses (64), such as germinal center formation, affinity maturation, and the development of high-affinity antibodies (65). Therefore, ARID3A is a promising independent prognostic biomarker corresponding to follicular helper T cells and activated CD4+ memory T cells.
A previous report showed that a de novo variant in the human HIST1H4J gene causes a syndrome analogous to HIST1H4C-associated neurodevelopmental disorder (66). Furthermore, HIST1H4J is the strongest negative predictor of ovarian cancer (67) and an important prognostic and chemoresistance predictor of ESCA (68). Herein, upregulation expression of HIST1H4J (log2FC =1.39; q value =1.87×10−3) was observed in patients with ESCA.
Hsa-miR-1269a serves as a diagnostic and oncogenic marker in hepatocellular carcinoma (56), colorectal cancer (57) and non‐small cell lung cancer (58). Our investigations showed that hsa-miR-1269a is significantly upregulated in ESCA patients with poor prognostic features. CD4+ resting memory T cells (P<0.01) were significantly negatively correlated with hsa-miR-1269a.
This is the first study to perform a comprehensive characterization of ESCA stemness based on a combined analysis of mRNA, miRNA, and lncRNA expression, and correlation with prognosis prediction. SRG network will facilitate improved treatment regimens for patients with ESCA and may promote the development of innovative therapeutics in the future.
Acknowledgments
We thank the editor and the four anonymous reviewers for their thoughtful comments.
Funding: This project was supported by the Talent Scientific Research Start-Up Foundation of Yijishan Hospital, Wannan Medical College (Grant Nos. YR202001 and YR201806), the Opening Foundation of the Key Laboratory of Non-Coding RNA Transformation Research of the Anhui Higher Education Institution (Grant No. RNA202004), and the Key Projects of Natural Science Research of Universities in Anhui Province (Grant No. KJ2020A0622).
Footnote
Reporting Checklist: The authors have completed the TRIPOD reporting checklist. Available at https://tcr.amegroups.com/article/view/10.21037/tcr-22-1723/rc
Conflicts of Interest: All authors have completed the ICMJE uniform disclosure form (available at https://tcr.amegroups.com/article/view/10.21037/tcr-22-1723/coif). The authors have no conflicts of interest to declare.
Ethical Statement: The authors are accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. The study was conducted in accordance with the Declaration of Helsinki (as revised in 2013).
Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0/.
References
- Lundberg E, Lagergren P, Mattsson F, et al. Life Expectancy in Survivors of Esophageal Cancer Compared with the Background Population. Ann Surg Oncol 2022;29:2805-11. [Crossref] [PubMed]
- Li K, Chen J, Lou X, et al. HNRNPA2B1 Affects the Prognosis of Esophageal Cancer by Regulating the miR-17-92 Cluster. Front Cell Dev Biol 2021;9:658642. [Crossref] [PubMed]
- Li X, Chen L, Luan S, et al. The development and progress of nanomedicine for esophageal cancer diagnosis and treatment. Semin Cancer Biol 2022; Epub ahead of print. [Crossref] [PubMed]
- Zayac A, Almhanna K. Esophageal, gastric cancer and immunotherapy: small steps in the right direction? Transl Gastroenterol Hepatol 2020;5:9. [Crossref] [PubMed]
- Ai D, Ye J, Wei S, et al. Comparison of 3 Paclitaxel-Based Chemoradiotherapy Regimens for Patients With Locally Advanced Esophageal Squamous Cell Cancer: A Randomized Clinical Trial. JAMA Netw Open 2022;5:e220120. [Crossref] [PubMed]
- Shim YM, Yun J, Im YH, et al. The efficacy of adjuvant chemotherapy with capecitabine and cisplatin after surgery in locally advanced esophageal squamous cell carcinoma: a multicenter randomized phase III trial. Dis Esophagus 2022;35:doab040.
- Harada K, Yamamoto S, Kato K. Pembrolizumab for the treatment of advanced esophageal cancer. Future Oncol 2022;18:2311-9. [Crossref] [PubMed]
- Najafi M, Farhood B, Mortezaee K. Cancer stem cells (CSCs) in cancer progression and therapy. J Cell Physiol 2019;234:8381-95. [Crossref] [PubMed]
- Chen D, Liu J, Zang L, et al. Integrated Machine Learning and Bioinformatic Analyses Constructed a Novel Stemness-Related Classifier to Predict Prognosis and Immunotherapy Responses for Hepatocellular Carcinoma Patients. Int J Biol Sci 2022;18:360-73. [Crossref] [PubMed]
- Zeng H, Ji J, Song X, et al. Stemness Related Genes Revealed by Network Analysis Associated With Tumor Immune Microenvironment and the Clinical Outcome in Lung Adenocarcinoma. Front Genet 2020;11:549213. [Crossref] [PubMed]
- Yin P, Bai Y, Wang Z, et al. Non-canonical Fzd7 signaling contributes to breast cancer mesenchymal-like stemness involving Col6a1. Cell Commun Signal 2020;18:143. [Crossref] [PubMed]
-
Wei R Quan J Li S Development and Validation of a Unique Signature of Cancer Stemness-Related Genes for Predicting the Overall Survival of Colorectal Cancer Patients. - Zhuang J, Li M, Zhang X, et al. Construction of Bone Metastasis-Specific Regulation Network Based on Prognostic Stemness-Related Signatures in Prostate Cancer. Dis Markers 2022;2022:8495923. [Crossref] [PubMed]
- Ho DW, Tsui YM, Sze KM, et al. Single-cell transcriptomics reveals the landscape of intra-tumoral heterogeneity and stemness-related subpopulations in liver cancer. Cancer Lett 2019;459:176-85. [Crossref] [PubMed]
- Guo H, Wang S, Ju M, et al. Identification of Stemness-Related Genes for Cervical Squamous Cell Carcinoma and Endocervical Adenocarcinoma by Integrated Bioinformatics Analysis. Front Cell Dev Biol 2021;9:642724. [Crossref] [PubMed]
- Wang W, Xu C, Ren Y, et al. A Novel Cancer Stemness-Related Signature for Predicting Prognosis in Patients with Colon Adenocarcinoma. Stem Cells Int 2021;2021:7036059. [Crossref] [PubMed]
- Liu Y, Wang J, Li L, et al. AC010973.2 promotes cell proliferation and is one of six stemness-related genes that predict overall survival of renal clear cell carcinoma. Sci Rep 2022;12:4272. [Crossref] [PubMed]
- Jiang W, Xie N, Xu C. Characterization of a prognostic model for lung squamous cell carcinoma based on eight stemness index-related genes. BMC Pulm Med 2022;22:224. [Crossref] [PubMed]
- Lu YJ, Lian L, Shen XM, et al. A bioinformatics analysis to evaluate the prognostic value of stemness-related genes in gastric cancer. Transl Cancer Res 2021;10:174-83. [Crossref] [PubMed]
- Yi L, Huang P, Zou X, et al. Integrative stemness characteristics associated with prognosis and the immune microenvironment in esophageal cancer. Pharmacol Res 2020;161:105144. [Crossref] [PubMed]
- Hong L, Zhou Y, Xie X, et al. A stemness-based eleven-gene signature correlates with the clinical outcome of hepatocellular carcinoma. BMC Cancer 2021;21:716. [Crossref] [PubMed]
- Chi J, Zhang H, Hu J, et al. AGR3 promotes the stemness of colorectal cancer via modulating Wnt/β-catenin signalling. Cell Signal 2020;65:109419. [Crossref] [PubMed]
- Müller L, Tunger A, Plesca I, et al. Bidirectional Crosstalk Between Cancer Stem Cells and Immune Cell Subsets. Front Immunol 2020;11:140. [Crossref] [PubMed]
- Colaprico A, Silva TC, Olsen C, et al. TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data. Nucleic Acids Res 2016;44:e71. [Crossref] [PubMed]
- Jiang L, Zhong M, Chen T, et al. Gene regulation network analysis reveals core genes associated with survival in glioblastoma multiforme. J Cell Mol Med 2020;24:10075-87. [Crossref] [PubMed]
- Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 2014;15:550. [Crossref] [PubMed]
- Wickham H, Chang W, Wickham MH. Package ‘ggplot2’. 2014. Available online: https://cran.microsoft.com/snapshot/2015-01-06/web/packages/ggplot2/ggplot2.pdf
- Subramanian A, Tamayo P, Mootha VK, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A 2005;102:15545-50. [Crossref] [PubMed]
- Hänzelmann S, Castelo R, Guinney J. GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinformatics 2013;14:7. [Crossref] [PubMed]
- von Mering C, Huynen M, Jaeggi D, et al. STRING: a database of predicted functional associations between proteins. Nucleic Acids Res 2003;31:258-61. [Crossref] [PubMed]
- Doncheva NT, Morris JH, Gorodkin J, et al. Cytoscape StringApp: Network Analysis and Visualization of Proteomics Data. J Proteome Res 2019;18:623-32. [Crossref] [PubMed]
- Jeggari A, Marks DS, Larsson E. miRcode: a map of putative microRNA target sites in the long non-coding transcriptome. Bioinformatics 2012;28:2062-3. [Crossref] [PubMed]
- McGeary SE, Lin KS, Shi CY, et al. The biochemical basis of microRNA targeting efficacy. Science 2019;366:eaav1741.
- Wong N, Wang X. miRDB: an online resource for microRNA target prediction and functional annotations. Nucleic Acids Res 2015;43:D146-52. [Crossref] [PubMed]
- Hsu SD, Lin FM, Wu WY, et al. miRTarBase: a database curates experimentally validated microRNA-target interactions. Nucleic Acids Res 2011;39:D163-9. [Crossref] [PubMed]
- Harris MA, Clark J, Ireland A, et al. The Gene Ontology (GO) database and informatics resource. Nucleic Acids yoto2004;32:D258-61.
- Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res 2000;28:27-30. [Crossref] [PubMed]
- Wu T, Hu E, Xu S, et al. clusterProfiler 4.0: A universal enrichment tool for interpreting omics data. Innovation (Camb) 2021;2:100141. [Crossref] [PubMed]
- Subramanian A, Kuehn H, Gould J, et al. GSEA-P: a desktop application for Gene Set Enrichment Analysis. Bioinformatics 2007;23:3251-3. [Crossref] [PubMed]
- Kassambara A, Kosinski M, Biecek P, et al. Package ‘survminer’. 2017. Available online: https://cran.microsoft.com/snapshot/2017-04-21/web/packages/survminer/survminer.pdf
- Friedman J, Hastie T, Tibshirani R. Regularization Paths for Generalized Linear Models via Coordinate Descent. J Stat Softw 2010;33:1-22. [Crossref] [PubMed]
- Newman AM, Liu CL, Green MR, et al. Robust enumeration of cell subsets from tissue expression profiles. Nat Methods 2015;12:453-7. [Crossref] [PubMed]
- Wang M, Mao C, Ouyang L, et al. Long noncoding RNA LINC00336 inhibits ferroptosis in lung cancer by functioning as a competing endogenous RNA. Cell Death Differ 2019;26:2329-43. [Crossref] [PubMed]
- Sun P, Hamblin MH, Yin KJ. Non-coding RNAs in the regulation of blood-brain barrier functions in central nervous system disorders. Fluids Barriers CNS 2022;19:27. [Crossref] [PubMed]
- Lin P, Wen DY, Li Q, et al. Genome-Wide Analysis of Prognostic lncRNAs, miRNAs, and mRNAs Forming a Competing Endogenous RNA Network in Hepatocellular Carcinoma. Cell Physiol Biochem 2018;48:1953-67. [Crossref] [PubMed]
- Zhang Y, Han P, Guo Q, et al. Oncogenic Landscape of Somatic Mutations Perturbing Pan-Cancer lncRNA-ceRNA Regulation. Front Cell Dev Biol 2021;9:658346. [Crossref] [PubMed]
- Kumar S, Prajapati KS, Singh AK, et al. Long non-coding RNA regulating androgen receptor signaling in breast and prostate cancer. Cancer Lett 2021;504:15-22. [Crossref] [PubMed]
- Dasgupta S, Ewing-Graham PC, Swagemakers SMA, et al. Exploring Differentially Methylated Genes in Vulvar Squamous Cell Carcinoma. Cancers (Basel) 2021;13:3580. [Crossref] [PubMed]
- Rao Y, Liu H, Yan X, et al. In Silico Analysis Identifies Differently Expressed lncRNAs as Novel Biomarkers for the Prognosis of Thyroid Cancer. Comput Math Methods Med 2020;2020:3651051. [Crossref] [PubMed]
- Liu M, Li J, Huang Z, et al. Gastric cancer risk-scoring system based on analysis of a competing endogenous RNA network. Transl Cancer Res 2020;9:3889-902. [Crossref] [PubMed]
- Sun H, Zhang H, Yan Y, et al. NCAPG promotes the oncogenesis and progression of non-small cell lung cancer cells through upregulating LGALS1 expression. Mol Cancer 2022;21:55. [Crossref] [PubMed]
- Zheng G, Han T, Hu X, et al. NCAPG Promotes Tumor Progression and Modulates Immune Cell Infiltration in Glioma. Front Oncol 2022;12:770628. [Crossref] [PubMed]
- Wang X, Tian X, Sui X, et al. Increased expression of NCAPG (Non-SMC condensing I complex subunit G) is associated with progression and poor prognosis of lung adenocarcinoma. Bioengineered 2022;13:6113-25. [Crossref] [PubMed]
- Shimonosono M, Idichi T, Seki N, et al. Molecular pathogenesis of esophageal squamous cell carcinoma: Identification of the antitumor effects of miR-145-3p on gene regulation. Int J Oncol 2019;54:673-88. [PubMed]
- Chen N, Zhang G, Fu J, et al. Identification of Key Modules and Hub Genes Involved in Esophageal Squamous Cell Carcinoma Tumorigenesis Using WCGNA. Cancer Control 2020;27:1073274820978817. [Crossref] [PubMed]
- Hu Y, Dingerdissen H, Gupta S, et al. Identification of key differentially expressed MicroRNAs in cancer patients through pan-cancer analysis. Comput Biol Med 2018;103:183-97. [Crossref] [PubMed]
- Qin S, Shi X, Wang C, et al. Transcription Factor and miRNA Interplays Can Manifest the Survival of ccRCC Patients. Cancers (Basel) 2019;11:1668. [Crossref] [PubMed]
- Wang X, Jiang X, Li J, et al. Serum exosomal miR-1269a serves as a diagnostic marker and plays an oncogenic role in non-small cell lung cancer. Thorac Cancer 2020;11:3436-47. [Crossref] [PubMed]
- Tang J, Yang L, Li Y, et al. ARID3A promotes the development of colorectal cancer by upregulating AURKA. Carcinogenesis 2021;42:578-86. [Crossref] [PubMed]
- Dausinas P, Pulakanti K, Rao S, et al. ARID3A and ARID3B induce stem promoting pathways in ovarian cancer cells. Gene 2020;738:144458. [Crossref] [PubMed]
- Xu G, Li K, Zhang N, et al. Screening Driving Transcription Factors in the Processing of Gastric Cancer. Gastroenterol Res Pract 2016;2016:8431480. [Crossref] [PubMed]
- Wang X, Zhou Y, Dong K, et al. Exosomal lncRNA HMMR-AS1 mediates macrophage polarization through miR-147a/ARID3A axis under hypoxia and affects the progression of hepatocellular carcinoma. Environ Toxicol 2022;37:1357-72. [Crossref] [PubMed]
- Zhou D, Ji G, Wei G, et al. MiR-361-3p promotes tumorigenesis of osteosarcoma cells via targeting ARID3A. Tissue Cell 2022;76:101759. [Crossref] [PubMed]
- Zhang X, Ing S, Fraser A, et al. Follicular helper T cells: new insights into mechanisms of autoimmune diseases. Ochsner J 2013;13:131-9. [PubMed]
- Ise W. Development and function of follicular helper T cells. Biosci Biotechnol Biochem 2016;80:1-6. [Crossref] [PubMed]
- Tessadori F, Rehman AU, Giltay JC, et al. A de novo variant in the human HIST1H4J gene causes a syndrome analogous to the HIST1H4C-associated neurodevelopmental disorder. Eur J Hum Genet 2020;28:674-8. [Crossref] [PubMed]
- Bachmayr-Heyda A, Aust S, Auer K, et al. Integrative Systemic and Local Metabolomics with Impact on Survival in High-Grade Serous Ovarian Cancer. Clin Cancer Res 2017;23:2081-92. [Crossref] [PubMed]
-
Zhang G Zhang Y Yang H Genome-Wide Profiling of a Prognostic RNA-Binding Protein Signature in Esophageal Cancer.
(English Language Editor: J. Teoh)