J Cancer 2023; 14(9):1660-1672. doi:10.7150/jca.84407 This issue Cite
Research Paper
1. Key Laboratory of Shanxi Province for Craniofacial Precision Medicine Research, College of Stomatology, Xi'an Jiaotong University, Xi'an 710004, China.
2. Laboratory Center of Stomatology, College of Stomatology, Xi'an Jiaotong University, Xi'an 710004, China.
3. Department of Oral and Maxillofacial Surgery, College of Stomatology, Xi'an Jiaotong University, Xi'an 710004, China.
4. Department of Ophthalmology, The First Affiliated Hospital of Xinjiang Medical University, Urumqi, 830001, China.
Objectives: Head and neck squamous cell carcinoma (HNSCC) is the most common malignancy of the head and neck. However, the molecular mechanisms governing the development of HNSCC have not been fully elucidated.
Materials and Methods: Differentially expressed genes (DEGs) were screened out from The Cancer Genome Atlas (TCGA) and GSE23036 datasets. Weighted gene coexpression network analysis (WGCNA) was used to reveal the correlations among genes and to search for significantly correlated gene modules. The expression levels of genes in HNSCC and normal samples according to antibody-based detected methods was assessed by utilizing the Human Protein Atlas (HPA). The impact of the selected hub genes on the prognosis of HNSCC patients was assessed by analysing immunohistochemistry (IHC) and immunofluorescence (IF) expression levels and clinical data.
Results: Twenty-four genes positively correlated with tumour status and 15 genes negatively correlated with tumour status were screened out by WGCNA. PLAU and LAMC2 were associated with a poor prognosis in patients with HNSCC and were finally screened out and verified by GEPIA and HPA database analysis. Immunohistochemistry of samples collected from 175 patients with HNSCC and subsequent statistical analysis also showed that PLAU and LAMC2 were associated with a poor prognosis in patients with HNSCC, and the levels of these two factors were positively correlated. The expression and co-localization of PLAU and LAMC2 in HNSCC tissues were confirmed by double immunofluorescence labeling.
Conclusions: There was a positive correlation between PLAU and LAMC2 expression in HNSCC samples, and PLAU and LAMC2 might be independent prognostic biomarkers for HNSCC.
Head and neck squamous cell carcinoma (HNSCC) is the most common malignancy of the head and neck, accounting for more than 800,000 new cases and 450,000 deaths in 2018 worldwide [1]. In the United States, 54,000 new HNSCC cases were diagnosed, and 11,230 patients died from HNSCC in 2021 [2]. HNSCC has gradually become a serious problem worldwide, and despite the rapid development of clinical treatment in the past few decades, the 5-year survival rate of this disease has not significantly improved [3]. Given the high incidence and mortality, early detection and risk assessment can improve the prognosis of HNSCC. Therefore, better prognostic biomarkers are needed to identify early-stage disease, and key gene-targeted therapy is necessary and urgent to improve the survival rate of HNSCC patients.
The Gene Expression Omnibus (GEO) and The Cancer Genome Atlas (TCGA) databases have recently been extensively used to identify key differentially expressed genes (DEGs) associated with the prognosis of HNSCC [4-6]. However, most studies only focused on a single database and screened out DEGs but ignored the combined analysis of multiple databases. In addition, the complicated network of the genome and the high degree of interconnection between function‐related genes were ignored. Currently, systematic bioinformatics methods are widely used in the research of various kinds of cancers [7-9].
In this study, we first aimed to combine the GEO and TCGA databases for analysis and then screen out the key DEGs of HNSCC in these two databases. Second, we constructed a coexpression network of related genes and a protein-protein interaction (PPI) network based on the DEGs of HNSCC and identified key genes deserving further investigation for their potential as biomarkers. An integrated bioinformatics analysis was used to further investigate the functions, pathways, and regulation mechanisms and ultimately screened out the key DEGs related to HNSCC prognosis. In addition, we verified the key DEGs by immunohistochemistry (IHC) combined with clinical data, and their effect on the prognosis of HNSCC patients was analysed.
One hundred seventy-five formalin-fixed, paraffin-embedded blocks of HNSCC samples and 10 normal samples were collected from the Department of Oral and Maxillofacial Surgery, College of Stomatology, Xi'an Jiaotong University, between March 2017 and March 2022. The screening criteria were as follows: (1) patients were pathologically diagnosed with HNSCC; (2) patients had undergone surgical resection; (3) patients had complete clinical data and follow-up data; (4) patients did not have other malignant tumours. The use of HNSCC samples was approved by the ethics review board (approval no. xjkqll[2022]NO.028). The TNM and clinicopathological classification and staging of patients with HNSCC were performed according to the American Joint Committee on Cancer (AJCC) guidelines [10]. Clinical data, including age, gender, survival status, differentiation, TNM stage, clinical stage, and recurrence were collected.
The present study design is shown in the form of a flow diagram in Figure 1. The version of TCGA used in this study was updated in April 2022 (https://portal. gdc.cancer.gov). The TCGA is a comprehensive and coordinated project designed to improve diagnostic methods and treatment standards and ultimately prevent cancer. Information about sequencing and the pathological data of more than 30 kinds of human tumours can be analysed using TCGA [11]. The gene expression data and corresponding clinical data of HNSCC samples were downloaded from the TCGA data portal. The RNA sequencing (RNA-Seq) FPKM data of 270 HNSCC samples and 19 adjacent normal samples were downloaded. The location of squamous cell carcinoma (1. lip; 2. palate; 3. gum; 4. base of the tongue; 5. other and unspecified parts of the mouth; 6. floor of the mouth; 7. other and unspecified parts of the tongue; 8. oropharynx) were obtained from the TCGA database for further analysis. Then, differential expression analysis was performed in the R software “edgeR” package with the cut-off criteria of |log2 (fold change [FC])| > 1.0 and false discovery rate (FDR) adjusted P value < 0.01.
Flow diagram of the dates of processing and analysis.
GEO (http://www.ncbi.nlm.nih.gov/geo) is a public functional genomics data repository containing array and sequence-based data. The GSE23036 [12] dataset (Affymetrix HG-U133A 2.0 Array; Affymetrix, Santa Clara, CA), with a total of 63 HNSCC samples and 5 normal mucosa samples, was downloaded from GEO to screen the DEGs. The probe ID was converted into an international standard name for gene symbols using Perl programming.
We used the “limma” and “edgeR” R packages to screen out the DEGs between HNSCC samples and normal samples from the TCGA database [13] and the GEO database [14]. Adjusted p value < 0.05 and |log2FC| >1) were chosen as the cut‐off thresholds.
Weighted gene coexpression network analysis (WGCNA) is an algorithm that is based on high‐throughput gene expression profiles and is widely used in gene coexpression network identification in various diseases to reveal the correlation of genes and to search for significantly correlated gene modules. In this study, the coexpression analyses were conducted using the “WGCNA” R package [15] with TCGA and GEO data and patients' corresponding clinical data. The power of β = 15 (scale-free R2 = 0.83) and a cut‐off module size = 30 were set as the soft threshold to ensure a scale-free network. Gene significance and module membership were statistically calculated in an intramodular analysis of the module, and the genes in the hub module with a correlation ≥ 0.8 were selected as hub genes. Significant modules were defined with a p value < 0.05.
Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses were performed using the “colorspace” and “string” R packages and the Bioconductor packages “DOSE”, “clusterProfiler”, and “Pathview” [16]. Each module in WGCNA was analysed in Search Tool for the Retrieval of Interacting Genes (STRING) version 11 (https://string-db.org/) and Cytoscape software. The STRING online database and a combined score > 0.7 were used for PPI network construction. The top 10 hub genes were visualized using Cytoscape software.
The significant intersecting genes from the hub modules in WGCNA were identified as hub genes. Further validation and survival analysis of these hub genes were performed by using the Gene Expression Profiling Interactive Analysis (GEPIA) database (http://gepia2.cancer-pku.cn/index.html) [17]. We plotted the survival curves and expression levels of the hub genes in the GEPIA database. The Human Protein Atlas (HPA) (https://www.proteinatlas.org) is a website that contains IHC-based expression data for nearly 20 highly common kinds of cancers, and each tumour type includes 12 individual tumours [18]. In this study, a direct comparison of the protein expression of the selected hub genes between normal oral mucosa and HNSCC samples was performed by IHC imaging.
HNSCC samples and normal samples were collected from humans for IHC; the methods for IHC are described in our previous study [19]. IHC was performed with anti-PLAU (17968-1-AP, Proteintech, 1:500) and anti-LAMC2 (ab210959, Abcam, 1:500) antibodies. The methods of immunofluorescence staining of PLAU and LAMC2 were similar to those of IHC. Sections were subjected to rehydration and antigen retrieval, followed by elimination of autofluorescence. Primary antibodies were then applied and incubated at 4°C overnight, followed by fluorescence-conjugated secondary antibody incubation and DAPI staining of nuclei.
Image-Pro Plus version 6.0 software (Media Cybernetics, Inc., Bethesda, MD, USA) was used to evaluate the intensity score (IS) of IHC by calculating the integrated optical density (IOD) three times in each field, and the IOD/I of the total area of each field was also calculated simultaneously. According to the IOD value, the staining IS values were 0 (-), 1 (+), 2 (++), and 3 (+ + +). We divided the staining proportion score (PS) into four levels: 0 (0%), 1 (1-25%), 2 (26-50%), 3 (51-75%); and 4 (76-100%). The IS multiplied by the PS was the final result of the staining score [20]. The expression of PLAU and LAMC2 was categorized into a low expression group (0-6) and a high expression group (7-12) in this study according to the staining score results.
SPSS 25.0 software (IBM) was used for statistical analysis. For analysis of clinical data, counting data were expressed by frequency and composition ratio, the chi-square test was used for comparison between groups, and Cox regression was used to analyse the correlation between clinical indicators and death. Statistical significance was defined as P < 0.05 (*P < 0.05, **P < 0.01, and ***P < 0.001).
The gene expression profiles of the TCGA RNA-Seq FPKM data of 270 HNSCC samples and 19 adjacent normal samples were analysed using the “limma” package of R software. A total of 952 DEGs were identified (293 upregulated and 659 downregulated), and the volcano plot of all DEGs and the heatmap of the top 50 DEGs are shown in Figure 2A. The gene expression profiles of the GSE23036 dataset from the GEO database were also analysed using the “limma” package of R software. A total of 570 DEGs were identified (236 upregulated and 334 downregulated). The volcano plot of all DEGs and the heatmap of the top 50 DEGs are shown in Figure 2B.
Differentially expressed genes in the TCGA and GEO datasets. (A) Volcano map and heatmap of significantly differentially expressed genes between HNSCC and normal samples in the TCGA dataset. (B). Volcano map and heatmap of significantly differentially expressed genes between HNSCC and normal samples in the GEO dataset.
The "WGCNA" package in R was used to place the DEGs with highly relevant expression patterns into modules by average linkage clustering. A soft-thresholding procedure was performed by WGCNA, and a best-fit cut-off value (β = 13) was selected at the lowest mean connective value and appropriate scale-free topology fit index (0.75) (Figure 3A, B). A total of 11 modules were identified in the TCGA database (Figure 3C), and 12 modules were identified in the GEO database (Figure 3D). We set the MED threshold as 0.25 to merge similar modules (Figure 3E, F). In the TCGA and GEO databases, the module-trait heatmap revealed the differences in the expression of different module gene sets between tumour and normal samples. In the TCGA database, the blue module was positively significantly correlated with tumour status (R = 0.52 and P = 2e-21), while the red module was negatively significantly correlated with tumour status (R = -0.45 and P = 1e-15) (Figure 3G).
WGCNA modular analysis of the TCGA and GEO datasets. (A, B) Analysis of the two datasets of the scale-free fit index for various soft-thresholding powers (β) and the mean connectivity for the soft-thresholding powers. (C, D) DEG clustering and module screening based on gene expression patterns in the two datasets. (E, F) Clustering of module eigengenes. A cut-line (0.25) was selected for the module dendrogram, and some modules were merged according to the dissimilarity of estimated module eigengenes. (G, H) Correlation module heatmap of tumour samples compared to normal samples. The blue gene module was most positively associated with tumour samples, and the red module was negatively associated with tumour samples in the TCGA dataset. The red gene module was most positively associated with tumour samples, and the black module was negatively associated with tumour samples in the GEO dataset.
In the GEO database, the green module was positively significantly correlated with tumour status (R = 0.78 and P = 5e-15), while the black module was negatively significantly correlated with tumour status (R = -0.54 and P = 2e-06) (Figure 3H). In the TCGA database, the intramodular analysis showed that genes in the blue module were highly positively correlated with tumour status (cor = 0.75, P < 1e‐200), and those in the red module were highly negatively correlated with tumour status (cor = 0.7, P = 7.7e‐138) (Figure 4A). In the GEO database, the intramodular analysis showed that genes in the green module were highly positively correlated with tumour status (cor = 0.84, P = 3.7e‐123), and those in the black module were highly negatively correlated with tumour status (cor = 0.77, P = 7.1e‐52) (Figure 4B).
To identify the common DEGs that were positively correlated with HNSCC in the TCGA and GEO databases, we intersected a total of 24 genes from four gene sets. In addition, for the common DEGs negatively correlated with HNSCC in the TCGA and GEO databases, we intersected a total of 15 genes from four gene sets (Figure 4C).
The gene sets of 24 positively and 15 negatively correlated DEGs were subjected to GO and KEGG pathway analyses to further explore their biological functions. For the 24 positively correlated DEGs of the gene sets, a total of 30 GO terms and 9 pathways (P < 0.05) were identified. The most significantly enriched biological process, cellular component, and molecular function of the GO terms were the regulation of the extrinsic apoptotic signalling pathway in the absence of ligand, the laminin complex, and the extracellular matrix structural constituent, respectively. Similarly, the significant pathways were mainly enriched in the ECM-receptor interaction signalling pathway (Supplementary Figure 1A, B). For the 15 negatively correlated DEGs of the gene sets, a total of 30 GO terms and 11 pathways (P < 0.05) were identified. The most significantly enriched biological process, cellular component, and molecular function of the GO terms were complement activation: the alternative pathway, platelet alpha granule lumen, and flavin adenine dinucleotide binding, respectively. Similarly, the significant pathways were mainly enriched in the tyrosine metabolism signalling pathway (Supplementary Figure 1C, D).
The outcomes of the PPI network of positively correlated DEG sets were obtained from STRING online analysis software, and Cytoscape software was used for visualization. The top 10 outstanding proteins (including proteins from similar families of hub genes on the Cytoscape website) were identified as hub genes and selected for further analysis (Figure 5A, B). In addition, the negatively correlated DEG sets were obtained from STRING online analysis software, and Cytoscape software was used for visualization. The top 10 outstanding proteins (including proteins from similar families of hub genes on the Cytoscape website) were identified as hub genes and selected for further analysis (Figure 5C, D).
GEPIA online software was used to perform survival analysis of the 20 potential hub genes of 2 gene sets. The results revealed the significance of 8 potential hub genes as prognostic factors of patients with HNSCC. Specifically, for the positively correlated DEGs, high expression levels of INHBA (P = 0.0011), LAMC2 (P = 0.013), PLAU (P = 0.00049), TGFβ1 (P = 0.013) and TIMP1 (P = 0.042) were strongly associated with poor prognosis. In addition, except for TIMP1 (P > 0.05), the expression levels of INHBA (P < 0.05), LAMC2 (P < 0.05), PLAU (P < 0.05) and TGFβ1 (P < 0.05) were significantly different between tumour and normal samples (Figure 6A). For the negatively correlated DEGs, high expression levels of FAM107A (P = 0.017), PACSIN1 (P = 0.0097) and PTGDS (P = 0.031) were strongly associated with better prognosis. In addition, except for PACSIN1 (P > 0.05), the expression levels of FAM107A (P < 0.05) and PTGDS (P < 0.05) were significantly different between tumour and normal samples (Figure 6B).
To further analyse the protein levels of the hub genes, the HPA database was used to determine the expression levels and locations of the hub genes. Since IHC results were not available for INHBA in the HPA online database, we excluded it. Among the positively correlated gene sets, PLAU and LAMC2 were more strongly expressed in tumour samples than in normal samples. TGFβ1 and TIMP1 expression was negative, and there was no significant difference between tumour and normal samples (Figure 7A). For the negatively correlated gene sets, FAM107A, PACSIN1, and PTGDS expression was negative, and there was no significant difference between tumour and normal samples (Figure 7B). This study mainly focuses on the screening and identification of genes related to a poor prognosis in patients with HNSCC. Based on the above results, PLAU and LAMC2 were selected for further studies.
Intersecting DEGs of different modules in both the TCGA and GEO datasets. (A) Relationship between module membership in the blue and red modules and gene significance for the tumour in the TCGA dataset. (B) Relationship between module membership in the green and black modules and gene significance for the tumour in the GEO dataset. (C) In the left panel, the gene sets were screened out by intersecting the genes positively associated with the tumour and the DEGs. In the right panel, the gene sets were screened out by intersecting the genes negatively associated with the tumour and the DEGs.
Protein-protein interaction network analysis of genes from different gene sets using the STRING online database and visualization using Cytoscape software. (A, B) PPI network analysis and Cytoscape analysis of gene sets that are positively associated with the tumour. (C, D) PPI network analysis and Cytoscape analysis of gene sets negatively associated with the tumour.
The IHC results of 175 HNSCC samples showed that PLAU and LAMC2 were positively expressed, and there was a difference between strong and weak expression in HNSCC samples (Figure 8A).
To further evaluate the impact of PLAU and LAMC2 in HNSCC patients, the association between the expression of PLAU and LAMC2 and patients' clinicopathological parameters was investigated. In total, 175 primary HNSCC patients were divided into two groups according to the expression levels of PLAU and LAMC2. High PLAU and LAMC2 expression was found to be correlated with clinical indicators (Table 1). Furthermore, multivariate Cox regression analysis showed that PLAU and LAMC2 were independent prognostic factors for HNSCC patients (Table S1 and Table S2 and Figure 8B).
A significant positive correlation between PLAU and LAMC2 expression in HNSCC samples was found based on the results of the TCGA database through gene correlation analysis via the cBioPortal database (Figure 8C). A positive correlation between PLAU and LAMC2 expression was also found in 175 HNSCC patients (contingency coefficient = 0.558, P < 0.001, Table S3). In addition, a clear colocalization expression relationship between PLAU and LAMC2 was found by immunofluorescence double-labelling in HNSCC samples (Figure 8D).
Survival analysis and expression levels of hub genes in different gene set analyses by the GEPIA online database. (A) INHBA, LAMC2, PLAU, TGFβ1, and TIMP1 were screened from the set of genes positively associated with tumours, and high expression of all five genes was associated with poor prognosis in HNSCC patients. The expression levels of INHBA, LAMC2, PLAU, and TGFβ1 were significantly higher in tumour tissues than in normal tissues, but there was no significant difference in the expression levels of TIMP1 between tumour tissues and normal tissues. (B) FAM107A, PACSIN1, and PTGDS were screened from the set of genes negatively associated with tumours, and high expression of all five genes was associated with better prognosis in HNSCC patients. The expression levels of FAM107A and PTGDS were significantly lower in tumour tissues than in normal tissues, but there was no significant difference in the expression levels of PACSIN1 between tumour tissues and normal tissues.
HNSCC is one of the most prevalent tumours, with approximately 550,000 new patients diagnosed yearly worldwide [21]. This kind of tumour is characterized by high morbidity, a high risk of recurrence, regional cervical metastasis, and a poor prognosis [22]. Currently, despite progress in treatment intervention and large numbers of basic studies, the prognosis and survival rate have not improved significantly [23]. Increasing research has illustrated that the abnormal expression of genes is one of the risk factors for the occurrence and progression of HNSCC, and some dysregulated genes in HNSCC might be potential candidate biomarkers for prognosis [24]. Therefore, our study aimed to investigate gene expression through bioinformatics analysis of the GSE23036 and TCGA datasets to identify significant molecules that might be used as biomarkers and therapeutic targets in HNSCC.
In the present study, a total of 952 DEGs were identified (293 upregulated and 659 downregulated) in the TCGA dataset, and a total of 570 DEGs were identified (236 upregulated and 334 downregulated) in the GEO dataset. WGCNA was performed for modular gene analysis of the DEGs in the 2 databases. In the TCGA and GEO datasets, 4 highly relevant expression patterns were identified, which are represented in the blue, red, green, and black modules. In the TCGA database, the blue module (positively correlated with tumour status) and the red module (negatively correlated with tumour status) were selected for further analysis.
Immunohistochemical expression of PLAU and LAMC2 in samples from 175 patients with HNSCC according to clinical data and follow‑up.
Variable | LAMC2 | P | PLAU | P | |||
---|---|---|---|---|---|---|---|
Weak | Strong | Weak | Strong | ||||
Gender | 0.951 | 0.694 | |||||
Male | 51(43.6) | 66(56.4) | 48(41) | 69(59) | |||
Female | 25(43.1) | 33(56.9) | 22(37.9) | 36(62.1) | |||
Age | 0.001** | 0.000 *** | |||||
<60 years | 35(60.3) | 23(39.7) | 34(58.6) | 24(41.4) | |||
≥60 years | 41(35) | 76(65) | 36(30.8) | 81(69.2) | |||
Differentiation | 0.206 | 0.409 | |||||
Poor- Moderately | 25(51) | 24(49) | 22(44.9) | 27(55.1) | |||
Well | 51(40.5) | 75(59.5) | 48(38.1) | 78(61.9) | |||
T Stage | 0.001** | 0.001** | |||||
T1-2 | 58(53.2) | 51(46.8) | 54(49.5) | 55(50.5) | |||
T3-4 | 18(27.3) | 48(72.7) | 16(24.2) | 50(75.8) | |||
N Stage | 0.005** | 0.033* | |||||
N0 | 32(59.3) | 22(40.7) | 28(51.9) | 26(48.1) | |||
N+ | 44(36.4) | 77(63.6) | 42(34.7) | 79(65.3) | |||
Clinical Stage | 0.002** | 0.025* | |||||
I-II | 27(64.3) | 15(35.7) | 23(54.8) | 19(45.2) | |||
III-IV | 49(36.8) | 84(63.2) | 47(35.3) | 86(64.7) | |||
Recurrence | 0.065 | 0.663 | |||||
No | 49(49.5) | 50(50.5) | 41(41.4) | 58(58.6) | |||
Yes | 27(35.5) | 49(64.5) | 29(38.2) | 47(61.8) | |||
Living Status | 0.001** | 0.002** | |||||
Living | 65(50.8) | 63(49.2) | 60(46.9) | 68(53.1) | |||
Dead | 11(23.4) | 36(76.6) | 10(21.3) | 37(78.7) |
*P < 0.05, **P < 0.01, ***P < 0.001.
Differential expression levels of hub genes in tumour and normal tissues in two gene sets analysed by the HPA online database. (A) The expression levels of LAMC2, PLAU, TGFβ1 and TIMP1 in tumour and normal tissues. (B) The expression levels of FAM107A, PACSIN1 and PTGDS in tumour and normal tissues.
Correlation analysis of the differential expression levels of PLAU and LAMC2 with clinicopathological parameters in HNSCC patients. (A) The expression levels of PLAU and LAMC2 in 175 HNSCC patients. (B) Prognosis analysis between the differential expression levels of PLAU and LAMC2 in 175 patients with HNSCC. (C) Significant positive correlation between PLAU and LAMC2 expression in HNSCC samples according to the cBioPortal database. (D) Colocalization of PLAU and LAMC2 expression in HNSCC samples verified by immunofluorescence double-labelling.
In addition, in the GEO database, the green module (positively correlated with tumour status) and the black module (negatively correlated with tumour status) were selected for further analysis. Twenty-four genes positively correlated with tumour status and 15 genes negatively correlated with tumour status were subjected to GO and KEGG analyses. In the set of genes positively correlated with tumour status, GO term analysis showed that the hub genes were mainly enriched in the regulation of the extrinsic apoptotic signalling pathway in the absence of ligands, laminin complexes, and extracellular matrix structural constituents. KEGG pathway analysis revealed that the hub genes were mainly enriched in ECM-receptor interaction. Debodipta Das et al [25] found that the ECM-receptor pathway was significantly enriched in HNSCC patients, and the expression of some key genes (COL4A1, COL4A2, COL4A6, and LAMC2) in this pathway may be used as prognostic biomarkers for HNSCC. In the set of genes negatively correlated with tumour status, GO term analysis showed that the hub genes were mainly enriched in complement activation: the alternative pathway, platelet alpha granule lumen, and flavin adenine dinucleotide binding. KEGG pathway analysis revealed that the hub genes were mainly enriched in tyrosine metabolism. Brandon Leonard et al [26] showed that the tyrosine metabolism pathway was associated with chemotherapeutic resistance to cetuximab in HNSCC.
Moreover, according to the PPI network and Cytoscape analysis, we identified 20 hub genes in the 2 gene sets. Based on the prognosis analysis using the GEPIA online analysis tool, high expression levels of INHBA, LAMC2, PLAU, TGFβ1, and TIMP1 were strongly associated with poor prognosis in HNSCC patients. In addition, high expression levels of FAM104A, PACSIN1, and PTGDS were strongly associated with a better prognosis in HNSCC patients. Moreover, we used the HPA online database to determine the expression levels of 8 genes in HNSCC and normal samples. Finally, we found that the expression of PLAU and LAMC2 was consistent with the prognostic analysis results. LAMC2 is overexpressed in oesophageal squamous cell carcinoma (ESCC) samples and facilitates ESCC progression and metastasis, which is correlated with the poor prognosis of ESCC patients [27], however, we did not find any relevant high-quality research studies on LAMC2 in HNSCC. PLAU may be an independent prognostic biomarker for HNSCC, and downregulation of miR-23b-3p contributes to the tumorigenic effect of PLAU in HNSCC [28]. In our study, PLAU and LAMC2 were first screened out by bioinformatics methods, and we found that PLAU and LAMC2 were highly expressed in HNSCC and were associated with poor prognosis. Therefore, we included the two genes in the subsequent analysis. IHC analyses of 175 HNSCC samples showed that PLAU and LAMC2 were positively expressed in HNSCC samples and that high expression levels of PLAU and LAMC2 were associated with age, T stage, N stage, clinical stage, and survival status. Moreover, univariate and multivariate Cox regression analyses demonstrated that high expression levels of PLAU and LAMC2 might be risk factors for poor prognosis in HNSCC, and the expression levels of PLAU and LAMC2 were positively correlated in HNSCC samples.
In conclusion, the present study successfully screened out PLAU and LAMC2, and comprehensive analysis showed that high expression of both PLAU and LAMC2 might predict a poor prognosis in patients with HNSCC.
Supplementary tables and figure.
JBT and ZCG led the team and were responsible for whole project administration. SLJ and SJN carried out all the experiments and wrote the manuscript. HC and YLX contributed to sample collection and validated the results. ZCG and SLJ analysed data and provided advice. All authors have read and approved the manuscript.
The datasets used and analysed during the current study are available from the corresponding author on reasonable request.
The study was conducted in accordance with the Declaration of Helsinki and approved by the Ethics Committee of College of Stomatology affiliated Xi'an Jiaotong University (approval no. xjkqll[2022]NO.028). Written informed consent was obtained from all patients for use of tissue sample. The TNM stage, clinicopathological classification and tumour stage of patients with HNSCC were assessed according to the American Joint Committee on Cancer (AJCC) guidelines.
The authors have declared that no competing interest exists.
1. Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2018;68:394-424
2. Siegel RL, Miller KD, Fuchs HE, Jemal A. Cancer Statistics, 2021. CA Cancer J Clin. 2021;71:7-33
3. von Witzleben A, Wang C, Laban S, Savelyeva N, Ottensmeier CH. HNSCC: Tumour Antigens and Their Targeting by Immunotherapy. CELLS-BASEL. 2020;9:2103-32
4. Yao Y, Yan Z, Lian S. et al. Prognostic value of novel immune-related genomic biomarkers identified in head and neck squamous cell carcinoma. J IMMUNOTHER CANCER. 2020;8:e000444-55
5. Chen Y, Li ZY, Zhou GQ, Sun Y. An Immune-Related Gene Prognostic Index for Head and Neck Squamous Cell Carcinoma. CLIN CANCER RES. 2021;27:330-41
6. Tao W, Tian G, Xu S, Li J, Zhang Z, Li J. NAT10 as a potential prognostic biomarker and therapeutic target for HNSCC. CANCER CELL INT. 2021;21:413
7. Zhao XT, Zhu Y, Zhou JF, Gao YJ, Liu FZ. Development of a novel 7 immune-related genes prognostic model for oral cancer: A study based on TCGA database. ORAL ONCOL. 2021;112:105088
8. Huang GZ, Lu ZY, Rao Y, Gao H, Lv XZ. Screening and identification of autophagy-related biomarkers for oral squamous cell carcinoma (OSCC) via integrated bioinformatics analysis. J CELL MOL MED. 2021;25:4444-54
9. Zeng Y, Li N, Liu W, Zeng M, Cheng J, Huang J. Analyses of expressions and prognostic values of Polo-like kinases in non-small cell lung cancer. J Cancer Res Clin Oncol. 2020;146:2447-60
10. Lydiatt WM, Patel SG, O'Sullivan B. et al. Head and Neck cancers-major changes in the American Joint Committee on Cancer eighth edition cancer staging manual. CA Cancer J Clin. 2017;67:122-37
11. Danaher P, Warren S, Lu R. et al. Pan-cancer adaptive immune resistance as defined by the Tumor Inflammation Signature (TIS): results from The Cancer Genome Atlas (TCGA). J IMMUNOTHER CANCER. 2018;6:63
12. Zhou C, Ye M, Ni S. et al. DNA methylation biomarkers for head and neck squamous cell carcinoma. EPIGENETICS-US. 2018;13:398-409
13. Kim H, Zeng P, Shaikh MH. et al. All HPV-negative head and neck cancers are not the same: Analysis of the TCGA dataset reveals that anatomical sites have distinct mutation, transcriptome, hypoxia, and tumor microenvironment profiles. ORAL ONCOL. 2021;116:105260
14. Jin Y, Qin X. Comprehensive analysis of transcriptome data for identifying biomarkers and therapeutic targets in head and neck squamous cell carcinoma. Ann Transl Med. 2020;8:282
15. Yang Z, Liang X, Fu Y. et al. Identification of AUNIP as a candidate diagnostic and prognostic biomarker for oral squamous cell carcinoma. EBIOMEDICINE. 2019;47:44-57
16. Zhang S, Li C, Zhang Z. et al. Analysis of differentially expressed genes in oral epithelial cells infected with Fusobacterium nucleatum for revealing genes associated with oral cancer. J CELL MOL MED. 2021;25:892-904
17. Song Y, Wang X, Wang F. et al. Identification of four genes and biological characteristics of esophageal squamous cell carcinoma by integrated bioinformatics analysis. CANCER CELL INT. 2021;21:123
18. Thul PJ, Lindskog C. The human protein atlas: A spatial map of the human proteome. PROTEIN SCI. 2018;27:233-44
19. Guo ZC, Jumatai S, Jing SL, Hu LL, Jia XY, Gong ZC. Bioinformatics and immunohistochemistry analyses of expression levels and clinical significance of CXCL2 and TANs in an oral squamous cell carcinoma tumor microenvironment of Prophyromonas gingivalis infection. ONCOL LETT. 2021;21:189
20. Xue Y, Du HD, Tang D. et al. Correlation Between the NLRP3 Inflammasome and the Prognosis of Patients With LSCC. FRONT ONCOL. 2019;9:588
21. Siegel RL, Miller KD, Jemal A. Cancer statistics, 2020. CA Cancer J Clin. 2020;70:7-30
22. Posner M, Vermorken JB. Induction therapy in the modern era of combined-modality therapy for locally advanced head and neck cancer. SEMIN ONCOL. 2008;35:221-28
23. Cohen RB. Current challenges and clinical investigations of epidermal growth factor receptor (EGFR)- and ErbB family-targeted agents in the treatment of head and neck squamous cell carcinoma (HNSCC). CANCER TREAT REV. 2014;40:567-77
24. Ouyang F, Liu J, Xia M. et al. GINS2 is a novel prognostic biomarker and promotes tumor progression in early-stage cervical cancer. ONCOL REP. 2017;37:2652-62
25. Das D, Maitra A, Panda CK. et al. Genes and pathways monotonically dysregulated during progression from normal through leukoplakia to gingivo-buccal oral cancer. NPJ GENOM MED. 2021;6:32
26. Leonard B, Brand TM, O'Keefe RA. et al. BET Inhibition Overcomes Receptor Tyrosine Kinase-Mediated Cetuximab Resistance in HNSCC. CANCER RES. 2018;78:4331-43
27. Liang Y, Chen X, Wu Y. et al. LncRNA CASC9 promotes esophageal squamous cell carcinoma metastasis through upregulating LAMC2 expression by interacting with the CREB-binding protein. CELL DEATH DIFFER. 2018;25:1980-95
28. Huo Z, Li X, Zhou J, Fan Y, Wang Z, Zhang Z. Hypomethylation and downregulation of miR-23b-3p are associated with upregulated PLAU: a diagnostic and prognostic biomarker in head and neck squamous cell carcinoma. CANCER CELL INT. 2021;21:564
Corresponding authors: Jun-bo Tu, Department of Oral and Maxillofacial Surgery, College of Stomatology, Xi'an Jiaotong University, No. 98 Xiwu Road, Xi'an 710004, China, Tel: 86-29-87215366; E-mail: tujunboedu.cn; ORCID: 0000-0002-8733-0824. Si-jia Na, Department of Oral and Maxillofacial Surgery, College of Stomatology, Xi'an Jiaotong University, No. 98 Xiwu Road, Xi'an 710004, China, Tel:86-29-87215366; E-mail: sijianaedu.cn.
Received 2023-3-17
Accepted 2023-5-12
Published 2023-6-4