Identification of diagnostic markers pyrodeath-related genes in non-alcoholic fatty liver disease based on machine learning and experiment validation.

Lei L ^{1,

2},

Li J ²,

Liu Z ²,

Zhang D ²,

Affiliations

1. Department of Geriatric Medicine, The Affiliated Hospital of Guilin Medical University, Guilin, 541001, Guangxi, China.
Authors
Lei L^{1,

2}
(1 author)
2. Division of Hepatobiliary Surgery, The Affiliated Hospital of Guilin Medical University, Guilin, 541001, Guangxi, China.
Authors
Lei L^{1,

2}
Li J²
Liu Z²
Zhang D²
Liu Z²
Wang Q²
Li J²
(7 authors)
3. Department of Gastrointestinal Surgery, The Affiliated Hospital of Guilin Medical University, Guilin, 541001, Guangxi, China.
Authors
Gao Y³
(1 author)
4. Department of Respiratory and Critical Care Medicine, The Second Affiliated Hospital of Guilin Medical University, Guilin, 541002, Guangxi, China.
Authors
Mo B⁴
(1 author)

Scientific Reports, 26 Oct 2024, 14(1):25541
https://doi.org/10.1038/s41598-024-77409-3 PMID: 39462099 PMCID: PMC11513955

This article is in the Europe PMC Open access subset. Refer to the copyright information in the article for licensing details.

Free full text in Europe PMC

Abstract

Non-alcoholic fatty liver disease (NAFLD) poses a global health challenge. While pyroptosis is implicated in various diseases, its specific involvement in NAFLD remains unclear. Thus, our study aims to elucidate the role and mechanisms of pyroptosis in NAFLD. Utilizing data from the Gene Expression Omnibus (GEO) database, we analyzed the expression levels of pyroptosis-related genes (PRGs) in NAFLD and normal tissues using the R data package. We investigated protein interactions, correlations, and functional enrichment of these genes. Key genes were identified employing multiple machine learning techniques. Immunoinfiltration analyses were conducted to discern differences in immune cell populations between NAFLD patients and controls. Key gene expression was validated using a cell model. Analysis of GEO datasets, comprising 206 NAFLD samples and 10 controls, revealed two key PRGs (TIRAP, and GSDMD). Combining these genes yielded an area under the curve (AUC) of 0.996 for diagnosing NAFLD. In an external dataset, the AUC for the two key genes was 0.825. Nomogram, decision curve, and calibration curve analyses further validated their diagnostic efficacy. These genes were implicated in multiple pathways associated with NAFLD progression. Immunoinfiltration analysis showed significantly lower numbers of various immune cell types in NAFLD patient samples compared to controls. Single sample gene set enrichment analysis (ssGSEA) was employed to assess the immune microenvironment. Finally, the expression of the two key genes was validated in cell NAFLD model using qRT-PCR. We developed a prognostic model for NAFLD based on two PRGs, demonstrating robust predictive efficacy. Our findings enhance the understanding of pyroptosis in NAFLD and suggest potential avenues for therapeutic exploration.

Free full text

Sci Rep. 2024; 14: 25541.

Published online 2024 Oct 26. https://doi.org/10.1038/s41598-024-77409-3

PMCID: PMC11513955

PMID: 39462099

Identification of diagnostic markers pyrodeath-related genes in non-alcoholic fatty liver disease based on machine learning and experiment validation

Liping Lei,^#^1,² Jixue Li,^#² Zirui Liu,^#² Dongdong Zhang,² Zihan Liu,² Qing Wang,² Yi Gao,³ Biwen Mo,⁴ and Jiangfa Li^2,^5,⁶

Author information Article notes Copyright and License information Disclaimer

Associated Data

Supplementary Materials: Supplementary Material 1
41598_2024_77409_MOESM1_ESM.docx (13K)
Supplementary Material 2
41598_2024_77409_MOESM2_ESM.xlsx (241K)
Supplementary Material 3
41598_2024_77409_MOESM3_ESM.xlsx (158K)
Supplementary Material 4
41598_2024_77409_MOESM4_ESM.xlsx (23K)
Supplementary Material 5
41598_2024_77409_MOESM5_ESM.xlsx (80K)
Supplementary Material 6
41598_2024_77409_MOESM6_ESM.xlsx (10K)
Supplementary Material 7
41598_2024_77409_MOESM7_ESM.xlsx (31K)
Supplementary Material 8
41598_2024_77409_MOESM8_ESM.xlsx (2.2M)

Data Availability Statement: “The datasets in this study were enrolled from the GEO database (https://www.ncbi.nlm.nih.gov/geo/), with the following data accessions enrolled: GSE135251 and GSE89632. The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.”

Abstract

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-024-77409-3.

Keywords: Non-alcoholic fatty liver disease, Pyroptosis, Diagnostic, Immune infiltration, Machine learning

Subject terms: Genomics, Computational biology and bioinformatics, Biomarkers, Gastroenterology

Introduction

Non-alcoholic fatty liver disease (NAFLD), representing the most prevalent liver condition, emerges as a significant global health dilemma, implicating up to a quarter of the adult populace worldwide. This disease not only underscores the escalating burden on healthcare infrastructures globally but also highlights an alarming trend of increased incidences among the pediatric demographic^¹. NAFLD is defined by the deposition of fat exceeding 5% in liver cells, in scenarios devoid of substantial alcohol intake or other secondary hepatic steatosis triggers such as obesity, dyslipidemia, type 2 diabetes, and assorted metabolic disorders^². NAFLD embodies a spectrum of hepatic abnormalities ranging from mere fat aggregation (benign steatosis) to more severe forms including inflammation (Non-alcoholic steatohepatitis [NASH]), fibrosis, cirrhosis, and ultimately, hepatocellular carcinoma^³,⁴. In the context of immune response, inflammation fundamentally serves as a protective measure against external pathogens^⁵. Nonetheless, it has been demonstrated through prior research that inflammation, particularly when provoked by immune cytokines, can inflict significant tissue damage^⁶.

Pyroptosis is a programmed cell death induced by a typical or atypical inflammasome, which is morphologically manifested as cell swelling and subsequent lysis, ultimately resulting in the release of intracellular contents^⁷,⁸. Pyroptosis is regulated by unique sets of critical inflammatory caspases that coordinate biological effects^⁹,¹⁰. Inflammasomes are multiprotein complexes that can sense danger signals and activate caspase-1 to mediate pro-inflammatory cytokines release and pyroptotic cell death. There are two main canonical and non-canonical signaling pathways that trigger inflammasome activation^¹¹. Pyroptosis is involved in many pathophysiological processes^{⁸,¹²,¹³}. Pyroptosis is involved in liver fibrogenesis from various pathologies^¹⁴,¹⁵.

There is a relative scarcity of research delving into the connection between pyroptosis metabolism and the underlying pathophysiology of NAFLD. To address this, we extensively investigated the expression, diagnosis, immune correlation, and mechanism of pyroptosis-related genes (PRGs) in NAFLD. Utilizing NAFLD-related data downloaded from the Gene Expression Omnibus (GEO) database, we conducted a series of analyses, including machine learning, to elucidate the relationship between PRGs and NAFLD. This process allowed us to identify differentially expressed genes, pinpoint key genes among them, and construct a prediction model that was subsequently externally validated. In the final stages of our study, we carried out immune infiltration analyses, evaluated related drugs, and explored associated competing endogenous RNAs (ceRNAs).

Materials and methods

Patients and datasets

The transcriptomic analysis of NAFLD and normal liver specimens included GSE135251, downloaded from the GEO database (53steatosis, 153 NASH and 10 control samples). The flow chart of this study is shown in Fig. 1. The identification of effective genes took the following steps. Firstly, genes with missing values were deleted. Secondly, genes with multiple duplicate values were averaged. Furthermore, if a gene had a value of 0 in half or more of the samples, the gene was deleted. The remaining valid genes were then used for differential expression analysis.

Fig. 1

Flowchart of the present study.

Expression of DEGs and PRGs in NAFLD

We utilized the R package “limma” to conduct a differential expression analysis based on the processed data from the GEO database. This analysis allowed us to identify differentially expressed genes (DEGs) between NAFLD samples and healthy controls. The results were visualized as volcano and heatmap plot using the “ggplot2” and “heatmap” R packages. The screening criteria were as follows: adjusted P<0.05. The intersection of DEGs related to PRGs was created using the “VennDiagram” R package and defined as DEG-PRGs for subsequent analysis. The Ven map was used to identify PRGs in DRGs between NAFLD and healthy samples. The “ggpubr” R package was use to build a boxplot representing the differences expressed of the DEG-PRGs between NAFLD and normal groups. The R software packages “heatmap” was used to generate heatmap of the DE-PRGs. Then, we used multivariate analysis to screen for important DE-PRGs.

Correlation analysis and protein‒protein interaction (PPI) network construction

The DE-PRGs PPI networks were constructed using the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING; https://string-db.org/) database. The “ggplot2[3.3.6]” package of R was used to carry out pair-to-pair correlation analysis of variables in the data, and the analysis results were visualized with the heat map.

Gene ontology (GO) and KEGG pathway enrichment analysis

To analyze the biological function of genes, we employed the “clusterProfiler” package in R, which facilitated the enrichment analysis of Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways^{¹⁶–¹⁸}. The GO annotation encompassed three domains: biological processes (BP), cellular components (CC), and molecular functions (MF).

Three machine learning methods to identify key genes

The “glmnet” package in R was employed to perform the least absolute shrinkage and selection operator (LASSO) regression on the selected linear model, a method that reduces data dimensionality while retaining valuable variables^¹⁹,²⁰. The principle of recursive feature elimination (RFE) is to iteratively build the model and then select the best (or worst) feature, determined by the coefficient^²¹. The sequential backward selection algorithm, known as support vector machine recursive feature elimination (SVM-RFE), is based on the maximum margin principle of the support vector machine (SVM)^²²–²⁴. SVM-RFE is a supervised machine learning model that differentiates between positive and negative instances by removing the feature vector generated by SVM. SVM is a powerful supervised learning algorithm used for classification and regression tasks. The primary goal of SVM is to find the optimal hyperplane that separates data points of different classes with the maximum margin. In bioinformatics, SVM is used for gene expression data analysis, protein classification, and cancer classification. SVM can extract features from complex biological data for effective classification prediction. RFE systematically removes features to improve the model’s performance, thus helping to identify a subset of genes that are most informative for distinguishing between classes. We used the “e1071” package in R to screen the most valuable genes for the SVM-RFE model construction^²⁵,²⁶. The SVM-RFE method was utilized to determine the optimal variables by searching for the point corresponding to the minimum cross-validation error. The random forest (RF) algorithms were integrated to select the optimal genes. RF is a regression tree technique that leverages bootstrap aggregation and randomization of predictors to achieve high prediction accuracy^²⁷. The “randomForest” R package was utilized for RF. The genes selected by these three machine learning methods were intersected to obtain the final key genes.

The establishment and verification of the DE-PRGs diagnostic model

In the construction of the DE-PRGs diagnostic model, we utilized the GSE135251 dataset for model construction. Initially, we utilized the “rms” package in R to construct a nomogram model to predict the occurrence of NAFLD. The “pROC” package in R^²⁸ was employed to analyze the area under the curve (AUC), specificity, and sensitivity of diagnostic value for marker genes using a time-dependent ROC. Each central gene was assigned a score, which was then aggregated to form a total score. The external dataset, GSE89632, was used to verify the diagnostic capability of the model. The GSE89632 dataset included 20 cases of simple steatosis, 19 cases of nonalcoholic steatohepatitis, and 24 healthy controls.

Gene set enrichment analysis (GSEA) and gene set variation analysis (GSVA)

We conducted GSEA using the “clusterProfiler” package to explore the potential functions of the hub genes^²⁹. Additionally, we executed differential gene expression and pathway enrichment analyses using the “GSVA,” “clusterProfiler,” and “Limma” packages. Statistical significance was determined by enrichment analysis with a p-value less than 0.05.

Assessment of Immune Infiltration

The “GSVA” package in R was utilized to apply a single-sample GSEA (ssGSEA) algorithm for determining enrichment scores between diverse immune cells, functions, or pathways in NAFLD and control groups. Reference gene collections were obtained from a public database (http://www.immport.org). The association between the four pivotal genes and the immune score was investigated using Spearman correlation analysis. The Wilcoxon test was employed to examine differences in immune cell and immune-related functional enrichment scores.

Investigation of drug-gene interactions and construction of ceRNA network

The drug-gene interaction database (DGIdb, https://dgidb.genome.wustl.edu/) was used to probe drug-gene interactions^³⁰. The miRNA-mRNA relationship could be predicted by leveraging the two key genes in miRDB (http://www.mirdb.org/) and TargetScan (https://www.targetscan.org/vert_80). Direct interaction evidence between the miRNA and lncRNA was gathered using SpongeScan (http://spongescan.rc.ufl.edu/). The Cytoscape software (version 3.9.0) was used to visualize mRNA‒miRNA–lncRNA interactions through the ceRNA network.

Cell culture and treatments

The Human Liver-7702 (HL-7702) cell line was supplied by Cybkon Biotechnology Co., LTD., (Shanghai, China, Item number: iCell-h054). The complete culture medium of HL-7702 cell line was added with DMEM/F-12 (1:1) (Gibco, 11330-032) 89 mL ITS liquid medium (Sigmadg, I3146) 1 mL dexamethasone (Sigma, D4902-100 mg) 40 ng/ mL FBS (Gibco) 10 mL, 37uC, 5% CO2, cultured in a cell incubator. When the cell fusion degree reached 60–70%, the cells were divided into 2 groups (n=3). (1) Control group (NS treatment for 24 h); (2) NAFLD group (cell treated with oleic acid (OA; Sigma, USA) 1mM for 24 h.). Cell condition was confirmed by Oil Red O staining.

Quantitative reverse transcription-polymerase chain reaction (qRT–PCR)

The RNA was extracted from HL-7702 cell line with TRIzol reagent (VAZYME, China). RNA was extracted and eluted through an RNA binding column, yielding purified total RNA samples. The first strand cDNA synthesis kit was used for cDNA reverse transcription. SYBR green qRT–PCR premix was employed for qRT–PCR. The expression levels of the target genes were normalized and analyzed in relation to GAPDH expression. The PrimeScript™ RT Reagent Kit (VAZYME, China) was used for RNA reverse transcription, and qRT‒PCR was carried out with an FX Connect system(VAZYME, China) and SYBR^® Green Supermix (VAZYME, China). qRT‒PCR was performed in triplicate. The primers used in this study are listed in Supplementary Table ^S1.

Statistical analyses

Continuous variables are expressed as mean±standard deviation. The Student’s t-test was used for comparing two groups, while the Wilcoxon rank-sum test was used for analyzing non-normally distributed variables. A p-value less than 0.05 was deemed to indicate a significant difference. The symbols *, **, and *** denote p-values less than 0.05, 0.01, and 0.001, respectively. All statistical analyses were conducted using R software (version 4.2.1).

Results

Identification of DE-PRGs associated with NAFLD

Using the “limma” package, 8586 DEGs (adj.p<0.05) were identified from the GSE135251 dataset consisting of 206 NAFLD and 10 control samples, as shown in Supplementary Table ^S2. Of these, 895 genes were up-regulated, and 1364 were down-regulated (Supplementary Table ^S3). The volcano plot of the differentially expressed genes is shown in Fig. 2A, and the heatmap of the top 50 genes in NAFLD and control samples is displayed in Fig. 2B. The top 50 genes included the top 25 DEGs with the largest values for positive logFC and the top 25 DEGs with the largest absolute values for negative logFC in DEGs. Additionally, 33 PRGs^³¹ overlapped with the 8586 DEGs, revealing 10 DE-PRGs with significant differences between the NAFLD and control groups (Fig. 2C, Supplementary Table ^S4). Eight DE-PRGs (CASP3, CASP4, CASP8, CASP9, GSDMD, PLCG1, TIRAP, TNF) were high expression and two DE-PRGs (IL1B and PJVK) were low expression in NAFLD (Fig. 2D, Supplementary Table ^S5), and the heatmap of these 10 DE-PRGs was shown in Fig. 2E.

Fig. 2

Identification of DE-PRGs in NAFLD. (A) Volcano plot of DEGs between NAFLD and control samples. (B) Heatmaps of the top 50 genes are presented. The top 50 genes included the top 25 DEGs with the largest values for positive logFC and the top 25 DEGs with the largest absolute values for negative logFC. (C) Venn diagrams showing the intersection between DEGs and PRGs. (D) Ten DE-PRGs are presented with the boxplots illustrating the differential expression between NAFLD and control samples. (E) Heatmap showing the expression patterns of these 10 DE-PRGs. P values are displayed as follows: *p<0.05; **p<0.01; ***p<0.001. DEGs differentially expressed genes, DE-PRGs differentially expressed pyroptosis-related genes, PRGs pyroptosis-related genes, NAFLD nonalcoholic fatty liver disease.

A PPI analysis using STRING was conducted to explore potential interactions among these 10 DE-PRGs (Fig. 3A). The correlation among the 10 DE-PRGs is shown in Fig. 3B. DE-PRGs were found to be related to response to lipopolysaccharide (LPS), molecule of bacterial origin, cobalt ion, and NF-kappaB signaling in BP, and inflammasome complex, membrane raft and membrane microdomain in CC, and cysteine-type endopeptidase activity, cytokine receptor binding, and cytokine receptor binding in MF, as revealed by GO enrichment analysis (Fig. 3C, Supplementary Table ^S6). KEGG pathway analysis showed involvement in Pathogenic Escherichia coli infection, lipid and atherosclerosis, liver disease, and NF-kappa B signaling pathway (Fig. 3C, Supplementary Table ^S6).

Fig. 3

PPI, GO, and KEGG analysis of 10 DE-PRGs. (A) Gene relationship network diagram of the 10 DE-PRGs. (B) Correlation analysis of the 10 DE-PRGs was conducted, with orange and blue representing positive and negative correlations, respectively. (C) GO and KEGG analysis of 10 DE-PRGs. P values are displayed as follows: *p<0.05. DEGs differentially expressed genes, DE-PRGs: differentially expressed pyroptosis-related genes, BP biological processes, CC cellular components, MF molecular functions, GO gene ontology, KEGG Kyoto Encyclopedia of Genes and Genomes.

Identification of diagnostic marker genes for NAFLD

Considering the individual complexity and heterogeneity of NAFLD patients and healthy controls, candidate key genes were identified from 10 DE-PRGs using LASSO regression and two validated machine learning models (SVM-RFE and RF), which aided in predicting NAFLD diagnosis. Two features were identified by SVM, (Fig. 4A and B). Four DE-PRGs were identified by the LASSO logistic regression algorithm, the coefficients of these four genes were non-0 in lasso regression model (Fig. 4C and D, Supplementary Table ^S7). And ten DE-PRGs were analyzed with RF, five of which were identified (Fig. 4E). A Venn diagram was used to intersect the essential genes in the LASSO, SVM-RFE, and RF analyses, identifying two key genes (TIRAP and GSDMD) for further analysis (Fig. 4F).

Fig. 4

Machine learning identification of diagnostic marker genes for NAFLD. The accuracy and error rate of feature selection of the SVM algorithm reached the lowest cross-validation error of 0.02% (A) and the peak accuracy of 0.98% (B) when 2 genes were selected. (C) LASSO coefficient analysis. (D) Diagnostic performance of LASSO model. (E) Random forest analysis was used for 10 DE-PRGs, and 5 genes were included, with an accuracy of 0.99. (F) Venn diagram showing overlapping genes obtained using the three machine learning algorithms (SVM, LASSO, and RF). SVM support vector machine, LASSO least absolute shrinkage and selection operator, RF random forest, NAFLD nonalcoholic fatty liver disease.

Evaluation of the diagnostic performance of NAFLD Diagnostic marker genes

A nomogram model for the diagnosis of NAFLD was constructed, which included two central genes, TIRAP and GSDMD (Fig. 5A). The nomogram model’s numerical value for each biomarker was used to predict NAFLD risk, with a correction curve indicating a clear correlation between the predicted and actual probability (Fig. 5B). The DCA revealed that the net benefit from this model was significantly higher than 0, implying its remarkable accuracy and utility for clinical decision-making (Fig. 5C). The ROC curve analysis showed that the combined features of the two key genes demonstrated high performance in diagnosing NAFLD (AUC=0.996, Fig. 5D) and the individual predictive ROC results for these two genes all exceeded 0.90 (Fig. 5E). The expression of 2 key genes in different groups of the GSE89632 dataset was shown in Fig. 5F. The ROC curve for the combination of the two genes in the GSE89632 sets was 0.825 (Fig. 5G), which was higher than the ROC curve for the predicted performance of the two genes separately (Fig. 5H). These indications suggest that the model based on these two marker genes may have strong predictive efficacy for NAFLD.

Fig. 5

Establishment and verification of marker genes diagnostic model. (A) Nomogram of marker genes. (B) Calibration curve. (C) The predictive efficiency of the nomogram model was illustrated by DCA. (D) The ROC of the combination of the two key genes for the diagnosis of NAFLD was 0.996 (95% CI 0.976–1.0). (E) The ROC results of each of the two key genes for the diagnosis of NAFLD. The AUC values of TIRAP, and GSDMD were 0.967 and 0.932, respectively. (F) Boxplots of GSE89632 revealed that the two DE-PRGs between the NAFLD and control samples were significantly different. (G) The two-gene model had the ROC result with an AUC value of 0.825 in GSE89632. (H) The AUC value of TIRAP and GSDMD was 0.774 and 0.745, respectively, in GSE89632. Model: A model that combines two key genes (TIRAP and GSDMD) to diagnose NFALD. Note that “all” and “none” are the two reference strategies used to compare the benefits of the forecast model. The “all” reference strategy means treatment in all cases, while the “none” reference strategy means no treatment. The purpose of these two reference strategies is to help evaluate the benefit and clinical utility of the predictive model. NAFLD nonalcoholic fatty liver disease.

GSEA and ssGSEA anlysis

We used GSEA to identify the major signaling pathways of the DEGs. GSEA of the KEGG pathways demonstrated that DEGs are implicated in NGF stimulated transcription, EIF2AK4 Gcn2 to amino acid deficiency, and metal ions (Fig. 6A and D, Supplementary Table ^S8). We used GSEA to identify the major signaling pathways of the two genes in the above model. GSEA of the KEGG pathways demonstrated that these two genes are implicated in cellular response to starvation. and infectious disase. GSVA revealed distinct activity pathways between low- and high-expression subtypes determined according to the levels of the two hub genes. Our analysis revealed that overexpression of GSDMD is involved in oncostatin M signaling, NGF stimulated transcription, and nuclear events kinase and transcription factor activation. Low GSDMD and TIRAP expression levels were linked to metabolism of lipids, small molecules metabolism of steroids, and metabolism of RNA (Fig. 6B–C E-F).

Fig. 6

An enrichment analysis of the DEGs and the two marker genes. Classical graphs for GSEA analysis of the signature based on (A) DEGs, (B) GSDMD, and (C) TIRAP. Histogram for GSEA analysis of the signature based on (D) DEGs, (E) GSDMD, and (F) TIRAP. GSEA gene set enrichment analysis, DEGs differentially expressed genes.

In order to verify whether pyroptosis could promote NAFLD progression by mediating immune infiltration, we conducted ssGSEA analysis. According to the grouping of NAFLD and Control, the samples of 206 NAFLD and 10 Control were divided into two clusters (Fig. 7A). ssGSEA analysis showed that NK CD56 dim cells, iDC, Cytotoxic cells were significantly increased in NAFLD patients versus normal liver tissue (Fig. 7B). But, CD8 T cells and T-helper cells were the opposite. In addition, we investigated the relationship between immune cell infiltration and two DE-PRGs by ssGSEA. The two genes were divided into high expression group and low expression group, and many kinds of immune cells showed significant differential expression (Fig. 7C-D).

Fig. 7

The expression of immune cells. (A) Principal component analysis further revealed a significant difference between NAFLD and control (206 NAFLD and 10 control samples in GSE135251). (B) Expression of different immune cells in NFALD and control. Expression of different immune cells in NAFLD with high and low expression of (C) GSDMD, (D) TIRAP. NAFLD nonalcoholic fatty liver disease.

Identification of drug candidates and ceRNA networks based on marker genes

To further explore drug therapy options for NAFLD, we analyzed the interactions between key genes and drugs using DGIdb. Cytoscape analysis revealed the interaction between genetic markers and drugs (Fig. 8A). A ceRNA network was constructed with the two essential genes using the TargetScan, miRanda, and miRDB databases, revealing one miRNA and 34 lncRNAs (Fig. 8B).

Fig. 8

mRNA–Drugs and ceRNA Network. (A) The green rectangle nodes symbolize the drugs, while the mRNA–drug interaction network is represented by blue dots. (B) The ceRNA network, based on marker gene, is depicted with yellow dots for miRNA and baby blue dots for lncRNA.

Expression of PRGs in a cell model of NAFLD

Oil red O staining showed large lipid deposits in the NAFLD group cells, which were characterized by the formation of more fat droplets (Fig. 9A). qRT‒PCR measurement of mRNA levels indicated that the expression levels of the two key genes were significantly increased in the NAFLD group compared with those in the control group (Fig. 9B).

Fig. 9

Expression of two key genes in Cell NAFLD Model. (A) Oil Red O staining. (B) The relative mRNA expression of the two hub genes in cell NAFLD model was verified by qRT‒PCR. N=3, **p<0.01.

Discussion

Non-alcoholic fatty liver disease (NAFLD) poses a significant global health challenge^³²,³³. NAFLD poses a significant global health challenge^{³⁴–³⁶}. However, the specific role of pyroptosis in the pathogenesis and regulation of NAFLD is still not fully understood. In this study, we investigated the potential role of PRGs in NAFLD, identified potential key genes, and explored possible target drugs.

We downloaded NAFLD and control liver data from the GEO database for statistical analysis to identify DEGs, resulting in the identification of 10 DEGs associated with pyroptosis levels. These findings suggest that PRGs may influence the progression of NAFLD. Our correlation analysis revealed that the identified DE-PRGs were closely related to each other; however, some showed no apparent correlation at the protein level, indicating heterogeneity in the interaction of PRGs at the gene and protein levels.

The important role of DE-PRGs inresponse to LPS,, cysteine-type endopeptidase activity, membrane raft, and Lipid and atherosclerosis was revealed by GO and KEGG enrichment analyses, respectively. LPS, also known as endotoxin, is a major component of the outer membrane of Gram-negative bacteria. LPS plays a critical role in the pathogenesis of various inflammatory diseases, including NAFLD^³⁷. In the context of NAFLD, this LPS-mediated inflammation is a key driver of liver damage. Recent studies have underscored the importance of the gut-liver axis in the progression of NAFLD^³⁷. Increased intestinal permeability allows for the translocation of LPS from the gut into the bloodstream, where it can reach the liver and exacerbate inflammation, contributing to the progression from simple steatosis to non-alcoholic steatohepatitis (NASH), a more severe form of NAFLD characterized by inflammation and fibrosis^³⁸.

Furthermore, interventions aimed at reducing LPS levels or blocking its signaling pathway have been shown to attenuate liver inflammation and fibrosis in NAFLD models^³⁹. The dysregulation of lipid metabolism, including increased de novo lipogenesis, impaired fatty acid oxidation, and altered lipid export, contributes significantly to hepatic fat accumulation^⁴⁰. an excess of saturated fatty acids can induce lipotoxicity, leading to hepatocyte injury, inflammation, and fibrosis^⁴¹. Additionally, the role of cholesterol and its metabolites in NAFLD has been increasingly recognized. Cholesterol accumulation in the liver exacerbates hepatic inflammation and fibrosis, further contributing to NASH progression^⁴².Cysteine-type endopeptidases, which belong to the family of proteases known as caspases, play a pivotal role in various cellular processes, including apoptosis, inflammation, and autophagy^⁴³. In the context of NAFLD, cysteine-type endopeptidase activity has been implicated in the progression of liver injury through mechanisms involving apoptosis and inflammation^⁴⁴.

Analyses using LASSO, RF, and SVM-RFE of the 10 DE-PRGs identified two key genes (TIRAP and GSDMD) that can effectively predict NAFLD, with an AUC value of 0.996. The validity of this two-gene model was confirmed using an external dataset, yielding AUCs of 0.825. The AUC values for the two key genes in the validation dataset exceeded 0.9. The nomogram model, calibration curves, and DCA demonstrated that this model possesses strong predictive capability and significant clinical applicability. Therefore, a predictive model incorporating these two key genes could serve as a reliable and robust biomarker for the effective prediction of NAFLD. TIRAP affects liver inflammation and immune response mainly by regulating Toll-like receptor (TLRs) signaling pathway^⁴⁵. In hepatitis, the expression of TIRAP is up-regulated, which may exacerbate the inflammatory response of the liver and lead to the aggravation of liver injury^⁴⁶. In addition, TIRAP is also involved in the development of liver fibrosis and promotes extracellular matrix accumulation by regulating the activation and proliferation of hepatic stellate cells^⁴⁷. GSDMD consists of an n-terminal domain (NTD, containing 242 amino acids) and a C-terminal domain (CTD, containing 43 amino acid splice and 199 amino acids)^⁴⁸. GSDMD (also known as GSDMDC1, DFNA5L, or FKSG10) was originally found in a congener of GSDMA^⁴⁹. Saeki et al. found that GSDMD is widely expressed in different tissues and immune cells^⁵⁰,⁵¹. The gasdermin protein family plays an important role in pyrodeath, and GSDMD is a key executive factor^⁵²,⁵³. In NAFLD and NASH, GSDMD-mediated inflammatory cell death may exacerbate liver inflammation and liver injury^⁵⁴. In addition, GSDMD is also involved in the development of liver fibrosis by promoting the activation and proliferation of hepatic stellate cells and the accumulation of extracellular matrix^⁵⁵.

We conducted gene-targeting drug analysis based on the two key genes identified. A drug targeting the TIRAP gene was immunomodulatory drug. Given the interaction and influence of lncRNAs, miRNAs, and mRNAs on cellular biosynthesis^{⁵⁶–⁵⁸}, we constructed an mRNA–miRNA–lncRNA regulatory network for NAFLD. This revealed that lncRNAs could regulate the one key gene (GSDMA). Therefore, gene-targeted drug analysis offers a novel approach to further search for potential drugs to prevent and treat NAFLD, and ceRNA network analysis provides a new pathway for further exploring the pathogenesis of NAFLD. These findings, however, require further validation in cell and animal studies.

Our study does have some limitations. Firstly, we performed genetic analysis on data downloaded from the GEO database, which may contain certain biases. Secondly, the total number of cases was relatively small. Furthermore, we have not yet performed cellular or animal validation of the gene-targeting drugs we discovered.

Conclusions

We initially identified four significant genes, and by combining these two genes, we can accurately diagnose patients with NAFLD. We then explored the relationship between these genes and invasive immune cells and analyzed the significant heterogeneity in immune responses between NAFLD patient and control liver samples. Our research unveils the role of pyroptosis in NAFLD, providing a new theoretical foundation for the potential pathogenesis of NAFLD and therapeutic options.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1^{(13K, docx)}

Supplementary Material 2^{(241K, xlsx)}

Supplementary Material 3^{(158K, xlsx)}

Supplementary Material 4^{(23K, xlsx)}

Supplementary Material 5^{(80K, xlsx)}

Supplementary Material 6^{(10K, xlsx)}

Supplementary Material 7^{(31K, xlsx)}

Supplementary Material 8^{(2.2M, xlsx)}

Acknowledgements

We are grateful to the researchers who provided the datasets (GSE135251 and GSE89632). We thank the website (https://www.xiantaozi.com) for some data analysis on this website. We are very grateful to the Kanehisa laboratory for granting us permission to perform the KEGG pathway analysis.

Abbreviations

ALT	Alanine aminotransferase
AST	Aspartate aminotransferase
AUC	Area under curve
BP	Biological processes
CC	Cellular component
PRG	Pyroptosis-related gene
DCA	Decision curve analysis
DE-PRG	Differential expression of pyroptosis-related gene
DEG	Differential expression genes
DGIdb	Drug–gene interaction databases
DSigDB	Drug signatures database
GEO	Gene expression omnibus
GO	Gene ontology
GSEA	Gene set enrichment analysis
GSVA	Gene set variation analysis
KEGG	Kyoto Encyclopedia of Genes and Genomes
LASSO	Least absolute shrinkage and selection operator
MF	Molecular functions
NAFLD	Non-alcoholic fatty liver disease
PPI	Protein‒protein interaction
RF	Random forest
ROC	Receiver operating characteristics
ssGSEA	Single-sample gene set enrichment analysis
SVM-RFE	Support vector machine-recursive feature elimination

Author contributions

LPL, JXL, ZRL, DDZ, and JFL conducted the formal analysis and initial draft of the manuscript, with project administration being overseen by YG, BWM, and JFL. QW, JXL, ZHL, and JFL performed software analysis. Data curation was handled by JFL, LPL, and BWM, while the execution of experiments was carried out by LPL, ZRL, DDZ, JXL, and ZHL. YG, BWM, and JFL all contributed to the writing of the article. Funding was secured by JFL and YG. All authors participated in the editing process and approved the manuscript for submission.

Funding

This study was supported by Affiliated Hospital of Guilin Medical University, PhD start-up fund, Science and Technology Project of Guangxi Province (No. guikeAD21220021), Openin Project of Key laboratory of High-Incidence-Tumor Prevention & Treatment (Guangxi Medical University), Ministry of Education/GuangXi Key Laboratory of Early Prevention and Treatment for Regional High Frequency Tumor (GKE-KF202202), Guangxi Medical and health key discipline construction project.

Data availability

“The datasets in this study were enrolled from the GEO database (https://www.ncbi.nlm.nih.gov/geo/), with the following data accessions enrolled: GSE135251 and GSE89632. The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.”

Declarations

Competing interests

The authors declare no competing interests.

Ethics approval and consent to participate

The patient datas used in the article was downloaded from a public database, so the approval of the unit ethics committee and the participant’s signed consent were waived.

Consent for publication

The patient datas used in the article was downloaded from a public database, so the participant’s consent for publication were waived.

Footnotes

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Liping Lei, Jixue Li and Zirui Liu contributed equally to this work.

Contributor Information

Biwen Mo, Email: moc.uhos@2002newibom.

Jiangfa Li, Email: moc.qq@061645742.

References

1. Cusi, K. et al. Non-alcoholic fatty liver disease (NAFLD) prevalence and its metabolic associations in patients with type 1 diabetes and type 2 diabetes. Diabetes Obes. Metab. 19 (11), 1630–1634 (2017). [Abstract] [Google Scholar]

2. Pimpin, L. et al. Burden of liver disease in Europe: epidemiology and analysis of risk factors to identify prevention policies. J. Hepatol. 69 (3), 718–735 (2018). [Abstract] [Google Scholar]

3. Cusi, K. Role of obesity and lipotoxicity in the development of nonalcoholic steatohepatitis: pathophysiology and clinical implications. Gastroenterology. 142 (4), 711–725e716 (2012). [Abstract] [Google Scholar]

4. Masoodi, M. et al. Metabolomics and lipidomics in NAFLD: biomarkers and non-invasive diagnostic tests. Nat. Rev. Gastroenterol. Hepatol. 18 (12), 835–856 (2021). [Abstract] [Google Scholar]

5. Choi, H. J. et al. The inhibitory effects of Geranium thunbergii on interferon-gamma- and LPS-induced inflammatory responses are mediated by Nrf2 activation. Int. J. Mol. Med. 35 (5), 1237–1245 (2015). [Europe PMC free article] [Abstract] [Google Scholar]

6. Nathan, C. Nonresolving inflammation redux. Immunity. 55 (4), 592–605 (2022). [Europe PMC free article] [Abstract] [Google Scholar]

7. Fang, Y. et al. Pyroptosis: A new frontier in cancer. Biomed. Pharmacother 121, 109595 (2020). [Abstract]

8. Wei, X. et al. Role of pyroptosis in inflammation and cancer. Cell. Mol. Immunol. 19 (9), 971–992 (2022). [Europe PMC free article] [Abstract] [Google Scholar]

9. Man, S. M., Karki, R. & Kanneganti, T. D. Molecular mechanisms and functions of pyroptosis, inflammatory caspases and inflammasomes in infectious diseases. Immunol. Rev. 277 (1), 61–75 (2017). [Europe PMC free article] [Abstract] [Google Scholar]

10. Man, S. M. & Kanneganti, T. D. Converging roles of caspases in inflammasome activation, cell death and innate immunity. Nat. Rev. Immunol. 16 (1), 7–21 (2016). [Europe PMC free article] [Abstract] [Google Scholar]

11. de Carvalho Ribeiro, M. & Szabo, G. Role of the inflammasome in liver disease. Annu. Rev. Pathol. 17, 345–365 (2022). [Europe PMC free article] [Abstract] [Google Scholar]

12. Xia, X. et al. The role of pyroptosis in cancer: pro-cancer or pro-host? Cell Death Dis. 10 (9), 650 (2019). [Europe PMC free article] [Abstract] [Google Scholar]

13. Elias, E. E., Lyons, B. & Muruve, D. A. Gasdermins and pyroptosis in the kidney. Nat. Rev. Nephrol. 19 (5), 337–350 (2023). [Abstract] [Google Scholar]

14. Tsuchida, T. & Friedman, S. L. Mechanisms of hepatic stellate cell activation. Nat. Rev. Gastroenterol. Hepatol. 14 (7), 397–411 (2017). [Abstract] [Google Scholar]

15. Xiao, Y. et al. STING mediates hepatocyte pyroptosis in liver fibrosis by epigenetically activating the NLRP3 inflammasome. Redox Biol. 62, 102691 (2023). [Europe PMC free article] [Abstract] [Google Scholar]

16. Kanehisa, M. et al. KEGG for taxonomy-based analysis of pathways and genomes. Nucleic Acids Res. 51 (D1), D587–D592 (2023). [Europe PMC free article] [Abstract] [Google Scholar]

17. Kanehisa, M. & Goto, S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28 (1), 27-30 (2000). [Europe PMC free article] [Abstract]

18. Kanehisa, M. Toward understanding the origin and evolution of cellular organisms. Protein Sci. 28 (11), 1947-1951 (2019). [Europe PMC free article] [Abstract]

19. van Egmond, M. B. et al. Privacy-preserving dataset combination and Lasso regression for healthcare predictions. BMC Med. Inf. Decis. Mak. 21 (1), 266 (2021). [Europe PMC free article] [Abstract] [Google Scholar]

20. Friedman, J., Hastie, T. & Tibshirani, R. Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33 (1), 1–22 (2010). [Europe PMC free article] [Abstract] [Google Scholar]

21. Ding, L. et al. Identification of cuproptosis-related subtypes, cuproptosis-related gene prognostic index in hepatocellular carcinoma. Front. Immunol. 13, 989156 (2022). [Europe PMC free article] [Abstract] [Google Scholar]

22. Joshi, P., Vedhanayagam, M. & Ramesh, R. An ensembled SVM based approach for predicting adverse drug reactions. Curr. Bioinform. 16 (3), 422–432 (2021). [Google Scholar]

23. Xu, H. et al. Identification of miRNA signature associated with erectile dysfunction in type 2 diabetes mellitus by support vector machine-recursive feature elimination. Front. Genet. 12, 762136 (2021). [Europe PMC free article] [Abstract] [Google Scholar]

24. Yang, X-F. et al. Predicting LncRNA subcellular localization using unbalanced pseudo-k nucleotide compositions. Curr. Bioinform. 15 (6), 554–562 (2020). [Google Scholar]

25. Abinash, M. J. & Vasudevan, V. Boundaries tuned support vector machine (BT-SVM) classifier for cancer prediction from gene selection. Comput. Methods Biomech. Biomed. Eng. 25 (7), 794–807 (2022). [Abstract] [Google Scholar]

26. Sanz, H. et al. SVM-RFE: selection and visualization of the most relevant features through non-linear kernels. BMC Bioinform. 19 (1), 432 (2018). [Europe PMC free article] [Abstract] [Google Scholar]

27. Rigatti, S. J. Random Forest. J. Insur Med. 47 (1), 31–39 (2017). [Abstract] [Google Scholar]

28. Robin, X. et al. pROC: an open-source package for R and S

to analyze and compare ROC curves. BMC Bioinform. 12, 77 (2011). [Europe PMC free article] [Abstract] [Google Scholar]

29. Yu, G. et al. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS. 16 (5), 284–287 (2012). [Europe PMC free article] [Abstract] [Google Scholar]

30. Cotto, K. C. et al. DGIdb 3.0: a redesign and expansion of the drug-gene interaction database. Nucleic Acids Res. 46 (D1), D1068–D1073 (2018). [Europe PMC free article] [Abstract] [Google Scholar]

31. Wu, J. et al. Comprehensive analysis of pyroptosis-related genes and Tumor Microenvironment Infiltration characterization in breast Cancer. Front. Immunol. 12, 748221 (2021). [Europe PMC free article] [Abstract] [Google Scholar]

32. Xiao, J. et al. Global liver disease burdens and research trends: analysis from a Chinese perspective. J. Hepatol. 71 (1), 212–221 (2019). [Abstract] [Google Scholar]

33. Eng, J. M. & Estall, J. L. Diet-Induced models of non-alcoholic fatty liver disease: food for thought on sugar, fat, and cholesterol. Cells, 10(7), 1805 (2021). [Europe PMC free article] [Abstract]

34. Rao, Z. et al. Pyroptosis in inflammatory diseases and cancer. Theranostics. 12 (9), 4310–4329 (2022). [Europe PMC free article] [Abstract] [Google Scholar]

35. Zhaolin, Z. et al. Role of pyroptosis in cardiovascular disease. Cell Prolif. 52 (2), e12563 (2019). [Europe PMC free article] [Abstract] [Google Scholar]

36. Li, S. et al. NLRP3/caspase-1/GSDMD-mediated pyroptosis exerts a crucial role in astrocyte pathological injury in mouse model of depression. JCI Insight. 6(23), e146852(2021). [Europe PMC free article] [Abstract]

37. Jin, C. J. et al. Loss of lipopolysaccharide-binding protein attenuates the development of diet-induced non-alcoholic fatty liver disease in mice. J. Gastroenterol. Hepatol. 32 (3), 708–715 (2017). [Abstract] [Google Scholar]

38. Matsushita, N. et al. Effect of Lipopolysaccharide on the progression of non-alcoholic fatty liver Disease in High Caloric Diet-Fed Mice. Scand. J. Immunol. 83 (2), 109–118 (2016). [Abstract] [Google Scholar]

39. Li, Z. et al. Probiotics and antibodies to TNF inhibit inflammatory activity and improve nonalcoholic fatty liver disease. Hepatology. 37 (2), 343–350 (2003). [Abstract] [Google Scholar]

40. Softic, S., Cohen, D. E. & Kahn, C. R. Role of dietary fructose and hepatic de novo lipogenesis in fatty liver disease. Dig. Dis. Sci. 61 (5), 1282–1293 (2016). [Europe PMC free article] [Abstract] [Google Scholar]

41. Ioannou, G. N. The role of cholesterol in the pathogenesis of NASH. Trends Endocrinol. Metab. 27 (2), 84–95 (2016). [Abstract] [Google Scholar]

42. Minami, Y. et al. Liver lipophagy ameliorates nonalcoholic steatohepatitis through extracellular lipid secretion. Nat. Commun. 14 (1), 4084 (2023). [Europe PMC free article] [Abstract] [Google Scholar]

43. Priyanka, M. et al. Late stage specific Rv0109 (PE_PGRS1) protein of Mycobacterium tuberculosis induces mitochondria mediated macrophage apoptosis. Microb. Pathog. 176, 106021 (2023). [Abstract] [Google Scholar]

44. Wilson, C. H. & Kumar, S. Caspases in metabolic disease and their therapeutic potential. Cell Death Differ. 25 (6), 1010–1024 (2018). [Europe PMC free article] [Abstract] [Google Scholar]

45. Kawai, T. & Akira, S. TLR signaling. Cell Death Differ. 13 (5), 816–825 (2006). [Abstract] [Google Scholar]

46. Seki, E. & Brenner, D. A. Toll-like receptors and adaptor molecules in liver disease: update. Hepatology. 48 (1), 322–335 (2008). [Abstract] [Google Scholar]

47. Suganami, T. et al. Role of the toll-like receptor 4/NF-kappaB pathway in saturated fatty acid-induced inflammatory changes in the interaction between adipocytes and macrophages. Arterioscler. Thromb. Vasc Biol. 27 (1), 84–91 (2007). [Abstract] [Google Scholar]

48. Liu, Z. et al. Caspase-1 engages full-length gasdermin D through two distinct interfaces that mediate caspase recruitment and substrate Cleavage. Immunity. 53 (1), 106–114e105 (2020). [Europe PMC free article] [Abstract] [Google Scholar]

49. Katoh, M. & Katoh, M. Identification and characterization of human DFNA5L, mouse Dfna5l, and rat Dfna5l genes in silico. Int. J. Oncol. 25 (3), 765–770 (2004). [Abstract] [Google Scholar]

50. Saeki, N. et al. Distinctive expression and function of four GSDM family genes (GSDMA-D) in normal and malignant upper gastrointestinal epithelium. Genes Chromosomes Cancer. 48 (3), 261–271 (2009). [Abstract] [Google Scholar]

51. Rieckmann, J. C. et al. Social network architecture of human immune cells unveiled by quantitative proteomics. Nat. Immunol. 18 (5), 583–593 (2017). [Abstract] [Google Scholar]

52. Shi, J. et al. Cleavage of GSDMD by inflammatory caspases determines pyroptotic cell death. Nature. 526 (7575), 660–665 (2015). [Abstract] [Google Scholar]

53. Kayagaki, N. et al. Caspase-11 cleaves gasdermin D for non-canonical inflammasome signalling. Nature. 526 (7575), 666–671 (2015). [Abstract] [Google Scholar]

54. Mridha, A. R. et al. NLRP3 inflammasome blockade reduces liver inflammation and fibrosis in experimental NASH in mice. J. Hepatol. 66 (5), 1037–1046 (2017). [Europe PMC free article] [Abstract] [Google Scholar]

55. Gaul, S. et al. Hepatocyte pyroptosis and release of inflammasome particles induce stellate cell activation and liver fibrosis. J. Hepatol. 74 (1), 156–167 (2021). [Europe PMC free article] [Abstract] [Google Scholar]

56. Wang, J. Y. et al. Potential regulatory role of lncRNA-miRNA-mRNA axis in osteosarcoma. Biomed. Pharmacother. 121, 109627 (2020). [Abstract] [Google Scholar]

57. Wang, L. et al. Long noncoding RNA (lncRNA)-Mediated competing endogenous RNA networks provide Novel potential biomarkers and therapeutic targets for colorectal cancer. Int. J. Mol. Sci. 20(22), 5758 (2019). [Europe PMC free article] [Abstract]

58. Hu, B. et al. The mRNA-miRNA-lncRNA regulatory network and factors associated with prognosis prediction of hepatocellular carcinoma. Genom. Proteom. Bioinform. 19 (6), 913–925 (2021). [Europe PMC free article] [Abstract] [Google Scholar]

Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

Full text links

Read article at publisher's site: https://doi.org/10.1038/s41598-024-77409-3

Citations & impact

This article has not been cited yet.

Impact metrics

Alternative metrics

Altmetric item for https://www.altmetric.com/details/169732304

Altmetric
Discover the attention surrounding your research
https://www.altmetric.com/details/169732304

Funding

Funders who supported this work.

Openin Project of Key laboratory of High-Incidence-Tumor Prevention & Treatment (Guangxi Medical University ), Ministry of Education/GuangXi Key Laboratory of Early Prevention and Treatment for Regional High Frequency Tumor (1)

Grant ID: GKE-KF202202
1 publication

Science and Technology Project of Guangxi Province (1)

Grant ID: guikeAD21220021
1 publication

Search life-sciences literature (45,094,167 articles, preprints and more)

Identification of diagnostic markers pyrodeath-related genes in non-alcoholic fatty liver disease based on machine learning and experiment validation.

Author information

Affiliations

Abstract

Free full text

Identification of diagnostic markers pyrodeath-related genes in non-alcoholic fatty liver disease based on machine learning and experiment validation

Liping Lei

Jixue Li

Zirui Liu

Dongdong Zhang

Zihan Liu

Qing Wang

Yi Gao

Biwen Mo

Jiangfa Li

Associated Data

Abstract

Supplementary Information

Introduction

Materials and methods

Patients and datasets

Expression of DEGs and PRGs in NAFLD

Correlation analysis and protein‒protein interaction (PPI) network construction

Gene ontology (GO) and KEGG pathway enrichment analysis

Three machine learning methods to identify key genes

The establishment and verification of the DE-PRGs diagnostic model

Gene set enrichment analysis (GSEA) and gene set variation analysis (GSVA)

Assessment of Immune Infiltration

Investigation of drug-gene interactions and construction of ceRNA network

Cell culture and treatments

Quantitative reverse transcription-polymerase chain reaction (qRT–PCR)

Statistical analyses

Results

Identification of DE-PRGs associated with NAFLD

Identification of diagnostic marker genes for NAFLD

Evaluation of the diagnostic performance of NAFLD Diagnostic marker genes

GSEA and ssGSEA anlysis

Identification of drug candidates and ceRNA networks based on marker genes

Expression of PRGs in a cell model of NAFLD

Discussion

Conclusions

Electronic supplementary material

Acknowledgements

Abbreviations

Author contributions

Funding

Data availability

Declarations

Footnotes

Contributor Information

References

Full text links

Citations & impact

Impact metrics

Alternative metrics

Similar Articles

Identification and validation of potential diagnostic signature and immune cell infiltration for NAFLD based on cuproptosis-related genes by bioinformatics analysis and machine learning.

Integrative analysis identifies oxidative stress biomarkers in non-alcoholic fatty liver disease via machine learning and weighted gene co-expression network analysis.

Identification of diagnostic gene signatures and molecular mechanisms for non-alcoholic fatty liver disease and Alzheimer's disease through machine learning algorithms.

Identification of biomarkers for the diagnosis of chronic kidney disease (CKD) with non-alcoholic fatty liver disease (NAFLD) by bioinformatics analysis and machine learning.

Funding

Openin Project of Key laboratory of High-Incidence-Tumor Prevention &amp; Treatment (Guangxi Medical University ), Ministry of Education/GuangXi Key Laboratory of Early Prevention and Treatment for Regional High Frequency Tumor (1)﻿

Science and Technology Project of Guangxi Province (1)﻿

Partnerships & funding

Openin Project of Key laboratory of High-Incidence-Tumor Prevention & Treatment (Guangxi Medical University ), Ministry of Education/GuangXi Key Laboratory of Early Prevention and Treatment for Regional High Frequency Tumor (1)

Science and Technology Project of Guangxi Province (1)