Development and validation of a circulating tumor DNA-based machine learning model for predicting immunotherapy response in non-small cell lung cancer
Original Article

Development and validation of a circulating tumor DNA-based machine learning model for predicting immunotherapy response in non-small cell lung cancer

Ji Xia1,2,3#, Tianchu He4#, Yong Hu5, Daobin Zhou4, Dan Zou1, Ya Li1,2,3, Min Zhang1,2,3, Benlan Li1,2,3, Minfang Wang1,2,3, Xian Liu1,2,3, Zhongjun Huang1,2,3, Shengfa Su2,3*, Jie Peng1,3*

1Department of Oncology, The Second Affiliated Hospital, Guizhou Medical University, Kaili, China; 2Department of Oncology, Affiliated Hospital of Guizhou Medical University, Guiyang, China; 3Department of Oncology, School of Clinical Medicine, Guizhou Medical University, Guiyang, China; 4Department of Oncology, Qiandongnan Prefecture People’s Hospital, Kaili, China; 5Department of Oncology, Guiyang Pulmonary Hospital, Guiyang, China

Contributions: (I) Conception and design: J Xia, T He, J Peng; (II) Administrative support: S Su, J Peng; (III) Provision of study materials or patients: T He, Y Hu, D Zhou; (IV) Collection and assembly of data: J Xia, B Li, Z Huang, Y Li, M Zhang, X Liu, M Wang; (V) Data analysis and interpretation: J Peng, J Xia; (VI) Manuscript writing: All authors; (VII) Final approval of manuscript: All authors.

#These authors contributed equally to this work.

*These authors contributed equally to this work.

Correspondence to: Jie Peng, MD. Department of Oncology, The Second Affiliated Hospital, Guizhou Medical University, No. 3 Kangfu Road, Kaili Guizhou Province, China; Department of Oncology, School of Clinical Medicine, Guizhou Medical University, Guiyang, China. Email: sank44@sina.com; Shengfa Su, MD. Department of Oncology, Affiliated Hospital of Guizhou Medical University, No. 28 Guiyi Street, Guiyang Guizhou Province, China; Department of Oncology, School of Clinical Medicine, Guizhou Medical University, Guiyang, China. Email: sushengfa2005@163.com.

Background: Current biomarkers, such as programmed death-ligand 1 (PD-L1) and tumor mutational burden (TMB), have limited predictive value for immune checkpoint inhibitor (ICI) response in non-small cell lung cancer (NSCLC). Machine learning analysis of circulating tumor DNA (ctDNA) can enhance patient stratification via liquid biopsy genomic signatures. We aimed to build a support vector machine (SVM) model using baseline ctDNA to predict ICI benefit in advanced NSCLC.

Methods: We trained an SVM model on pretreatment ctDNA whole-exome sequencing (WES) data from the OAK trial cohort (n=303) to predict durable clinical benefit (DCB): response or stable disease (SD) ≥6 months. We used least absolute shrinkage and selection operator (LASSO) regression to select 41 predictive somatic mutations. Model performance was validated in the independent POPLAR trial cohort (n=97) and a regional multicenter cohort (n=41). Receiver operating characteristic (ROC) curve analysis and Kaplan-Meier methods were used to evaluate predictive accuracy and survival outcomes.

Results: The ctDNA-based SVM model achieved high accuracy across cohorts: area under the ROC curve (AUC) =0.87 in OAK, 0.92 in POPLAR, and 0.83 in the local cohort. Patients classified as SVM-low (score >0.55) had significantly longer median progression-free survival (mPFS) [12.80 vs. 3.00 months; hazard ratio (HR) =0.191; P=0.002] and median overall survival (mOS) (17.30 vs. 6.40 months; HR =0.159; P=0.002) than SVM-high patients. In particular, SVM-low patients with initial SD exhibited better outcomes than SVM-high. This noninvasive model may reduce the need for repeat tissue biopsies and associated costs. Functional enrichment of the 41-gene signature highlighted VEGF, JAK-STAT, and Ras signalling pathways in ICI resistance.

Conclusions: Our ctDNA-based SVM model accurately predicts DCB and survival outcomes in NSCLC patients receiving ICIs. By using a single baseline liquid biopsy, this model could streamline immunotherapy decision-making without requiring longitudinal monitoring.

Keywords: Circulating tumor DNA (ctDNA); immune checkpoint inhibitors (ICIs); machine learning; non-small cell lung cancer (NSCLC); predictive biomarker


Submitted Jun 04, 2025. Accepted for publication Oct 24, 2025. Published online Dec 29, 2025.

doi: 10.21037/tcr-2025-1182


Highlight box

Key findings

• The model was validated in three cohorts [OAK: area under the receiver operating characteristic curve (AUC) =0.87; POPLAR: AUC =0.92; local cohort: AUC =0.83]. Support vector machine (SVM)-low patients (score >0.55) had significantly longer median progression-free survival [mPFS 12.80 vs. 3.00 months; hazard ratio (HR) =0.191, P=0.002] and medain overall survival (mOS 17.30 vs. 6.40 months; HR =0.159, P=0.002) than SVM-low patients. The model effectively distinguished prognostic differences among patients who initially presented with stable disease.

What is known and what is new?

• Existing biomarkers, including programmed death-ligand 1 (PD-L1) and blood-based tumor mutational burden, inadequately predict immune checkpoint inhibitor (ICI) response in non-small cell lung cancer (NSCLC). Longitudinal circulating tumor DNA (ctDNA) monitoring can detect emerging resistance but requires serial sampling.

• This study established the first SVM model using a single-timepoint baseline ctDNA whole-exome sequencing to predict durable clinical benefit and survival with high accuracy (AUC >0.83). This approach removes the need for repeated sampling. The 41-gene signature also revealed resistance pathways (VEGF, JAK-STAT, Ras), offering mechanistic insights into immunotherapy failure.

What is the implication, and what should change now?

• This model enables precise pretreatment stratification for ICI therapy, potentially guiding treatment decisions in advanced/metastatic and neoadjuvant NSCLC.

• Future work should include prospective validation in larger, ethnically diverse cohorts (especially East Asian populations) to confirm generalizability.

• Integrating this ctDNA model with other biomarkers (e.g., PD-L1, radiomics) may create multimodal predictive frameworks.

• Standardization of ctDNA analysis protocols will be important for clinical implementation.


Introduction

The advent of immune checkpoint inhibitors (ICIs) has transformed the treatment landscape for advanced non-small cell lung cancer (NSCLC) by producing durable responses in a subset of patients (1-3). Existing clinical biomarkers—such as programmed death-ligand 1 (PD-L1) expression, tumor mutational burden (TMB), and gene-expression signatures—show statistical associations with ICI benefit but lack sufficient accuracy for reliable patient stratification (4). Long-term trial data (for example, the PACIFIC study) have revealed substantial heterogeneity in survival outcomes across prognostic subgroups, underscoring the urgent need for improved predictive strategies (5).

Assessing immunotherapy efficacy poses unique challenges. Atypical response patterns (including delayed responses and pseudoprogression) complicate timely evaluation and can confound endpoint interpretation (6). Moreover, in immunotherapy trials for NSCLC, the relationship between progression-free survival (PFS) and overall survival (OS) is often discordant, which limits the utility of conventional endpoints for early decision-making (7). These factors highlight the necessity of predictive models that combine baseline molecular features with tumor-biology dynamics to guide earlier and more accurate therapeutic choices (8).

Circulating tumor DNA (ctDNA) offers a minimally invasive window into tumor genomics and has emerged as a promising source of predictive biomarkers (9-11). ctDNA captures somatic mutation and epigenetic landscapes and can be sampled serially from peripheral blood. Higher blood-based TMB (bTMB) measured from ctDNA has been associated with improved ICI response, likely reflecting increased neoantigen burden and enhanced immunogenicity (12). Specific driver alterations (for example, STK11, KEAP1, and lesions affecting JAK-STAT signaling) can reshape the tumor microenvironment and influence sensitivity to ICIs (13). Specific driver gene mutations influence immunotherapy responsiveness by modulating the tumor microenvironment—particularly through alterations in PD-L1 expression and immune cell infiltration—thereby offering molecular indicators for personalized therapeutic strategies (14). Empirical studies further demonstrate that ctDNA kinetic profiles function as early biomarkers of response in NSCLC immunotherapy (15). Patients achieving post-treatment ctDNA clearance, defined by mutant allele frequencies below detection thresholds, exhibited markedly prolonged median PFS (mPFS), with outcomes showing strong concordance with radiographic evaluations (16). Additionally, ctDNA profiling can reveal acquired resistance mechanisms (e.g., JAK1/2 loss, B2M inactivation, promoter methylation) that facilitate immune escape (17), and minimal-residual-disease (MRD) monitoring by ctDNA can identify patients at higher risk of recurrence who may benefit from adjuvant strategies (18).

Machine learning and deep-learning approaches have shown substantial promise for integrating high-dimensional genomic and clinical data to improve outcome prediction in oncology (19,20). For instance, deep networks trained on whole-exome sequencing (WES) or combined radiogenomic data can capture complex, non-linear relationships among TMB, neoantigen load, and the immune microenvironment, producing more accurate predictions of ICI response than many conventional methods (21,22).

Building on these observations, we developed a support vector machine (SVM) model using pretreatment ctDNA WES features to predict durable clinical benefit (DCB) from ICIs in advanced NSCLC. The model was trained on the OAK trial cohort externally validated in the POPLAR trial and a regional multicenter cohort to assess predictive performance and potential clinical utility. We present this article in accordance with the TRIPOD reporting checklist (available at https://tcr.amegroups.com/article/view/10.21037/tcr-2025-1182/rc).


Methods

Public clinical trial cohorts (OAK and POPLAR)

Individual patient-level data (IPD) were obtained from two multicenter randomized trials of atezolizumab in advanced/metastatic NSCLC (OAK and POPLAR), and the overall study design and cohort integration workflow are illustrated in Figure 1. After excluding patients who did not receive atezolizumab, specimens failing quality control, or records with incomplete clinical metadata or missing ctDNA/WES data, 303 patients from OAK were used as the training cohort and 97 patients from POPLAR served as an independent validation cohort (Figure 1). For reference on the clinical trials of atezolizumab used here, see Socinski et al. (23). For clarity, we defined blood TMB as the number of somatic mutations per megabase of coding sequence identified from ctDNA WES data.

Figure 1 Patient enrollment flowchart of training and validation cohorts. ctDNA, circulating tumor DNA; NSCLC, non-small cell lung cancer.

Local validation cohort

A prospective multicenter regional cohort was assembled between April 2023 and June 2024. Eligible patients had pathologically confirmed stage III–IV NSCLC and were scheduled to receive programmed death-1 (PD-1)/PD-L1 inhibitor therapy (monotherapy or combination). Of 52 initially enrolled patients, 2 discontinued before completing two cycles and 3 were lost to follow-up, leaving 47 evaluable cases; 6 were ctDNA-negative (exploratory) and 41 were ctDNA-positive and included in the primary validation analyses (Figure 1). Peripheral blood (20 mL) was collected 24 h before treatment in Streck cfDNA tubes, shipped at 2–8 °C, and processed at a central laboratory (iGeneTech, Wuhu, China) within 72h. This study was conducted in accordance with the Declaration of Helsinki and its subsequent amendments. The study was approved by the Ethics Committee of Affiliated Hospital of Guizhou Medical University (No. 2023-LS-02), and all participating hospitals were informed and agreed to the study. All participants provided written informed consent.

Sample processing and sequencing

Plasma ctDNA was extracted using a magnetic bead-based protocol. DNA integrity and quantity were assessed by agarose gel electrophoresis and Qubit fluorometry. WES libraries were prepared with hybrid-capture exome probes following fragmentation (150–250 bp), end repair, adapter ligation, and polymerase chain reaction (PCR) enrichment. Libraries passing quality control (QC) (≥25 ng/µL, expected fragment distribution 220–450 bp) were sequenced on an Illumina NovaSeq 6000 platform with 2×150 bp paired reads to achieve median exome coverage adequate for variant calling. Somatic single-nucleotide variants (SNVs) and small insertions/deletions (InDels) were called using standard pipelines, filtered against population germline databases, and compared with matched normals when available (Figure 2).

Figure 2 SNP and InDel profiling in the local cohort. (A) InDel length distribution in Patient 1; (B) SNP variant spectrum in Patient 1. InDel, insertion-deletion; SNP, single-nucleotide polymorphism.

Feature selection and model development

The analytic workflow is shown in Figure 3. In the OAK training set, we performed high-dimensional feature reduction with the least absolute shrinkage and selection operator (LASSO) using five-fold cross-validation to select a parsimonious set of predictive somatic features (final panel: 41 features; Figure 4). These binary mutation features were then used to train an SVM classifier implemented in R (version 4.3.x; e1071 package). We used a radial basis function (RBF) kernel and optimized hyperparameters (penalty C and kernel width σ) via grid search on the training folds. An ε-insensitive loss (ε =0.015) was applied during SVM fitting. To avoid information leakage between feature selection and model evaluation, we employed nested five-fold cross-validation during model development (i.e., the inner loop for hyperparameter tuning/feature selection and the outer loop for performance estimation). The final model produced an SVM score for each patient (Table S1), interpreted as a continuous measure of predicted probability of DCB. For general machine-learning context on similar clinical prediction tasks (24).

Figure 3 Study workflow schematic. ctDNA, circulating tumor DNA; KEGG, Kyoto Encyclopedia of Genes and Genomes; OS, overall survival; PFS, progression-free survival; SVM, support vector machine.
Figure 4 Genomic feature selection using LASSO regression with five-fold cross-validation. (A,B) Optimal genomic features identified in immunotherapy-treated non-small cell lung cancer patients. (C) Final panel of 41 selected genomic signatures. AUC, area under the receiver operating characteristic curve; LASSO, least absolute shrinkage and selection operator.

Clinical endpoints

DCB was defined as complete response (CR), partial response (PR), or stable disease (SD) sustained for ≥6 months from treatment initiation; non-DCB was defined as progressive disease (PD) or SD <6 months. PFS was measured from the first ICI dose to radiographic progression (per RECIST v1.1) or death; OS was measured from treatment start to death from any cause. Tumor responses were assessed by computed tomography (CT) or magnetic resonance imaging (MRI) every 6–8 weeks and adjudicated using RECIST v1.1, with confirmation imaging used to limit misclassification from pseudoprogression.

Data sharing statement

The datasets analyzed in this study were derived from publicly available clinical trials and a prospectively collected regional cohort. IPD from the OAK (ClinicalTrials.gov identifier: NCT02008227) and POPLAR (NCT01903993) studies were obtained through authorized data-sharing agreements with the data-holding institution (F. Hoffmann-La Roche Ltd.). The regional validation cohort data were collected at our institution following ethical approval and patient-informed consent. Deidentified patient-level data supporting the findings of this study are available from the corresponding author upon reasonable request. Requests will be reviewed in accordance with institutional and data protection policies to ensure appropriate data use and privacy protection.

Statistical analysis

Model discrimination was evaluated by receiver operating characteristic (ROC) curve analysis and reported as area under the curve (AUC) with 95% confidence intervals (CI). The optimal threshold to dichotomize the SVM score into SVM-high and SVM-low groups was selected using the Youden index on the training ROC. Continuous variables are presented as medians with interquartile ranges (IQR) and compared using Mann-Whitney U tests; categorical variables are presented as counts (%) and compared using χ2 or Fisher’s exact tests as appropriate. Kaplan-Meier methods and log-rank tests were used to compare PFS and OS between groups. Functional enrichment of the 41-gene signature was performed using Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway annotation (via standard R packages); enrichment results were assessed with appropriate multiple-testing correction and presented for pathways with P<0.05 and false discovery rate (FDR) <0.25. For comparable ctDNA-based enrichment workflows and pathway interpretation in similar studies, see Peng et al. (25).


Results

Baseline characteristics of study cohorts

A total of 303 patients from the OAK trial were included in the training cohort, 97 from the POPLAR trial in the independent validation cohort, and 41 ctDNA-positive patients from the prospective regional cohort. Clinical and demographic features of the three cohorts are summarized in Tables 1-3. In all cohorts, DCB was observed in approximately one-third of patients. No significant associations were identified between baseline clinicopathologic variables (age, sex, smoking status, histology) and DCB.

Table 1

OAK cohort: baseline characteristics of patients with NSCLC

OAK cohort Total (n=303) Non-DCB (n=205) DCB (n=98) P
Age (years) 63.00 (57.00, 69.50) 63.00 (57.00, 68.00) 64.00 (57.00, 73.00) 0.13
Sex 0.84
   Female 115 (37.95) 77 (37.56) 38 (38.78)
   Male 188 (62.05) 128 (62.44) 60 (61.22)
Histological 0.75
   Non-squamous 223 (73.60) 152 (74.15) 71 (72.45)
   Squamous 80 (26.40) 53 (25.85) 27 (27.55)
bTMB, mut/Mb 7.00 (3.00, 15.00) 7.00 (3.00, 14.00) 8.50 (3.00, 19.00) 0.25
SVM-score 0.05 (0.05, 0.05) 0.05 (0.05, 0.05) 0.95 (0.05, 0.95) <0.001

Data are presented as number (%) or median (Q1, Q3). bTMB, blood tumor mutational burden; DCB, durable clinical benefit; NSCLC, non-small cell lung cancer; SVM-score, support vector machine-derived score.

Table 2

POPLAR: baseline characteristics of patients with NSCLC

POPLAR cohort Total (n=97) Non-DCB (n=65) DCB (n=32) P
Age (years) 61.00 (55.00, 68.00) 60.00 (55.00, 67.00) 62.00 (55.75, 69.25) 0.45
Sex 0.48
   Female 32 (32.99) 23 (35.38) 9 (28.12)
   Male 65 (67.01) 42 (64.62) 23 (71.88)
Histological 0.42
   Non-squamous 63 (64.95) 44 (67.69) 19 (59.38)
   Squamous 34 (35.05) 21 (32.31) 13 (40.62)
bTMB, mut/Mb 8.00 (4.00, 15.00) 8.00 (5.00, 13.00) 6.50 (3.75, 16.00) 0.65
SVM-score 0.05 (0.05, 0.05) 0.05 (0.05, 0.05) 0.95 (0.05, 0.95) <0.001

Data are presented as number (%) or median (Q1, Q3). bTMB, blood tumor mutational burden; DCB, durable clinical benefit; NSCLC, non-small cell lung cancer; SVM-score, support vector machine-derived score.

Table 3

Baseline characteristics of patients with NSCLC in the internal validation cohort

Internal cohort Total (n=41) Non-DCB (n=25) DCB (n=16) P
Age (years) 60.00 (56.00, 68.00) 60.00 (56.00, 68.00) 60.00 (57.50, 68.75) 0.54
Sex >0.99
   Female 32 (78.05) 20 (80.00) 12 (75.00)
   Male 9 (21.95) 5 (20.00) 4 (25.00)
Smoking status 0.66
   Never smoker 10 (24.39) 5 (20.00) 5 (31.25)
   Current smoker 31 (75.61) 20 (80.00) 11 (68.75)
Histological 0.52
   Non-squamous 13 (31.71) 7 (28.00) 6 (37.50)
   Squamous 28 (68.29) 18 (72.00) 10 (62.50)
TNM stage 0.52
   III 13 (31.71) 7 (28.00) 6 (37.50)
   IV 28 (68.29) 18 (72.00) 10 (62.50)
Treatment >0.99
   Chemo-ICIs 36 (87.80) 22 (88.00) 14 (87.50)
   ICI monotherapy 1 (2.44) 1 (4.00) 0 (0.00)
   Chemo-ICI + anti-VEGF 4 (9.76) 2 (8.00) 2 (12.50)
bTMB, mut/Mb 439.22 (389.18, 479.46) 435.79 (386.54, 479.46) 441.61 (410.16, 473.33) 0.70
SVM-score 0.05 (0.05, 0.95) 0.05 (0.05, 0.05) 0.95 (0.95, 0.95) <0.001

Data are presented as number (%) or median (Q1, Q3). bTMB, blood tumor mutational burden; DCB, durable clinical benefit; ICI, immune checkpoint inhibitor; NSCLC, non-small cell lung cancer; SVM-score, support vector machine-derived score; TNM, tumor-node-metastasis.

Baseline blood TMB did not significantly differ between DCB and non-DCB groups in either the OAK (median 7.0 vs. 8.5 mut/Mb, P=0.25) or POPLAR cohort (median 6.5 vs. 8.0 mut/Mb, P=0.65), confirming its modest predictive capacity compared with integrated ctDNA signatures. Similarly, in the regional cohort, median bTMB values were not significantly associated with outcome (P=0.70).

SVM model predictive performance

Using 41 ctDNA-derived genomic features selected by LASSO regression, the SVM model demonstrated strong discriminative performance in the OAK training cohort. The median SVM score was 0.05 in non-DCB patients vs. 0.95 in DCB patients (P<0.001), producing an AUC of 0.87 (95% CI: 0.82–0.91) (Figure 5A).

Figure 5 SVM model performance validation. (A) OAK cohort ROC curve; (B) POPLAR cohort ROC curve; (C) IVC ROC curve; (D) model performance across cohorts. AUC, area under the ROC curve; CI, confidence interval; IVC, internal validation cohort; ROC, receiver operating characteristic; SVM, support vector machine.

External validation in the POPLAR cohort yielded comparable results: the SVM model distinguished outcomes with an AUC of 0.92 (95% CI: 0.86–0.97) (Figure 5B). Similarly, in the regional cohort, the model achieved an AUC of 0.83 (95% CI: 0.69–0.98) (Figure 5C). The model achieved high predictive accuracy across cohorts, with ROC curves shown in Figure 5. These results confirm the generalizability of the model across independent datasets, despite differences in treatment regimens.

Of note, the distribution of SVM scores was highly bimodal across cohorts, with most non-DCB patients clustering at the lower extreme (~0.05) and DCB patients at the higher extreme (~0.95). This distribution underscores the model’s ability to act as a near-binary classifier, facilitating patient stratification in a clinically interpretable manner.

Survival stratification by SVM score

The SVM score threshold of 0.55 was determined by the Youden index from the ROC curve in the OAK training cohort, stratifying patients into SVM-low (>0.55, low-risk group) and SVM-high (≤0.55, high-risk group; reference group). Across all cohorts, SVM-low patients achieved significantly longer PFS and OS compared with SVM-high patients (Figures 6,7).

Figure 6 SVM survival prediction in immunotherapy-treated NSCLC. (A) OAK cohort PFS prediction; (B) OAK cohort OS prediction; (C) POPLAR cohort PFS prediction; (D) POPLAR cohort OS prediction. CI, confidence interval; HR, hazard ratio; NSCLC, non-small cell lung cancer; OS, overall survival; PFS, progression-free survival; SVM, support vector machine.
Figure 7 SVM-based survival prediction in local cohort subgroups. (A) PFS in ctDNA-positive patients; (B) OS in ctDNA-positive patients; (C) PFS in SD-evaluated patients; (D) OS in SD-evaluated patients. CI, confidence interval; ctDNA, circulating tumor DNA; HR, hazard ratio; Inf, infinity; OS, overall survival; PFS, progression-free survival; SD, stable disease; SVM, support vector machine.

In OAK, mPFS was 11.6 vs. 1.7 months [hazard ratio (HR) =0.35; P<0.001] (Figure 6A), and median OS (mOS) was 19.5 vs. 9.3 months (HR =0.62, P<0.001) (Figure 6B). In POPLAR, mPFS was 11.5 vs. 1.9 months (HR =0.36, P<0.001) (Figure 6C), and mOS was 20.7 vs. 9.7 months (HR =0.475, P=0.003) (Figure 6D). In the regional cohort, which included 47 patients who completed the treatment and had adequate follow-up (Table S2), the mPFS was 12.8 vs. 3.0 months (HR =0.191, P=0.002) (Figure 7A), and the mOS was 17.3 vs. 6.4 months (HR =0.159, P=0.002) (Figure 7B).

Importantly, among patients with initial SD at the first radiologic evaluation, the SVM model stratified long-term outcomes: SVM-low patients had markedly longer survival than SVM-high patients [mPFS 12.8 vs. 4.2 months, P=0.04; mOS 15.6 vs. 13.4 months, P=0.03] (Figure 7C,7D). This suggests that the model may help distinguish pseudoprogression from true non-response, a recognized clinical challenge in immunotherapy (26).

Gene enrichment analysis

Functional annotation of the 41 selected genes revealed significant enrichment in biologically relevant pathways (Table S3). KEGG analysis identified VEGF signaling, Ras signaling, cell cycle regulation, JAK-STAT signaling, and neurotrophin signaling as key enriched pathways (Figure 8A,8B). These pathways are known to influence tumor immune evasion and therapeutic resistance.

Figure 8 Enriched signature genes and core pathways via KEGG profiling. (A) Key gene-pathway associations. (B) Enriched signaling pathways (bubble plot). FDR, false discovery rate; KEGG, Kyoto Encyclopedia of Genes and Genomes.

Notably, genes such as STK11, KEAP1, TP53, RB1, EGFR, MTOR, PIK3R1, RAF1, FLT1, and CDKN2A were included, many of which have been implicated in immune suppression and poor outcomes with ICIs in NSCLC (27). The enrichment results provide mechanistic support for the predictive value of the identified ctDNA signature.


Discussion

In this study, we developed and validated a ctDNA-based machine learning model that predicts immunotherapy benefit in advanced NSCLC. Using baseline ctDNA WES data, we trained an SVM model in the OAK trial cohort and externally validated it in both the POPLAR trial and a regional prospective cohort. Across all datasets, the model demonstrated robust and reproducible predictive performance, with AUC exceeding 0.80. Patients categorized as SVM-low exhibited significantly longer PFS and OS compared with SVM-high patients.

These findings highlight the potential of ctDNA-derived mutational profiles as a noninvasive, pretreatment biomarker for predicting DCB from ICIs. The use of baseline ctDNA signatures offers a practical advantage over dynamic monitoring approaches, which require serial sampling and longitudinal follow-up (28). By leveraging a single timepoint sample, the model facilitates early patient stratification and may assist clinicians in optimizing ICI selection and avoiding ineffective therapy.

Traditional biomarkers such as PD-L1 expression and TMB have limited accuracy in predicting ICI response. The modest predictive power of blood TMB observed in the OAK and POPLAR studies was consistent with our findings (29). In contrast, our model integrates multiple somatic genomic features using a data-driven approach, thereby capturing a broader range of molecular determinants related to immune sensitivity and resistance.

Previous studies have suggested that certain driver mutations—such as STK11, KEAP1, PTEN, and EGFR—are associated with a non-inflamed tumor microenvironment and poor outcomes with ICIs (30). Our model incorporated several of these genes, which likely contributed to its predictive strength. Our model incorporated several of these genes, which likely contributed to its predictive strength. Emerging data suggest that NTRK1-related signaling may also modulate immune responsiveness and checkpoint inhibitor efficacy (31). Moreover, the enriched signaling pathways identified in this study, including VEGF, JAK-STAT, Ras, and cell-cycle regulation, are consistent with known immunosuppressive and oncogenic mechanisms. For instance, VEGF signaling can inhibit dendritic cell maturation and promote T-cell exhaustion, whereas JAK-STAT alterations may impair interferon-γ signaling and antigen presentation (32). These convergent findings reinforce the biological relevance of the ctDNA-derived gene set.

In addition, the bimodal distribution of SVM scores observed in our model suggests clear separation between responders and non-responders, which aligns with the clinical dichotomy frequently observed in ICI-treated NSCLC populations (33). The reproducibility of this distribution across three independent cohorts underscores the model’s stability and the potential to generalize beyond specific datasets or treatment regimens.

The clinical utility of the proposed SVM model lies in its ability to identify patients likely to derive benefit from ICIs prior to treatment initiation. This approach is particularly valuable given the heterogeneous and sometimes unpredictable responses observed with immunotherapy. By using pretreatment ctDNA data, our model could complement or even surpass traditional biomarkers such as PD-L1 and TMB in predictive accuracy.

Furthermore, among patients who initially achieved SD after two cycles of ICI therapy, the model successfully stratified long-term outcomes, distinguishing those with true disease control from those with eventual progression. This capability could help resolve the diagnostic ambiguity of pseudoprogression, a phenomenon that complicates early treatment decisions in immunotherapy. Consequently, the model may serve as a decision-support tool, helping clinicians determine whether to continue, adjust, or discontinue treatment.

Given that the regional validation cohort included patients receiving combination regimens (immunotherapy plus chemotherapy and/or antiangiogenic therapy), the model’s ability to retain discriminative power in this heterogeneous clinical context suggests potential applicability across therapeutic settings. Nevertheless, further validation in large-scale, prospective, and ethnically diverse populations is warranted to confirm its clinical robustness.

Several limitations should be acknowledged. First, the OAK and POPLAR cohorts represent Western populations, while our local validation cohort consisted primarily of East Asian patients. Ethnic and genomic differences may influence ctDNA mutational profiles and immune microenvironment characteristics (34). Therefore, future studies should include multiethnic and geographically diverse cohorts to enhance the generalizability of the model. Second, although the regional cohort introduced real-world heterogeneity in treatment regimens, including immunochemotherapy and anti-VEGF therapy, this variability might have confounded certain associations. Stratified analyses by treatment type in larger cohorts will be necessary to confirm the robustness of the predictive model under different therapeutic contexts. Third, the SVM score distribution appeared bimodal, indicating that the model primarily behaves as a binary classifier. While this enhances interpretability, it may limit its use for probabilistic predictions in intermediate cases. Future model iterations could incorporate calibration techniques or probabilistic frameworks to improve granularity. Fourth, the current study relied exclusively on baseline ctDNA features without integrating other potential biomarkers such as PD-L1 expression, tumor-infiltrating lymphocytes, or radiomic features. A multimodal approach combining ctDNA, imaging, and immune profiling might further improve predictive accuracy. Finally, the sample size of the local validation cohort was modest. Larger, prospective trials with serial sampling would allow the integration of dynamic ctDNA kinetics, providing a comprehensive framework for assessing both baseline prediction and early response monitoring (35).

In summary, our ctDNA-based SVM model provides a novel, noninvasive strategy for predicting ICI benefit in advanced NSCLC using a single pretreatment blood sample. The model demonstrated consistent predictive accuracy across multiple independent cohorts and revealed biologically relevant gene signatures associated with immune responsiveness. By improving pretreatment stratification, this approach could facilitate personalized immunotherapy and reduce unnecessary exposure to ineffective treatments. Further multicenter validation and integration with complementary biomarkers are ongoing to accelerate its clinical translation.

In this multi-cohort study, we developed and validated a ctDNA-based machine learning model capable of predicting immunotherapy benefit in patients with advanced NSCLC. By integrating pretreatment ctDNA mutational features using an SVM algorithm, the model accurately stratified patients into high- and low-benefit subgroups across three independent cohorts, achieving high reproducibility and robust predictive power. Unlike conventional biomarkers such as PD-L1 expression and TMB, which show limited predictive value, our model leverages the complexity of the ctDNA mutational landscape to capture multiple mechanisms influencing immune responsiveness. The identified 41-gene signature was enriched in key oncogenic and immune-regulatory pathways—including VEGF, JAK-STAT, and Ras signaling—consistent with prior studies on immune resistance in NSCLC. The main strength of this approach lies in its reliance on a single baseline liquid biopsy, enabling early patient selection without the need for serial sampling or invasive tissue biopsies. This feature makes the model practical for real-world applications. In addition, its predictive validity across both clinical trial and regional real-world cohorts supports potential generalizability. However, further work remains necessary. Prospective validation in larger, ethnically diverse populations is essential to confirm the model’s robustness and transferability. Future studies should also explore multimodal integration—combining ctDNA data with PD-L1 expression, radiomic features, and tumor microenvironment markers—to further enhance prediction accuracy.


Conclusions

In conclusion, this ctDNA-based SVM model offers a powerful and noninvasive tool for personalizing immunotherapy in NSCLC. With continued validation, it may support precision treatment decisions, improve clinical outcomes, and help optimize the deployment of ICIs in oncology practice.


Acknowledgments

We sincerely thank all patients and their families for participating in the clinical trials and regional cohort that made this study possible. We also thank the clinical staff and bioinformatics teams for their valuable technical assistance in data collection and processing.


Footnote

Reporting Checklist: The authors have completed the TRIPOD reporting checklist. Available at https://tcr.amegroups.com/article/view/10.21037/tcr-2025-1182/rc

Data Sharing Statement: Available at https://tcr.amegroups.com/article/view/10.21037/tcr-2025-1182/dss

Peer Review File: Available at https://tcr.amegroups.com/article/view/10.21037/tcr-2025-1182/prf

Funding: This work was supported by the Qian Dong Nan Science and Technology Program (No. qdnkhJz [2023] 14), Scientific Research Project of Guizhou Provincial Health and Wellness Commission (Nos. gzwkj2024-099 and gzwkj2025-608), Guizhou Medical University National Natural Science Foundation Cultivation Project (No. 25NSFCP35), Public Hospital High-Quality Development Research Public Welfare Project Fund (No. GL-A014), Cultivation of High-Level Innovative Talents in Guizhou Province (No. qian qian ceng ren cai[2024]202215), Spark Program (No. XHJH-0048), Chen Xiao-Ping Foundation for the Development of Science and Technology of Hubei Province (No. CXPJJH125009-05) and Wu Jieping Medical Foundation (No. 320.6750.2025-16-21).

Conflicts of Interest: All authors have completed the ICMJE uniform disclosure form (available at https://tcr.amegroups.com/article/view/10.21037/tcr-2025-1182/coif). The authors have no conflicts of interest to declare.

Ethical Statement: The authors are accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. This study was conducted in accordance with the Declaration of Helsinki and its subsequent amendments. The study was approved by the Ethics Committee of Guizhou Medical University Affiliated Hospital (No. 2023-LS-02), and all participating hospitals were informed and agreed the study. All participants provided written informed consent.

Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0/.


References

  1. Wolf E, Sacchi de Camargo Correia G, Li S, et al. Emerging Immunotherapies for Advanced Non-Small-Cell Lung Cancer. Vaccines (Basel) 2025;13:128. [Crossref] [PubMed]
  2. Remon J, Bortolot M, Bironzo P, et al. De-Escalation Strategies With Immune Checkpoint Blockers in Non-Small Cell Lung Cancer: Do We Already Have Enough Evidence? J Clin Oncol 2025;43:1148-56. [Crossref] [PubMed]
  3. Liguori L, Giorgio G, Polcaro G, et al. Checkpoint based immunotherapy in non-small cell lung cancer: a real-world retrospective study. Front Immunol 2024;15:1419544. [Crossref] [PubMed]
  4. Liu R, Rizzo S, Wang L, et al. Characterizing mutation-treatment effects using clinico-genomics data of 78,287 patients with 20 types of cancers. Nat Commun 2024;15:10884. [Crossref] [PubMed]
  5. Antonia SJ, Villegas A, Daniel D, et al. Durvalumab after Chemoradiotherapy in Stage III Non-Small-Cell Lung Cancer. N Engl J Med 2017;377:1919-29. [Crossref] [PubMed]
  6. Saad ED, Coart E, Deltuvaite-Thomas V, et al. Trial Design for Cancer Immunotherapy: A Methodological Toolkit. Cancers (Basel) 2023;15:4669. [Crossref] [PubMed]
  7. Ye J, Ji X, Dennis PA, et al. Relationship Between Progression-Free Survival, Objective Response Rate, and Overall Survival in Clinical Trials of PD-1/PD-L1 Immune Checkpoint Blockade: A Meta-Analysis. Clin Pharmacol Ther 2020;108:1274-88. [Crossref] [PubMed]
  8. Su H, Yu C, Sun G, et al. Prognostic value of immunotherapy in advanced NSCLC based on baseline and dynamic changes in HALP. Biomol Biomed 2024;25:29-41. [Crossref] [PubMed]
  9. Sogbe M, Aliseda D, Sangro P, et al. Prognostic value of circulating tumor DNA in different cancer types detected by ultra-low-pass whole-genome sequencing: a systematic review and patient-level survival data meta-analysis. Carcinogenesis 2025;46:bgae073. [Crossref] [PubMed]
  10. Marinello A, Tagliamento M, Pagliaro A, et al. Circulating tumor DNA to guide diagnosis and treatment of localized and locally advanced non-small cell lung cancer. Cancer Treat Rev 2024;129:102791. [Crossref] [PubMed]
  11. Hitchen N, Shahnam A, Tie J. Circulating Tumor DNA: A Pan-Cancer Biomarker in Solid Tumors with Prognostic and Predictive Value. Annu Rev Med 2025;76:207-23. [Crossref] [PubMed]
  12. Gandara DR, Paul SM, Kowanetz M, et al. Blood-based tumor mutational burden as a predictor of clinical benefit in non-small-cell lung cancer patients treated with atezolizumab. Nat Med 2018;24:1441-8. [Crossref] [PubMed]
  13. Alessi JV, Elkrief A, Ricciuti B, et al. Clinicopathologic and Genomic Factors Impacting Efficacy of First-Line Chemoimmunotherapy in Advanced NSCLC. J Thorac Oncol 2023;18:731-43. [Crossref] [PubMed]
  14. Liu F, Zhang X, Lu M, et al. The association of genomic alterations with PD-L1 expression in Chinese patients with EGFR/ALK wild-type lung adenocarcinoma and potential predictive value of Hippo pathway mutations to immunotherapy. Cancer Med 2024;13:e7038. [Crossref] [PubMed]
  15. Anagnostou V, Moore S, Ho C, et al. A biomarker-directed, multi-center phase II/III study of ctDNA molecular response adaptive immuno-chemotherapy in patients with non-small cell lung cancer. J Clin Oncol 2024;42:TPS8669.
  16. Assaf ZJF, Zou W, Fine AD, et al. A longitudinal circulating tumor DNA-based model associated with survival in metastatic non-small-cell lung cancer. Nat Med 2023;29:859-68. [Crossref] [PubMed]
  17. Bayle A, Belcaid L, Palmieri LJ, et al. Circulating tumor DNA landscape and prognostic impact of acquired resistance to targeted therapies in cancer patients: a national center for precision medicine (PRISM) study. Mol Cancer 2023;22:176. [Crossref] [PubMed]
  18. Xia L, Mei J, Kang R, et al. Perioperative ctDNA-Based Molecular Residual Disease Detection for Non-Small Cell Lung Cancer: A Prospective Multicenter Cohort Study (LUNGCA-1). Clin Cancer Res 2022;28:3308-17. [Crossref] [PubMed]
  19. Reis TC. Deep learning in oncology: transforming cancer diagnosis, prognosis, and treatment. Emerg Trends Drugs Addict Health 2025;5:100171.
  20. Prelaj A, Miskovic V, Zanitti M, et al. Artificial intelligence for predictive biomarker discovery in immuno-oncology: a systematic review. Ann Oncol 2024;35:29-65. [Crossref] [PubMed]
  21. Yang W, Chen C, Ouyang Q, et al. Machine learning models for predicting of PD-1 treatment efficacy in Pan-cancer patients based on routine hematologic and biochemical parameters. Cancer Cell Int 2024;24:258. [Crossref] [PubMed]
  22. Mu W, Jiang L, Shi Y, et al. Non-invasive measurement of PD-L1 status and prediction of immunotherapy response using deep learning of PET/CT images. J Immunother Cancer 2021;9:e002118. [Crossref] [PubMed]
  23. Socinski MA, Jotte RM, Cappuzzo F, et al. Atezolizumab for First-Line Treatment of Metastatic Nonsquamous NSCLC. N Engl J Med 2018;378:2288-301. [Crossref] [PubMed]
  24. Peng J, Zou D, Han L, et al. A Support Vector Machine Based on Liquid Immune Profiling Predicts Major Pathological Response to Chemotherapy Plus Anti-PD-1/PD-L1 as a Neoadjuvant Treatment for Patients With Resectable Non-Small Cell Lung Cancer. Front Immunol 2021;12:778276. [Crossref] [PubMed]
  25. Peng J, Zhang X, Hu Y, et al. Deep learning to estimate response of concurrent chemoradiotherapy in non-small-cell lung carcinoma. J Transl Med 2024;22:896. [Crossref] [PubMed]
  26. Nabet BY, Esfahani MS, Moding EJ, et al. Noninvasive Early Identification of Therapeutic Benefit from Immune Checkpoint Inhibition. Cell 2020;183:363-376.e13. [Crossref] [PubMed]
  27. Provencio-Pulla M, Pérez-Parente D, Olson S, et al. Identification of non-actionable mutations with prognostic and predictive value in patients with advanced or metastatic non-small cell lung cancer. Clin Transl Oncol 2024;26:1384-94. [Crossref] [PubMed]
  28. Ai X, Jia B, He Z, et al. Noninvasive early identification of durable clinical benefit from immune checkpoint inhibition: a prospective multicenter study (NCT04566432). Signal Transduct Target Ther 2024;9:350. [Crossref] [PubMed]
  29. Chen YT, Seeruttun SR, Wu XY, et al. Maximum Somatic Allele Frequency in Combination With Blood-Based Tumor Mutational Burden to Predict the Efficacy of Atezolizumab in Advanced Non-small Cell Lung Cancer: A Pooled Analysis of the Randomized POPLAR and OAK Studies. Front Oncol 2019;9:1432. [Crossref] [PubMed]
  30. Gutiérrez-Babativa L, Wagner-Gutiérrez N, Rojas L, et al. Overcoming immunotherapy resistance in non-small cell lung cancer: a narrative review of related factors. Immunotherapy 2025;17:823-33. [Crossref] [PubMed]
  31. Smith MR, Dixon CB, Wang Y, et al. Targeting NTRK1 Enhances Immune Checkpoint Inhibitor Efficacy in NTRK1 Wild-Type Non-Small Cell Lung Cancer. Cancer Res 2024;84:4002-16. [Crossref] [PubMed]
  32. Lee WS, Yang H, Chon HJ, et al. Combination of anti-angiogenic therapy and immune checkpoint blockade normalizes vascular-immune crosstalk to potentiate cancer immunity. Exp Mol Med 2020;52:1475-85. [Crossref] [PubMed]
  33. Molina-Arcas M, Downward J. Exploiting the therapeutic implications of KRAS inhibition on tumor immunity. Cancer Cell 2024;42:338-57. [Crossref] [PubMed]
  34. Chen J, Yang H, Teo ASM, et al. Genomic landscape of lung adenocarcinoma in East Asians. Nat Genet 2020;52:177-86. [Crossref] [PubMed]
  35. Wang Y, Wang W, Zhang T, et al. Dynamic bTMB combined with residual ctDNA improves survival prediction in locally advanced NSCLC patients with chemoradiotherapy and consolidation immunotherapy. J Natl Cancer Cent 2024;4:177-87. [Crossref] [PubMed]
Cite this article as: Xia J, He T, Hu Y, Zhou D, Zou D, Li Y, Zhang M, Li B, Wang M, Liu X, Huang Z, Su S, Peng J. Development and validation of a circulating tumor DNA-based machine learning model for predicting immunotherapy response in non-small cell lung cancer. Transl Cancer Res 2025;14(12):8778-8791. doi: 10.21037/tcr-2025-1182

Download Citation