Development and validation of nomograms predicting survival in operable breast cancer patients at reproductive age after breast conserving surgery and postoperative radiotherapy based on SEER database
Original Article

Development and validation of nomograms predicting survival in operable breast cancer patients at reproductive age after breast conserving surgery and postoperative radiotherapy based on SEER database

Sirui Zhu1#, Ke Zhang1#, Wei Lu1#, Huaiyu Yang1, Chenxuan Yang1, Changyuan Guo2, Lei Guo2, Xuemin Xue2, Zhongzhao Wang1, Lixue Xuan1

1Department of Breast Surgical Oncology, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China; 2Department of Pathology, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China

Contributions: (I) Conception and design: S Zhu, K Zhang; (II) Administrative support: W Lu, H Yang; (III) Provision of study materials or patients: Z Wang, L Xuan; (IV) Collection and assembly of data: C Yang, C Guo; (V) Data analysis and interpretation: L Guo, X Xue; (VI) Manuscript writing: All authors; (VII) Final approval of manuscript: All authors.

#These authors contributed equally to this work.

Correspondence to: Lixue Xuan, PhD. Department of Breast Surgical Oncology, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, No. 17 Panjiayuan Nanli, Chaoyang District, Beijing 100021, China. Email: xuanlx@hotmail.com.

Background: Breast cancer (BC) in women of reproductive age (<45 years) is characterized by aggressive biology and elevated recurrence risk despite curative breast-conserving surgery (BCS) and radiotherapy. Current staging systems inadequately predict outcomes in this population, necessitating precision tools to guide therapy. This study aims to develop validated nomograms integrating clinicopathologic variables to improve survival prediction and therapeutic personalization for young operable BC patients.

Methods: Using the Surveillance, Epidemiology, and End Results (SEER) database (2010–2016), we analyzed 9,477 women aged 20–45 years with operable BC (T1–3N0–1M0) treated with BCS and radiotherapy. Prognostic factors for overall survival (OS) and cancer-specific survival (CSS) were identified via Cox regression. Nomograms integrating clinicopathologic variables were developed (training cohort: n=6,633) and validated (test cohort: n=2,844) using C-index, time-dependent receiver operating characteristic (ROC) curves, and decision curve analysis. Chemotherapy benefits were assessed across different risk subgroups.

Results: Tumor grade [poorly differentiated: hazard ratio (HR) =4.46 for OS; HR =6.01 for CSS], lymph node metastasis (N1: HR =1.96 for OS; HR =2.25 for CSS), and multiple primaries which was defined as 2nd of two or more primaries (HR =2.33 for OS) independently predicted poorer survival. Human epidermal growth factor receptor 2 (HER2) positivity (HR =0.54 for OS) and progesterone receptor (PR) positivity (HR =0.50 for CSS) were protective factors. Nomograms outperformed the American Joint Committee on Cancer (AJCC) 7th edition staging system, with C-indices of 0.77 (OS) and 0.82 (CSS) in training, and 0.75 (OS) and 0.78 (CSS) in validation. Chemotherapy worsened outcomes in low-risk patient but benefited high-risk subgroups, particularly neoadjuvant chemotherapy for patients with complete response (HR =0.41 for OS).

Conclusions: This study establishes validated nomograms that improve survival prediction for young, operable BC patients, identifying high-risk subgroups likely to benefit from neoadjuvant chemotherapy. Conversely, low-risk patients may be spared unnecessary treatment. Prospective validation integrating molecular biomarkers is warranted to refine therapeutic personalization.

Keywords: Operable breast cancer (OBC); reproductive age; Surveillance, Epidemiology, and End Results (SEER); nomogram


Submitted May 21, 2025. Accepted for publication Aug 26, 2025. Published online Oct 29, 2025.

doi: 10.21037/tcr-2025-1059


Highlight box

Key findings

• This study developed and validated prognostic nomograms for operable breast cancer (BC) patients aged 20–45 years, integrating clinicopathological variables (tumor grade, lymph node status, hormone receptor status, etc.), with superior predictive accuracy (C-index: 0.77–0.82) over the American Joint Committee on Cancer (AJCC) 7th staging system.

• This study identified high-risk subgroups benefiting from neoadjuvant chemotherapy [complete responders: hazard ratio (HR) =0.41 for overall survival (OS)], while low-risk patients showed worsened outcomes with chemotherapy.

• Human epidermal growth factor receptor 2 (HER2) positivity [HR =0.54 for OS] and progesterone receptor (PR) positivity [HR =0.50 for cancer-specific survival (CSS)] were protective factors, whereas poorly differentiated tumors (HR =6.01 for CSS) and multiple primaries (HR =2.33 for OS) predicted poorer survival.

What is known and what is new?

• Young BC patients face aggressive biology and high recurrence risk despite breast-conserving surgery (BCS) and radiotherapy. Current staging systems inadequately predict outcomes.

• This study firstly developed validated nomograms specifically for young operable patients, incorporating chemotherapy response stratification (neoadjuvant vs. adjuvant) and identifying differential benefits by risk group.

What is the implication, and what should change now?

• Nomograms enable personalized risk assessment, guiding chemotherapy decisions: high-risk patients (especially complete responders) should prioritize neoadjuvant therapy, while low-risk patients may avoid overtreatment.

• Prospective validation integrating molecular biomarkers is warranted to refine therapeutic strategies. Clinicians should adopt these tools for young patients to optimize survival outcomes and reduce unnecessary toxicity.


Introduction

Breast cancer (BC) is the most commonly diagnosed malignancy and one of the major causes of cancer-related mortality globally, with a higher incidence in women (1,2). While the risk of BC increases with age, particularly in women over 50 years, it remains the most common malignancy among younger women of reproductive age (<45 years). In fact, over 40% of cancers diagnosed in women aged 45 and younger are BCs (3). Age is a critical determinant of long-term BC survival, with younger patients typically facing a worse prognosis than older age groups (4,5). Furthermore, several studies have shown that BC in young patients is often more aggressive, characterized by factors such as poorer tumor grade, more frequent breast cancer susceptibility gene 1/2 (BRCA1/2) mutations and lymphovascular invasion, all of which are associated with a poorer prognosis (6). The recent report indicates that the BC incidence rates among young women are 0.1, 5.7, and 46.6 per 100,000 for those aged 15–19, 20–29, and 30–39 years, respectively (7). Moreover, approximately 43% and 7% of young women present with regional or distant-stage disease, while BC-specific mortality reaches 22% in those aged 15–45 years (8).

Operable breast cancer (OBC) encompassing stages I, II, and III, represents more than 90% of all BC diagnoses (9). Despite the availability of curative treatment options for OBC—including surgery, radiation, chemotherapy, and adjuvant endocrine therapy—more than 30% of patients will experience recurrence, often with distant metastases, which remains incurable (10,11). Advancements in surgical techniques, improved collaboration with pathology, and the integration of radiotherapy and systemic therapies have led to increasing recognition of breast-conserving surgery (BCS) for its safety, cosmetic outcomes, reduced complications, and enhanced quality of life, as endorsed by both patients and surgeons (12). BCS combined with postoperative radiotherapy has become the standard surgical approach for the majority of early-stage BC cases (13). However, some studies have identified that patients with high-risk clinical features—such as larger tumor size, more advanced stage, extensive axillary lymph node involvement, and poorer histologic grade—are at an elevated risk of recurrence (14,15). Furthermore, some patients with early-stage and locally advanced BC may benefit from neoadjuvant chemotherapy, as tumor response during and after treatment can be assessed and guide subsequent treatment decisions (16).Thus, identifying patients at high risk of recurrence can help optimize treatment strategies (17,18).

To our knowledge, no systematic study has yet identified survival risk factors in OBC patients of reproductive age. Thus, this study aims to identify independent prognostic factors for overall survival (OS) and cancer-specific survival (CSS) in this cohort, developing a visual predictive model to enable high-risk patient identification for personalized management. We present this article in accordance with the TRIPOD reporting checklist (available at https://tcr.amegroups.com/article/view/10.21037/tcr-2025-1059/rc).


Methods

Data collection and patient selection

Patients with BC were identified from the Surveillance, Epidemiology, and End Results (SEER) Research Plus Data 22 registry (2000–2019). To ensure staging consistency using the 7th American Joint Committee on Cancer (AJCC) system, we restricted analysis to cases diagnosed between January 2010 and December 2016. Inclusion criteria comprised: (I) age 20–45 years; (II) completion of BCS with postoperative radiotherapy; (III) tumor, node, metastasis (TNM) stage T1–3N0–1M0. This cohort enabled analysis of survival rates and prognostic factors.

Each patient’s comprehensive information encompassed a range of age, race, laterality, TNM stage, grade, location, histological type, lymph node metastasis, BC subtype, tumor size, estrogen receptor (ER) status, progesterone receptor (PR) status, human epidermal growth factor receptor 2 (HER2) status, cause-specific death, tumor sequence, response to neoadjuvant therapy and survival months (more than 0 days of survival). Specifically, patients confirmed as achieving pathologic complete response (CR) upon postoperative specimen analysis are defined as having a CR. Patients stated as having no response or partial response are defined as having a non-complete response (NCR).

Patients were randomly allocated to training and validation cohorts (7:3 ratio) to ensure analytical robustness. Exclusion criteria comprised: (I) incomplete survival/follow-up data; (II) zero survival time or missing survival data. The study was conducted in accordance with the Declaration of Helsinki and its subsequent amendments.

Statistical analysis

Through the utilization of Cox regression models, we performed calculations to determine a 95% confidence interval (CI) and hazard ratio (HR). In order to identify potential prognostic factors, those showing significant differences in the univariate Cox regression analysis were further examined through multivariate analysis.

Using R software, we developed multivariate analysis-based nomograms to predict 3-, 5-, and 7-year OS and CSS. Model discrimination was evaluated using the concordance index (C-index), time-dependent receiver operating characteristic (ROC) curves, and area under the curve (AUC). Calibration plots compared predicted versus observed survival. Clinical utility was assessed via decision curve analysis (DCA), quantifying net benefits across threshold probabilities. The development cohort was segmented into risk groups based on total points, and survival differences were analyzed using the Kaplan-Meier method with log-rank tests. Propensity score matching (PSM) was applied to evaluate clinical intervention effects on outcomes. Statistical analyses used R (v3.6.1) and Statistical Package for the Social Sciences​(SPSS v25.0; International Business Machines Corporation). The “rms”, “survival”, “magick”, “timeROC”, “ggplotify”, and “cowplot” R packages facilitated nomogram development and validation. Statistical significance was defined as P<0.05.


Results

Baseline clinical features

In summary, this study included 9,477 women patients, categorized into a training cohort of 6,633 and a test cohort of 2,844. The majority of patients are between the ages of 40 to 44 years, with a median survival time of 66 months. All patients underwent BCS with postoperative radiotherapy, with more than half (61.06%) receiving chemotherapy. The majority of tumors were classified as grade III/IV (poorly differentiated and undifferentiated), accounting for 41.19% of cases, while only 18.99% and 39.81% of patients exhibited grade I (well-differentiated) and grade II (moderately differentiated) tumors, respectively. Infiltrating ductal carcinoma was the predominant histologic subtype, observed in 85.92% of patients. Chemotherapy was administered to over half of the patients (61.06%). The most common BC subtype was hormone receptor-positive (HR+)/HER2-negative (HER2−) (68.19%). ER positivity was present in 80.16% of cases, and PR positivity in 74.52%, while HER2 negativity was found in the majority (82.69%). In terms of tumor staging, stage I was the most frequent (53.76%), with T1 being the most common T stage (61.89%) and N0 being the most prevalent N stage (74.29%). Basic characteristics and variance analyses are detailed in Table 1. No significant differences in feature distributions were found between the training and test sets using Mann-Whitney U and Chi-squared (χ2) tests.

Table 1

Baseline characteristics of operable breast cancer patients in the training and test sets

Variables Total (n=9,477) Test (n=2,844) Train (n=6,633) P
Age 0.27
   20–24 years 31 (0.33) 9 (0.32) 22 (0.33)
   25–29 years 246 (2.60) 88 (3.09) 158 (2.38)
   30–34 years 799 (8.43) 226 (7.95) 573 (8.64)
   35–39 years 2,100 (22.16) 637 (22.40) 1,463 (22.06)
   40–44 years 6,301 (66.49) 1,884 (66.24) 4,417 (66.59)
Race 0.06
   Black 1,200 (12.66) 380 (13.36) 820 (12.36)
   Others 1,488 (15.70) 474 (16.67) 1,014 (15.29)
   White 6,789 (71.64) 1,990 (69.97) 4,799 (72.35)
Primary site 0.40
   Upper-inner quadrant of breast 1,446 (15.26) 448 (15.75) 998 (15.05)
   Lower-inner quadrant of breast 537 (5.67) 144 (5.06) 393 (5.92)
   Upper-outer quadrant of breast 3,753 (39.60) 1,139 (40.05) 2,614 (39.41)
   Lower-outer quadrant of breast 842 (8.88) 258 (9.07) 584 (8.80)
   Others 2,899 (30.59) 855 (30.06) 2,044 (30.82)
Grade 0.61
   I 1,800 (18.99) 538 (18.92) 1,262 (19.03)
   II 3,773 (39.81) 1,153 (40.54) 2,620 (39.50)
   III/IV 3,904 (41.19) 1,153 (40.54) 2,751 (41.47)
Histology 0.62
   Infiltrating duct carcinoma 8,143 (85.92) 2,436 (85.65) 5,707 (86.04)
   Others 1,334 (14.08) 408 (14.35) 926 (13.96)
Laterality 0.75
   Left 4,725 (49.86) 1,425 (50.11) 3,300 (49.75)
   Right 4,752 (50.14) 1,419 (49.89) 3,333 (50.25)
Chemotherapy 0.35
   No/unknown 3,690 (38.94) 1,087 (38.22) 2603 (39.24)
   Yes 5,787 (61.06) 1,757 (61.78) 4,030 (60.76)
Breast subtype 0.23
   HR−/HER2− 1,375 (14.51) 391 (13.75) 984 (14.83)
   HR−/HER2+ 385 (4.06) 130 (4.57) 255 (3.84)
   HR+/HER2− 6,462 (68.19) 1,947 (68.46) 4,515 (68.07)
   HR+/HER2+ 1,255 (13.24) 376 (13.22) 879 (13.25)
ER 0.49
   Negative 1,880 (19.84) 552 (19.41) 1,328 (20.02)
   Positive 7,597 (80.16) 2,292 (80.59) 5,305 (79.98)
PR 0.76
   Negative 2,415 (25.48) 719 (25.28) 1,696 (25.57)
   Positive 7,062 (74.52) 2,125 (74.72) 4,937 (74.43)
HER2 0.41
   Negative 7,837 (82.69) 2,338 (82.21) 5,499 (82.90)
   Positive 1,640 (17.31) 506 (17.79) 1,134 (17.10)
Sequence number 0.61
   2nd of two or more primaries 371 (3.91) 107 (3.76) 264 (3.98)
   One primary only 9,106 (96.09) 2,737 (96.24) 6,369 (96.02)
Marital status 0.608
   Married 5,821 (61.42) 1,758 (61.81) 4,063 (61.25)
   Unmarried 3,656 (38.58) 1,086 (38.19) 2,570 (38.75)
Stage 0.41
   I 5,095 (53.76) 1,555 (54.68) 3,540 (53.37)
   II 4,257 (44.92) 1,249 (43.92) 3,008 (45.35)
   IIIA 125 (1.32) 40 (1.41) 85 (1.28)
T stage 0.60
   T1 5,865 (61.89) 1,780 (62.59) 4,085 (61.59)
   T2 252 (2.66) 77 (2.71) 175 (2.64)
   T3 3,360 (35.45) 987 (34.70) 2,373 (35.78)
N stage 0.78
   N0 7,040 (74.29) 2,118 (74.47) 4,922 (74.20)
   N1 2,437 (25.71) 726 (25.53) 1,711 (25.80)

Data are presented as n (%). ER, estrogen receptor; HER2−, human epidermal growth factor receptor 2-negative; HER2+, human epidermal growth factor receptor 2-positive; HR−, hormone receptor-negative; HR+, hormone receptor-positive; N, lymph node; PR, progesterone receptor; T, tumor.

Variable feature importance of survival prediction

To identify prognostic factors for OS, we conducted univariate and multivariate Cox regression analyses in the training set (Table 2). Both analyses indicated several risk factors for OS outcomes: grade, PR status, HER2 status, tumor sequence, marital status, T stage, and N stage. Specifically, moderately differentiated (HR =2.79; 95% CI: 1.48–5.26), poorly differentiated and undifferentiated (HR =4.46; 95% CI: 2.35–8.45), unmarried (HR =4.46; 95% CI: 2.35–8.45), T2 stage (HR =1.90; 95% CI: 1.03–3.51), T3 stage (HR =2.11; 95% CI: 1.63–2.74) and N1 stage (HR =1.96; 95% CI: 2.35–8.45). In contrast, PR-positive status (HR =0.59; 95% CI: 0.45–0.76), HER2-positive status (HR =0.54; 95% CI: 0.39–0.76) and only one primary tumor (HR =0.43; 95% CI: 0.28–0.68) were linked to improved OS.

Table 2

Univariate and multivariate Cox analysis for OS of patients

Characteristics Univariate analysis Multivariate analysis
Hazard ratio (95% CI) P value Hazard ratio (95% CI) P value
Grade
   I Ref
   II 3.59 (1.92–6.73) <0.001* 2.79 (1.48–5.26) 0.002
   III/IV 8.42 (4.59–15.45) <0.001* 4.46 (2.35–8.45) <0.001*
Histology
   Infiltrating duct carcinoma Ref Ref
   Others 0.61 (0.42–0.90) 0.01 0.82 (0.56–1.22) 0.33
Chemotherapy
   No/unknown Ref Ref
   Yes 3.14 (2.31–4.27) <0.001* 1.02 (0.70–1.48) 0.91
PR
   Negative Ref Ref
   Positive 0.34 (0.27–0.43) <0.001* 0.59 (0.45–0.76) <0.001*
HER2
   Negative Ref Ref
   Positive 0.74 (0.53–0.95) 0.045 0.54 (0.39–0.76) <0.001*
ER
   Negative Ref Ref
   Positive 0.36 (0.29–0.45) <0.001* 0.75 (0.52–1.07) 0.11
Sequence number
   2nd of two or more primaries Ref Ref
   One primary only 0.58 (0.37–0.90) 0.01 0.43 (0.28–0.68) <0.001*
Marital status
   Married Ref Ref
   Unmarried 1.49 (1.18–1.87) <0.001* 1.37 (1.09–1.73) 0.007
T stage
   T1 Ref Ref
   T2 3.27 (1.80–5.94) <0.001* 1.90 (1.03–3.51) 0.03
   T3 3.28 (2.58–4.18) <0.001* 2.11 (1.63–2.74) <0.001*
N stage
   N0 Ref Ref
   N1 2.44 (1.94–3.07) <0.001* 1.96 (1.55–2.48) <0.001*
Race
   Black Ref Ref
   White 0.56 (0.42–0.74) <0.001* 0.85 (0.63–1.14) 0.28
   Others 0.38 (0.24–0.59) <0.001* 0.56 (0.36–1.24) 0.054
Primary site
   Upper-inner quadrant of breast Ref
   Lower-inner quadrant of breast 1.39 (0.83–2.34) 0.20
   Upper-outer quadrant of breast 1.09 (0.76–1.56) 0.62
   Lower-outer quadrant of breast 0.59 (0.33–1.07) 0.08
   Others 1.26 (0.88–1.81) 0.21
Laterality
   Left Ref
   Right 0.84 (0.67–1.05) 0.13
Stage
   I Ref Ref
   II 3.20 (2.48–4.13) <0.001* 0.80 (0.47–1.38) 0.42
   IIIA 4.77 (2.31–9.86) <0.001* 0.84 (0.21–3.39) 0.81

*, statistical significance. CI, confidence interval; ER, estrogen receptor; HER2, human epidermal growth factor receptor 2; N, lymph node; OS, overall survival; PR, progesterone receptor; T, tumor.

For CSS, independent prognostic factors included grade, PR status, HER2 status, tumor sequence, T stage and N stage (Table 3). Notably, moderately differentiated (HR =3.43; 95% CI: 1.47–8.00), poorly differentiated and undifferentiated (HR =6.01; 95% CI: 2.57–14.04), T2 stage (HR =2.54; 95% CI: 1.87–3.43), T3 stage (HR =3.44; 95% CI: 1.93–6.11) and N1 stage (HR =2.25; 95% CI: 1.73–2.91) correlated with poorer CSS. Furthermore, PR-positive status (HR =0.50; 95% CI: 0.37–0.68), HER2-positive status (HR =0.36; 95% CI: 0.23–0.56) only one primary tumor (HR =0.48; 95% CI: 0.29–0.79) were associated with better CSS outcomes.

Table 3

Univariate and multivariate Cox analysis for CSS of patients

Characteristics Univariate analysis Multivariate analysis
Hazard ratio (95% CI) P value Hazard ratio (95% CI) P value
Grade
   I Ref Ref
   II 6.18 (2.49–15.37) <0.001* 3.43 (1.47–8.00) 0.004
   III/IV 8.42 (4.59–15.45) <0.001* 6.01 (2.57–14.04) <0.001*
Histology
   Infiltrating duct carcinoma Ref Ref
   Others 0.49 (0.31–0.78) 0.003 0.68 (0.44–1.07) 0.096
Chemotherapy
   No/unknown Ref Ref
   Yes 5.26 (3.53–7.85) <0.001* 1.51 (0.95–2.41) 0.082
PR
   Negative Ref Ref
   Positive 0.24 (0.19–0.31) <0.001* 0.50 (0.37–0.68) <0.001*
HER2
   Negative Ref Ref
   Positive 0.46 (0.30–0.73) <0.001* 0.36 (0.23–0.56) <0.001*
ER
   Negative Ref Ref
   Positive 0.27 (0.21–0.35) <0.001* 0.77 (0.53–1.11) 0.15
Sequence number
   2nd of two or more primaries Ref Ref
   One primary only 0.54 (0.34–0.88) 0.01 0.48 (0.29–0.79) 0.004
Marital status
   Married Ref Ref
   Unmarried 1.35 (1.05–1.74) 0.02 1.18 (0.91–1.53) 0.20
T stage
   T1 Ref Ref
   T2 4.77 (2.58–8.79) <0.001* 2.54 (1.87–3.43) <0.001*
   T3 4.25 (3.21–5.62) <0.001* 3.44 (1.93–6.11) <0.001*
N stage
   N0 Ref Ref
   N1 3.23 (2.51–4.15) <0.001* 2.25 (1.73–2.91) <0.001*
Race
   Black Ref Ref
   White 0.54 (0.39–0.73) <0.001* 0.88 (0.64–1.22) 0.45
   Others 0.41 (0.26–0.65) <0.001* 0.70 (0.44–1.12) 0.13
Primary site
   Upper-inner quadrant of breast Ref
   Lower-inner quadrant of breast 0.91 (0.47–1.74) 0.76
   Upper-outer quadrant of breast 1.01 (0.69–1.48) 0.97
   Lower-outer quadrant of breast 0.65 (0.35–1.20) 0.16
   Others 1.14 (0.77–1.69) 0.50
Laterality
   Left Ref
   Right 1.03 (0.80–1.33) 0.80
Stage
   I Ref Ref
   II 4.40 (3.24–5.97) <0.001* 0.80 (0.44–1.45) 0.45
   IIIA 7.49 (3.56–15.75) <0.001* 0.83 (0.20–3.46) 0.79

*, statistical significance. CI, confidence interval; CSS, cancer-specific survival; ER, estrogen receptor; HER2, human epidermal growth factor receptor 2; N, lymph node; OS, overall survival; PR, progesterone receptor; T, tumor.

Nomogram construction

Based on multivariate analysis of the training cohort, we constructed a nomogram predicting OS and CSS (Figures 1,2). Each prognostic factor was assigned a score between 0 and 100, reflecting its impact on the model’s predictive accuracy. By aggregating these scores for each patient, we derived a total point value that facilitated the estimation of 3-, 5- and 7-year OS and CSS probabilities. Importantly, higher total scores were associated with a worse prognosis for patients.

Figure 1 Nomogram for OS prediction in the patients. HER2, human epidermal growth factor receptor 2; N, lymph node; OS, overall survival; PR, progesterone receptor; T, tumor.
Figure 2 Nomogram for CSS prediction in the patients. CSS, cancer-specific survival; HER2, human epidermal growth factor receptor 2; N, lymph node; PR, progesterone receptor; T, tumor.

Model validation

In the training cohort, the OS nomogram achieved a significantly higher C-index than the AJCC 7th staging system [0.77 (95% CI: 0.75–0.80) vs. 0.69 (95% CI: 0.66–0.71)]. Time-dependent ROC analysis yielded AUCs of 0.81 (3-year), 0.78 (5-year), and 0.76 (7-year) (Figure 3). Validation confirmed these findings: C-index =0.75 (0.73–0.77) with corresponding 3-/5-/7-year AUCs of 0.81, 0.76 and 0.76 (Figure 3). Calibration plots demonstrated excellent agreement between predicted and observed outcomes in both cohorts (training: Figure 4A-4C; validation: Figure 4D-4F). DCA revealed superior clinical utility of the nomogram over AJCC 7th staging in training (Figure 5A) and validation sets (Figure 5B).

Figure 3 ROC curves of the nomogram for 3-year (A), 5-year (B) and 7-year (C) OS in the training set and validation set. AUC, area under the curve; CI, confidence interval; OS, overall survival; ROC, receiver operating characteristic.
Figure 4 Calibration plots of the training set (A-C) and the validation set (D-F) for 3-year (A,D), 5-year (B,E) and 7-year (C,F) OS. OS, overall survival.
Figure 5 Decision curve analysis of nomogram and AJCC 7th staging system for the OS prediction in the training cohort (A) and validation cohort (B). None: none of the patients have a bad outcome; All: bad outcomes occur in all patients. AJCC, American Joint Committee on Cancer; OS, overall survival.

The CSS nomogram demonstrated robust validation. In the training cohort, its C-index reached 0.82 (95% CI: 0.80–0.84), exceeding the AJCC system’s 0.62 (95% CI: 0.60–0.66). Time-dependent ROC analysis yielded AUCs of 0.86 (3-year), 0.82 (5-year), and 0.81 (7-year) for CSS (Figure S1). Validation cohort results confirmed this superiority: C-index =0.78 (95% CI: 0.74–0.81) with corresponding 1-/3-/5-year AUCs of 0.739, 0.730, and 0.730 (Figure S1). Calibration plots showed high concordance between predicted and observed outcomes in both cohorts (training: Figure S2A-S2C; validation: Figure S2D-S2F). DCA of the training set confirmed the model’s superior clinical utility (Figure S3A), and DCA of the validation cohort demonstrated superior clinical applicability versus the AJCC system (Figure S3B).

Therapeutic efficacy across risk-stratified subgroups

Patients were categorized into three risk groups based on scores derived from the X-tile prediction model: low-risk (OS <159, CSS <202), middle-risk (159< OS <228, 202< CSS <250), and high-risk (OS ≥228, CSS ≥250) (Figure S4). Kaplan-Meier analysis revealed significant survival differences among these groups, with higher-risk patients showing poorer OS and CSS, while the low-risk group exhibited improved outcomes (Figure 6). After PSM, we assessed differential treatment effects on OS and CSS in subgroup analyses (Tables S1-S12). Chemotherapy did not provide significant survival benefits in either the low- or middle-risk groups (Figure 7), and in fact, worsened outcomes in the low-risk group. In the middle-risk group, comparison of chemotherapy modalities revealed that adjuvant chemotherapy did not significantly improve OS or CSS (Figure 8). In contrast, neoadjuvant chemotherapy showed a notable benefit for OS in patients with CR, but not in those with NCR (Figure 8). Interestingly, in the high-risk group, NCR patients had better OS and CSS with adjuvant chemotherapy compared to neoadjuvant treatment, while CR patients fared better with neoadjuvant chemotherapy (Figure 9).

Figure 6 Kaplan-Meier survival curves with different risk group stratified by the nomogram. Kaplan-Meier OS (A) and CSS (B) curves with three risk groups stratified by the nomogram. CSS, cancer-specific survival; OS, overall survival.
Figure 7 The adjuvant chemotherapy effects in the low-risk and middle-risk groups. The comparison between patients who received adjuvant chemotherapy and those who did not receive adjuvant chemotherapy in the low-risk group for OS (A) and CSS (B). The comparison between patients who received adjuvant chemotherapy and those who did not receive adjuvant chemotherapy in the middle-risk group for OS (C) and CSS (D). CI, confidence interval; CSS, cancer-specific survival; HR, hazard ratio; OS, overall survival.
Figure 8 Kaplan-Meier analysis for different chemotherapy strategies in middle-risk patients group. Comparative analysis of chemotherapy recipients versus non-recipients in the middle-risk group: OS (A) and CSS (D). The comparison between patients who received neoadjuvant chemotherapy and those who did not receive neoadjuvant chemotherapy in the middle-risk patients group of NCR for OS (B) and CSS (E). The comparison between patients who received adjuvant chemotherapy and those who receive neoadjuvant chemotherapy in the middle-risk patients group of CR for OS (C) and CSS (F). CI, confidence interval; CR, complete response; CSS, cancer-specific survival; HR, hazard ratio; NCR, non-complete response; OS, overall survival.
Figure 9 Kaplan-Meier analysis for different chemotherapy strategies in the high-risk patients group. Adjuvant vs. neoadjuvant chemotherapy in high-risk CR patients: OS (A) and CSS (B). The comparison between patients who received adjuvant chemotherapy and those who receive neoadjuvant chemotherapy in the high-risk patients group of NCR for OS (C) and CSS (D). CI, confidence interval; CR, complete response; CSS, cancer-specific survival; HR, hazard ratio; NCR, non-complete response; OS, overall survival.

Discussion

The incidence of BC in young women, especially those of reproductive age, has been increasing steadily (19). These patients often present with higher-grade tumors and more aggressive biological characteristics than older individuals (20). While BCS followed by postoperative radiotherapy has gained increasing acceptance as a standard approach for early-stage and locally advanced disease in younger women, this cohort remains at heightened risk of recurrence, driven by adverse clinicopathological features such as hormone receptor negativity, lymphovascular invasion, and multifocality (21,22). Although adjuvant systemic therapies have improved outcomes, their benefits are often constrained by inherent tumor resistance and treatment-related toxicities (23). Recent data suggest that neoadjuvant chemotherapy may enhance surgical outcomes and provide prognostic insights through pathologic response assessment; however, its role in early-stage operable disease remains contentious, particularly among patients without overt high-risk features (24,25). These challenges underscore the critical need for refined risk stratification tools to identify subgroups of young patients who may derive maximal benefit from intensified or alternative treatment strategies.

Our study aimed to investigate the prognostic factors for OS and CSS in this population and to develop predictive tools to guide clinical decision-making. We identified several factors, including tumor grade, T stage, N stage, hormone receptor status and tumor sequence as independent prognostic indicators. Tumor grade and lymph node were identified as key predictors of both OS and CSS, consistent with previous studies linking poorly differentiated histology and lymph node metastasis to more aggressive disease in younger patients (26). The observed protective effect of HER2 positivity contrasts with earlier reports but may reflect the significant impact of HER2-targeted therapies in current treatment regimens (27). Likewise, PR positivity correlated with better outcomes, likely due to increased sensitivity to endocrine therapies (28). Furthermore, our analysis firstly identified tumor sequence—specifically, the presence of multiple primary tumors—as an independent predictor of adverse survival outcomes.

While the role of chemotherapy in improving survival outcomes in early-stage BC has been established, the optimal use of chemotherapy, especially in high-risk younger patients, remains a subject of ongoing debate (29). In our study, we observed that chemotherapy provided limited survival benefits for both low- and middle-risk groups, and in some cases, was associated with worsened outcomes in the low-risk group. This observation aligns with the findings of recent clinical trials, which suggest that chemotherapy may have a less pronounced benefit in patients with early-stage disease and low-risk features (30). It is increasingly evident that overtreatment with chemotherapy in low-risk patients may lead to unnecessary toxicities without substantial improvements in survival. Additionally, we observed for the first time that neoadjuvant chemotherapy significantly improved OS in patients with CR, but not in those with NCR within the middle-risk group. The possible reason for this discrepancy may lie in the differential biological behavior of tumors in CR versus NCR patients, with the former likely achieving a more profound therapeutic response (31).

In contrast, neoadjuvant chemotherapy showed significant benefits for patients with CR, particularly in the middle-risk group, and offered improved OS for this subgroup. These results support the emerging consensus that neoadjuvant chemotherapy, by allowing for real-time assessment of tumor response, may guide more personalized treatment strategies. However, our findings also suggest that neoadjuvant chemotherapy does not benefit patients with NCR, who may fare better with adjuvant chemotherapy. The differential effectiveness of neoadjuvant versus adjuvant chemotherapy in high-risk subgroups raises important questions about tumor biology and the optimal timing for therapeutic intervention. Recent evidence suggests that neoadjuvant therapy not only reduces tumor size but also reveals underlying resistance mechanisms, allowing for more adaptive treatment strategies. CR patients saw the greatest benefit from neoadjuvant regimens, while those with NCR appeared to respond better to adjuvant chemotherapy. This may be due to the elimination of residual micrometastatic disease through postoperative systemic therapy (32). This differentiation highlights the importance of tailoring chemotherapy regimens based on individual tumor biology and response to treatment, a concept that has gained attention in the context of precision medicine (33).

Our study also highlights the utility of nomograms as a predictive tool for guiding clinical decisions in BC patients of reproductive age. The nomograms developed in this study demonstrated superior predictive accuracy compared to traditional staging AJCC 7th edition. This is particularly relevant in younger women with BC, who may present with diverse clinical features not fully captured by conventional staging methods. By integrating multiple clinical variables, the nomogram provides a personalized risk assessment, enabling clinicians to more effectively identify high-risk patients and make informed decisions regarding adjuvant therapy. Limitations of this study include its retrospective design and reliance on registry data. Critically, the absence of molecular profiling (such as genomic classifiers, Ki-67 indices, and immune markers) precludes biologically driven risk stratification. Furthermore, while internal validation demonstrated robust discrimination, external validation in independent, multi-institutional cohorts is needed to confirm generalizability before clinical translation. Future research integrating clinicopathological variables with tumor molecular subtyping may refine prediction accuracy and guide therapy de-escalation.

In conclusion, while current treatment strategies like BCS and postoperative radiotherapy have improved outcomes for early-stage BC, high-risk patients still face significant challenges in terms of recurrence and metastasis. Chemotherapy remains an important component of treatment, but its benefit must be carefully evaluated in individual patients, with particular attention to tumor biology and response. The development of predictive nomograms offers a promising approach for optimizing treatment strategies and improving survival outcomes in young BC patients. Future studies, including prospective clinical trials, are needed to further validate these models and refine our understanding of the role of chemotherapy in this patient population.


Conclusions

Our study highlights the importance of personalized treatment strategies for reproductive age women with OBC. The nomograms constructed in this study, based on prognostic factors, offer improved predictive accuracy over traditional staging systems, aiding in the identification of high-risk patients. While chemotherapy remains essential, its benefit must be tailored to individual tumor biology and response, with neoadjuvant therapy showing significant advantages for complete responders. Future prospective studies are needed to validate these models and further refine treatment strategies.


Acknowledgments

We would like to thank all the researchers for the SEER program.


Footnote

Reporting Checklist: The authors have completed the TRIPOD reporting checklist. Available at https://tcr.amegroups.com/article/view/10.21037/tcr-2025-1059/rc

Peer Review File: Available at https://tcr.amegroups.com/article/view/10.21037/tcr-2025-1059/prf

Funding: This research was funded in by the National Natural Science Foundation of China (Grant No. 82372078 and No. 82203688).

Conflicts of Interest: All authors have completed the ICMJE uniform disclosure form (available at https://tcr.amegroups.com/article/view/10.21037/tcr-2025-1059/coif). The authors have no conflicts of interest to declare.

Ethical Statement: The authors are accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. The study was conducted in accordance with the Declaration of Helsinki and its subsequent amendments.

Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0/.


References

  1. Silva JDDE, de Oliveira RR, da Silva MT, et al. Breast Cancer Mortality in Young Women in Brazil. Front Oncol 2020;10:569933. [Crossref] [PubMed]
  2. Siegel RL, Giaquinto AN, Jemal A. Cancer statistics, 2024. CA Cancer J Clin 2024;74:12-49. [Crossref] [PubMed]
  3. Lambertini M, Peccatori FA, Demeestere I, et al. Fertility preservation and post-treatment pregnancies in post-pubertal cancer patients: ESMO Clinical Practice Guidelines Ann Oncol 2020;31:1664-78. [Crossref] [PubMed]
  4. Tzikas AK, Nemes S, Linderholm BK. A comparison between young and old patients with triple-negative breast cancer: biology, survival and metastatic patterns. Breast Cancer Res Treat 2020;182:643-54. [Crossref] [PubMed]
  5. Walsh SM, Zabor EC, Flynn J, et al. Breast cancer in young black women. Br J Surg 2020;107:677-86. [Crossref] [PubMed]
  6. Kataoka A, Iwamoto T, Tokunaga E, et al. Young adult breast cancer patients have a poor prognosis independent of prognostic clinicopathological factors: a study from the Japanese Breast Cancer Registry. Breast Cancer Res Treat 2016;160:163-72. [Crossref] [PubMed]
  7. Li W, Liang H, Wang W, et al. Global cancer statistics for adolescents and young adults: population based study. J Hematol Oncol 2024;17:99. [Crossref] [PubMed]
  8. Wilkinson L, Gathani T. Understanding breast cancer as a global health concern. Br J Radiol 2022;95:20211033. [Crossref] [PubMed]
  9. Gradishar WJ, Moran MS, Abraham J, et al. Breast Cancer, Version 3.2024, NCCN Clinical Practice Guidelines in Oncology. J Natl Compr Canc Netw 2024;22:331-57. [Crossref] [PubMed]
  10. Slamon D, Lipatov O, Nowecki Z, et al. Ribociclib plus Endocrine Therapy in Early Breast Cancer. N Engl J Med 2024;390:1080-91. [Crossref] [PubMed]
  11. Sopik V, Sun P, Narod SA. Predictors of time to death after distant recurrence in breast cancer patients. Breast Cancer Res Treat 2019;173:465-74. [Crossref] [PubMed]
  12. Adesunkanmi AO, Wuraola FO, Fagbayimu OM, et al. Oncoplastic Breast-Conserving Surgery in African Women: A Systematic Review. JCO Glob Oncol 2024;10:e2300460. [Crossref] [PubMed]
  13. Liu J, Tan Y, Bi Z, et al. RecurIndex-Guided postoperative radiotherapy with or without Avoidance of Irradiation of regional Nodes in 1-3 node-positive breast cancer (RIGAIN): a study protocol for a multicentre, open-label, randomised controlled prospective, phase III trial. BMJ Open 2024;14:e078049. [Crossref] [PubMed]
  14. Kolberg-Liedtke C, Gluz O, Heinisch F, et al. Association of TILs with clinical parameters, Recurrence Score® results, and prognosis in patients with early HER2-negative breast cancer (BC)-a translational analysis of the prospective WSG PlanB trial. Breast Cancer Res 2020;22:47. [Crossref] [PubMed]
  15. Roussot N, Constantin G, Desmoulins I, et al. Prognostic stratification ability of the CPS+EG scoring system in HER2-low and HER2-zero early breast cancer treated with neoadjuvant chemotherapy. Eur J Cancer 2024;202:114037. [Crossref] [PubMed]
  16. Magbanua MJM, Brown Swigart L, Ahmed Z, et al. Clinical significance and biology of circulating tumor DNA in high-risk early-stage HER2-negative breast cancer receiving neoadjuvant chemotherapy. Cancer Cell 2023;41:1091-1102.e4. [Crossref] [PubMed]
  17. Curigliano G, Burstein HJ, Gnant M, et al. Understanding breast cancer complexity to improve patient outcomes: The St Gallen International Consensus Conference for the Primary Therapy of Individuals with Early Breast Cancer 2023. Ann Oncol 2023;34:970-86. [Crossref] [PubMed]
  18. Verreck EEF, Kuijer A, van Steenhoven JEC, et al. 70-Gene signature-guided adjuvant systemic treatment adjustments in early-stage ER+ breast cancer patients: 7-year follow-up of a prospective multicenter cohort study. Breast Cancer Res Treat 2025;209:331-40. [Crossref] [PubMed]
  19. Katsura C, Ogunmwonyi I, Kankam HK, et al. Breast cancer: presentation, investigation and management. Br J Hosp Med (Lond) 2022;83:1-7. [Crossref] [PubMed]
  20. Corey B, Smania MA, Spotts H, et al. Young Women With Breast Cancer: Treatment, Care, and Nursing Implications. Clin J Oncol Nurs 2020;24:139-47. [Crossref] [PubMed]
  21. Pu S, Song S, Chen H, et al. A nomogram to identify appropriate candidates for breast-conserving surgery among young women with breast cancer: A large cohort study. Front Oncol 2022;12:1012689. [Crossref] [PubMed]
  22. Scardina L, Carnassale B, Di Leone A, et al. Young Women with Early-Stage Breast Cancer Treated with Upfront Surgery: Overview of Oncological Outcomes. J Clin Med 2024;13:3966. [Crossref] [PubMed]
  23. Li Y, Chen H, He J, et al. The outcome of neoadjuvant chemotherapy and the current trend of surgical treatment in young women with breast cancer: A multicenter real-world study (CSBrS-012). Front Public Health 2023;11:1100421. [Crossref] [PubMed]
  24. Kim HJ, Dominici L, Rosenberg SM, et al. Surgical Treatment After Neoadjuvant Systemic Therapy in Young Women With Breast Cancer: Results From a Prospective Cohort Study. Ann Surg 2022;276:173-9. [Crossref] [PubMed]
  25. Golshan M, Loibl S, Wong SM, et al. Breast Conservation After Neoadjuvant Chemotherapy for Triple-Negative Breast Cancer: Surgical Results From the BrighTNess Randomized Clinical Trial. JAMA Surg 2020;155:e195410. [Crossref] [PubMed]
  26. Hamedi SZ, Emami H, Khayamzadeh M, et al. Application of machine learning in breast cancer survival prediction using a multimethod approach. Sci Rep 2024;14:30147. [Crossref] [PubMed]
  27. Mao X, Omeogu C, Karanth S, et al. Association of reproductive risk factors and breast cancer molecular subtypes: a systematic review and meta-analysis. BMC Cancer 2023;23:644. [Crossref] [PubMed]
  28. Song C, Kendi AT, Shim JY, et al. ER-/PR+ breast cancer is controlled more effectively with an inflammatory inhibitor than hormonal inhibitor. Breast Cancer 2023;30:436-52. [Crossref] [PubMed]
  29. Sun L, Jia X, Wang K, et al. Unveiling the future of breast cancer therapy: Cutting-edge antibody-drug conjugate strategies and clinical outcomes. Breast 2024;78:103830. [Crossref] [PubMed]
  30. Chen XC, Jiao DC, Qiao JH, et al. De-escalated neoadjuvant weekly nab-paclitaxel with trastuzumab and pertuzumab versus docetaxel, carboplatin, trastuzumab, and pertuzumab in patients with HER2-positive early breast cancer (HELEN-006): a multicentre, randomised, phase 3 trial. Lancet Oncol 2025;26:27-36. [Crossref] [PubMed]
  31. Bardia A, Mayer I, Winer E, et al. The oral selective estrogen receptor degrader GDC-0810 (ARN-810) in postmenopausal women with hormone receptor-positive HER2-negative (HR + /HER2 -) advanced/metastatic breast cancer. Breast Cancer Res Treat 2023;197:319-31. [Crossref] [PubMed]
  32. Villacampa G, Navarro V, Matikas A, et al. Neoadjuvant Immune Checkpoint Inhibitors Plus Chemotherapy in Early Breast Cancer: A Systematic Review and Meta-Analysis. JAMA Oncol 2024;10:1331-41. [Crossref] [PubMed]
  33. Liu S, Jiang C, Wu D, et al. Development of predictive models for pathological response status in breast cancer after neoadjuvant therapy based on peripheral blood inflammatory indexes. BMC Womens Health 2024;24:560. [Crossref] [PubMed]
Cite this article as: Zhu S, Zhang K, Lu W, Yang H, Yang C, Guo C, Guo L, Xue X, Wang Z, Xuan L. Development and validation of nomograms predicting survival in operable breast cancer patients at reproductive age after breast conserving surgery and postoperative radiotherapy based on SEER database. Transl Cancer Res 2025;14(10):6300-6315. doi: 10.21037/tcr-2025-1059

Download Citation