食管癌根治术后预后影响因素分析及机器学习预测模型的应用价值

Analysis of prognostic factors for esophageal cancer after radical resection and the applica-tion value of machine learning prediction model

  • 摘要:
    探讨食管癌根治术后预后影响因素及机器学习预测模型的应用价值。
    采用回顾性队列研究方法。收集2018年1月至2022年3月山东大学齐鲁医院收治的406例食管癌患者的临床病理资料;男357例,女49例;年龄为(64±8)岁。患者均行食管癌根治术。406例患者依据随机数字表法以7∶3比例分为训练集285例和验证集121例。训练集用于构建预测模型,验证集用于验证模型效能。根据风险评分将患者分为高危组和低危组。观察指标:(1)患者随访情况及预后影响因素分析。(2)机器学习预测模型的构建及验证。正态分布的计量资料组间比较采用独立样本t检验。偏态分布的计量资料组间比较采用Mann⁃Whitney U检验。计数资料组间比较采用χ²检验。等级资料组间比较采用秩和检验。采用Kaplan⁃Meier法计算生存率并绘制生存曲线,Log‑rank检验进行生存分析。单因素和多因素分析采用Cox比例风险回归模型。纳入独立影响因素,应用基于R语言随机生存森林(RSF)、梯度提升机(GBM)、最小绝对收缩与选择算子Cox回归(LASSO‑Cox)、Cox比例风险模型提升法(CoxBoost)、生存支持向量机(survivalsvm)、极端梯度提升(XGBoost)、监督主成分分析(SuperPC)以及Cox偏最小二乘回归(plsRcox)程序包进行数据处理、机器学习预测模型的构建及结果可视化输出,绘制受试者工作特征(ROC)曲线并计算灵敏度、特异度和曲线下面积(AUC)。训练集中不同模型AUC差异采用Delong检验评估,并采用时间依赖性ROC比较不同预测模型的效能。采用校准曲线评估模型的准确性,决策曲线评估模型的总体净收益。
    (1)患者随访情况及预后影响因素分析。406例患者获得术后随访,随访时间为28(6~36)个月,术后1、3年总生存率分别为86.5%、40.9%。训练集285例患者随访时间为30(6~36)个月,术后1、3年总生存率分别为85.1%、35.5%;验证集121例患者随访时间为25(6~36)个月,术后1、3年总生存率分别为87.0%、43.2%。训练集和验证集术后总生存率比较,差异无统计学意义(χ²=3.20,P>0.05)。多因素分析结果显示:经左胸手术入路、术前中性粒细胞、肿瘤血管浸润、肿瘤神经浸润、病理学T2~4期、病理学N2~3期、术后肺炎是影响训练集285例患者术后生存的独立危险因素(风险比=1.466、1.037、1.482、1.549、5.268、7.727、22.202、2.539、2.686、1.425,95%可信区间为1.026~2.096、1.003~1.073、1.008~2.179、1.105~2.170、1.201~23.099、1.833~32.576、4.734~104.128、1.577~4.087、1.631~4.422、1.018~1.994,P<0.05)。(2)机器学习预测模型的构建及验证。纳入影响患者术后生存的独立危险因素构建RSF、GBM、LASSO‑Cox、CoxBoost、survivalsvm、XGBoost、SuperPC、plsRcox机器学习预测模型。Delong检验结果显示:RSF及GBM机器学习预测模型AUC分别与其他6种模型比较,差异均有统计学意义(P<0.05)。时间依赖性ROC曲线结果显示:8种机器学习预测模型在训练集中均具有良好的区分能力,其中RSF机器学习预测模型的预测效能最优。校准曲线结果显示:RSF机器学习预测模型训练集术后1、2、3年总生存率校准曲线拟合良好,与实际结果具有较高的一致性。决策曲线结果显示:阈值为0~0.80,RSF机器学习预测模型总体净收益较好。进一步分析结果显示:RSF机器学习预测模型验证集术后1、2、3年ROC曲线的AUC分别为0.786(95%可信区间为0.609~0.962)、0.774(95%可信区间为0.676~0.873)、0.750(95%可信区间为0.652~0.848)。校准曲线结果显示:RSF机器学习预测模型验证集术后1、2、3年总生存率校准曲线拟合良好,与实际结果具有较高的一致性。训练集中,RSF机器学习预测模型输出的风险评分最佳阈值为11.7分,风险评分≥11.7分为高危组,风险评分<11.7分为低危组,两组患者的中位生存时间分别为18.0个月和>36.0个月,两组比较,差异有统计学意义(χ2=73.30,P<0.05);验证集中,RSF机器学习预测模型输出的风险评分最佳阈值为11.7分,风险评分≥11.7分为高危组,风险评分<11.7分为低危组,两组患者的中位生存时间分别为17.0个月和>36.0个月,两组比较,差异有统计学意义(χ2=35.20,P<0.05)。
    经左胸手术入路、术前中性粒细胞、肿瘤血管浸润、肿瘤神经浸润、病理学T2~4期、病理学N2~3期、术后肺炎是影响食管癌患者根治术后生存的独立危险因素,基于此构建的RSF机器学习预测模型可有效区分高危和低危患者的生存预后。

     

    Abstract:
    Objective To investigate the prognostic factors for esophageal cancer after radical resection and the application value of machine learning prediction model.
    Methods The retrospective cohort study was conducted. The clinicopatholigical data of 406 esophageal cancer patients who were admitted to Qilu Hospital of Shandong University from January 2018 to March 2022 were collected. There were 357 males and 49 females, aged (64±8)years. All patients underwent radical resection of esophageal cancer. The 406 patients were randomly divided into a training set of 285 cases and a validation set of 121 cases at a 7∶3 ratio based on a random number table. The training set was used to construct prediction model, and the validation set was used to validate prediction model. Patients were divided into high-risk group and low-risk group based on risk scores. Observation indicators: (1) follow‑up of patients and analysis of influencing factors for prognosis; (2) construction and validation of machine learning prediction models. Comparison of measurement data with normal distribution between groups was conducted using the independent sample t test. Comparison of measurement data with skewed distribution between groups was conducted using the Mann‑Whitney U test. Comparison of count data between groups was conducted using the chi‑square test. Comparison of ordinal data between groups was conducted using the rank sum test. The Kaplan‑Meier method was used to calculate survival rate and plot survival curve, and the Log‑rank test was used for survival analysis. The Cox proportional hazard regression model was used for univariate and multivariate analyses. Independent influencing factors were included, and data processing, machine learning model construction, and visualization were performed using R packages including random survival forest (RSF), gradient boosting machine (GBM), least absolute shrinkage and selection operator Cox regression (LASSO‑Cox), Cox proportional hazards model boosting (CoxBoost), survival support vector machine (survivalsvm), extreme gradient boosting (XGBoost), supervised principal component analysis (SuperPC), and Cox partial least squares regression (plsRcox). Receiver operating characteristic (ROC) curves were drawn, and sensitivity, specificity, and area under the curve (AUC) were calculated. The Delong test was used to assess the differences in AUC among different models in the training set, and the time‑dependent ROC was used to compare the predictive performance of different models. Calibration curves were used to evaluate model accuracy, and decision curve analysis (DCA) was used to evaluate overall net benefit.
    Results (1) Follow‑up of patients and analysis of influencing factors for prognosis. All 406 patients were followed up postoperatively for 28(range, 6-36)months, with 1‑ and 3‑year overall survival rate of 86.5% and 40.9%, respectively. The 285 patients in the training set were followed up postoperatively for 30(range, 6-36)months, with 1‑ and 3‑year overall survival rate of 85.1% and 35.5%, respectively. The 121 patients in the validation set were followed up postoperatively for 25(range, 6-36)months, with 1‑ and 3‑year overall survival rate of 87.0% and 43.2%, respectively. There was no significant difference in postoperative overall survival rate between the training set and the validation set (χ2=3.20, P>0.05). Results of multivariate analysis showed that left thoracic surgical approach, preopera-tive neutrophil count, vascular invasion, perineural invasion, pathological T2-4 stage, pathological N2-3 stage, and postoperative pneumonia were independent risk factors affecting postoperative survival of 285 patients in the training set (hazard ratio=1.466, 1.037, 1.482, 1.549, 5.268, 7.727, 22.202, 2.539, 2.686, 1.425, 95% confidence interval as 1.026-2.096, 1.003-1.073, 1.008-2.179, 1.105-2.170, 1.201-23.099, 1.833-32.576, 4.734-104.128, 1.577-4.087, 1.631-4.422, 1.018-1.994, P<0.05). (2) Construction and validation of machine learning prediction models. Independent risk factors affecting postoperative survival were included to construct RSF, GBM, LASSO‑Cox, CoxBoost, survivalsvm, XGBoost, SuperPC, and plsRcox machine learning prediction models. Results of Delong test showed that there were significant differences in the AUC of RSF and GBM from the other six models (P<0.05). Results of time-dependent ROC curve showed that all 8 machine learning predic-tion models had good discriminative ability in the training cohort, among which the RSF machine learning prediction model had the best predictive performance. Results of calibration curve showed that the RSF machine learning prediction model fitted well for predicting postoperative 1‑, 2‑, and 3‑year overall survival in the training cohort, with high consistency with actual results. Results of decision curve analysis showed that within a threshold range of 0-0.80, the RSF machine learning prediction model provided a better overall net benefit. Further analysis showed that in the validation set, the AUC of RSF machine learning prediction model for postoperative 1‑, 2‑, and 3‑year survival prediction were 0.786 (95% confidence interval as 0.609-0.962), 0.774 (95% confidence interval as 0.676-0.873), and 0.750 (95% confidence interval as 0.652-0.848), respectively. Results of calibration curve showed that the RSF machine learning prediction model fitted well for predicting postopera-tive 1‑, 2‑, and 3‑year overall survival in the validation set, with high consistency with actual results. In the training set, the optimal cutoff value of the RSF machine learning prediction model risk score was 11.7. Patients with risk score ≥11.7 were classified as the high‑risk group, and those with risk score <11.7 as the low‑risk group. The median survival times of the two groups were 18.0 months and >36.0 months, respectively, showing a significant difference between them (χ²=73.30, P<0.05). In the validation set, the optimal cutoff value of the RSF machine learning prediction model risk score was 11.7. Patients with risk score ≥11.7 were classified as the high‑risk group, and those with risk score<11.7 as the low‑risk group. The median survival times of the two groups were 17.0 months and>36.0 months for the high‑risk and low‑risk groups, respectively, showing a significant difference between them (χ²=35.20, P<0.05).
    Conclusions Left thoracic surgical approach, preoperative neutrophil count, vascular invasion, perineural invasion, pathological T2-4 stage, pathological N2-3 stage, and postoperative pneumonia are independent risk factors affecting survival of esophageal cancer patients after radical resection. The RSF machine learning prediction model constructed based on these factors can effectively distinguish the survival prognosis of high‑risk and low‑risk patients.

     

/

返回文章
返回