基于增强CT检查联合临床特征构建胆囊癌神经浸润机器学习预测模型的应用价值

Application value of machine learning prediction model for neural invasion in gallbladder cancer based on enhanced CT and clinical characteristics

  • 摘要:
    目的 探讨基于增强CT检查联合临床特征构建胆囊癌神经浸润机器学习预测模型的应用价值。
    方法 采用回顾性队列研究方法。收集2010年1月至2024年6月新乡医学院第一附属医院收治的502例胆囊癌患者的临床影像资料;男171例,女331例;年龄为65(35~91)岁。患者均行术前腹部增强CT检查及胆囊癌根治性切除术。502例患者以7∶3比例随机分为训练集351例和测试集151例,训练集用于构建预测模型,测试集用于验证模型效能。观察指标:(1)胆囊癌神经浸润情况及影响因素分析。(2)胆囊癌神经浸润机器学习预测模型构建及评价。计数资料组间比较采用χ²检验。等级资料组间比较采用Mann⁃Whitney U检验。单因素和多因素分析采用Logistic回归模型。纳入独立影响因素应用基于python 3.9的标准库模块构建机器学习模型,绘制受试者工作特征(ROC)曲线并计算灵敏度、特异度、准确度、曲线下面积(AUC)、精确率、F1评分、阳性预测率、阴性预测率、Kappa值评估模型的预测效能。测试集中不同模型AUC差异采用Delong检验评估,使用Hosmer⁃Lemeshow检验和Brier分数判断模型的校准度。
    结果 (1)胆囊癌神经浸润情况及影响因素分析。502例胆囊癌患者中,131例有神经浸润,371例无神经浸润。多因素分析结果显示:总胆红素、癌胚抗原、CA199、CA125、中性粒细胞与淋巴细胞比值、CT检查肝侵犯、CT检查血管侵犯、CT检查肝门部或腹膜后淋巴结转移和肿瘤T分期为T3、T4期是胆囊癌患者神经浸润的独立影响因素比值比=3.747、2.395、3.917、3.596、2.805、2.377、3.523、2.774、5.080、6.809,95%可信区间(CI)为1.890~7.430,1.154~4.971,2.054~7.472,1.807~7.155,1.506~5.225,1.241~4.553,1.666~7.449,1.483~5.189,2.050~12.589,2.552~18.168,P<0.05。(2)胆囊癌神经浸润机器学习预测模型的构建与评价。纳入独立影响因素构建逻辑回归、K⁃最近邻、支持向量机、随机森林、决策树、反向传播神经网络和梯度提升机7种机器学习模型,绘制测试集ROC曲线,AUC分别为0.900(95%CI为0.851~0.948)、0.741(95%CI为0.646~0.829)、0.836(95%CI为0.762~0.895)、0.782(95%CI为0.701~0.855)、0.839(95%CI为0.770~0.901)、0.817(95%CI为0.738~0.887)、0.843(95%CI为0.770~0.909)。Delong检验结果显示:逻辑回归模型AUC最高。逻辑回归模型灵敏度和特异度分别为0.868和0.805,平衡性最佳。Hosmer⁃Lemeshow检验结果显示:逻辑回归模型拟合优度较好(χ²=5.320,P>0.05)。逻辑回归模型Brier分数较低,为0.168,验证了其校准优势。
    结论 总胆红素、癌胚抗原、CA199、CA125、中性粒细胞与淋巴细胞比值、CT检查肝侵犯、CT检查血管侵犯、CT检查肝门部或腹膜后淋巴结转移和肿瘤T分期为T3、T4期是胆囊癌患者神经浸润的独立影响因素。基于增强CT检查联合临床特征构建的7种机器学习模型可预测胆囊癌神经浸润,其中逻辑回归模型预测性能较佳。

     

    Abstract:
    Objective To explore the application value of machine learning prediction model for neural invasion in gallbladder cancer based on enhanced computed tomography (CT) and clinical characteristics.
    Methods The retrospective cohort study was conducted. The clinical and imaging data of 502 patients with gallbladder cancer who were admitted to The First Affiliated Hospital of Xinxiang Medical University from January 2010 to June 2024 were collected. There were 171 males and 331 females, aged 65(range, 35‒91)years. All patients underwent preoperative abdominal enhanced CT and radical resection. The 502 patients were randomly divided into a training set of 351 cases and a test set of 151 cases at a 7:3 ratio. The training set was used to construct prediction model, and the test set was used to validate prediction model. Observation indicators: (1)neural invasion in gallbladder cancer and influencing factor analysis; (2) construction and validation of machine learning prediction models for neural invasion in gallbladder cancer. Comparison of count data between groups was conducted using the chi-square test. Comparison of ordinal data between groups was conducted using the Mann-Whitney U test. Logistic regression model was performed for univariate and multivariate analyses. Independent influencing factors were incor-porated to construct machine learning models using the standard library modules based on Python 3.9. Receiver operating characteristic (ROC) curves were plotted, and the accuracy, sensitivity, specificity, area under the curve (AUC), precision, F1 score, positive predictive value, negative predic-tive value, and Kappa value were calculated to evaluate the predictive performance of the models. The Delong test was used to assess the differences in AUC among different models in the test set. The Hosmer-Lemeshow test and Brier score were used to evaluate the calibration of the models.
    Results (1) Neural invasion in gallbladder cancer and influencing factor analysis. Of the 502 patients with gallbladder cancer, 131 cases had neural invasion, and 371 cases had no neural invasion. Results of multivariate analysis showed that total bilirubin, carcinoembryonic antigen, CA199, CA125, neutrophil-lymphocyte ratio, liver invasion detected by CT, vascular invasion detected by CT, hilar or retroperi-toneal lymph node metastasis detected by CT, and tumor stages T3 and T4 were independent influencing factors for neural invasion in patients with gallbladder cancer odds ratios=3.747, 2.395, 3.917, 3.596, 2.805, 2.377, 3.523, 2.774, 5.080, 6.809, 95% confidence interval (CI) as 1.890‒7.430, 1.154‒4.971, 2.054‒7.472, 1.807‒7.155, 1.506‒5.225, 1.241‒4.553, 1.666‒7.449, 1.483‒5.189, 2.050‒12.589, 2.552‒18.168, P<0.05. (2) Construction and validation of machine learning predic-tion models for neural invasion in gallbladder cancer. Based on the independent influencing factors, seven machine learning models were constructed, including logistic regression, K-nearest neighbors, support vector machine, random forest, decision tree, back-propagation neural network, and gradient boosting machine. The ROC curves of seven machine learning models in the test set were plotted, and the AUC were 0.900(95%CI as 0.851‒0.948), 0.741(95%CI as 0.646‒0.829), 0.836(95%CI as 0.762‒0.895), 0.782(95%CI as 0.701‒0.855), 0.839(95%CI as 0.770‒0.901), 0.817(95%CI as 0.738‒0.887), 0.843(95%CI as 0.770‒0.909), respectively. Results of Delong test showed that the logistic regression model had the highest AUC. The sensitivity and specificity of the logistic regression model were 0.868 and 0.805 respectively, indicating the best balance. Results of Hosmer-Lemeshow test showed that the logistic regression model had a good goodness-of-fit (χ²=5.320, P>0.05). The Brier score of the logistic regression model was relatively low, as 0.168, which verified its calibration advantage.
    Conclusion Total bilirubin, carcinoembryonic antigen, CA199, CA125, neutrophil-to-lymphocyte ratio, liver invasion detected by enhanced CT, vascular invasion detected by enhanced CT, hilar or retroperitoneal lymph node metastasis detected by enhanced CT, and tumor stages T3 and T4 are independent influencing factors for nerve invasion in patients with gallbladder cancer. Seven machine learning models are constructed based on enhanced CT and clinical characteristics to predict neural invasion in gallbladder cancer, of which the logistic regression model demonstrates good predictive performance.

     

/

返回文章
返回