史瑀, 王琪, 郑智, 黄珊珊, 殷翠香. 医务人员血源性职业暴露危险因素分析:基于随机森林算法和logistic回归模型的效应对比[J]. 职业卫生与应急救援, 2024, 42(4): 440-445. DOI: 10.16369/j.oher.issn.1007-1326.2024.04.004
引用本文: 史瑀, 王琪, 郑智, 黄珊珊, 殷翠香. 医务人员血源性职业暴露危险因素分析:基于随机森林算法和logistic回归模型的效应对比[J]. 职业卫生与应急救援, 2024, 42(4): 440-445. DOI: 10.16369/j.oher.issn.1007-1326.2024.04.004
SHI Yu, WANG Qi, ZHENG Zhi, HUANG Shanshan, YIN Cuixiang. Analysis of risk factors for occupational blood-borne exposures among healthcare workers: a comparative study using random forest and logistic regression models[J]. Occupational Health and Emergency Rescue, 2024, 42(4): 440-445. DOI: 10.16369/j.oher.issn.1007-1326.2024.04.004
Citation: SHI Yu, WANG Qi, ZHENG Zhi, HUANG Shanshan, YIN Cuixiang. Analysis of risk factors for occupational blood-borne exposures among healthcare workers: a comparative study using random forest and logistic regression models[J]. Occupational Health and Emergency Rescue, 2024, 42(4): 440-445. DOI: 10.16369/j.oher.issn.1007-1326.2024.04.004

医务人员血源性职业暴露危险因素分析:基于随机森林算法和logistic回归模型的效应对比

Analysis of risk factors for occupational blood-borne exposures among healthcare workers: a comparative study using random forest and logistic regression models

  • 摘要:
    目的 分析医务人员血源性职业暴露风险因素,为及时提出针对性干预措施,预防血源性职业暴露提供依据。
    方法 选取2016年1月—2023年6月呼伦贝尔市某三级综合医院上报的发生血源性职业暴露的103名医务人员为暴露组,选取同班次未发生职业暴露的625名医务人员为对照组,收集研究对象的相关资料,采用随机森林算法和多因素logistic回归分析分别构建影响医务人员发生血源性职业暴露的预测模型。
    结果 103例血源性职业暴露人员的主要暴露部位为手,有89例(占86.41%);主要暴露方式为锐器伤,有83例(占80.58%);主要暴露源为乙型肝炎病毒,有65例(占63.11%)。暴露后立即上报的有65例(占63.11%),12 h以内上报的24例(占23.30%)。logistic回归分析结果显示:相比年龄≥ 30岁,年龄 < 30岁的医务人员发生血源性职业暴露的可能性增加至4.142倍(P < 0.05);相比工作年限≥ 5年,工作年限 < 5年的医务人员发生血源性职业暴露的可能性增加至1.696倍(P < 0.05);相比中级及以上职称,初级职称医务人员发生血源性职业暴露的可能性增加至5.989倍(P < 0.05);相比每年多次参加培训,每年只参加1次培训的医务人员发生血源性职业暴露的可能性增加至1.864倍(P < 0.05);科室类别为急诊、重症及手术室的医务人员较其他科室医务人员发生血源性职业暴露的可能性增加至2.205倍(P < 0.05)。随机森林算法结果显示,重要性评分排名前6的影响因素分别是职称、年龄、工作年限、年职业防护培训次数、职业类型、文化程度。随机森林预测模型的准确率、精确率、召回率及F1分数(精确率和召回率的调和平均数)均要高于logistic预测模型,且随机森林预测模型的ROC曲线下面积(AUC)为0.829(P < 0.001),也高于logistic回归模型的0.818(P < 0.001)。
    结论 随机森林模型对血源性职业暴露有更好的预测效能,但logistic回归模型有更直观的分析结果,两者联合使用能进一步提高预测的准确性。应加强对年龄小、工作年限短、职称低等重点人员的培训,制定标准化的措施来预防医务人员的血源性暴露。

     

    Abstract:
    Objective To analyze risk factors for occupational blood-borne exposures among healthcare workers, providing a basis for timely and targeted interventions to prevent occupational blood-borne exposure.
    Methods The study included 103 healthcare workers who reported occupational blood-borne exposures from January 2016 to June 2023 at a tertiary hospital in Hulunbuir as the exposure group and 625 healthcare workers from the same shifts who did not experience occupational exposures as the control group. Relevant information was collected for all subjects. Random Forest algorithms and multivariate logistic regression analysis were used to construct predictive models for occupational blood-borne exposure among healthcare workers.
    Results Among the 103 blood-borne exposure cases, the main exposure site was the hand in 89 cases (86.41%); the main mode of exposure was sharp force injury in 83 cases (80.58%); and the predominant exposure source was hepatitis B virus (HBV) in 65 cases (63.11%). Sixty-five cases (63.11%) were reported immediately after exposure, while 24 cases (23.30%) were reported within 12 hours. Logistic regression analysis revealed that compared to healthcare workers aged 30 years or more, likelihood of occupational blood-borne exposure increased to 4.142 times for those under 30 years (P < 0.05); compared with 5 or more years, the likelihood of occupational blood-borne exposure increased to 1.696 times for healthcare workers with less than 5 years of service (P < 0.05); the likelihood of occupational blood-borne exposure increased to 5.989 times for junior-title healthcare workers compared to intermediate and higher-level titles (P < 0.05); healthcare workers who attended occupational protection training only once per year were up to 1.864 times more likely to have occupational blood-borne exposure compared to those who participated multiple times annually (P < 0.05); the likelihood of occupational blood-borne exposure increased to 2.205 times for healthcare workers in the departmental categories of emergency, critical care, and operating room compared to those in other departments (P < 0.05). The Random Forest algorithm identified the top 6 influential factors in order of importance as professional title, age, years of work experience, annual training frequency of occupational protection, occupation type, and education level. The Random Forest predictive model demonstrated higher accuracy, precision, recall, and F1 score (the reconciled mean of precision and recall) compared to the logistic regression model. The area under the receiver operating characteristic curve (AUC) for the Random Forest model was 0.829 (P < 0.001), which was also higher than that of the logistic regression model of 0.818 (P < 0.001).
    Conclusions The Random Forest model showed superior predictive performance for occupational blood-borne exposures, while the logistic regression model provided more intuitive analytical results. The combined use of both models can further enhance prediction accuracy. Enhanced training should be provided for high-risk groups, including younger healthcare workers, those with less work experience, and those with lower professional titles. Standardized measures should be implemented to prevent blood-borne exposure among healthcare workers.

     

/

返回文章
返回