基于可解释机器学习构建基础教育教师职业倦怠预测模型

Building an Interpretable Machine Learning Model to Predict Occupational Burnout Among Basic Education Teachers

  • 摘要: 本研究基于河南省商丘市、周口市和信阳市334所中小幼学校的33 569名教师样本,构建可解释机器学习模型预测基础教育教师职业倦怠。通过逻辑回归、随机森林、XGBoost、CatBoost和LightGBM算法比较,发现CatBoost综合性能最优(AUC=0.88,特异性=0.86)。人口学分析显示村镇教师倦怠率高于区县,未婚教师及低经济水平群体风险突出。SHAP解释性分析揭示了基础教育教师职业倦怠的重要风险和保护因素排序,影响基础教育教师职业倦怠的前5个预测因子依次为:深层扮演、工作投入、角色过载、抑郁、神经质。其中,深层扮演与工作投入为核心保护因子,角色过载、抑郁和神经质为主要风险源。研究构建了可解释的教师倦怠预测模型和可视化预测工具,明确了“深层扮演—工作投入”双保护因子与“角色过载—抑郁—神经质”三风险源的核心作用,为区域教育管理部门实施分级干预提供了证据。

     

    Abstract: Based on a sample of 33, 569 teachers from 334 kindergartens, primary and secondary schools in Shangqiu, Zhoukou and Xinyang, Henan Province, an interpretable machine learning model is constructed to predict occupational burnout among basic education teachers. By comparing logistic regression, random forest, XGBoost, CatBoost and LightGBM algorithms, CatBoost demonstrated the best comprehensive performance (AUC=0.88, specificity=0.86). Demographic analysis revealed that burnout rates were higher among teachers in villages and towns compared to those in districts/counties, and unmarried teachers and teachers with low economic status faced a significantly higher risk. SHAP explanatory analysis identified and ranked the important risk and protective factors for burnout among basic education teachers. The top five predictors were: deep acting, job involvement, role overload, depression, and neuroticism, with deep play and job involvement serving as the core protective factors, and role overload, depression, and neuroticism acting as the main risk sources. An interpretable prediction model for teacher burnout is constructed, with'deep acting-job involvement'identified as the dual core protective factors and'role overload-depression-neuroticism'as the triple core risk sources, thereby providing evidence for regional education management departments to implement tiered interventions.

     

/

返回文章
返回