基于知识关系与试题价值权重的认知诊断模型

Cognitive Diagnostic Model Based on Knowledge Relationships and Exercise Value Weights

  • 摘要: 现有认知诊断方法忽略了试题中知识点之间存在的显性、隐性相关性以及试题的价值权重,为解决上述问题,文章提出了一种基于神经网络的诊断模型(QENCD)。该模型首先利用Jaccard算法探索知识点间的相似性,发现试题中的显性知识点,并根据特征向量中心性算法求其权重;其次,利用显性知识点信息作为先验信息来推断试题中的隐性知识点;然后,引入学生潜能因素和试题价值权重因素,重新构建交互函数;最后,结合猜测因素预测学生是否能正确回答试题,了解学生实际水平。在4个真实数据集(ASSIST0910、MathEC、Junyi和Math1)上,将QENCD模型与3个统计认知诊断模型(IRT、MIRT、DINA)、3个神经网络认知诊断模型(NCD、CDGK、ICD)进行对比实验;同时,为了验证每个组件的有效性,进行了消融试验,通过移除模型的不同部分(如知识矩阵、试题价值权重等)来评估各部分对模型性能的影响。此外,进行了基于一致性程度指标(DOA)的可解释性实验,以衡量模型的解释能力。对比实验结果表明在4个真实数据集上,与目前最佳的基线模型(ICD) 相比,QENCD模型的ACC、AUC平均提高了0.84%、1.12%,RMSE降低了0.38%。消融实验结果显示:模型的不同部分对模型的准确性和解释性都有重要贡献。由4个模型(DINA、NCDM、CDGK和QENCD)在4个真实数据集上的DOA值可知:QENCD模型能够更准确地识别和解释学生的知识状态。

     

    Abstract: Existing cognitive diagnosis methods overlooked the explicit and implicit correlations between knowledge concepts in exercises and the value weights of the exercises. To address these issues, a neural network-based diagnostic model (QENCD) was proposed in the paper. The model first employed the Jaccard algorithm to explore the similarity between knowledge concepts, identified the explicit knowledge concepts in exercises, and computed their weights using the feature vector centrality algorithm. Subsequently, it utilized the information from explicit know-ledge concepts as prior information to infer the implicit knowledge concepts in exercises. It then introduced factors of student potential and exercise value weights to reconstruct the interaction function. Finally, combined with the guessing factors, it was predicted whether the students could answer the questions correctly, and their actual levels were assessed. In comparative experiments conducted on four real-world datasets (ASSIST0910, MathEC, Junyi, and Math1), QENCD was compared with three statistical cognitive diagnosis models (IRT, MIRT, DINA) and three neural network cognitive diagnosis models (NCD, CDGK, ICD). Additionally, to validate the effectiveness of each component, ablation experiments were performed by removing different parts of the model (such as the know-ledge matrix and exercise value weights) to assess the impact of each part on the model's performance. In addition, interpretability experiments based on the Degree of Agreement (DOA) metric were conducted to assess the explanatory power of the models. The experimental results show that, across four real-world datasets, the QENCD model outperforms the current best baseline model (ICD) with an average improvement of 0.84% in ACC, 1.12% in AUC, and a reduction of 0.38% in RMSE. The results of the ablation experiments show that different components of the model contribute significantly to its accuracy and interpretability. Based on the DOA values of four models (DINA, NCDM, CDGK, and QENCD) on four real datasets, it is known that the QENCD model can more accurately identify and interpret students' knowledge states.

     

/

返回文章
返回