Pattern-based Knowledge Component Extraction from Student Code Using Representation Learning
This paper proposes an automated framework for extracting Knowledge Components (KCs) from student programming code using representation learning and pattern-based analysis of Abstract Syntax Trees, validated through learning curve analysis and Deep Knowledge Tracing to model student knowledge acquisition in CS education.
Effective personalized learning in computer science education depends on accurately modeling what students know and what they need to learn. While Knowledge Components (KCs) provide a foundation for such modeling, automated KC extraction from student code is inherently challenging due to insufficient explainability of discovered KCs and the open-endedness of programming problems with significant structural variability across student solutions and complex interactions among programming concepts.