ViScratch: Using Large Language Models and Gameplay Videos for Automated Feedback in Scratch
ViScratch is an automated feedback system for Scratch programming that uses vision-language models to diagnose and repair bugs by analyzing both block code and gameplay videos, demonstrating substantial improvements over prior LLM-based debugging tools in identifying and fixing semantic errors in student projects.
Block-based programming environments such as Scratch are increasingly popular in programming education, in particular for young learners. While the use of blocks helps prevent syntax errors, semantic bugs remain common and difficult to debug. Existing tools for Scratch debugging rely heavily on predefined rules or user manual inputs, and crucially, they ignore the platform's inherently visual nature. We introduce ViScratch, the first multimodal feedback generation system for Scratch that leverag