Sorry, you need to enable JavaScript to visit this website.

Deep Learning for Classroom Activity Detection from Audio

Citation Author(s):
Robin Cosbey, Allison Wusterbarth, Brian Hutchinson
Submitted by:
Robin Cosbey
Last updated:
10 May 2019 - 4:33pm
Document Type:
Poster
Document Year:
2019
Event:
Paper Code:
MLSP-P15
 

Increasingly, post-secondary instructors are incorporating innovative teaching practices into their classrooms to improve student learning outcomes. In order to assess the effect of these techniques, it is helpful to quantify the types of activity being conducted in the classroom. Unfortunately, self-reporting is unreliable and manual annotation is tedious and scales poorly. We introduce a set of deep learning classifiers for automatic activity annotation, evaluating them on a collection of classroom recordings, with frames labeled as “single-voice” (primarily lecture), “multi-voice” (primarily group discussion), “no-voice” (primarily silent work) or “other.” We
find that our best approach obtains a 7.1% frame error rate (92.7% weighted F-measure) on held out class sessions from previously seen instructors (a 7 hr test set), and 10.1% error (89.1% weighted F-measure) on previously unseen instructors (a separate 18 hr test set). These represent 32% and 45% relative reductions in error over the existing state-of-the-art for this task. We also show that our estimates of how much classroom time spent per task are better correlated with actual time spent than existing systems.

up
0 users have voted: