Developing initial notions of variability when learning about box plots

Feb 10, 2025·
Martin Abt
,
Katharina Loibl
,
Timo Leuders
,
Wim Van Dooren
Frank Reinhold
Frank Reinhold
· 0 min read
Abstract
In the boxplot, the box always represents - regardless of its area - the middle half of the data and thus a measure of variability (interquartile range). However, when students first learn about boxplots, they are usual already familiar with other forms of statistical representations (e.g., bar or circle graphs) in which a larger area represents a higher frequency of observations. If students erroneously apply this well-established area-represents-frequency schema to boxplots, it results in a systematic error which we describe as the consequence of an incomplete conceptual change. We empirically validated difficulty-generating characteristics that allow the differentiation between item types with varying complexity (item level) and aimed to identify profiles (person level) that differ depending on which schema was used in which item type. For this purpose, we conducted two cross-sectional studies with N = 100 university students (study 1) and N = 297 participants who finished secondary school or higher (study 2) and used generalized linear mixed models (item level) and k-means clustering with predefined cluster centers (person level) to test our hypotheses. We could replicate the systematic error that was described in previous research and found new difficulty-generating characteristics in boxplot items. Our results support the notion of different profiles potentially emerging based on varying degrees of conceptual change. From an instructional perspective, information about individual progress in conceptual change could be considered for tailoring individualized interventions.
Type
Publication
Educational Studies in Mathematics