|国家预印本平台
首页|统计学习能力测量的信度研究:基于模态、材料特征和测试任务的证据

统计学习能力测量的信度研究:基于模态、材料特征和测试任务的证据

Reliability Study of Statistical Learning Ability Measurement: Evidence from Modality, Material and Task

中文摘要英文摘要

统计学习能力常作为自变量预测个体语言能力的发展,但以组间差异为目标编制的实验任务普遍信度不高,难以满足心理测量学的基本要求。本研究使用混合长度的目标结构合成学习材料,并且使用了二选一迫选和熟悉度评分两种测试任务,本研究设计了听觉语音和视觉图形两种模态,计算了测验的内部一致性系数和分半信度。结果发现使用混合长度目标结构编制实验任务的信度普遍高于以往研究;同时,视觉模态下的信度要高于听觉模态;前者迫选任务的信度也要高于熟悉度评分任务。综上,本研究推荐在视觉模态下,使用混合长度的目标结构合成学习材料,同时以迫选任务考察被试的统计学习能力。

Research has considered statistical learning (SL) as a fundamental learning mechanism in cognition, for which individuals rely on the statistical regularities from visual and verbal input during information processing. Take the verbal SL task as an example, participants are first exposed to a nonsensical artificial language or visual sequence for 5~10 mins and then asked to finish a 2 alternative forced choice task (2AFC). Accuracy on each trial is coded in a dichotomous manner, with 0 for incorrect and 1 for correct, and aggregated across participants to generate the mean accuracy of the group. If it is higher than chance level, it is assumed that learning has occurred. This research perspective is called the perspective of inter group differences.However, this index?is the result from the perspective of inter-group differences, which is suitable for judging whether the test group exhibits statistical learning effects,?but not measuring the relationship between SL ability?and other cognitive ability. Some researchers criticized the low reliability of SL tasks and suggested that the task results are not psychometrically satisfactory. In the current study, we aimed to put forward a modified SL task that is relatively more comprehensive. Two aspects of traditional tasks have been modified; one is that we constructed learning materials with mixed-lengths targets, and another is that we employed a familiarity rating task to measure learning outcomes in addition to the 2AFC task.A total of 159?participants took part in our experiment. Two types of reliability Cronbachs alpha coefficient and split-half reliability were computed with the?reliability?function in R. The results of this study are divided into three aspects. Firstly, the index of two types of reliability in the current study are better than previous studies. This indicates that?the learning materials we constructed with mixed-length nonsensical words exhibit some advantages?in reliability. Secondly, the results revealed that both the Cronbachs alpha coefficient and split-half reliability of statistical learning tasks in the visual modality were higher than those in the auditory modality, which is consistent with the opinion of Siegelman?(2018a). Then, the reliability of forced-choice tasks in the visual modality was higher than that of familiarity rating tasks, suggesting the results obtained from 2AFC task are more stable and consistent across participants.The current study explored the task in measuring SL ability,?underscoring?the importance of using mixed-length learning materials and suggest employing visual stimuli in assessing statistical learning abilities in addition to the traditional utilization of forced-choice tasks during the testing phase. Future studies should not only focus on designing brief SL tasks for children and language disorder population that align with psychometric standards, but also rethink the cognitive mechanism underlying various SL task.

梁丹丹、于文勃、王天琳、亓鹤潼

科学、科学研究教育数学

统计学习能力信度迫选任务熟悉度评分任务

statistical learning2-alternative forced choice taskfamiliarity rating taskreliability

梁丹丹,于文勃,王天琳,亓鹤潼.统计学习能力测量的信度研究:基于模态、材料特征和测试任务的证据[EB/OL].(2025-01-02)[2025-08-21].https://chinaxiv.org/abs/202408.00250.点此复制

评论