Using Large Language Models for Automatic Item Generation: Development and Validation for TIMSS Fourth Grade
Full Report, supplement, and annexes
Muszyński, Marek
(2025)
Download the report3.13 MB
Download supplement37.53 MB
Download Annex 16.69 MB
Download Annex 2 1.53 MB
Download Annex 33.87 MB
Download Annex 4227.52 KB
Download Annex 5110.71 KB
The study aimed to validate the quality of assessment items generated by Large Language Models for use in TIMSS fourth grade mathematics and science assessment. This publication includes the report, psychometric analysis supplement, and five annex documents. The IEA Research and Development call three has supported the research for this publication.
