Comparison ofe-rater® Automated Essay Scoring Model Calibration Methods Based on Distributional Targets
Autor: | David M. Williamson, Catherine Trapani, F. Jay Breyer, Mo Zhang |
---|---|
Rok vydání: | 2012 |
Předmět: | |
Zdroj: | International Journal of Testing. 12:345-364 |
ISSN: | 1532-7574 1530-5058 |
Popis: | This article describes two separate, related studies that provide insight into the effectiveness of e-rater score calibration methods based on different distributional targets. In the first study, we developed and evaluated a new type of e-rater scoring model that was cost-effective and applicable under conditions of absent human rating and small candidate volume. This new model type, called the Scale Midpoint Model, outperformed an existing e-rater scoring model that is often adopted by certain e-rater system users without modification. In the second study, we examined the impact of three distributional score calibration approaches on existing models’ performance. These approaches included percentile calibrations on e-rater scores in accordance with a human rating distribution, normal distribution, and uniform distribution. Results indicated that these score calibration approaches did not have overall positive effects on the performance of existing e-rater scoring models. |
Databáze: | OpenAIRE |
Externí odkaz: | |
Nepřihlášeným uživatelům se plný text nezobrazuje | K zobrazení výsledku je třeba se přihlásit. |