Zobrazeno 1 - 10
of 76
pro vyhledávání: '"David M. Williamson"'
Publikováno v:
Challenges in Mechanics of Time-Dependent Materials & Mechanics of Biological Systems and Materials, Volume 2 ISBN: 9783031174568
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::6eb89489f14af0326036c8206e76255a
https://doi.org/10.1007/978-3-031-17457-5_3
https://doi.org/10.1007/978-3-031-17457-5_3
Autor:
David M. Williamson
Publikováno v:
Handbook of Automated Scoring ISBN: 9781351264808
Handbook of Automated Scoring
Handbook of Automated Scoring
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::480d8e58dcebea2af246bb4c6cee3022
https://doi.org/10.1201/9781351264808-17
https://doi.org/10.1201/9781351264808-17
Publikováno v:
ETS Research Report Series. 2018:1-31
Publikováno v:
ETS Research Report Series. 2015:1-28
Automated scoring models were trained and evaluated for the essay task in the Praxis I® writing test. Prompt-specific and generic e-rater® scoring models were built, and evaluation statistics, such as quadratic weighted kappa, Pearson correlation,
Autor:
Matthew Duchnowski, Chaitanya Ramineni, F. Jay Breyer, April Harris, David M. Williamson, Yigal Attali, Laura Ridolfi-McCulla
Publikováno v:
ETS Research Report Series. 2014:1-66
In this research, we investigated the feasibility of implementing the e-rater® scoring engine as a check score in place of all-human scoring for the Graduate Record Examinations® (GRE®) revised General Test (rGRE) Analytical Writing measure. This
Autor:
David M. Williamson, Norbert Elliot
Publikováno v:
Assessing Writing. 18:1-6
Publikováno v:
Assessing Writing. 18:25-39
In this paper, we provide an overview of psychometric procedures and guidelines Educational Testing Service (ETS) uses to evaluate automated essay scoring for operational use. We briefly describe the e-rater system, the procedures and criteria used t
Publikováno v:
International Journal of Testing. 12:345-364
This article describes two separate, related studies that provide insight into the effectiveness of e-rater score calibration methods based on different distributional targets. In the first study, we developed and evaluated a new type of e-rater scor
Bayesian inference networks, a synthesis of statistics and expert systems, have advanced reasoning under uncertainty in medicine, business, and social sciences. This innovative volume is the first comprehensive treatment exploring how they can be app
Publikováno v:
Language Testing. 29:371-394
This paper compares two alternative scoring methods – multiple regression and classification trees – for an automated speech scoring system used in a practice environment. The two methods were evaluated on two criteria: construct representation a