I have had to evaluate written tests, and I went through "calibration" first, where I evaluated writing that an expert had already evaluated. I had to match the expert evaluations in order to do solo evaluations. If I only matched 30-60% of the time, I wouldn't have been allowed to evaluate AT ALL
add a skeleton here at some point
1 day ago