SUMMARY OF RESEARCH REVIEWED BY THE FCQ COMMITTEE

by Mark R. Malone

Over the past eighteen years three major studies have synthesized the results of research in the area of student ratings of college level instruction. This report will provide a synopsis of these three studies. Review of these studies assisted our committee in suggesting changes to instruments and procedures for evaluating and improving instruction at the University. The three studies are presented in reverse chronological order.

The Cashin Study

The purpose of this report is to provide an overview of a study summarizing the conclusions of major reviews the student rating literature. The study draws from a body of over 1500 studies conducted regarding student evaluations of college teaching. This report was a synthesis research in this area compiled by William E. Cashin is available for $2 by requesting Idea Paper No. 32 from Kansas State University, 1615 Anderson Avenue, Manhattan, KS 66502-4073. The article concludes that the large body of literature in this area makes it possible to find individual studies to support almost any conclusion. However the body of research is broad enough to begin to draw conclusions about overall trends. These trends generally indicate that student ratings are statistically reliable, valid and relatively free from bias. It cautions that they are only one of many possible sources of data about teaching. This type of information should be coupled with multiple sources of data if they are to be used to make judgments about the quality of science teaching.

STUDY ASSUMPTIONS BASED ON LITERATURE

VARIOUS APPROACHES TO EVALUATING TEACHING

VARIABLES THAT GENERALLY DO NOT CORRELATE WITH STUDENT EVALUATIONS

Instructors often suspect that variables beyond their control create bias in student ratings. The research indicates that most factors considered in the research do not contribute significantly to bias in student ratings. The major factors addressed by the research as potential sources of bias are listed below.

VARIABLES THAT GENERALLY DO CORRELATE WITH STUDENT EVALUATIONS

The Scriven Study

The purpose of this report is to summarize a report regarding the utility of student ratings of university courses and instructors. The study summarized is Ratings offer useful input to teacher evaluations. This report was a synthesis research in this area compiled by Michael Scriven and is available as an ERIC Document No. 398240. This document is available on line through the ERIC Search Wizard (http://ericae.net/scripts/ewiz/amain2.asp) using the key words Scriven + student + evaluations. The article addresses concerns about the validity of student ratings and presents a case for their use in teacher evaluation. The article cautions that while student evaluations of teaching have merit, they should not be the only sources used for determining the merit of teaching.

CONCERNS

ARGUMENTS FOR USING STUDENT RATINGS

PROBLEMS WITH EVALUATION FORMS

Most student evaluation forms are invalid as a basis for making personnel decisions because they tend to make a multitude of common errors including:

SUGGESTIONS FOR IMPROVING THE VALIDITY OF STUDENT RATINGS

If student ratings are to be valid, they must be properly administered, stringently controlled, and include a thorough analysis of test results. Common errors include:

The Cohen Study

The purpose of this report is to summarize the findings of the most recent meta-analysis of studies researching the validity of student ratings of instructors in college courses. The study summarized was Student ratings of instruction and student achievement: a meta-analysis of multisection validity studies. The study was conducted by Peter A. Cohen and was reported in the Review of Educational Research, Fall, 1981, vol. 51 (3), pp. 281-309. While this study is seventeen years old, it is the most recent comprehensive meta-analysis study available.

STUDY ASSUMPTIONS

LIMITATIONS

STUDY FINDINGS

The study calculated a correlation for each of the following dimensions of teaching with measures of student achievement. Each dimension of teaching is followed by its correlation with student achievement. The reporting of “n” indicates the number of studies summarized not the number of students in these studies.