Full Report (2 MB) || Educator's Summary (464 KB) || Educator's Guide (584 KB)
An exhaustive search considered hundreds of published and unpublished articles. It included those that met the following criteria:
- Schools or classrooms using each program had to be compared to randomly assigned or well-matched control groups.
- Study duration had to be at least 12 weeks.
- Outcome measures had to be assessment of the reading content being taught in all classes. Almost all are standardized tests or state assessments.
- The review placed particular emphasis on studies in which schools, teachers, or students were assigned at random to experimental or control groups.
Programs were rated according to the overall strength of the evidence support in their effects on reading achievement. “Effect size” (ES) is the proportion of a standard deviation by which a treatment exceeds a control group. Average effect sizes were weighted by sample sizes in computing means. The categories are as follows:
Strong Evidence of Effectiveness: At least two studies, one of which is a randomized or randomized quasi-experimental study, or multiple smaller studies, with a sample size-weighted effect size of at least +0.20, and a collective sample size across all studies of at least 250 students. To qualify for this category, effect sizes from the randomized studies must have a weighted mean effect size of at least +0.20.
Moderate Evidence of Effectiveness: At least two matched prospective studies, with a collective sample size of 250 students, and a weighted mean effect size of at least +0.20.
Limited Evidence of Effectiveness: Strong Evidence of Modest Effects: Studies meet the criteria for “moderate evidence of effectiveness” except that the weighted mean effect size is +0.10 to +0.19.
Limited Evidence of Effectiveness: Weak Evidence with Notable Effect: A weighted mean effect size of at least +0.20 based on one or more qualifying studies of any qualifying design insufficient in number or sample size to meet the criteria for “Moderate Evidence of Effectiveness.”
Insufficient Evidence of Effectiveness: Qualifying studies do not meet the criteria for “limited evidence of effectiveness.”
N No Qualifying studies: No studies meet inclusion standards.
|