Stanine (STAndard NINE) is a method of scaling test scores on a nine-point standard scale with a mean of five and a standard deviation of two.
Some web sources attribute stanines to the U.S. Army Air Forces during World War II. Psychometric legend has it that a 1–9 scale was used because of the compactness of recording the score as a single digit but Thorndike[1] claims that by reducing scores to just nine values, stanines "reduce the tendency to try to interpret small score differences (p. 131)". The earliest known use of stanines was by the U.S. Army Air Forces in 1942.[2]
Calculation
Test scores are scaled to stanine scores using the following algorithm:
- Rank results from lowest to highest
- Give the lowest 4% a stanine of 1, the next 7% a stanine of 2, etc., according to the following table:
Bracketed proportion | 4% | 7% | 12% | 17% | 20% | 17% | 12% | 7% | 4% |
Stanine | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 |
Standardized score | below −1.75 | −1.75 to −1.25 | −1.25 to −0.75 | −0.75 to −0.25 | −0.25 to +0.25 | +0.25 to +0.75 | +0.75 to +1.25 | +1.25 to +1.75 | above +1.75 |
Wechsler scale score | below 74 | 74 to 81 | 81 to 89 | 89 to 96 | 96 to 104 | 104 to 111 | 111 to 119 | 119 to 126 | above 126 |
Cumulative proportion | 4% | 11% | 23% | 40% | 60% | 77% | 89% | 96% | 100% |
The underlying basis for obtaining stanines is that a normal distribution is divided into nine intervals, each of which has a width of 0.5 standard deviations excluding the first and last, which are just the remainder (the tails of the distribution). The median lies at the centre of the fifth interval.
Use today
Today stanines are mostly used in educational assessment.
- The University of Alberta in Edmonton, Alberta, Canada used the stanine system until 2003, when it switched to a 4-point scale.[3]
- In the United States, the Educational Records Bureau (they administer the "ERBs") reports test scores as stanines and percentiles.
- The New Zealand Council for Educational Research uses stanines.[4]
- GL Assessment use stanines alongside SAS (Standardised Age Scores) to express the results of its CAT4 assessments, used in many UK and British international schools [5]
- The Otis-Lennon School Ability Test uses a stanine system along with percentiles.
- Average Korean High School uses stanine system to evaluate their students
- The IDF (Israeli Defense Force) uses the stanine grading system ranging from 10 to 90 (10,20,30 and so on) to rank intelligence ability relevant to the army's use, determined by a 100 question test divided to 4 categories having to do with different uses and implications of cognitive abilities
- The Polish Matura secondary-school exam results and university admissions utilise the stanine system
See also
- Sten scores – a similar system, but with 10 possible values
- Normal score
Notes
- ↑ Thorndike, R. L. (1982). Applied Psychometrics. Boston, MA: Houghton Mifflin
- ↑ Krueger Hussey, A. (2004). Air Force Flight Screening: Evolutionary Changes, 1917-2003. Office of History and Research Headquarters Air Education and Training Command Randolph AFB, Texas
- ↑ "Grade Comparison Guide". Archived from the original on 2006-12-12. Retrieved 2007-01-04.
- ↑ "Understanding Stanines", nzcersupport.org.nz
- ↑ "GL Assessment"
References
- Ballew, Pat (c. 2002). "Origins of some arithmetic terms". Math words. pballew.net. p. 3. Archived from the original on 2016-06-05. Retrieved 26 December 2004.
- Boydsten, Robert E. (27 February 2000). "Winning My Wings". boydstonfoundation.org. Archived from the original on 2008-05-14.
- Craven, Wesley Frank; Cate, James Lea, eds. (1983) [1955]. Men and Planes (PDF). The Army Air Forces in World War II. Vol. 6 (new imprint ed.). Chicago, IL: University of Chicago Press (1955) / Office of Air Force History (1983). ISBN 0-912799-03-X. LCCN 48-3657. AFD-101105-019. Archived from the original (PDF) on 2016-11-23 – via afhso.af.mil.