Threshold values for significant changes in test-retest difference scores for the Wechsler Intelligence Scale for Children – Fourth Edition

Lina Pezzuti, James Dawe, Marco Lauriola

Accepted August 31, 2022

First published August 30, 2022

https://doi.org/10.26387/bpa.2022.00004

Abstract

One of the purposes of administering intelligence scales is to assess changes in cognitive functioning
over time, from a few days to several years, to determine whether the examinee has progressed or regressed after
treatment or other events. (e.g., an accident, a rehabilitation, etc.). The present research aimed to study the short-term
practice effect of the Wechsler Intelligence Scales for Children – Fourth Edition and provide threshold values that allow
practitioners to assess whether there are true differences in individual performance or whether these differences are due
to chance. A sample of 440 subjects was administered the WISC-IV twice with an average interval of 30 days. The results
show that practice is more pronounced when using raw subtest scores than when using weighted scores. Threshold
values for assessing significant change in subtests and indices were obtained. For example, for the Full-Scale Intelligence
Quotient, a difference between 6 and 27 IQ points between the first and second administration indicates a practice effect.
Conversely, if the difference is equal to or less than 5 IQ points, then there was a decline, while if it is equal to or greater
than 28 IQ points, there was an increase in performance not due to the practice effect. Therefore, these data should
allow practitioners to more accurately assess the clinical significance of observed changes during a short-term dual
administration.

References

ANDERSON, P.L., CRONIN, M.E. & KAZMIERSKI, S. (1989).WISC-Rstabilityandre-evaluationoflearning-disabledstudents.Journal of Clinical Psychology, 45, 941-944.
doi.org/10.1002/1097-4679(198911)45:6_941: AID-JCLP2270450619_3.0.CO;2-P
BARTOI, M., ISSNER, J.B., HETTERSCHEIDT, L., JANUARY, A.M.,KUENTZEL, J.G. & BARNETT, D. (2015). Attention problemsand stability of WISC-IV scores among clinically referredchildren. Applied Neuropsychology: Child, 4, 133-140. http://dx.
doi.org/10.1080/21622965.2013.811075
BASSO, M.R., CARONA, F.D., LOWERY, N. & AXELROD, B.N.(2002). Practice effects on the WAIS-III across 3- and 6-monthintervals. The Clinical Neuropsychologist, 16, 57-63. http://dx.
doi.org/10.1076/clin.16.1.57.8329
BAUMAN, E. (1991). Determinants of WISC-R subtest stability inchildren with learning difficulties. Journal of Clinical Psychology,47, 430-435.
doi.org/10.1002/1097-4679(199105)47:3_430::AID-JCLP2270470317_3.0.CO;2-N
BROOKS, B.L., STRAUSS, E., SHERMAN, E.M.S., IVERSON, G.L.& SLICK, D.J. (2009). Developments in neuropsychologicalassessment: Refining psychometric and clinical interpretivemethods. Canadian Psychology, 50, 196-209. http://dx.
doi.org/10.1037/a0016066
CANIVEZ, G.L. & WATKINS, M.W. (1998). Long-term stabilityof the Wechsler Intelligence Scale for Children – ThirdEdition. Psychological Assessment, 10 (3), 285-291. https:/
doi.org/10.1037/1040-3590.10.3.285
CANIVEZ, G.L. & WATKINS, M.W. (1999). Long-term stabilityof the Wechsler Intelligence Scale for Children – Third Editionamong demographic subgroups: Gender, race/ethnicity,and age. Journal of Psychological Assessment, 17, 300-313.
doi.org/10.1177/073428299901700401
CANIVEZ, G.L. & WATKINS, M.W. (2001). Long-term stabilityof the Wechsler Intelligence Scale for Children – Third Editionamong students with disabilities. School Psychology Review, 30,438-453.
CHARTER, R.A. (2003). Study samples are too small to producesufficiently precise reliability coefficients. The Journal of GeneralPsychology, 130 (2), 117-129.
CHELUNE, G.J. (2003). Assessing reliable neuropsychologicalchange. In R.D. Franklin (Ed.), Prediction in forensic andneuropsychology: Sound statistical practices. Erlbaum.
CHELUNE, G.J., NAUGLE, R.I., LÜDERS, H., SEDLAK, J. & AWAD,I.A. (1993). Individual change after epilepsy surgery: Practiceeffects and base-rate information. Neuropsychology, 7, 41-52.https:/
doi.org/10.1037/0894-4105.7.1.41
CHEN, Z. & SIEGLER, R.S. (2000). Intellectual developmentin childhood. In R.J. Sternberg (Ed.), Handbook ofintelligence. Cambridge University Press.
doi.org/10.1017/CBO9780511807947.006
COHEN, J. (1994). The earth is round (p<.05). American Psychologist,4 (12), 997-1003.
<a title="COHEN, J. (1994). The earth is round (p
COHEN, J. (1988). Statistical power analysis for the behavioralsciences, 2nd ed. Lawrence Erlbaum Associates.
CONLEY, J.J. (1984). The hierarchy of consistency: A review andmodel of longitudinal findings on adult individual differencesin intelligence, personality and self-opinion. Personality andIndividual Differences, 5, 11-25.
DEARY, I.J., PATTIE, A. & STARR, J.M. (2013). The stability ofintelligence from age 11 to age 90 years: The Lothian birth cohortof 1921. Psychological Science, 24, 2361-2368. http://dx.
doi.org/10.1177/0956797613486487
DEARY, I.J., WHALLEY, L.J., LEMMON, H., CRAWFORD, J.R. &STARR, J.M. (2000). The stability of individual differences inmental ability from childhood to old age: Follow-up of the 1932Scottish Mental Survey. Intelligence, 28, 49-55. http://dx.
doi.org/10.1016/S0160-2896(99)00031-8
ELLZEY, J.T. & KARNES, F.A. (1990). Test-retest stability of WISC-RIQs among gifted students. Psychological Reports, 66, 1023-1026.
doi.org/10.2466/pr0.1990.66.3.1023
ESTEVIS, E., BASSO, M.R. & COMBS, D. (2012). Effects of practiceon the Wechsler Adult Intelligence Scale – IV across 3- and6-month intervals. The Clinical Neuropsychologist, 26, 239-254.http://dx.
doi.org/10.1080/13854046.2012.659219
HEILBRONNER, R.L., SWEET, J.J., ATTIX, D.K., KRULL, K.R.,HENRY, G.K. & HART, R.P. (2010). Official position of theAmerican Academy of Clinical Neuropsychology on serialneuropsychological assessments: The utility and challenges ofrepeat test administrations in clinical and forensic contexts. TheClinical Neuropsychologist, 24, 1267-1278.
doi.org/10.1080/13854046.2010.526785
HUNT, E. (2010). Human intelligence. Cambridge University Press.
JACOBSON, N.S., FOLLETTE, W.C. & REVENSTROF, D. (1984).Psychotherapy outcome research: Methods for reportingvariability and evaluating clinical significance. Behavior Therapy,15, 336-352.
JACOBSON, N.S. & TRUAX, P. (1991). Clinical significance:A statistical approach to defining meaningful change inpsychotherapy research. Journal of Consulting and ClinicalPsychology, 59, 12-19.
JOHNSON, W., GOW, A.J., CORLEY, S., STARR, J.M. & DEARY,I.J. (2010). Location in cognitive and residential space at age70 reflects a lifelong trait over parental and environmentalcircumstances: The Lothian birth cohort 1936. Intelligence, 38,402-411.
doi.org/10.1016/j.intell.2010.04.001
KAUFMAN, A.S. & KAUFMAN, N.L. (2008). KABC-II batterie pourl’examen psychologique de l’enfant (2 éd.). ECPA.
KIENG, S., ROSSIER, J., FAVEZ, N., GEISTLICH, S. & LECERF, T.(2015). Stabilité à long terme des scores du WISC-IV: Forcesetfaiblesses personnelles. Pratiques psychologiques, 21, 137-154.http://dx.
doi.org/10.1016/j.prps.2015.03.002
LANDER, J. (2010). Long-term stability of scores on the WechslerIntelligence Scale for Children – Fourth Edition in children withlearning disabilities. Fairleigh Dickinson University.
LECERF, T., KIENG, S. & GEISTLICH, S. (2017). WISC-IV: Valeursseuils pour des changements significatifs des scores de différencetest-retest [WISC-IV: Cutoff values for meaningful test-retestdifference scores]. Pratiques Psychologiques, 23 (4), 345-358.https:/
doi.org/10.1016/j.prps.2016.07.003
LEMAY, S., BEDARD, M.A., ROULEAU, I. & TREMBLAY, P.L.(2004). Practice effect and test-retest reliability of attentional andexecutive tests in middle-aged to elderly subjects. The ClinicalNeuropsychologist,18,284-302.
doi.org/10.1080/13854040490501718
MACKINTOSH, N.J. (1998). IQ and human intelligence. New York,NY: Oxford University Press.
MOFFITT, T.E., CASPI, A., HARKNESS, A.R. & SILVA, P.A. (1993).The natural history of change in intellectual performance:Who changes? How much? Is it meaningful? Child Psychology& Psychiatry & Allied Disciplines, 34 (4), 455-506. https:/
doi.org/10.1111/j.1469-7610.1993.tb01031.x
MOSIER, C.I. (1943). On the reliability of a weighted composite.Psychometrika, 8, 161-168. https:/
doi.org/10.1007/BF02288700
OKADA, S., KAWASAKI, Y., SHINOMIYA, M., HOSHINO, H.,INO, T., SAKAI, K., … & NIWA, S.I. (2021). Long-term stabilityof the WISC-IV in children with autism spectrum disorder.International Journal of School & Educational Psychology, 1-12.https:/
doi.org/10.1080/21683603.2021.1930307
ORSINI,A.,PEZZUTI,L.&PICONE,L.(2012).WISC-IV.Contributoalla taratura italiana. Firenze: Giunti O.S. OrganizzazioniSpeciali.
REEVE, C.L. & BONACCIO, S. (2011). On the myth and the realityof the temporal validity degradation of general mental ability testscores. Intelligence, 39, 255-272.
doi.org/10.1016/j.intell.2011.06.009
REUCHLIN, M. (1992). Introduction à la recherche en psychologie.Nathan.
REVELLE, W. (2010). An introduction to psychometric theory withapplications in R. Retrieved from http://www.personality-project.org/r/book/
RYAN, J.J., GLASS, L.A. & BARTELS, J.M. (2010). Stability ofthe WISC-IV in a sample of elementary and middle schoolchildren. Applied Neuropsychology, 17, 68-72. http://dx.
doi.org/10.1080/09084280903297933Saklofske
SALTHOUSE,T.(2014).Frequentassessmentsmayobscurecognitivedecline. Psychological Assessment, 26, 1063-1069. http://dx.
doi.org/10.1037/pas0000007
SATTLER, J.M. (2008). Assessment of children. Cognitive foundations(5th ed.). Jerome M. Sattler, Publisher, Inc.
SHERMAN, E.M.S., BROOKS, B.L., IVERSON, G.L., SLICK, D.J. &STRAUSS, E. (2011). Reliability and validity in neuropsychology.In M.R. Schoenberg & J.G. Scott (Eds.), The little black book ofneuropsychology. Springer.
SHROUT, P.E. & FLEISS, J.L. (1979). Intraclass correlations: Uses inassessing rater reliability. Psychological Bulletin, 86 (2), 420-428.
SIMONTON, D.K. (2011). Exceptional talent and genius. In T.Chamorro-Premuzic, S. von Stumm & A. Furnham (Eds.), Wiley-Blackwell handbook of individual differences. Blackwell.
STRAUSS, E., SHERMAN, E.M.S. & SPREEN, O. (2006). Acompendium of neuropsychological tests: Administration, norms,and commentary. New York, NY: Oxford University Press.
TRUSCOTT, S.D., NARRETT, C.M. & SMITH, S.E. (1994).WISC-R subtest reliability overtime: Implications for practiceand research. Psychological Reports, 74, 147-156. http://dx.
doi.org/10.2466/pr0.1994.74.1.147
WATKINS, M.W. & CANIVEZ, G.L. (2004). Temporal stabilityof WISC-III subtest composite: Strengths and weaknesses.Psychological Assessment, 16, 133-138.
WATKINS, M.W. & SMITH, L.G. (2013). Long-term stabilityof the Wechsler Intelligence Scale for Children – FourthEdition. Psychological Assessment, 25 (2), 477-483. https:/
doi.org/10.1037/a0031653
WATSON, D. (2004). Stability versus change, dependabilityversus error: Issues in the assessment of personality over time.Journal of Research in Personality, 38, 319-350.
doi.org/10.1016/j.jrp.2004.03.001
WECHSLER, D. (1949). Manual for the Wechsler Intelligence Scale forChildren. Psychological Corporation.
WECHSLER, D. (1974). Manual for the Wechsler Intelligence Scale forChildren – Revised. Psychological Corporation.
WECHSLER, D. (1991). WISC-III manual. PsychologicalCorporation.WECHSLER, D. (2003a). WISC-IV administration and scoringmanual. Psychological Corporation.WECHSLER, D. (2003b). WISC-IV technical and interpretive manual.Psychological Corporation.
WECHSLER, D. (2012). WISC-IV. Manuale di somministrazione escoring. It. ad. A. Orsini & L. Pezzuti (Eds.). Firenze: Giunti O.S.Organizzazioni Speciali.
WRIGHT, A. J. (2011). Conducting psychological assessment: A guidefor practitioners.Declaration of conflicting interests. One of the authors (L. Pezzuti) receivesroyalties from sales of the WISC-IV (Wechsler, 2012, It ad. Orsini & Pezzuti).

SHOW ALL REFERENCES (53)HIDE REFERENCES

Intelligence profiles of children and adolescents with High functioning autism spectrum disorder
Riccardo Alessandrelli, Claudia Di Bucchianico, Valeria Mancini, Dominga Marfisi, Tatiana Bortolatto, Candida Marchione, Luana Pitturelli, Maria Elena Di Bucchianico, Antonietta Vassalli, Morena Farese, James Dawe, Lina Pezzuti
First published November 20, 2025
Relation between parents’ education and sons’ intellectual profile on Wechsler Intelligence Scale for Children – Fourth Edition
Lina Pezzuti, James Dawe
First published November 20, 2025

DOWNLOAD PDF

Issue:

Issue 294

Keywords:

Views:

2825

Downloads:

550

Cite the article:

Pezzuti Lina. Dawe James. Lauriola Marco. Threshold values for significant changes in test-retest difference scores for the Wechsler Intelligence Scale for Children – Fourth Edition. BPA Applied Psychology Bulletin. 2022;294(1):14-27. doi:10.26387/bpa.294.1.

CITE THE ARTICLE

5527

Threshold values for significant changes in test-retest difference scores for the Wechsler Intelligence Scale for Children – Fourth Edition

Abstract

References

Related articles

Intelligence profiles of children and adolescents with High functioning autism spectrum disorder

Relation between parents’ education and sons’ intellectual profile on Wechsler Intelligence Scale for Children – Fourth Edition

Article info

Issue:

Keywords:

Views:

Downloads:

Cite the article:

BPA - BULLETIN OF APPLIED PSYHOLOGY

MANAGEMENT AND EDITORIAL OFFICE

ABOUT US

Citation tool

How to cite this article