Threshold values for significant changes in test-retest difference scores for the Wechsler Intelligence Scale for Children – Fourth Edition

Lina Pezzuti, James Dawe, Marco Lauriola

Accepted August 31, 2022

First published August 30, 2022

https://doi.org/10.26387/bpa.2022.00004

Abstract

One of the purposes of administering intelligence scales is to assess changes in cognitive functioning
over time, from a few days to several years, to determine whether the examinee has progressed or regressed after
treatment or other events. (e.g., an accident, a rehabilitation, etc.). The present research aimed to study the short-term
practice effect of the Wechsler Intelligence Scales for Children – Fourth Edition and provide threshold values that allow
practitioners to assess whether there are true differences in individual performance or whether these differences are due
to chance. A sample of 440 subjects was administered the WISC-IV twice with an average interval of 30 days. The results
show that practice is more pronounced when using raw subtest scores than when using weighted scores. Threshold
values for assessing significant change in subtests and indices were obtained. For example, for the Full-Scale Intelligence
Quotient, a difference between 6 and 27 IQ points between the first and second administration indicates a practice effect.
Conversely, if the difference is equal to or less than 5 IQ points, then there was a decline, while if it is equal to or greater
than 28 IQ points, there was an increase in performance not due to the practice effect. Therefore, these data should
allow practitioners to more accurately assess the clinical significance of observed changes during a short-term dual
administration.

References

  • ANDERSON, P.L., CRONIN, M.E. & KAZMIERSKI, S. (1989).WISC-Rstabilityandre-evaluationoflearning-disabledstudents.Journal of Clinical Psychology, 45, 941-944.

    doi.org/10.1002/1097-4679(198911)45:6_941: AID-JCLP2270450619_3.0.CO;2-P
  • BARTOI, M., ISSNER, J.B., HETTERSCHEIDT, L., JANUARY, A.M.,KUENTZEL, J.G. & BARNETT, D. (2015). Attention problemsand stability of WISC-IV scores among clinically referredchildren. Applied Neuropsychology: Child, 4, 133-140. http://dx.

    doi.org/10.1080/21622965.2013.811075
  • BASSO, M.R., CARONA, F.D., LOWERY, N. & AXELROD, B.N.(2002). Practice effects on the WAIS-III across 3- and 6-monthintervals. The Clinical Neuropsychologist, 16, 57-63. http://dx.

    doi.org/10.1076/clin.16.1.57.8329
  • BAUMAN, E. (1991). Determinants of WISC-R subtest stability inchildren with learning difficulties. Journal of Clinical Psychology,47, 430-435.

    doi.org/10.1002/1097-4679(199105)47:3_430::AID-JCLP2270470317_3.0.CO;2-N
  • BROOKS, B.L., STRAUSS, E., SHERMAN, E.M.S., IVERSON, G.L.& SLICK, D.J. (2009). Developments in neuropsychologicalassessment: Refining psychometric and clinical interpretivemethods. Canadian Psychology, 50, 196-209. http://dx.

    doi.org/10.1037/a0016066
  • CANIVEZ, G.L. & WATKINS, M.W. (1998). Long-term stabilityof the Wechsler Intelligence Scale for Children – ThirdEdition. Psychological Assessment, 10 (3), 285-291. https:/

    doi.org/10.1037/1040-3590.10.3.285
  • CANIVEZ, G.L. & WATKINS, M.W. (1999). Long-term stabilityof the Wechsler Intelligence Scale for Children – Third Editionamong demographic subgroups: Gender, race/ethnicity,and age. Journal of Psychological Assessment, 17, 300-313.

    doi.org/10.1177/073428299901700401
  • CANIVEZ, G.L. & WATKINS, M.W. (2001). Long-term stabilityof the Wechsler Intelligence Scale for Children – Third Editionamong students with disabilities. School Psychology Review, 30,438-453.

  • CHARTER, R.A. (2003). Study samples are too small to producesufficiently precise reliability coefficients. The Journal of GeneralPsychology, 130 (2), 117-129.

  • CHELUNE, G.J. (2003). Assessing reliable neuropsychologicalchange. In R.D. Franklin (Ed.), Prediction in forensic andneuropsychology: Sound statistical practices. Erlbaum.

  • CHELUNE, G.J., NAUGLE, R.I., LÜDERS, H., SEDLAK, J. & AWAD,I.A. (1993). Individual change after epilepsy surgery: Practiceeffects and base-rate information. Neuropsychology, 7, 41-52.https:/

    doi.org/10.1037/0894-4105.7.1.41
  • CHEN, Z. & SIEGLER, R.S. (2000). Intellectual developmentin childhood. In R.J. Sternberg (Ed.), Handbook ofintelligence. Cambridge University Press.

    doi.org/10.1017/CBO9780511807947.006
  • COHEN, J. (1994). The earth is round (p<.05). American Psychologist,4 (12), 997-1003.

    <a title="COHEN, J. (1994). The earth is round (p
  • COHEN, J. (1988). Statistical power analysis for the behavioralsciences, 2nd ed. Lawrence Erlbaum Associates.

  • CONLEY, J.J. (1984). The hierarchy of consistency: A review andmodel of longitudinal findings on adult individual differencesin intelligence, personality and self-opinion. Personality andIndividual Differences, 5, 11-25.

  • DEARY, I.J., PATTIE, A. & STARR, J.M. (2013). The stability ofintelligence from age 11 to age 90 years: The Lothian birth cohortof 1921. Psychological Science, 24, 2361-2368. http://dx.

    doi.org/10.1177/0956797613486487
  • DEARY, I.J., WHALLEY, L.J., LEMMON, H., CRAWFORD, J.R. &STARR, J.M. (2000). The stability of individual differences inmental ability from childhood to old age: Follow-up of the 1932Scottish Mental Survey. Intelligence, 28, 49-55. http://dx.

    doi.org/10.1016/S0160-2896(99)00031-8
  • ELLZEY, J.T. & KARNES, F.A. (1990). Test-retest stability of WISC-RIQs among gifted students. Psychological Reports, 66, 1023-1026.

    doi.org/10.2466/pr0.1990.66.3.1023
  • ESTEVIS, E., BASSO, M.R. & COMBS, D. (2012). Effects of practiceon the Wechsler Adult Intelligence Scale – IV across 3- and6-month intervals. The Clinical Neuropsychologist, 26, 239-254.http://dx.

    doi.org/10.1080/13854046.2012.659219
  • HEILBRONNER, R.L., SWEET, J.J., ATTIX, D.K., KRULL, K.R.,HENRY, G.K. & HART, R.P. (2010). Official position of theAmerican Academy of Clinical Neuropsychology on serialneuropsychological assessments: The utility and challenges ofrepeat test administrations in clinical and forensic contexts. TheClinical Neuropsychologist, 24, 1267-1278.

    doi.org/10.1080/13854046.2010.526785
  • HUNT, E. (2010). Human intelligence. Cambridge University Press.

  • JACOBSON, N.S., FOLLETTE, W.C. & REVENSTROF, D. (1984).Psychotherapy outcome research: Methods for reportingvariability and evaluating clinical significance. Behavior Therapy,15, 336-352.

  • JACOBSON, N.S. & TRUAX, P. (1991). Clinical significance:A statistical approach to defining meaningful change inpsychotherapy research. Journal of Consulting and ClinicalPsychology, 59, 12-19.

  • JOHNSON, W., GOW, A.J., CORLEY, S., STARR, J.M. & DEARY,I.J. (2010). Location in cognitive and residential space at age70 reflects a lifelong trait over parental and environmentalcircumstances: The Lothian birth cohort 1936. Intelligence, 38,402-411.

    doi.org/10.1016/j.intell.2010.04.001
  • KAUFMAN, A.S. & KAUFMAN, N.L. (2008). KABC-II batterie pourl’examen psychologique de l’enfant (2 éd.). ECPA.

  • KIENG, S., ROSSIER, J., FAVEZ, N., GEISTLICH, S. & LECERF, T.(2015). Stabilité à long terme des scores du WISC-IV: Forcesetfaiblesses personnelles. Pratiques psychologiques, 21, 137-154.http://dx.

    doi.org/10.1016/j.prps.2015.03.002
  • LANDER, J. (2010). Long-term stability of scores on the WechslerIntelligence Scale for Children – Fourth Edition in children withlearning disabilities. Fairleigh Dickinson University.

  • LECERF, T., KIENG, S. & GEISTLICH, S. (2017). WISC-IV: Valeursseuils pour des changements significatifs des scores de différencetest-retest [WISC-IV: Cutoff values for meaningful test-retestdifference scores]. Pratiques Psychologiques, 23 (4), 345-358.https:/

    doi.org/10.1016/j.prps.2016.07.003
  • LEMAY, S., BEDARD, M.A., ROULEAU, I. & TREMBLAY, P.L.(2004). Practice effect and test-retest reliability of attentional andexecutive tests in middle-aged to elderly subjects. The ClinicalNeuropsychologist,18,284-302.

    doi.org/10.1080/13854040490501718
  • MACKINTOSH, N.J. (1998). IQ and human intelligence. New York,NY: Oxford University Press.

  • MOFFITT, T.E., CASPI, A., HARKNESS, A.R. & SILVA, P.A. (1993).The natural history of change in intellectual performance:Who changes? How much? Is it meaningful? Child Psychology& Psychiatry & Allied Disciplines, 34 (4), 455-506. https:/

    doi.org/10.1111/j.1469-7610.1993.tb01031.x
  • MOSIER, C.I. (1943). On the reliability of a weighted composite.Psychometrika, 8, 161-168. https:/

    doi.org/10.1007/BF02288700
  • OKADA, S., KAWASAKI, Y., SHINOMIYA, M., HOSHINO, H.,INO, T., SAKAI, K., … & NIWA, S.I. (2021). Long-term stabilityof the WISC-IV in children with autism spectrum disorder.International Journal of School & Educational Psychology, 1-12.https:/

    doi.org/10.1080/21683603.2021.1930307
  • ORSINI,A.,PEZZUTI,L.&PICONE,L.(2012).WISC-IV.Contributoalla taratura italiana. Firenze: Giunti O.S. OrganizzazioniSpeciali.

  • REEVE, C.L. & BONACCIO, S. (2011). On the myth and the realityof the temporal validity degradation of general mental ability testscores. Intelligence, 39, 255-272.

    doi.org/10.1016/j.intell.2011.06.009
  • REUCHLIN, M. (1992). Introduction à la recherche en psychologie.Nathan.

  • REVELLE, W. (2010). An introduction to psychometric theory withapplications in R. Retrieved from http://www.personality-project.org/r/book/

  • RYAN, J.J., GLASS, L.A. & BARTELS, J.M. (2010). Stability ofthe WISC-IV in a sample of elementary and middle schoolchildren. Applied Neuropsychology, 17, 68-72. http://dx.

    doi.org/10.1080/09084280903297933Saklofske
  • SALTHOUSE,T.(2014).Frequentassessmentsmayobscurecognitivedecline. Psychological Assessment, 26, 1063-1069. http://dx.

    doi.org/10.1037/pas0000007
  • SATTLER, J.M. (2008). Assessment of children. Cognitive foundations(5th ed.). Jerome M. Sattler, Publisher, Inc.

  • SHERMAN, E.M.S., BROOKS, B.L., IVERSON, G.L., SLICK, D.J. &STRAUSS, E. (2011). Reliability and validity in neuropsychology.In M.R. Schoenberg & J.G. Scott (Eds.), The little black book ofneuropsychology. Springer.

  • SHROUT, P.E. & FLEISS, J.L. (1979). Intraclass correlations: Uses inassessing rater reliability. Psychological Bulletin, 86 (2), 420-428.

  • SIMONTON, D.K. (2011). Exceptional talent and genius. In T.Chamorro-Premuzic, S. von Stumm & A. Furnham (Eds.), Wiley-Blackwell handbook of individual differences. Blackwell.

  • STRAUSS, E., SHERMAN, E.M.S. & SPREEN, O. (2006). Acompendium of neuropsychological tests: Administration, norms,and commentary. New York, NY: Oxford University Press.

  • TRUSCOTT, S.D., NARRETT, C.M. & SMITH, S.E. (1994).WISC-R subtest reliability overtime: Implications for practiceand research. Psychological Reports, 74, 147-156. http://dx.

    doi.org/10.2466/pr0.1994.74.1.147
  • WATKINS, M.W. & CANIVEZ, G.L. (2004). Temporal stabilityof WISC-III subtest composite: Strengths and weaknesses.Psychological Assessment, 16, 133-138.

  • WATKINS, M.W. & SMITH, L.G. (2013). Long-term stabilityof the Wechsler Intelligence Scale for Children – FourthEdition. Psychological Assessment, 25 (2), 477-483. https:/

    doi.org/10.1037/a0031653
  • WATSON, D. (2004). Stability versus change, dependabilityversus error: Issues in the assessment of personality over time.Journal of Research in Personality, 38, 319-350.

    doi.org/10.1016/j.jrp.2004.03.001
  • WECHSLER, D. (1949). Manual for the Wechsler Intelligence Scale forChildren. Psychological Corporation.

  • WECHSLER, D. (1974). Manual for the Wechsler Intelligence Scale forChildren – Revised. Psychological Corporation.

  • WECHSLER, D. (1991). WISC-III manual. PsychologicalCorporation.WECHSLER, D. (2003a). WISC-IV administration and scoringmanual. Psychological Corporation.WECHSLER, D. (2003b). WISC-IV technical and interpretive manual.Psychological Corporation.

  • WECHSLER, D. (2012). WISC-IV. Manuale di somministrazione escoring. It. ad. A. Orsini & L. Pezzuti (Eds.). Firenze: Giunti O.S.Organizzazioni Speciali.

  • WRIGHT, A. J. (2011). Conducting psychological assessment: A guidefor practitioners.Declaration of conflicting interests. One of the authors (L. Pezzuti) receivesroyalties from sales of the WISC-IV (Wechsler, 2012, It ad. Orsini & Pezzuti).

SHOW ALL REFERENCES (53)HIDE REFERENCES

Related articles

Article info

Issue:

Keywords:

Views:

2586

Downloads:

111

Cite the article:

Author Surname Author Initial. Title. Publication Title. Year Published;Volume number(Issue number):Pages Used. doi:DOI Number.


Pezzuti Lina. Dawe James. Lauriola Marco. Threshold values for significant changes in test-retest difference scores for the Wechsler Intelligence Scale for Children – Fourth Edition. BPA Applied Psychology Bulletin. 2022;294(1):14-27. doi:10.26387/bpa.294.1.

Citation tool

How to cite this article

Author Surname Author Initial. Title. Publication Title. Year Published;Volume number(Issue number):Pages Used. doi:DOI Number.


Pezzuti Lina. Dawe James. Lauriola Marco. Threshold values for significant changes in test-retest difference scores for the Wechsler Intelligence Scale for Children – Fourth Edition. BPA Applied Psychology Bulletin. 2022;294(1):14-27. doi:10.26387/bpa.294.1.