Scientific Publications Database

Article Title: Analytical validation of a standardised scoring protocol for Ki67 immunohistochemistry on breast cancer excision whole sections: an international multicentre collaboration
Authors: Leung, Samuel C. Y.; Nielsen, Torsten O.; Zabaglo, Lila A.; Arun, Indu; Badve, Sunil S.; Bane, Anita L.; Bartlett, John M. S.; Borgquist, Signe; Chang, Martin C.; Dodson, Andrew; Ehinger, Anna; Fineberg, Susan; Focke, Cornelia M.; Gao, Dongxia; Gown, Allen M.; Gutierrez, Carolina; Hugh, Judith C.; Kos, Zuzana; Laenkholm, Anne-Vibeke; Mastropasqua, Mauro G.; Moriya, Takuya; Nofech-Mozes, Sharon; Osborne, C. Kent; Penault-Llorca, Frederique M.; Piper, Tammy; Sakatani, Takashi; Salgado, Roberto; Starczynski, Jane; Sugie, Tomoharu; van der Vegt, Bert; Viale, Giuseppe; Hayes, Daniel F.; McShane, Lisa M.; Dowsett, Mitch
Journal: HISTOPATHOLOGY Volume 75 Issue 2
Date of Publication:2019
Abstract:
Aims The nuclear proliferation marker Ki67 assayed by immunohistochemistry has multiple potential uses in breast cancer, but an unacceptable level of interlaboratory variability has hampered its clinical utility. The International Ki67 in Breast Cancer Working Group has undertaken a systematic programme to determine whether Ki67 measurement can be analytically validated and standardised among laboratories. This study addresses whether acceptable scoring reproducibility can be achieved on excision whole sections. Methods and results Adjacent sections from 30 primary ER+ breast cancers were centrally stained for Ki67 and sections were circulated among 23 pathologists in 12 countries. All pathologists scored Ki67 by two methods: (i) global: four fields of 100 tumour cells each were selected to reflect observed heterogeneity in nuclear staining; (ii) hot-spot: the field with highest apparent Ki67 index was selected and up to 500 cells scored. The intraclass correlation coefficient (ICC) for the global method [confidence interval (CI) = 0.87; 95% CI = 0.799-0.93] marginally met the prespecified success criterion (lower 95% CI >= 0.8), while the ICC for the hot-spot method (0.83; 95% CI = 0.74-0.90) did not. Visually, interobserver concordance in location of selected hot-spots varies between cases. The median times for scoring were 9 and 6 min for global and hot-spot methods, respectively. Conclusions The global scoring method demonstrates adequate reproducibility to warrant next steps towards evaluation for technical and clinical validity in appropriate cohorts of cases. The time taken for scoring by either method is practical using counting software we are making publicly available. Establishment of external quality assessment schemes is likely to improve the reproducibility between laboratories further.