A novel strategy for interpreting the T-SPOT.TB test results read by an ELISPOT plate imager
Authors:
Tae Yeul Kim aff001; Ho Eun Chang aff001; Seong-Wook Lee aff002; Soo Hyun Seo aff001; Yun Ji Hong aff001; Jeong Su Park aff001; Kyoung Un Park aff001
Authors place of work:
Department of Laboratory Medicine, Seoul National University Bundang Hospital, Seongnam-si, Gyeonggi-do, Korea
aff001; Department of Molecular Biology, Dankook University, Yongin-si, Gyeonggi-do, Korea
aff002; Department of Laboratory Medicine, Seoul National University College of Medicine, Seoul, Korea
aff003
Published in the journal:
PLoS ONE 14(9)
Category:
Research Article
doi:
https://doi.org/10.1371/journal.pone.0222920
Summary
Background
The T-SPOT.TB can be read by an ELISPOT plate imager as an alternative to a labor-intensive and time-consuming manual reading, but its accuracy has not been sufficiently discussed to date.
Methods
1,423 test results obtained from manual reading using a microscope and an ELISPOT plate imager were compared. The agreement of qualitative test results was assessed using Cohen's kappa coefficient. The relationship of spot counts was studied using Bland-Altman analysis.
Results
The overall percent agreement of the qualitative test results was 95.43% with a kappa coefficient of 0.91. Positive test results with the maximum net spot count of 8 and borderline test results showed relatively high discordance. The agreement of spot counts in panel A, panel B, and nil control was good, and variability did not increase with higher spot counts. On the basis of study findings, a novel strategy for interpreting the test results by an ELISPOT plate imager was proposed.
Conclusions
To increase diagnostic accuracy, positive test results with the maximum net spot count of 8 and borderline test results should be manually confirmed. Our strategy could be a practical guide for laboratories to build their own strategies for interpreting the test results by an ELISPOT plate imager.
Keywords:
tuberculosis – T cells – Qualitative studies – Enzyme-linked immunoassays – Technicians – Decision making – Granulocytes – nontuberculous mycobacteria
Introduction
Tuberculosis (TB) remains a major public health threat worldwide, with 10.0 million new cases and 1.6 million deaths estimated to have occurred in 2017 [1]. One of the most daunting challenges for global TB control is to contain the reservoir of latent TB infection (LTBI), affecting an estimated 1.7 billion individuals, just shy of a quarter of the global population [2]. Without treatment, 5 to 10 percent of individuals with LTBI progress to active TB in their lifetime [3]. World Health Organization recommends that either a tuberculin skin test (TST) or interferon-gamma release assay (IGRA) can be used to test for LTBI [4]. IGRAs appear to be more specific than the TST for the diagnosis of LTBI, measuring interferon-gamma produced by T-cells in response to Mycobacterium tuberculosis-specific antigens absent from all Bacille Calmette-Guérin strains and most non-tuberculous mycobacteria [5].
The T-SPOT.TB (Oxford Immunotec, Abingdon, UK), the fourth IGRA to be approved by the U.S. Food and Drug Administration (FDA), uses an enzyme-linked immunospot (ELISPOT) method in which interferon-gamma released by antigen-specific T-cells is captured in the immediate vicinity of the T-cells by specific antibodies on the surface of a membrane and then visualized as spots. The number of spots can be counted by laboratory technicians using a hand held magnifying glass or a suitable microscope. However, such manual spot counting is a highly laborious and subjective process. Alternatively, automated ELISPOT plate imagers can be used to reduce the workload and subjectivity of manual spot counting. Each ELISPOT plate imager has a given set of reading parameters that define spots, which can be adjusted by the user’s preference [6]. Evaluating the accuracy of ELISPOT plate imagers is a very crucial task as inaccurate spot counting could potentially complicate clinical decision making for LTBI treatment. Few studies have addressed the variability of the T-SPOT.TB test results read by technicians and ELISPOT plate imagers [7, 8]. No studies, however, have investigated how to interpret the T-SPOT.TB test results by an ELISPOT plate imager.
In this study, the T-SPOT.TB test results read by laboratory technicians using a microscope and by an ELISPOT plate imager were compared. On the basis of study findings, we recommended a novel strategy for interpreting the T-SPOT.TB test results by an ELISPOT plate imager.
Materials and methods
T-SPOT.TB
The T-SPOT.TB was performed according to the manufacturer’s instructions. The assay requires four wells with different stimulations to be used for each patient sample: one as an antigen-free nil control, one as a positive control containing phytohemagglutinin, and two as panels A and B containing M. tuberculosis-specific antigens early secretory antigenic target-6 and culture filtrate protein-10, respectively. Spot counting was performed by one technician using a Nikon Eclipse E600 microscope (Nikon Corp., Tokyo, Japan) and then automatically on an ELISPOT plate imager (AID ELISPOT reader classic; Autoimmun Diagnostika GmbH, Strassberg, Germany) with software version 7.0, build 14733. A total of four technicians conducted manual spot counting during the study period. Reading parameters of the ELISPOT plate imager preset by the manufacturer were not adjusted throughout the study. Technicians did not quantify spot counts above 20 in a given sample well, annotating the well as ‘too many to count (TMTC)’. Positive controls giving less than 20 spot counts on the ELISPOT plate imager were sent for manual review and annotated as ‘TMTC’ when the spots were confluent. Test results were qualitative and reported as positive, negative, borderline, or invalid depending on the number of spots counted. The test result was interpreted as positive when the spot count in net panel A (panel A minus nil control) and/or net panel B (panel B minus nil control) was greater than or equal to 8. The test result was interpreted as negative when the spot count in both net panel A and net panel B was less than or equal to 4. A borderline test result was determined when the higher of the net panel A and net panel B spot count was 5, 6, or 7. The test result was classified as invalid when the spot count in the positive control was less than 20 and the spot count in net panel A and net panel B was less than or equal to 4. An invalid test result was also determined when the spot count in the nil control was greater than or equal to 10.
Statistical analyses
Statistical analyses were performed using MedCalc for Windows, version 18.11.3 (MedCalc Software, Ostend, Belgium). The agreement of qualitative test results by the technicians’ manual reading and ELISPOT plate imager was assessed using Cohen's kappa coefficient (0.81 < κ < 1.00, very good; 0.61 < κ < 0.80, good; 0.41 < κ < 0.60, moderate; 0.21 < κ < 0.40, fair; κ < 0.2, poor). Cohen’s kappa coefficient was also calculated for each technician. The agreement and relationship of spot counts read by the technicians and ELISPOT plate imager were studied using Bland-Altman analysis. For samples with spot counts less than 20 in panel A, panel B, and nil control, the spot count differences between the ELISPOT plate imager and technicians’ manual reading were plotted against their means. The mean difference and its limits of agreement (±1.96 SD) were computed. The Institutional Review Board of Seoul National University Bundang Hospital approved the study protocol and waived the need for informed consent because no patients were at risk (IRB no. B-1907-550-102). Patient records were anonymized and de-identified prior to analysis.
Results
Between January and November 2018, a total of 1,423 samples from 1,343 individuals were tested by the T-SPOT.TB at Seoul National University Bundang Hospital. Among 74 individuals retested at different time points, 68 individuals were tested twice, and 6 individuals were tested three times. The qualitative test results obtained from the technicians’ manual reading and ELISPOT plate imager were compared in Table 1. The qualitative test results read by each technician and ELISPOT plate imager were compared in S1 Table. The overall percent agreement of the qualitative test results was 95.43% (1358/1423) with a kappa coefficient of 0.91 (95% CI, 0.89 to 0.93), indicating very good agreement. Invalid test results showed the highest concordance (100%, 43/43), followed by negative and positive test results (99.00%, 893/902 and 97.82%, 359/367). Borderline test results showed the lowest concordance (56.76%, 63/111). Each technician’s kappa coefficient ranged from 0.87 to 0.93, indicating very good agreement (Table 2). The Bland-Altman analysis showed agreement of spot counts between the technicians’ manual reading and ELISPOT plate imager with a mean difference of 0.12 (limits of agreement: -1.91 to 2.15) for panel A, 0.13 (limits of agreement: -2.03 to 2.28) for panel B, and 0.01 (limits of agreement: -1.44 to 1.46) for nil control. Visual inspection of the Bland-Altman plot revealed that variability did not increase with higher spot counts (Fig 1).
On the basis of study findings, we developed a new strategy for interpreting the T-SPOT.TB results read by an ELISPOT plate imager (Fig 2). Samples with the maximum net spot count greater than or equal to 9 are reported as positive because only one discordant test result (an invalid test result by manual reading) occurred in those samples. Samples with the maximum net spot count of 8 require manual confirmation due to the great variability (2 negative and 5 borderline test results by manual reading). Samples with borderline test results also warrant manual confirmation by virtue of the lowest concordance (8 positive, 33 negative, and 7 invalid test results by manual reading). Samples with negative test results are reported as negative, as only 9 out of 902 samples showed discordant test results (5 borderline and 4 invalid test results by manual reading). Retesting was conducted on 6 out of these 9 samples, producing 5 negative test results and one invalid test result. Invalid test results are reported as such because no discordant test results were observed in this study.
Discussion
Accurate identification of individuals with LTBI is essential for clinical decision making. False positive results could result in unnecessary treatment, while false negative results would mean a missed opportunity to prevent active TB [9]. Therefore, variability of the T-SPOT.TB test results arising from different counting methods might have a huge economic impact. In the absence of a gold standard, agreement between different counting methods is a measure of diagnostic accuracy. This study aimed at investigating the diagnostic accuracy of an ELISPOT plate imager in comparison with manual reading using a microscope. The overall agreement of the qualitative test results read by the technicians and ELISPOT plate imager was very good with a kappa coefficient of 0.91, exceeding those reported in the previous studies [7, 8]. Additionally, the agreement of spot counts was found between the technicians and ELISPOT plate imager. The overall agreement of qualitative test results can be affected by the proportion of LTBI in the study population [7]. As positive and negative test results showed comparable concordance (97.82% and 99.00%, respectively), the overall agreement was unlikely to be affected by the prevalence of LTBI in the study population.
There was a very good agreement of the qualitative test results between each individual technician and the ELISPOT plate imager with kappa coefficients ranging from 0.87 to 0.93. Such consistent agreement might indicate that spot counting tendency of each individual technician does not significantly affect test results and clinical decision making. However, a limitation of this study was that we did not directly measure the inter-technician variability of the T-SPOT.TB test results. Although technicians are trained to distinguish true spots from artifacts, the quality and intensity of the training vary among laboratories, and even technicians working in the same laboratory often undergo different training, contributing to the systematic variation of spot counts between technicians. One Taiwanese laboratory determines test results by calculating mean values from spot counts manually read by two technicians to address inter-technician variability [10]. In one Spanish laboratory, test results were determined manually by one technician or, in case of doubt, by two technicians [11]. Franken et al. found that some technicians tended to count more or fewer spots than others, but they did so consistently in all four wells of a given sample [8]. This study showed that the systematic variation in spot counts read by different technicians was nullified by subtracting the spot count in the nil control [8]. Random variation is another source of inter-technician variability and also inherent to any ELISPOT plate imager [12]. Even one spot count difference between different technicians due to random variation could yield different test results, particularly in samples with spot counts around the cutoff.
IGRAs are prone to biological and technical variability, and hence, the borderline category has been recommended to define genuine conversions and reversions [13]. Our study showed that the borderline category had the lowest concordance (56.76%), which was consistent with the previous study [7]. Such low concordance is probably attributable to the narrow range (5 to 7 spots) of the FDA-recommended borderline category. Higher concordance is expected in laboratories adopting the borderline category with the wider range (4 to 8 spots) suggested by van Zyl-Smit et al. [14]. Due to high variability, we recommend that borderline test results by an ELISPOT plate imager should be manually confirmed. Most borderline results resolve into positive or negative test results on retesting [15, 16], but manual confirmation of borderline test results could be more cost-effective than retesting by drawing another blood sample. Manual confirmation is expected to determine 8 positive and 33 negative test results from 111 borderline test results read by the ELISPOT plate imager. The workload of manual confirmation varies depending on the proportion of borderline test results read by an ELISPOT plate imager. Although borderline test results read by an ELISPOT plate imager are manually confirmed as positive or negative, clinical decision making should consider the limitations of T-SPOT.TB and individual’s risks and benefits of LTBI treatment [17]. Also, laboratories performing the T-SPOT.TB should avoid reporting only qualitative test results and provide spot counts to referring clinicians [14].
ELISPOT plate imagers can produce false positive or false negative results, leading to increased healthcare costs despite the presence of borderline category. The possibility of false negative results is considered slim because in this study, most discordant test results resolved to negative test results on retesting. Thus, negative test results by an ELISPOT plate imager can be reported as such without significant risks. Oxford Diagnostic Laboratories manually confirms all positive test results by an ELISPOT plate imager to avoid false positive results [16]. In this study, two positive test results read by the ELISPOT plate imager were potentially false positive because they were classified by manual reading as negative. In those two test results, the higher of net panel A and net panel B spot count was 8 at the positive cutoff. According to Bland-Altman analysis, the agreement of spot counts was good, and variability did not increase with higher spot counts. This finding is consistent with that of Janetzki et al. [18], who demonstrated that the spot count difference between manual reading and an ELISPOT plate imager was the smallest in the range of 0 to 20 spots. Thus, manual confirmation should be confined to positive test results with the maximum net spot count of 8. Before introducing an ELISPOT plate imager, individual laboratories need to understand that different interpretation algorithms generate different test results of T-SPOT.TB. Our interpretation algorithm offers a practical guide to them.
ELISPOT plate imagers are programmed to distinguish spots from artifacts and other unwanted signals by using reading parameters such as spot size, intensity of staining, spot gradient, and spot shape [6]. Reading parameters and spot-recognition algorithms differ among ELISPOT plate imagers, thereby leading to disparity in T-SPOT.TB test results. Precision of ELISPOT plate imagers for spot counting is good, particularly in a sample well with low spot counts [14]; however, the absence of a gold standard for LTBI testing hampers the assessment of the accuracy of spot counting. ELISPOT plate imagers are fine-tuned to exclude artificial signals created by dead or apoptotic cells and other contaminating cell populations including granulocytes, platelet, or red blood cells [6]. Even with standardized reading parameters, however, perfectly discriminating spots from artifacts is inherently impossible due to the fact that sizes and densities of spots vary depending on quantities and kinetics of cytokines generated by the individual T-cells [19]. Moreover, disruption of spot formation by high granulocyte contamination impedes accurate spot counting [20]. ELISPOT plate imagers can also produce false positive or false negative results in the presence of high background reactivity. Lastly, the accuracy of spot counting is subject to technical factors affecting image acquisition of ELISPOT plate wells such as the quality of digital files, camera settings, and lighting system [6, 21]. Individual laboratories should unequivocally understand sources of undermining the accuracy of spot counting and establish their own standard operating procedure to address them.
Conclusions
Our study reveals that concordance between manual reading and automated evaluation is high. Our recommended strategy confines manual confirmation to positive test results with the maximum net spot count of 8 and borderline test results due to relatively high discordance. Our recommendations might not be directly applicable to laboratories having an ELISPOT plate imager with different algorithms and reading parameters but could be a practical guide to establishing their own strategies for interpreting the test results read by an ELISPOT plate imager.
Supporting information
S1 Table [docx]
Comparison of the qualitative test results read by each technician and ELISPOT plate imager.
Zdroje
1. Global tuberculosis report 2018. Geneva: World Health Organization; 2018.
2. Houben RM, Dodd PJ. The Global Burden of Latent Tuberculosis Infection: A Re-estimation Using Mathematical Modelling. PLoS Med. 2016;13(10):e1002152. doi: 10.1371/journal.pmed.1002152 27780211
3. Horsburgh CR Jr., Priorities for the treatment of latent tuberculosis infection in the United States. N Engl J Med. 2004;350(20):2060–2067. doi: 10.1056/NEJMsa031667 15141044
4. Latent tuberculosis infection: updated and consolidated guidelines for programmatic management. Geneva: World Health Organization; 2018.
5. Doan TN, Eisen DP, Rose MT, Slack A, Stearnes G, McBryde ES. Interferon-gamma release assay for the diagnosis of latent tuberculosis infection: A latent-class analysis. PloS one. 2017;12(11):e0188631. doi: 10.1371/journal.pone.0188631 29182688
6. Janetzki S, Price L, Schroeder H, Britten CM, Welters MJ, Hoos A. Guidelines for the automated evaluation of Elispot assays. Nat Protoc. 2015;10(7):1098–1115. doi: 10.1038/nprot.2015.068 26110715
7. Agarwal S, Nguyen DT, Lew JD, Teeter LD, Yamal JM, Restrepo BI, et al. Comparing TSPOT assay results between an Elispot reader and manual counts. Tuberculosis (Edinb). 2016;101S:S92–98.
8. Franken WP, Thijsen S, Wolterbeek R, Bouwman JJ, el Bannoudi H, Kik SV, et al. Variation in T-SPOT.TB spot interpretation between independent observers from different laboratories. Clin Vaccine Immunol. 2009;16(10):1439–1442. doi: 10.1128/CVI.00456-08 19710293
9. Al-Orainey IO. Diagnosis of latent tuberculosis: Can we do better? Ann Thorac Med. 2009;4(1):5–9. doi: 10.4103/1817-1737.44778 19561914
10. Wang JY, Chou CH, Lee LN, Hsu HL, Jan IS, Hsueh PR, et al. Diagnosis of tuberculosis by an enzyme-linked immunospot assay for interferon-gamma. Emerg Infect Dis. 2007;13(4):553–558. doi: 10.3201/eid1304.051195 17553269
11. Ramos JM, Robledano C, Masia M, Belda S, Padilla S, Rodriguez JC, et al. Contribution of interferon gamma release assays testing to the diagnosis of latent tuberculosis infection in HIV-infected patients: a comparison of QuantiFERON-TB Gold In Tube, T-SPOT.TB and tuberculin skin test. BMC Infect Dis. 2012;12:169. doi: 10.1186/1471-2334-12-169 22849726
12. Hawkins N, Self S, Wakefield J. The automated counting of spots for the ELISpot assay. J Immunol Methods. 2006;316(1–2):52–58. doi: 10.1016/j.jim.2006.08.005 17010368
13. Ringshausen FC, Schablon A, Nienhaus A. Interferon-gamma release assays for the tuberculosis serial testing of health care workers: a systematic review. J Occup Med Toxicol. 2012;7(1):6. doi: 10.1186/1745-6673-7-6 22537915
14. van Zyl-Smit RN, Pai M, Peprah K, Meldau R, Kieck J, Juritz J, et al. Within-subject variability and boosting of T-cell interferon-gamma responses after tuberculin skin testing. Am J Respir Crit Care Med. 2009;180(1):49–58. doi: 10.1164/rccm.200811-1704OC 19342414
15. King TC, Upfal M, Gottlieb A, Adamo P, Bernacki E, Kadlecek CP, et al. T-SPOT.TB Interferon-gamma Release Assay Performance in Healthcare Worker Screening at Nineteen U.S. Hospitals. Am J Respir Crit Care Med. 2015;192(3):367–373. doi: 10.1164/rccm.201501-0199OC 26017193
16. Rego K, Pereira K, MacDougall J, Cruikshank W. Utility of the T-SPOT((R)).TB test's borderline category to increase test resolution for results around the cut-off point. Tuberculosis (Edinb). 2018;108:178–185.
17. Tang P, Johnston J. Treatment of Latent Tuberculosis Infection. Curr Treat Options Infect Dis. 2017;9(4):371–379. doi: 10.1007/s40506-017-0135-7 29238270
18. Janetzki S, Schaed S, Blachere NE, Ben-Porat L, Houghton AN, Panageas KS. Evaluation of Elispot assays: influence of method and operator on variability of results. J Immunol Methods. 2004;291(1–2):175–183. doi: 10.1016/j.jim.2004.06.008 15345315
19. Sundararaman S, Karulin AY, Ansari T, BenHamouda N, Gottwein J, Laxmanan S, et al. High Reproducibility of ELISPOT Counts from Nine Different Laboratories. Cells. 2015;4(1):21–39. doi: 10.3390/cells4010021 25585297
20. De Rose R, Taylor EL, Law MG, van der Meide PH, Kent SJ. Granulocyte contamination dramatically inhibits spot formation in AIDS virus-specific ELISpot assays: analysis and strategies to ameliorate. J Immunol Methods. 2005;297(1–2):177–186. doi: 10.1016/j.jim.2004.12.009 15777941
21. Ashoor I, Najafian N, Korin Y, Reed EF, Mohanakumar T, Ikle D, et al. Standardization and cross validation of alloreactive IFNgamma ELISPOT assays within the clinical trials in organ transplantation consortium. Am J Transplant. 2013;13(7):1871–1879. doi: 10.1111/ajt.12286 23710568
Článok vyšiel v časopise
PLOS One
2019 Číslo 9
- Metamizol jako analgetikum první volby: kdy, pro koho, jak a proč?
- Nejasný stín na plicích – kazuistika
- Masturbační chování žen v ČR − dotazníková studie
- Těžké menstruační krvácení může značit poruchu krevní srážlivosti. Jaký management vyšetření a léčby je v takovém případě vhodný?
- Fixní kombinace paracetamol/kodein nabízí synergické analgetické účinky
Najčítanejšie v tomto čísle
- Graviola (Annona muricata) attenuates behavioural alterations and testicular oxidative stress induced by streptozotocin in diabetic rats
- CH(II), a cerebroprotein hydrolysate, exhibits potential neuro-protective effect on Alzheimer’s disease
- Comparison between Aptima Assays (Hologic) and the Allplex STI Essential Assay (Seegene) for the diagnosis of Sexually transmitted infections
- Assessment of glucose-6-phosphate dehydrogenase activity using CareStart G6PD rapid diagnostic test and associated genetic variants in Plasmodium vivax malaria endemic setting in Mauritania