Comparison of the abilities of staging and risk stratification systems to predict the long-term structural recurrence in patients with differentiated thyroid carcinoma after total thyroidectomy and radioactive iodine remnant ablation
Introduction
In patients with differentiated thyroid carcinoma (DTC), various staging and risk of recurrence stratification systems have been used to predict disease-specific mortality or estimate the risk of recurrence because DTC has no unique clinicopathological risk factors, unlike minimally invasive follicular thyroid carcinoma, in which the number of vascular invasion foci indicates the risk of recurrence (1-13).
The American Joint Committee on Cancer/Union for International Cancer Control (AJCC/UICC) Tumor Node Metastasis (TNM) staging system is commonly used to predict disease-specific mortality. However, it is unreliable for predicting the risk of structural recurrence (14-17), which is the primary concern in DTC. Other risk stratification systems have been developed based on international guidelines, such as the 2015 American Thyroid Association (ATA) initial risk estimate, which has been used to tailor initial therapy and guide early postoperative management strategies based on the expected risk of structural recurrence in individual patients (11,18,19). Nonetheless, risk stratification systems used during initial diagnosis and treatment cannot reliably predict long-term structural recurrence because they only estimate risk based on data available at initial treatment and fail to account for data obtained during follow-up such as initial treatment response (14-17,19). In contrast, the dynamic risk stratification (DRS) system reclassifies the initial risk of structural recurrence based on response to initial therapy using data obtained during follow-up. The DRS system better predicts long-term outcomes and helps guide treatment intensity and long-term follow-up strategies (14-17).
Recently, the AJCC/UICC TNM staging system was revised; the age cut-off was increased, lymph node (LN) metastases were removed from the stage III definition, and microscopic extrathyroidal extension was removed from the staging criteria. The 2015 ATA guidelines were also modified, with changes to the extent of LN involvement, mutational status, and specific histology of follicular thyroid carcinoma. However, few studies have compared these revised staging and risk stratification systems in terms of their ability to predict long-term outcomes.
The present study aimed to compare the recently revised staging and risk stratification systems in terms of their ability to predict long-term structural recurrence in patients with DTC who underwent total thyroidectomy and subsequent radioactive iodine (RAI) remnant ablation. We present the following article in accordance with the STROBE reporting checklist (available at https://dx.doi.org/10.21037/gs-21-8).
Methods
Study population
We retrospectively reviewed the data of 510 patients with DTC who underwent total thyroidectomy and subsequent RAI remnant ablation at Wonju Severance Christian Hospital (Wonju, South Korea) between 2000 and 2008. We excluded patients with distant metastases (n=2) detected at the initial presentation or within 12 months of the initial surgery. All enrolled patients had sufficient information available to assess the response to initial therapy, as well as at least 3 years of follow-up after initial treatment. We evaluated the clinicopathological characteristics, including the 8th edition of TNM stage (TNM-8), the 2015 ATA initial risk estimate, and the DRS. We also assessed other risk factors for structural recurrence. The study was conducted in accordance with the Declaration of Helsinki (as revised in 2013). The study was approved by the institutional/regional/national ethics/committee/ethics board of the Yonsei University Wonju College of Medicine (CR320068) and informed consent was taken from all the patients.
ATA initial risk stratification and DRS
Patients were classified as low-, intermediate-, or high-risk based on the 2015 ATA initial risk stratification system, and they were divided into four categories (excellent, indeterminate, biochemically incomplete, and structurally incomplete) based on the response to initial therapy using the DRS system. The detailed definitions of each category were based on those proposed and revised by Tuttle et al. (17).
The level of non-stimulated thyroglobulin (Tg) at a similar thyroid-stimulating hormone (TSH) level was classified as stable, declining, or increasing based on a >20% change in two consecutive measurements. Positivity for anti-Tg antibody (Ab) was defined at a serum concentration >115 IU/mL. The trend in anti-Tg Ab change was also defined as stable, declining, or increasing based on three consecutive values over the follow-up period.
Preoperative diagnosis and staging work-up
The preoperative diagnosis was determined using ultrasonography-guided fine-needle aspiration cytology (FNAC). When patients were diagnosed with papillary thyroid carcinoma (PTC), a preoperative staging work-up was performed using both high-resolution neck ultrasonography and computed tomography (CT) to evaluate tumor characteristics and cervical LN status. Patients confirmed as having follicular thyroid carcinoma after initial diagnostic thyroid lobectomy underwent a staging work-up postoperatively using the same imaging methods.
Surgical strategy
During the study period, total thyroidectomy was recommended to nearly all patients with a PTC >1 cm, based on the 2009 ATA guidelines. Patients who refused total thyroidectomy were provided with sufficient information to understand the potential risks and benefits of thyroid lobectomy versus total thyroidectomy, and they were recommended regular follow-up examinations.
At our institution, central compartment neck dissection (CCND) is routinely performed to treat patients with PTC. At a minimum, unilateral prophylactic CCND is performed, even when no suspicious LNs are present on preoperative imaging or during surgery. Bilateral CCND is only performed in patients who have suspicious LNs or LN enlargement in the contralateral central compartment, or in those with bilateral cancer. Compartment-based nodal dissection was performed. Prophylactic lateral neck dissection is not preferred to treat patients with PTC. Therapeutic lateral neck dissection is only performed in patients with biopsy-confirmed lateral LN metastasis.
RAI remnant ablation protocol
Subsequent RAI remnant ablation was performed in indicated patients 4–6 weeks after the initial operation. At our institution, high-dose (>100 mCi) remnant ablation is preferred.
At the time of remnant ablation, following thyroid hormone withdrawal or recombinant human TSH (rh-TSH) administration, the serum stimulated Tg level (reference range: 3.5–77.0 ng/mL) and anti-Tg Ab level (reference: <115 IU/mL) were measured when the TSH level (reference range: 0.35–5.50 mIU/L) was >30 mIU/L. A post-ablation whole-body scan (WBS) was performed 2 days after the administration of iodine 131.
Postoperative follow-up protocol
All patients underwent follow-up examinations at the outpatient clinic. In those who underwent total thyroidectomy with RAI remnant ablation, diagnostic WBS was performed following thyroid hormone withdrawal or rh-TSH administration 6–12 months after remnant ablation. At the same time, serum stimulated Tg was measured. Serum Tg/anti-Tg Ab measurements and neck ultrasonography were performed on all patients during the follow-up period. When the stimulated Tg was ≥1 ng/mL and neck ultrasonography showed no evidence of disease 6–12 months after remnant ablation, 18F-deoxyglucose positron emission tomography (FDG-PET) or chest CT imaging was considered to localize persistent or remnant disease. Any patients suspected to have locoregional recurrence underwent ultrasonography-guided FNAC. Distant metastasis was diagnosed using WBS, chest CT, or 18F-FDG-PET/CT, and confirmed using serial imaging or biopsy. Structural recurrence was defined as the following: appearance of cytopathologically, or histopathologically-proven malignant tissue after a period of no evidence of disease for a minimum of one year after the initial treatment. Unlike extracervical structural lesions (distant metastasis), even highly suspicious locoregional structural lesions on cross-sectional or functional imaging studies were not defined as a structural recurrence until they were confirmed using FNAC or surgical biopsy. Lesions that had not been confirmed using cytopathology or histopathology, but which looked suspicious under radiology, were re-evaluated using serial ultrasonography with repeat FNAC or surgical biopsy.
Statistical analyses
Continuous variables are presented as means ± standard deviations or as medians and ranges. Categorical variables are presented as percentages and absolute numbers. Logistic regression was conducted to compare categorical variables between patients with and without structural recurrence. Univariate and multivariate Cox proportional hazard modeling analyses were used to identify the risk factors for structural recurrence. Hazard ratios (HRs) with 95% confidence intervals (CIs) were calculated. Only potentially significant risk factors in univariate models were entered into the multivariate models. The proportion of variance explained (PVE) was used to evaluate the ability of the 8th TNM staging system, the ATA initial risk stratification system, and the DRS subgroups to predict structural recurrence. Survival outcomes were analyzed using the Kaplan-Meier method and log-rank tests. A two-sided P value <0.05 was considered statistically significant. Statistical analyses were performed using R version 3.1.1 software (R Project; Vienna, Austria).
Results
Baseline demographic characteristics
The clinicopathological characteristics of the patients are summarized in Table 1. The median follow-up duration was 108 months (range: 49–223 months). Patients were allocated to a low-risk group (189 patients; 37.1%), an intermediate-risk group (311 patients; 60.9%), or a high-risk group (10 patients; 2.0%) based on the 2015 ATA initial risk stratification system. DRS showed that 435 patients (85.2%) achieved excellent response, 35 (6.9%) indeterminate or biochemically incomplete response, while only five showed structurally incomplete response.
Full table
Structural recurrence occurred in 39 patients (7.6%). Two patients (0.4%) had distant metastasis (lung and tonsil, respectively). The sites of locoregional structural recurrence were previously undissected lateral LNs in 24 patients (4.7%), previously dissected lateral LNs in eight patients (1.6%), previously dissected central LNs in eight patients (1.6%), and previously undissected central LNs in three patients (0.6%), with overlapping recurrent sites in some patients. None of the patients died from DTC during the follow-up period. All patients who experienced locoregional structural recurrence underwent re-operation. Two patients with distant metastasis received high-dose RAI treatment.
Risk factor analysis for structural recurrence
As shown in Table 2, minimal extrathyroidal extension [HR: 5.68 (95% CI: 1.71–26.68), P=0.011], LN metastasis [HR: 3.853 (95% CI: 1.25–11.35), P=0.023], and biochemically incomplete response in the DRS [HR: 13.76 (95% CI: 5.35–36.78), P<0.001] were independent risk factors for structural recurrence (Table 2). Structurally incomplete response was not an independent risk factor for structural recurrence (Table 2).
Full table
Comparison of the abilities of the TNM staging system, the ATA initial risk estimate, and the DRS based on response to initial therapy to predict long-term structural recurrence
Twenty-four patients (5.2%) in stage I and 15 (33.3%) in stage II of the TNM-8 staging system had structural recurrence (Table 3). The risk of structural recurrence was significantly greater in stage II [HR: 8.74 (95% CI: 4.54–16.82), P<0.001] than in stage I (Table 3).
Full table
According to the 2015 ATA initial risk stratification, one patient (0.5%) in the low-risk group, 37 (11.9%) in the intermediate-risk group, and one (10.0%) in the high-risk group had structural recurrence (Table 3). The risk of structural recurrence in the intermediate-risk [HR: 19.30 (95% CI: 2.64–140.96), P=0.004) and high-risk groups [HR: 24.22 (95% CI: 1.51–388.10), P=0.024] were significantly higher than that in the low-risk group (Table 3).
Based on initial therapy response in the DRS, structural recurrence occurred in 3.4% of patients (n=15/435) with an excellent response, 14.3% of those (n=5/35) with indeterminate response, 51.4% of those (n=18/35) with incomplete biochemical response, and 20.0% of those (n=1/5) with incomplete structural response (Table 3). The risk of structural recurrence was significantly greater in the indeterminate response group [HR: 4.15 (95% CI: 1.51–11.44), P=0.006], biochemical incomplete response group [HR: 18.78 (95% CI: 9.36–37.67), P<0.001), and structural incomplete response group [HR 7.83 (95% CI: 1.03–59.63), P=0.047] than in the excellent response group (Table 3).
The disease-free survival (DFS) curves of the patients differed significantly among the TNM-8, 2015 ATA initial risk stratification system, and DRS system based on initial therapy response (Figure 1). The PVE of the DRS system (20.7%) was greater than those of the TNM staging system or ATA initial risk estimates (Table 3).
Discussion
In the present study, we aimed to compare the ability of commonly used staging and risk of recurrence stratification systems, all of which have recently been revised, to predict long-term structural recurrence among patients with DTC who underwent a total thyroidectomy and subsequent RAI remnant ablation. The DFS curves of each category within the TNM-8, 2015 ATA initial risk estimates, and DRS system based on initial therapy response showed significant differences. Biochemically incomplete response in the DRS system independently predicted structural recurrence in our cohort, whereas the ATA risk categories did not. Moreover, the ability of the DRS to predict long-term structural recurrence, as assessed by PVE, was much higher than those of the TNM-8 and 2015 ATA initial risk estimates.
Recent studies have revealed that the DRS system can be used to modify initial risk estimates of structural recurrence and may provide valuable information to predict long-term structural recurrence (15-20). Several studies have validated the DRS system in patients treated using total thyroidectomy and RAI remnant ablation, reporting that the PVE of the DRS system ranges from 44.6% to 62.1%, which is considerably greater than the 25% to 34% PVE of the ATA risk stratification system (17-21). In the present study, the PVE of the DRS system was only 20.7%, but this was appreciably higher than those of the ATA risk estimates (4.6%) and TNM-8 (6.1%) (17-21).
Furthermore, the present study demonstrated a high proportion of excellent response than previous studies (85.2% vs. 64.2–68.8%), as well as a lower proportion of indeterminate (6.9% vs. 19.8–24.6%), biochemically incomplete (6.9% vs. 9.1–9.9%), and structurally incomplete responses (1.0% vs. 1.7–2.1%) (15,22). We speculate that these results were influenced by the low rate of long-term structural recurrence in our patients, which may in turn have been due to our institution’s preference for prophylactic CCND and high-dose RAI remnant ablation.
Interestingly, structurally incomplete response within the DRS was not an independent risk factor for long-term structural recurrence in the present study, whereas biochemically incomplete response was. In addition, the prevalence of long-term recurrence after re-operation in patients with structurally incomplete response was only 20.0%, which was lower than that of patients with biochemically incomplete response (51.4%). This suggests that appropriate surgical treatment, combined with additional RAI therapy and rigorous TSH suppression, improves outcomes after re-operation, even among patients with structurally incomplete response.
The present study failed to provide compelling evidence, perhaps because the rate of structural recurrence was so low, and because retrospective studies contain inherent bias. Nonetheless, among the patients who underwent total thyroidectomy and RAI remnant ablation, only two patients were excluded, and all enrolled participants had sufficient information to assess initial therapy response, as well as at least 3 years of follow-up after initial surgery. Furthermore, the study included all patients who were confirmed to have histologic variants of PTC as well as classical PTC, suggesting that our findings could be applied generally to all patients with DTC. On the other hand, our institution’s preference for prophylactic CCND and high-dose RAI remnant ablation may have caused bias, because these procedures are not recommended by many international guidelines.
Based on the results of the present study, the DRS system may be more useful than the TNM-8 and ATA initial risk stratification systems for predicting long-term structural recurrence, as well as for tailoring long-term management and follow-up strategies in patients with DTC undergoing total thyroidectomy followed by RAI remnant ablation.
Acknowledgments
This work was supported by the Department of Surgery, Wonju Severance Christian Hospital, South Korea.
Funding: None.
Footnote
Reporting Checklist: The authors have completed the STROBE reporting checklist. Available at https://dx.doi.org/10.21037/gs-21-8
Data Sharing Statement: Available at https://dx.doi.org/10.21037/gs-21-8
Conflicts of Interest: All authors have completed the ICMJE uniform disclosure form (available at https://dx.doi.org/10.21037/gs-21-8). The authors have no conflicts of interest to declare.
Ethical Statement: The authors are accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. The study was conducted in accordance with the Declaration of Helsinki (as revised in 2013). The study was approved by the institutional/regional/national ethics/committee/ethics board of the Yonsei University Wonju College of Medicine (CR320068) and informed consent was taken from all the patients.
Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0/.
References
- Beenken S, Roye D, Weiss H, et al. Extent of surgery for intermediate-risk well-differentiated thyroid cancer. Am J Surg 2000;179:51-6. [Crossref] [PubMed]
- Cady B, Rossi R. An expanded view of risk-group definition in differentiated thyroid carcinoma. Surgery 1988;104:947-53. [PubMed]
- Edge SB, Compton CC. The American Joint Committee on Cancer: The 7th edition of the AJCC cancer staging manual and the future of TNM. Ann Surg Oncol 2010;17:1471-4.
- Hay ID, Bergstralh EJ, Goellner JR, et al. Predicting outcome in papillary thyroid carcinoma: Development of a reliable prognostic scoring system in a cohort of 1779 patients surgically treated at one institution during 1940 through 1989. Surgery 1993;114:1050-7; discussion 7-8. [PubMed]
- Hay ID, Grant CS, Taylor WF, et al. Ipsilateral lobectomy versus bilateral lobar resection in papillary thyroid carcinoma: A retrospective analysis of surgical outcome using a novel prognostic scoring system. Surgery 1987;102:1088-95. [PubMed]
- Lerch H, Schober O, Kuwert T, et al. Survival of differentiated thyroid carcinoma studied in 500 patients. J Clin Oncol 1997;15:2067-75. [Crossref] [PubMed]
- Mazzaferri EL, Jhiang SM. Long-term impact of initial surgical and medical therapy on papillary and follicular thyroid cancer. Am J Med 1994;97:418-28. [Crossref] [PubMed]
- Noguchi S, Murakami N, Kawamoto H. Classification of papillary cancer of the thyroid based on prognosis. World J Surg 1994;18:552-7; discussion 8. [Crossref] [PubMed]
- Shaha AR, Loree TR, Shah JP. Intermediate-risk group for differentiated carcinoma of thyroid. Surgery 1994;116:1036-40; discussion 40-1. [PubMed]
- Sherman SI, Brierley JD, Sperling M, et al. Prospective multicenter study of thyroid carcinoma treatment: initial analysis of staging and outcome. National thyroid cancer treatment cooperative study registry group. Cancer 1998;83:1012-21. [Crossref] [PubMed]
- Haugen BR, Alexander EK, Bible KC, et al. 2015 American thyroid association management guidelines for adult patients with thyroid nodules and differentiated thyroid cancer: The American thyroid association guidelines task force on thyroid nodules and differentiated thyroid cancer. Thyroid 2016;26:1-133. [Crossref] [PubMed]
- Pacini F, Schlumberger M, Dralle H, et al. European consensus for the management of patients with differentiated thyroid carcinoma of the follicular epithelium. Eur J Endocrinol 2006;154:787-803. [Crossref] [PubMed]
- Pitoia F, Ward L, Wohllk N, et al. Recommendations of the Latin American Thyroid Society on diagnosis and management of differentiated thyroid cancer. Arq Bras Endocrinol Metabol 2009;53:884-7. [Crossref] [PubMed]
- Momesso DP, Tuttle RM. Update on differentiated thyroid cancer staging. Endocrinol Metab Clin North Am 2014;43:401-21. [Crossref] [PubMed]
- Momesso DP, Vaisman F, Yang SP, et al. Dynamic risk stratification in patients with differentiated thyroid cancer treated without radioactive iodine. J Clin Endocrinol Metab 2016;101:2692-700. [Crossref] [PubMed]
- Tarasova VD, Tuttle RM. A risk-adapted approach to follow-up in differentiated thyroid cancer. Rambam Maimonides Med J 2016;7:e0004 [Crossref] [PubMed]
- Tuttle RM, Tala H, Shah J, et al. Estimating risk of recurrence in differentiated thyroid cancer after total thyroidectomy and radioactive iodine remnant ablation: Using response to therapy variables to modify the initial risk estimates predicted by the new American Thyroid Association staging system. Thyroid 2010;20:1341-9. [Crossref] [PubMed]
- Pitoia F, Bueno F, Urciuoli C, et al. Outcomes of patients with differentiated thyroid cancer risk-stratified according to the American thyroid association and Latin American thyroid society risk of recurrence classification systems. Thyroid 2013;23:1401-7. [Crossref] [PubMed]
- Castagna MG, Maino F, Cipri C, et al. Delayed risk stratification, to include the response to initial treatment (surgery and radioiodine ablation), has better outcome predictivity in differentiated thyroid cancer patients. Eur J Endocrinol 2011;165:441-6. [Crossref] [PubMed]
- Vaisman F, Momesso D, Bulzico DA, et al. Spontaneous remission in thyroid cancer patients after biochemical incomplete response to initial therapy. Clin Endocrinol (Oxf) 2012;77:132-8. [Crossref] [PubMed]
- Vaisman F, Tala H, Grewal R, et al. In differentiated thyroid cancer, an incomplete structural response to therapy is associated with significantly worse clinical outcomes than only an incomplete thyroglobulin response. Thyroid 2011;21:1317-22. [Crossref] [PubMed]
- Park S, Kim WG, Song E, et al. Dynamic Risk Stratification for Predicting Recurrence in Patients with Differentiated Thyroid Cancer Treated Without Radioactive Iodine Remnant Ablation Therapy. Thyroid 2017;27:524-30. [Crossref] [PubMed]