FormalPara Key Summary Points

Why carry out this study?

Regorafenib is a multikinase inhibitor used as late-line therapy for metastatic colorectal cancer. Despite being associated with various adverse events, including hand–foot skin reactions, the objective response rate is 1%, and a potential biomarker is required.

What did the study ask?/What was the hypothesis of the study?

We investigated the utility of FDG-PET as an imaging metabolic biomarker for the efficacy of regorafenib to detect responders/nonresponders.

What were the study outcomes/conclusions?

The observed metabolic response rate using centrally reviewed FDG-PET was 5.9%, below our expected early detection rate of 10%. Therefore, the JACCRO CC-12 study failed to demonstrate the usefulness of FDG-PET as an imaging biomarker of regorafenib.

What has been learned from the study?

The observed metabolic response (5.9%) was higher than the objective response rate (0%) using RECIST, and patients who were diagnosed with partial response or stable disease using FDG-PET showed better outcomes than those diagnosed with progressive disease. These data suggest the potential for early metabolic diagnosis using FDG-PET. To evaluate the metabolic response, whole-body total lesion glycolysis might be more useful than the SUVmax of target lesions.

Introduction

Regorafenib, an oral multikinase inhibitor that inhibits the activity of angiogenic (VEGFR1, VEGFR2, VEGFR3, TIE2), stromal (PDGFR, FGFR), and oncogenic (KIT, RET, RAF-1, BRAF, BRAFV600E) receptor tyrosine kinases, has been approved for the treatment of metastatic colorectal cancer (mCRC), gastrointestinal stromal tumor, and hepatocellular carcinoma [1, 2]. Two randomized phase III trials, CORRECT and CONCUR, demonstrated a clinically modest prolongation of overall survival (OS) ranging from 1.4 to 2.5 months in patients treated with regorafenib combined with best supportive care compared with placebo combined with best supportive care [3, 4]. In the CORRECT trial, even though the objective response rate (ORR) was only 1%, treatment-related adverse events, including hand–foot skin reactions, fatigue, diarrhea, hypertension, and rash, occurred in 93% of the patients assigned to receive regorafenib [3]. Therefore, the identification of potential biomarkers might help select those patients most likely to benefit from regorafenib and avoid unnecessary adverse effects in patients who are not likely to respond.

RadioCORRECT, a post hoc analysis of a cohort of patients within the CORRECT trial, investigated the association between tumor response and early radiological parameters [5]. The results of multivariate analyses and baseline and week 8 lung cavitation were found to be significant predictors of progression-free survival (PFS), and baseline lung cavitation, RECIST (Response Evaluation Criteria in Solid Tumours) 1.1 responses, and change in the sum of target lesion diameters (≥ 9% vs. < 9%) were found to be predictors of OS. Assessment of liver metastases did not predict clinical outcomes. That analysis suggested that response to multikinase inhibitors depended not only on changes in the sum of target lesion diameters, but also on non-morphological changes.

Here, we focused on [18F]fluorodeoxyglucose (FDG) positron emission tomography (PET) as an imaging biomarker for the efficacy of regorafenib. The standard definition of tumor response to cytotoxic agents is based on the measurement of changes in tumor size as determined using morphological imaging methods such as contrast-enhanced (CE) computed tomography (CT). Despite the widely accepted practice of using these criteria, the introduction of molecularly targeted agents such as angiogenesis inhibitors, including regorafenib, requires a new modality for monitoring antitumor effects, because they have different biological mechanisms compared with classic cytotoxic chemotherapy. These antiangiogenic agents inhibit the growth of new blood vessels in cancer tissue, which does not immediately result in the dissolution of tumor masses. Considering these mechanisms, FDG-PET has recently been widely introduced not only for preoperative examinations and diagnosis, but also for the prediction and evaluation of treatment response in various solid tumors [7, 8]. Regarding colorectal cancer, the usefulness of FDG-PET as a predictive/prognostic biomarker after locoregional therapies, including tumor ablation, has already been reported [9, 10].

Therefore, we conducted a prospective multicenter phase II trial to examine whether FDG-PET can aid the early prediction of the efficacy of regorafenib.

Methods

Study design

The JACCRO CC-12, a single-arm, prospective, open-label phase II study, was conducted at five centers in Japan in accordance with the Ethical Guidelines for Medical and Health Research Involving Human Subjects and the Declaration of Helsinki. The institutional review board and/or ethics committee at each site approved the protocol, and all patients provided written informed consent to participate.

Patients

The eligibility criteria were as follows: histologically confirmed colorectal cancer; mCRC or inoperable locally advanced colorectal cancer (regardless of RAS mutation); measurable lesions according to RECIST (ver. 1.1); assessable target lesion by FDG-PET/CT; refractory to fluorinated pyrimidine, oxaliplatin, irinotecan, bevacizumab, and anti-EGFR antibody therapy; Eastern Cooperative Oncology Group performance status 0–1; age ≥ 20 years; life expectancy of more than 3 months; able to take oral medications; adequate organ function for study treatment within 14 days before enrollment (white blood cell count ≥ 3000/mm3, < 12,000/mm3, neutrophil count ≥ 1500/mm3, platelet count ≥ 100,000/mm3, hemoglobin level ≥ 9.0 g/dL, total bilirubin ≤ 1.5 × upper limit of normal (ULN), aspartate aminotransferase ≤ 2.5 × ULN, alanine aminotransferase ≤ 2.5 × ULN, creatinine ≤ 1.5 × ULN, prothrombin time-international normalized ratio ≤ 1.5 × ULN); and provision of written informed consent.

The main exclusion criteria were as follows: synchronous or metachronous multiple malignancy within 5 years of disease-free survival; serious infection; any of the following severe comorbidities: (1) diabetes mellitus requiring insulin, (2) uncontrolled hypertension (systolic blood pressure [BP] > 150 mmHg, and/or diastolic BP > 90 mmHg), (3) unstable angina, myocardial infarction, cerebral infarction, or pulmonary embolism within 6 months, (4) abnormal cardiac rhythm requiring antiarrhythmic agent, (5) congestive heart failure (New York Heart Association ≥ class 2), (6) active or chronic hepatitis B and/or hepatitis C, (7) active interstitial pneumonia, (8) hemorrhage (Common Terminology Criteria for Adverse Events Grade ≥ 3) within 4 weeks before enrollment, or (9) nonhealing wound, ulcer, or bone fracture, brain metastases, history of treatment with regorafenib, history of allergy to regorafenib, extended-field radiotherapy within 4 weeks or local radiotherapy within 2 weeks before enrollment, major surgery, skin-open biopsy, or severe injury within 4 weeks before enrollment, sustained proteinuria (3+), unwilling to avoid pregnancy, pregnancy or breastfeeding, a positive pregnancy test, and any other patients regarded as inadequate for study enrollment by the investigators.

Treatment Procedures

Patients received regorafenib at a standard dose of 160 mg once daily for the first 3 weeks, followed by 1 week of rest until disease progression, death, occurrence of unacceptable toxic effects, withdrawal of consent by the patient, or a decision by the treating physician that discontinuation would be in the patient’s best interest. Reduction of dosage to 120 mg/day or 80 mg/day due to adverse effects was permitted.

FDG-PET/CT Imaging

Pre-treatment FDG-PET/CT imaging was performed ≤ 21 days before the start of regorafenib. Subsequent FDG-PET/CT imaging was performed on day 28 (ideal range: days 24–32). All FDG-PET images were centrally reviewed by an independent monitoring committee. The lesion with the highest maximum standardized uptake value (SUVmax) at baseline FDG-PET was defined as the target site for the primary end point.

Response Criteria of FDG-PET/CT

The response criteria assessed using FDG-PET in this study were defined based on EORTC (European Organisation for Research and Treatment of Cancer) and PERCIST (Positron Emission Tomography Response Criteria in Solid Tumors) criteria [11, 12]. The FDG-PET criteria used in this study were as follows: complete response (CR) (complete resolution of FDG uptake in the main target lesion); partial response (PR) (a > 25% reduction in the SUVmax in the main target lesion); stable disease (SD) (non-progressive disease [PD] or CR/PR); PD corresponds to (1) an increase of > 20% in the longest diameter of the main target lesion on CT, (2) an increase of > 25% in the SUVmax in at least one of the top five target lesions, or (3) new FDG uptake in metastatic lesions (new lesions and/or the progression of existing non-target lesions).

CE-CT

Change in tumor diameter was assessed using CE-CT. CE-CT scans were performed ≤ 21 days before the start of regorafenib and on day 28 of the first course. All CE-CT images were centrally reviewed by an independent monitoring committee using RECIST version 1.1 [12].

Statistical Considerations

The primary end point was the change in the SUVmax of the lesion with the highest uptake on baseline FDG-PET. The secondary end points were PFS, OS, ORR, correlation between changes in SUVmax and PFS/OS, and safety. The severity of adverse events was graded according to the National Cancer Institute Common Terminology Criteria for Adverse Events, v.4 [13].

Based on previous studies, we assumed a response rate by CE-CT to regorafenib of 0%, and expected a metabolic early detection rate of 10% by FDG-PET with a clinically meaningful response rate. Given a two-sided alpha of 0.05 and statistical power of 95%, with about 20% dropout from adverse events and/or early tumor progression, we set 20 patients as the target sample size in this study. The full analysis set was defined as patients who fulfilled the eligibility criteria and received protocol therapy at least once. The per-protocol set was defined as patients who received protocol therapy at least once and in whom FDG-PET/CT was performed after one course of regorafenib. The safety population was defined as all eligible patients who received treatment at least once. PFS and OS curves were calculated using the Kaplan–Meier method, and 95% confidence intervals (CIs) were estimated using Greenwood’s formula.

Results

Patient Background Characteristics

From November 2014 to March 2016, 20 patients were registered from three institutions. Intention-to-treat and safety analyses were carried out for all 20 patients. Table 1 shows the patients’ background characteristics. Thirteen patients were male, and seven were female, with a median age of 66 years (range: 42–79 years). All patients had received more than two prior regimens of chemotherapy, including oxaliplatin, irinotecan, and 5-fluorouracil. Four patients (20%) had received four prior therapies before accrual. The median duration from initiation of first-line chemotherapy to enrollment was 26.7 months (range: 11.5–44.5 months). The most common metastatic sites were lung and liver (13 patients, 65%). The data cutoff was November 2017.

Table 1 Patient characteristics

Treatment Exposure

Table 2 shows the treatment exposure. Treatment ranged from one to seven cycles. The total dose of regorafenib administered ranged from 320 to 16,800 mg, and the median total dose was 2080 mg. The calculated relative dose intensity of regorafenib was 59.3%.

Table 2 Treatment exposure

Efficacy

Metabolic response assessed using FDG-PET and CE-CT at the main target lesion

Three patients were excluded from the FDG-PET efficacy analysis because of a need for over 28 days of rest in one patient and mortality in two; therefore, per-protocol analysis for response was performed in 17 patients. Table 3 shows the metabolic response assessed based on the FDG-PET criteria used in this study. The SUVmax of the main target lesion in one patient was changed from 10.47 to 7.58, and this patient was diagnosed as PR by an independent monitoring committee. The response rate using centrally reviewed FDG-PET was 5.9% (95% CI: 0.1%–28.7%). Table 4 shows the morphological change in target lesions evaluated using CE-CT. No patient with CR or PR was found on CE-CT based on RECIST version 1.1, and the ORR based on CE-CT by an independent monitoring committee was 0%.

Table 3 FDG-PET response of the lesion with the highest uptake in the baseline FDG-PET
Table 4 Morphological response rate assessed using contrast-enhanced computed tomography

Figure 1 shows the best percentage change in the SUVmax between before and after regorafenib treatment. Although two of 17 patients showed a greater than 25% decrease in SUVmax, one was assessed as PD because of the appearance of new FDG uptake. The median percentages of change in SUVmax were −8.1% in PR + SD patients and +3.9% in PD patients.

Fig. 1
figure 1

Best percentage change in SUVmax of the lesion with the highest uptake in each patient at baseline FDG-PET

Survival

After a median follow-up of 8.3 months, the median PFS was 1.7 months (95% CI: 0.95–3.68 months) (Fig. 2). Regarding the median PFS among subgroups based on FDG-PET response, patients with PR + SD in the main target lesion showed a better tendency than did patients with PD (3.68 months [95% CI: 1.81–NR] vs. 1.00 months [95% CI: 0.82–3.84], respectively, p = 0.13) (Fig. 3). The median OS was 9.8 months (95% CI: 3.29–19.94) (Fig. 4). Regarding the median OS among subgroups based on FDG-PET response, patients with PR + SD in the main target lesion showed a slightly better tendency than did those with PD (12.98 months [95% CI: 6.77–NR] vs. 10.61 months [95% CI: 2.04–22.21], respectively, p = 0.43) (Fig. 5).

Fig. 2
figure 2

Progression-free survival

Fig. 3
figure 3

Overall survival

Fig. 4
figure 4

Progression-free survival divided by the PET response (PR + SD vs. PD)

Fig. 5
figure 5

Overall survival divided by the PET response (PR + SD vs. PD)

Safety

Adverse events are summarized in Table 5. Toxicity was assessed in all 20 patients. Palmar–plantar erythrodysesthesia syndrome was observed in 12 patients (60%) and grade 3 proteinuria in three (15%). Among all 20 patients, two patients died: one as a result of liver failure and the other because of tumor lysis syndrome.

Table 5 Summary of adverse events (n = 20)

Discussion

In this study, the ORR using centrally reviewed FDG-PET was 5.9%, below our expected early detection rate of 10%. Therefore, the JACCRO CC-12 study failed to demonstrate the usefulness of FDG-PET as an imaging biomarker of regorafenib.

Possible explanations for the negative results include the small number of patients, the single-arm design, and mortality. In the present study, three patients (15%) could not undergo FDG-PET after regorafenib treatment, so the efficacy analysis was performed using data from 17 patients. Because we had estimated that four patients (20%) would drop out because of adverse events and/or other reasons when setting the sample size in the study protocol, this decrease did not affect the statistical robustness of the results.

Two patients died during the treatment period: one from liver failure and the other from what was believed to be tumor lysis syndrome. The mortality rate of 10% was much higher than the reported 2% in the CORRECT trial [3]. This high mortality rate is partly as a result of the small number of patients. Liver failure is known to be a severe adverse event related to regorafenib treatment [3, 4]. A post-marketing survey in Japan noted that fatal outcomes resulting from serious hepatic-related adverse reactions were reported in 27 (0.3%) of an estimated 8380 patients with exposure to regorafenib [14]. The multivariate analysis in that survey revealed that the baseline total bilirubin level (1.2 vs. < 1.2 mg/dL) and the starting dose (< 160 vs. 160 mg) significantly affected the occurrence of serious adverse reactions in the liver. Regarding the patient who died because of liver failure in the present study, the total bilirubin level before regorafenib treatment was 1.54 mg/dL and the starting dose was 160 mg/day. This patient was considered to be at high risk for liver failure. As a future measure, a dose-escalation strategy starting with a lower dose could reduce these adverse events in such populations [15, 16].

Although JACCRO CC-12 ended as a negative study, some interesting findings were observed. First, an FDG-PET response of 5.9% was observed despite the fact that CE-CT showed an objective response of 0% based on RECIST. The PFS and OS of the patient diagnosed as PR using FDG-PET were 4.7 and 13.0 months, respectively. These values were about twice as high as those reported in the CORRECT trial, in which the PFS and OS were 1.9 and 6.4 months, respectively [3]. Also, in the present study, patients diagnosed as PR or SD using FDG-PET showed a better tendency than did patients diagnosed as PD, although this difference was not significant. These facts may allow us to consider that metabolic changes in the target lesion could help justify the continuation of regorafenib treatment. A second interesting finding is related to the definition of metabolic response. In one patient in this study, the SUVmax of the target lesion was dramatically decreased, from 13.3 to 7.1, but a new lesion was detected and this patient was diagnosed as PD using FDG-PET. This result suggests that the FDG-PET response should be assessed based not only on the SUVmax of one target lesion, but also on whole-body total lesion glycolysis (TLG). Recently, metabolic tumor volume (MTV) and TLG have been widely used as tumor metabolic and volumetric parameters in FDG-PET. MTV represents the volume of a tumor with active FDG uptake, and TLG is calculated by multiplying MTV by the mean SUV; these parameters indicate the volume of burden and metabolic activity of tumors, respectively [18,19,20], and should be used in future FDG-PET studies.

The first limitation to note is the large dropout rate of patients. JACCRO CC-12 was designed in 2013, when the dosage of regorafenib was limited to 160 mg/day, and frequent adverse events were observed at that dosage. In recent years, low-dose administration of regorafenib has been proposed [15]. ReDOS study, an open-label phase II study comparing a dose-escalation strategy group (a starting dose of 80 mg/day with weekly escalation, in 40 mg increments, to 160 mg/day regorafenib) and a standard-dose strategy group (160 mg/day orally) showed non-inferiority of the dose-escalation strategy, with less toxicity. Starting with a lower dose of regorafenib might prevent dropout, increase continuity, and more reliably confirm the efficacy of FDG-PET. A secondary point of limitation is the timing of the FDG-PET. In many studies, FDG-PET was performed 7 to 14 days after the initiation of each therapy. In this study, we set the second PET at 28 days after the start of treatment. This setting was dependent on the Japanese medical insurance system, which does not allow two FDG-PET scans in the same month. Considering that the reported PFS of patients treated with regorafenib was 1.7 months, the timing of an FDG-PET-based diagnosis should be earlier.

Conclusion

In summary, JACCRO CC-12 failed to demonstrate the usefulness of FDG-PET as an early imaging biomarker of regorafenib in patients with mCRC. Therefore, more effective methods need to be established for the early detection of responders and nonresponders to regorafenib.