Scolaris Content Display Scolaris Content Display

Positron emission tomography (PET) and magnetic resonance imaging (MRI) for assessing tumour resectability in advanced epithelial ovarian, fallopian tube and/or primary peritoneal cancer

Esta versión no es la más reciente

Contraer todo Desplegar todo

Abstract

This is a protocol for a Cochrane Review (Diagnostic test accuracy). The objectives are as follows:

To assess the diagnostic test accuracy of PET(‐CT), conventional and diffusion‐weighted MRI as an replacement or an add‐on to abdominal CT, for predicting tumour resectability at primary debulking surgery in patients with stage III – IV epithelial ovarian, fallopian tube and/or primary peritoneal cancer.

Background

Epithelial ovarian, fallopian tube and primary peritoneal cancers are malignancies of the internal female genital tract. Clinically, these tumours are often regarded as a single entity due to their similarity and overlap in pathophysiology, symptomatology, diagnostic approach, staging, treatment and prognosis (Prat 2014). Ovarian cancer affects 239,000 women each year (Ferlay 2012). It is most commonly identified at an advanced stage due to the absence of symptoms in early stage disease. When symptoms do occur, they are often non‐specific including abdominal pain or discomfort and fatigue (Olson 2001). The extent of ovarian cancer is categorised using the International Federation of Gynaecology and Obstetrics (FIGO) staging criteria. In advanced stage disease, the tumour is not confined to the ovaries (stage I) or true pelvis (stage II), but has spread outside the pelvis through the peritoneal cavity or towards regional lymph nodes (stage III), or towards extra‐abdominal lymph nodes and/or with haematogenous spread resulting in distant metastasis (e.g. liver parenchyma, stage IV) (Mutch 2014; Prat 2014). This late presentation makes ovarian cancer the leading cause of death from gynaecologic malignancies in developed countries worldwide with an absolute mortality of 152,000 women each year (Ferlay 2012).

In women with advanced stage epithelial ovarian, fallopian tube and primary peritoneal cancer, a combination of chemotherapy and debulking surgery is considered the mainstay of treatment. Debulking surgery (i.e. surgical efforts to remove the bulk of tumour) usually encompasses removal of the uterus (hysterectomy) and adnexa, resection of the omentum (an apron of fatty tissue containing veins, arteries, lymphatics), and the attempted resection of all visible tumour deposits (NCI 2015). The actual feasibility of the latter is in reality limited by the location of lesions (e.g. around blood vessels) and the potential morbidity that each resection induces. At the end of each surgical procedure a conclusion can be drawn on the completeness of debulking (cytoreductive) surgery, divided in debulking with no visible tumour deposits left (i.e. macroscopic (complete) debulking), debulking with residual disease ≤ 1 cm (in the past often called 'optimal debulking'), or debulking with residual disease > 1 cm (i.e. incomplete debulking). This distinction is important since, along with tumour response to chemotherapy, the completeness of debulking surgery is the most important prognostic factor for survival in patients with advanced stage epithelial ovarian cancer (Bristow 2002; Elattar 2011; NCI 2015; Vergote 2010). Unfortunately, despite chemotherapy and macroscopic debulking surgery, the majority of patients still develop recurrent disease (du Bois 2009). As 'macroscopic complete debulking' is determined by the naked eye of the surgeon, this does not imply that the resections are radical in the sense of radical surgical margins determined by histopathological examination of the specimen. Therefore, recurrences can be partly due to remaining microscopic disease (i.e. occult disease) after treatment.

Preoperative diagnostic imaging is used to estimate tumour extension and thus the feasibility of surgical debulking. If macroscopic debulking seems feasible based on imaging, primary debulking surgery is attempted. If imaging indicates that the chance of macroscopic debulking is small, patients receive neoadjuvant chemotherapy (in order to reduce tumour load) and subsequently debulking surgery (i.e. interval debulking). Currently, diagnostic imaging is predominantly based on abdominal computed tomography (CT). Unfortunately, this preoperative assessment is imperfect since small tumour deposits can be missed and distinguishing malignant from benign tissue can be challenging. This can lead to cases wherein primary surgery is attempted but deemed unfeasible, which causes unnecessary morbidity and negatively influences prognosis. In contrast, macroscopic debulking is the strongest independent predictor of patient outcome and should be attempted whenever deemed possible (Vergote 2010). Therefore, it is undesirable to select woman for neoadjuvant chemotherapy followed by interval debulking surgery when the tumour load would have been primary resectable. In conclusion, it is important to conscientiously select patients for either primary debulking surgery with adjuvant chemotherapy or neoadjuvant chemotherapy followed by interval debulking.

Bristow 2002 demonstrated the extensive heterogeneity between centres on their percentage of macroscopically debulking and incomplete debulking with residual disease limited to 1 cm in diameter, or 2 cm in the earlier studies (Baker 1994), which ranged from 0% to 100% with a weighted mean of only 41.9%. Even with careful patient selection using laparoscopy, the percentage of patients with residual tumour after primary debulking surgery still range up to 31% (Rutten 2014). Recent randomised controlled trials have demonstrated equivalence in survival between primary surgery and the alternative approach with neoadjuvant chemotherapy and interval debulking surgery, with reduced morbidity in the latter (Kehoe 2015; Morrison 2012; Vergote 2010).

Target condition being diagnosed

The target condition is the outcome of primary debulking surgery for advanced stage epithelial ovarian, fallopian tube and/or primary peritoneal cancer. The outcome is defined by the diameter of the largest tumour deposit remaining after surgery and is determined by the surgeon performing the procedure. The term 'primary' specifies those patients in whom no treatment, surgical or chemotherapy, has been given prior to this surgery. Three target condition categories will be considered.

  • Macroscopic debulking, which is defined as no macroscopically visible tumour deposits at the end of surgery. Debulking of all deposits is the objective, though not always clinically feasible (NICE 2011). This can be due to their location (e.g. situated on the mesentery or liver hilum) or when the number of (small) metastasis is innumerable (i.e. miliary pattern of spread). In general, deposit resection needs to be abandoned when continuing would induce unacceptable morbidity (e.g. compromising the blood supply to the entire small bowel in case of mesenterial resections). Consequently, this leads to an incomplete debulking with residual deposits of ovarian cancer.

  • Incomplete debulking with residual disease, divided into two subcategories, depending on whether macroscopically visible tumour deposits:

    • ≤ 1 cm in diameter remain at the end of surgery; or

    • >1 cm in diameter remain at the end of surgery.

Index test(s)

In this systematic review, we will consider the following three non‐invasive and commonly available index tests.

  • Whole body fluorodeoxyglucose‐18 (FDG) positron emission tomography (PET), with or without a parallel conventional CT for anatomical reference (PET‐CT).

  • Conventional T1w/T2w (i.e. anatomical) magnetic resonance imaging (MRI), with or without intravenously administered gadolinium contrast.

  • Diffusion weighted MRI (DWI), in addition to conventional MRI, an imaging method that uses the diffusion of water molecules to generate contrast.

Clinical pathway

With (subtle) symptoms, or based on accidental discovery of an abdominal mass, patients suspected of ovarian cancer preferably present to a gynaecological oncologist. Here, a standard diagnostic work‐up is performed starting with obtaining information about i.e. medical history, symptoms, family history, known allergies, use of medication and the social background. This is followed by a general physical and pelvic examination (Roett 2009). In most centres, ultrasound (transvaginal and/or abdominal) is routinely added to assess the size and composition of the adnexal mass as well as the presence of free fluid in the rectouterine excavation (i.e. pouch of Douglas) (NICE 2011).

Blood tests are performed to assess both general health as well as specific tumour marker levels and a CT scan of the pelvis, abdomen and optionally the chest is performed (NICE 2011). The presence, location and extent of the adnexal mass, ascites, peritoneal tumour deposits, omental caking (abnormally thickened greater omentum which indicates infiltration of tumour tissue), lymph node enlargement, pleural effusion and haematogenous metastases are specifically assessed. In some centres, chest CT is substituted by two‐directional plain film chest radiography.

A multidisciplinary tumour board of experts discuss all findings and determine the diagnosis, stage and treatment plan and in particular the feasibility of (complete) tumour debulking. When considered feasible, primary debulking surgery followed by adjuvant chemotherapy is preferred. The tumour stage is macroscopically estimated at surgery and definitively after histopathological examination. When the feasibility of debulking surgery is questionable, women are commonly treated with three or six cycles of neoadjuvant chemotherapy (usually a combination of carboplatin and paclitaxel) and subsequently, in the case of no disease progression, with interval debulking surgery.

Alternative test(s)

When studies do not follow an add‐on design (i.e. substitute CT by one of the index tests), contrast enhanced CT of the abdomen can be considered as an alternative test (to the index tests) in patients with stage III – IV epithelial ovarian, fallopian tube and/or primary peritoneal cancer.

Laparoscopy, performed either as ambulatory surgery or directly before the laparotomy, can also be considered as an alternative test. A recent Cochrane systematic review on laparoscopy for the assessment of tumour resectability in ovarian cancer remained inconclusive (Rutten 2014). However, a randomised controlled trial which assessed whether the proportion of incomplete debulking surgeries with residual disease ≤ 1 cm or > 1 cm in diameter can be reduced by laparoscopic assessment has been performed and we are now awaiting results (Rutten 2012).

Rationale

Abdominal CT is imperfect in assessing the (non‐)resectability of advanced stage ovarian cancer in primary debulking surgery (Borley 2015; Suidan 2014, Vergote 2008). Alternative imaging options, such as PET(‐CT), conventional and diffusion‐weighted MRI, are currently widely available in the developed world and may possibly yield a superior diagnostic test accuracy (DTA) to assess preoperatively if macroscopic debulking can be achieved. First, PET(‐CT) provides information on tumour extension based on the enhanced glucose metabolism of cancer cells, and is particularly useful for identification of distant metastases. Second, MRI has good soft tissue image contrast and gives a detailed view of structures and its position towards surrounding tissue. These imaging tests can be added to the preoperative work‐up (if the healthcare system permits with respect to costs), either as an alternative to abdominal CT (i.e. replacement test) or in combination with CT (i.e. as an add‐on test). Adding an alternative imaging method can be considered in women with a tumour load determined resectable by CT, in an attempt to filter out false‐negatives (i.e. resectable based on CT, not resectable according to the alternative method). In these women with non‐resectable tumours, additional imaging studies such as MRI or PET(‐CT) may possibly reduce the percentage of patients with residual disease outcome after primary debulking surgery. If PET(‐CT) and/or MRI show superior accuracy, more adequate selection of patients for either primary debulking or neoadjuvant chemotherapy can be performed.

Unfortunately, there is currently no systematic review which addresses the DTA of these imaging modalities (see; Index test(s)) in this context.

Objectives

To assess the diagnostic test accuracy of PET(‐CT), conventional and diffusion‐weighted MRI as an replacement or an add‐on to abdominal CT, for predicting tumour resectability at primary debulking surgery in patients with stage III – IV epithelial ovarian, fallopian tube and/or primary peritoneal cancer.

Secondary objectives

To investigate the year of study initiation, the annual surgical caseload and whether surgery is performed by a gynaecological oncologist as possible sources of heterogeneity. For further details, please see Investigations of heterogeneity.

Methods

Criteria for considering studies for this review

Types of studies

We will include randomised comparisons of diagnostic tests, cross‐sectional, retro‐ and prospective cohort studies, that address the DTA of preoperative PET(‐CT), conventional or (additional) diffusion‐weighted MRI on predicting tumour resectability in women who are planned to undergo primary debulking surgery. Studies which add‐on to CT or when the index test replaces CT, will be included. To evaluate the add‐on effect, the alternative imaging test should be performed within four weeks before or after the CT scan. Studies following a case‐control design, which carry an inherent high risk of bias in a DTA research objective, will be excluded.

Participants

Studies have to include adult (18 years of age or more) female patients diagnosed with advanced stage (stage III ‐ IV) epithelial ovarian, fallopian tube and/or primary peritoneal cancer, considered eligible for primary debulking surgery (i.e. no adjuvant chemotherapy treatment or prior surgery to assess tumour extension is performed). Also, studies with participants in stage I‐IV disease will be included if data from patients with stage III‐IV disease can be extracted.

Index tests

The index tests of interest are preoperatively performed fluorodeoxyglucose‐18 PET(‐CT), conventional and diffusion‐weighted MRI (see; Index test(s)). All these imaging modalities may be used as a replacement or as an add‐on to abdominal CT in patients with advanced epithelial ovarian, fallopian tube and/or primary peritoneal cancer.

A positive index test is defined as an assessment of tumour spread in which resection at primary debulking surgery is judged to be unfeasible (i.e. index test indicates ‘tumour is not resectable’). Conversely, a negative index test is defined as a tumour for which resection by primary debulking surgery is considered feasible.

Target conditions

The target condition is defined as the resectability of all deposits from epithelial ovarian, fallopian tube and/or primary peritoneal cancer at primary debulking surgery. This target condition has three categories (see; Target condition being diagnosed) which makes two commonly studied and clinically relevant dichotomisations possible (see; Statistical analysis and data synthesis).

Reference standards

The reference standard will be primary debulking surgery. This is most commonly performed via a laparotomy, although in recent years laparoscopy has also performed in cases of limited disease volume. During such a procedure the abdomen is systematically explored to assess the tumour spread and its resectability. The outcome category is determined by the surgeon at the end of this surgery.

Search methods for identification of studies

Our search for relevant literature will involve electronic databases (see Electronic searches) as well as from additional sources (see Searching other resources).

Electronic searches

We will search MEDLINE (Ovid), Embase (Ovid), Science citation index expanded (web of knowledge), social sciences citation index (web of knowledge), and arts & humanities citation index (web of knowledge). See Appendix 1 for a proposed draft search strategy to be run in MEDLINE (OVID). Similarly, structured search strategies will be designed using search terms appropriate for each database. Controlled vocabulary such as MeSH terms and EMTREE will be used where appropriate. Search filters (collections of terms aimed at reducing the number needed to screen) will not be used as an overall limiter because those published have not proved sensitive enough (Beynon 2013). No language restriction will be applied to the electronic searches, and translation services will be used as necessary.

Initial searches will be performed by one review author with extensive experience in systematic reviews. Screening of abstracts and titles will be conducted independently by two review authors.

Searching other resources

We will search both clinicaltrials.gov and WHO‐ICTRP to identify prospectively registered trials. The reference lists of all relevant studies will be searched for additional relevant studies. These studies will also be used to search the electronic databases to identify additional studies through the use of the related article feature and to identify newly citing articles by way of cited reference search. We will also contact research groups authoring studies used in the analysis for unpublished data.

Data collection and analysis

The data collection and analysis will adhere to the guidelines provided in the Cochrane Handbook for Systematic Reviews of Diagnostic Test Accuracy (Deeks 2013).

Selection of studies

All titles and abstracts retrieved by electronic searching will be downloaded to a reference management database (Endnote) and duplicates will be removed. The remaining references will be independently examined by two review authors (JFR and JPH) on title and abstract using the pre‐set inclusion and exclusion criteria as stated above. Afterwards, discrepancies in judgement between both review authors will be discussed in an attempt to reach consensus. When the possible inclusion or exclusion of an individual study remains unclear, full‐text assessment will be independently performed by the same two review authors for a final decision. Articles considered directly eligible based on title and abstract screening will also be read in full text to definitively confirm adherence to the inclusion and exclusion criteria. When discrepancies in judgement between both investigators persist following full‐text assessment, RJPMS will act as a referee to reach a final verdict. Excluded studies will be documented and the reasons for exclusion stated according to the guidance provided in the Cochrane Handbook for Systematic Reviews of Diagnostic Test Accuracy.

Data extraction and management

Two review authors (JFR and JPH) will independently perform data extraction. Missing data will be sought, if possible, by contacting the study authors concerned. Data will be checked and entered into RevMan 5 by one review author and checked by another review author.

For included studies, data on characteristics of patients (inclusion criteria, age, stage, co‐morbidity, previous treatment, number enrolled in each arm) and interventions (type, formulation, dose, duration, regimen), risk of bias, duration of follow‐up, outcomes and deviations from protocol will be abstracted independently by two review authors (JFR and JPH) onto a data abstraction form specially designed for the review.

Assessment of methodological quality

The QUADAS‐2 assessment tool for diagnostic accuracy studies in the context of systematic reviews will be completed for all included studies (Whiting 2011). This assessment will be performed independently by two review authors, one of whom has a clinical background (JFR), and the other both a clinical and methodological background (JPH). Final results will be based on consensus discussion. When consensus cannot be reached or uncertainty persists, RJPMS will act as a referee to reach a final decision. Operational definitions of QUADAS items are derived from Rutten 2014 and described in Appendix 2.

Statistical analysis and data synthesis

We will perform separate analyses for two different definitions of resectability versus non‐resectability, based on the three common categories of the target condition (see; Target condition being diagnosed):

  • Macroscopic debulking versus incomplete debulking with residual disease of any size (i.e. a dichotomisation of tumour deposits of 0 cm in diameter).

  • Macroscopic debulking or incomplete debulking with residual disease ≤ 1 cm in diameter versus incomplete debulking with residual disease > 1 cm in diameter (i.e. a dichotomisation of tumour deposits at 1 cm).

Figure 1 and Figure 2 outline the definitions of the two by two table for these analyses.


Definitions of the two by two table, wherein the index tests are tabulated against the reference standard outcome, on the analysis: macroscopic debulking versus incomplete debulking with residual disease of any size (i.e. consisting of deposits ≤ 1 cm and > 1 cm in diameter ). TP= true positive, FP= false positive, FN= false negative, TN=true negative.

Definitions of the two by two table, wherein the index tests are tabulated against the reference standard outcome, on the analysis: macroscopic debulking versus incomplete debulking with residual disease of any size (i.e. consisting of deposits ≤ 1 cm and > 1 cm in diameter ). TP= true positive, FP= false positive, FN= false negative, TN=true negative.


Definitions of the two by two table, wherein the index tests are tabulated against the reference standard outcome, on the analysis: macroscopic debulking or incomplete debulking with residual disease ≤ 1 cm in diameter versus incomplete resection with residual disease > 1 cm in diameter. TP= true positive, FP= false positive, FN= false negative, TN=true negative.

Definitions of the two by two table, wherein the index tests are tabulated against the reference standard outcome, on the analysis: macroscopic debulking or incomplete debulking with residual disease ≤ 1 cm in diameter versus incomplete resection with residual disease > 1 cm in diameter. TP= true positive, FP= false positive, FN= false negative, TN=true negative.

We will perform separate analyses for the two different roles of the index tests: a) as a replacement of CT, and b) as an add‐on test in women who are considered resectable by CT.

We will perform meta‐analyses of DTA studies according to the guidelines described in the Chapter 10 of the Cochrane Handbook for Systematic Reviews of Diagnostic Test Accuracy (Macaskill 2010). Review Manager software (Review Manager 2014) will be used to prepare forest plots of paired sensitivity and specificity of the included studies, the distribution of studies in the receiver operating characteristics (ROC)‐space and the graphical presentation of the results of the meta‐analyses, whereas the actual meta‐analyses will be done by the use of SAS (module METADAS). This module includes random‐effects methods for meta‐analysis of DTA studies in which overall sensitivity and specificity (or a summary ROC‐curve) are jointly estimated, whilst taking account of the existing covariance of those two parameters and the existing heterogeneity between studies, which is the rule rather than the exception in meta‐analyses of DTA studies (Macaskill 2010).

We assume, that implicit uniform thresholds will have been applied for each index test. Therefore, we will use pairs of sensitivity and specificity to calculate a summary estimate of sensitivity and specificity using a generalised linear mixed model (binomial family) (Reitsma 2005). If thresholds vary across studies, we will estimate the underlying ROC curve by the use of a hierarchical summary receiver operating characteristics (HROC) model (Rutter 1995; Rutter 2001).

To compare the accuracy of different index tests, we will extend the model with covariates indicating the type of index test. This allows for formal comparison of differences in mean sensitivity and specificity between index tests. In addition, we will restrict the analysis to studies that applied more than one index test in the same patients.

We will perform all statistical analyses with the macro METADAS using the statistical software SAS (Macaskill 2010).

We will assign levels of evidence to the various outcome categories (true positive (TP), false positive (FP), false negative (FN) and true negative (TN), see Figure 1 and Figure 2) according to GRADE and prepare 'Summary of findings' tables (Schünemann 2008). Labelling the tumour status erroneously as resectable ('false negatives') is considered worse than labelling the tumour status erroneously as non‐resectable ('false positives'), For GRADE, therefore, the DTA outcome ‘false negative’ is considered 'critical' (9) and the DTA outcome 'false positive' as less critical (8). The other outcomes (TP and TN) are considered 'important'.

The methods for GRADEing the level of evidence for DTA studies are still under development. We will apply the methods described in 2008 (Schünemann 2008) and in a more recent draft paper by Schünemann and colleagues (Schünemann personal communication). To create the GRADE‐profiles and 'Summary of findIngs' tables, we will use the Guideline Development Tool (http://gdt.guidelinedevelopment.org/central_prod/_design/client/index.html).

Investigations of heterogeneity

We will explore heterogeneity by adding covariates to the statistical model for the following characteristics. The following covariates will be considered.

  • Year of study initiation. Rapid advances have been made over the past decade(s) in the imaging sciences. Thus, heterogeneity caused by time‐dependent qualitative differences in the index test will be explored by adding the year of study initiation to the model.

  • Annual caseload at the study centre. Studies have suggested that better outcomes are achieved in hospitals with a high volume of debulking surgeries for advanced ovarian cancer (Mercado 2010; Schrag 2006).

  • Whether primary debulking surgery is performed by a subspecialised gynaecological oncologist. Quality of care and associated outcomes (including the probability to undergo debulking surgery, survival, etc.) have been reported to be dependent on whether a general surgeon, general gynaecologist or gynaecological oncologist performs the surgery (Earle 2006; Mercado 2010).

  • Percentage of patients with stage IIIC/IV ovarian cancer. It could be more difficult to achieve macroscopic debulking in these patients compared to stage IIIA/IIIB patients.

Sensitivity analyses

Sensitivity analyses will be performed by excluding studies at high risk of bias for each of the QUADAS domains.

Assessment of reporting bias

No assessment of reporting bias will be performed. Currently, no uniformly accepted and validated method for assessing this type of bias, in the context of a review based on DTA studies, exists (van Enst 2014).

Definitions of the two by two table, wherein the index tests are tabulated against the reference standard outcome, on the analysis: macroscopic debulking versus incomplete debulking with residual disease of any size (i.e. consisting of deposits ≤ 1 cm and > 1 cm in diameter ). TP= true positive, FP= false positive, FN= false negative, TN=true negative.
Figuras y tablas -
Figure 1

Definitions of the two by two table, wherein the index tests are tabulated against the reference standard outcome, on the analysis: macroscopic debulking versus incomplete debulking with residual disease of any size (i.e. consisting of deposits ≤ 1 cm and > 1 cm in diameter ). TP= true positive, FP= false positive, FN= false negative, TN=true negative.

Definitions of the two by two table, wherein the index tests are tabulated against the reference standard outcome, on the analysis: macroscopic debulking or incomplete debulking with residual disease ≤ 1 cm in diameter versus incomplete resection with residual disease > 1 cm in diameter. TP= true positive, FP= false positive, FN= false negative, TN=true negative.
Figuras y tablas -
Figure 2

Definitions of the two by two table, wherein the index tests are tabulated against the reference standard outcome, on the analysis: macroscopic debulking or incomplete debulking with residual disease ≤ 1 cm in diameter versus incomplete resection with residual disease > 1 cm in diameter. TP= true positive, FP= false positive, FN= false negative, TN=true negative.