Scolaris Content Display Scolaris Content Display

Symptom screening for active tuberculosis in pregnant women living with HIV

Collapse all Expand all

Abstract

This is a protocol for a Cochrane Review (Diagnostic test accuracy). The objectives are as follows:

To assess the accuracy of the four‐symptom screen (cough, fever, night sweats, or weight loss) for identifying active TB in pregnant PLHIV who are screened in an outpatient or community setting.

Background

Tuberculosis (TB) is the leading cause of mortality in people living with HIV (PLHIV) (UNAIDS 2016); and the third leading cause of death among women of child‐bearing age in high TB burden areas (Mathad 2012). Every year, over 200,000 pregnant women are estimated to develop TB during pregnancy (Sugarman 2014), though this is likely an underestimation due to administrative, diagnostic, or possible immunologic‐related delays in diagnosis (Llewelyn 2000; Zenner 2012). Earlier detection and treatment of TB in pregnant PLHIV is crucial in improving outcomes for both mothers and their young children (Getahun 2012). The World Health Organization (WHO) recommends that PLHIV, including pregnant women, should be routinely screened for symptoms of active TB at every health facility visit using a four‐symptom screen (cough, fever, night sweats, and weight loss (WHO 2011a). Routine TB screening in settings of high HIV and TB burden offers opportunity to identify PLHIV who require further diagnostic work‐up to evaluate for active TB, as well as PLHIV without symptoms who are unlikely to have TB and may be candidates for isoniazid preventive therapy (IPT) (WHO 2011a). Recent data suggest that the performance of the four‐symptom screen for active TB may perform differently in pregnant PLHIV compared to PLHIV in general (Hoffmann 2013; Kancheya 2014; LaCourse 2016; Modi 2016). For this systematic review, we will review the accuracy of symptom screening in pregnant PLHIV for active TB.

Target condition being diagnosed

TB is an airborne infectious disease caused by organisms within the complex Mycobacterium tuberculosis (Mtb) (Pai 2016). Although typically a disease of the lungs, Mtb can disseminate throughout the body. Infection primarily occurs after close and prolonged exposure to a person with active TB, with Mtb spread often occurring through coughing. The majority of people infected with Mtb will not develop active TB and are often described as having latent TB infection (LTBI). They are largely asymptomatic and not infectious. Although previous descriptions have categorized LTBI and active TB as binary states, in reality there is likely a spectrum: from clearance of the infection with evidence of immune response, to LTBI and granuloma formation, to subclinical disease (typically without symptoms but potentially infectious), to active (symptomatic) disease (Lawn 2011). Active TB can occur after recent exposure and infection, or much later in the setting of progression of previously quiescent LTBI. Approximately 5% to 15% of people will develop active TB in the setting of recent infection (within a few months or years), with the remaining described as having LTBI who are then at risk of developing TB in the future (Pai 2016).

Progression to active TB is more likely to occur in the setting of immune suppression with HIV or malnutrition (Horsburgh 2011). HIV significantly increases the risk for progression from LTBI to active TB (Lawn 2011), from an approximately 10% lifetime risk to over 10% per year (Corbett 2003). Given this elevated risk of active TB, the WHO recommends that PLHIV in TB‐endemic areas be provided IPT to reduce the risk of progression from LTBI to active TB (WHO 2011a). The risk of active TB also appears to be elevated in pregnant and postpartum women (Zenner 2012). The time from LTBI to active TB (when it occurs) can be variable, and it can take months to years for individuals to develop symptoms and bacteriologically detectable TB. People with active TB who are not on treatment can continue to be infectious for prolonged periods of time. Untreated maternal active TB is associated with poor maternal and infant outcomes (Getahun 2012; Mathad 2012). TB diagnosis may be delayed in pregnant women in part due to masking of symptoms from physiologic (Hamadeh 1992) or immunologic changes (Kourtis 2014; Mathad 2012; Singh 2007) associated with pregnancy. Maternal HIV and TB co‐disease is associated with adverse infant outcomes including prematurity, small for gestational age, vertical HIV transmission, neonatal TB, and death (Getahun 2012; Mathad 2012). Treatment for TB diagnosed during pregnancy should not be delayed (Nahid 2016; WHO 2010). In a recent large case‐control study in South Africa, mothers with HIV and TB were more likely to experience poor maternal and infant outcomes despite widespread access to antiretroviral therapy (ART), TB treatment and infant IPT (Salazar‐Austin 2017).

TB symptom screening for PLHIV

The WHO defines screening for TB as the “systematic identification of people with suspected active TB, in a predetermined target group, using tests, examinations or other procedures that can be applied rapidly” with the goal to “efficiently distinguish people with a high probability of having active TB from those who are unlikely to have active TB” (WHO 2013). Screening tests are not intended to be diagnostic, rather after a positive screen result, diagnosis is established using additional diagnostic tests. Systematic screening for TB is primarily provider‐initiated, and allows for earlier detection of TB and permits earlier treatment and improved outcomes (WHO 2013). In contrast, “passive‐case finding” often relies on people with potential TB reporting their symptoms to providers prior to further work‐up.

As part of the guidelines for intensified TB case finding and isoniazid preventive therapy (IPT) for PLHIV in resource‐limited settings, the WHO recommends that PLHIV should be routinely screened for active TB at every health facility visit using a four‐symptom screen (cough, fever, night sweats, and weight loss: CFSW) (WHO 2011a). The presence of any one of these four symptoms is considered a positive screen. The absence of all four symptoms is considered a negative screen. The goal of TB symptom screening in PLHIV is two‐fold: 1) to identify those with a positive symptom screen who should undergo further evaluation for active TB; and 2) to identify those with a negative symptom screen who are unlikely to have TB and therefore should be offered IPT. The recommendation regarding the use of the TB four‐symptom screen in PLHIV is based primarily on the individual participant data meta‐analysis performed by Getahun and colleagues of 12 observational studies including 8148 PLHIV (Getahun 2011). The focus of that meta‐analysis was to identify a combination of clinical symptoms that could be readily identified at any level of health system with the combination of greatest sensitivity and specificity, as well as negative predictive value, for active TB among PLHIV. The best performing rule included the presence of any one of CFSW with a sensitivity of 78.9% (95% CI 58.3% to 90.9%) and specificity of 49.6% (95% CI 29.2% to 70.1%). Negative predictive values were 99.6% (95% CI 99.5% to 99.6%) and 97.7% (95% CI 97.4% to 98.0%) for prevalences of 1% and 5%, respectively.

TB symptom screening for pregnant PLHIV

Integrating TB case‐finding within maternal and child health settings is considered a fundamental part of the Global Plan to End TB 2016‐2020 (Stop TB Partnership 2015). The WHO guidelines specifically include TB screening of pregnant PLHIV using the four‐symptom screen (WHO 2011a). However, a number of recently published studies report that the four‐symptom screen appears to have lower sensitivity among pregnant PLHIV (Hoffmann 2013; Kancheya 2014; LaCourse 2016; Modi 2016), compared to PLHIV in general. Previously published estimates of the high negative predictive value in pregnant PLHIV (Gupta 2011), may be different in the current era of widespread ART (Ahmad Khan 2014; Rangaka 2012), including ART initiated for prevention of mother‐to‐child transmission (PMTCT) of HIV in pregnant and breastfeeding women and continued for life regardless of CD4 count (referred to as "Option B+" by the WHO) (WHO 2015a).

Index test(s)

The primary index test that we will evaluate in this review is the WHO‐recommended TB four‐symptom screen, which consists of questions regarding the presence of any cough (of any duration), fever, night sweats, or weight loss (CFSW) to identify pregnant PLHIV with active TB. The four‐symptom screen consists of “Yes or No” type questions regarding the presence of any CFSW symptoms. The presence of any one of these symptoms is considered a positive screen. The absence of all four of these symptoms is considered a negative screen. The four‐symptom screen is designed to be easily asked by healthcare providers in an outpatient or community setting and can be done efficiently in low‐resource settings, allowing for further diagnostics to be reserved for those individuals who are most likely to have TB.

Clinical pathway

The four‐symptom screen is the initial test for PLHIV in a clinical pathway that includes subsequent confirmatory testing (Xpert MTB/RIF (Xpert), and in some cases chest radiograph or mycobacterial culture (solid or liquid)), to establish the diagnosis of active TB (Figure 1) (WHO 2011a; WHO 2013). PLHIV with a negative four‐symptom screen are considered to be unlikely to have TB, and are assessed for IPT eligibility to prevent active TB (WHO 2011a; WHO 2013). PLHIV with a positive four‐symptom screen continue through a testing pathway and undergo further diagnostic tests for TB and other diseases. The WHO currently recommends that PLHIV with a positive screen have an Xpert (rapid nucleic acid amplification test for Mtb), performed on expectorated sputum as a primary diagnostic test for TB (WHO 2011b). If Xpert positive, individuals should be started on multi‐drug anti‐tuberculosis treatment. In some settings, a culture is performed if Xpert positive to confirm TB diagnosis or to confirm whether the Mtb is sensitive to first‐line anti‐tuberculosis treatment. In most resource‐limited settings, routine confirmatory culture is not performed unless the patient has failed first‐line therapy or had a relapse. When available, chest radiography can also be used as an additional screening test to improve the pretest probability of the subsequent diagnostic test, and to reduce the number of people who need to undergo further diagnostic evaluation (WHO 2013). However in many settings, chest radiograph is unavailable (Saito 2012), therefore the WHO does not recommend that it be required in the evaluation of individuals suspected of TB. Additionally, clinicians may be reluctant to perform a chest radiograph in pregnancy, despite the low risk of this amount of radiation exposure to a pregnant woman or her fetus (Ratnapalan 2004).


Algorithm for TB screening in adults and adolescents living with HIV in HIV‐prevalent and resource‐constrained settings. Reprinted from: Guidelines for intensified tuberculosis case‐finding and isoniazid preventive therapy for people living with HIV in resource‐constrained settings. Copyright: WHO 2011a.

Algorithm for TB screening in adults and adolescents living with HIV in HIV‐prevalent and resource‐constrained settings. Reprinted from: Guidelines for intensified tuberculosis case‐finding and isoniazid preventive therapy for people living with HIV in resource‐constrained settings. Copyright: WHO 2011a.

Role of index test(s)

The role of the four‐symptom screen is a triage test as part of an algorithm to identify PLHIV who require further investigations for TB (those with positive symptom screens), as well as those who should be assessed for IPT (those with negative symptom screens) (Figure 1). Triage tests are used before diagnostic tests with only those patients with a positive result continuing through a testing pathway (Bossuyt 2006). Triage tests may be less accurate than existing diagnostics tests and are not meant to replace them, but may have additional advantages such as ease of use or low cost.

A true positive (TP) on the four‐symptom TB screen allows for referral for further diagnostics leading to early diagnosis and treatment of active TB. A true negative (TN) allows for potentially costly diagnostics to be avoided, and for evaluation of IPT for those truly without TB. The consequences of false positives (FP) are further diagnostics with which active TB would be ruled out. The consequences of false negatives (FN) are a possible delay in diagnosis and treatment for active TB as well as the potential for the inappropriate initiation of IPT. Individuals with active TB who receive isoniazid as opposed to multi‐drug anti‐tuberculosis therapy could be at risk of developing drug‐resistant TB. However rates of isoniazid resistance among PLHIV receiving IPT in TB prevention trials are similar to baseline rates of isoniazid resistance, and this risk is thought to be relatively low (WHO 2015c).

Alternative test(s)

The four‐symptom screen is considered the standard of care for the initial TB screening test for PLHIV (WHO 2011a). Alternative approaches to screening include passive case‐finding, alternate symptom screening, sputum microscopy, chest radiograph, and urine lateral flow lipoarabinomannan (LAM) assay (LF‐LAM).

Passive‐case finding relies on identifying TB among people who actively seek care for TB (WHO 2013). This is effective in identifying people who are highly symptomatic, but is less effective in identifying people who may have less pronounced or protean symptoms which may often be the case in PLHIV.

Alternate symptom screening includes the presence of chronic cough (for example longer than two weeks). Chronic cough has low sensitivity for TB among PLHIV, leading to missed cases and diagnostic delays (Getahun 2011).

Similarly, sputum microscopy has poor sensitivity in PLHIV, who often have paucibacillary disease. Chest radiographs can be negative or inconclusive in PLHIV, and require expert trained staff to read the images. Therefore the WHO does not recommend or require the use of sputum microscopy or chest radiograph for the initial screening test for TB in PLHIV (WHO 2011a).

LAM antigen is a lipopolysaccharide present in mycobacterial cell walls, and can be detected in the urine of people with active TB using urine LF‐LAM (Lawn 2012). Urine‐based testing has potential advantages over sputum‐based testing due to ease of collection and lower infection control risks compared to sputum collection. However, due to low sensitivity and specificity (Shah 2016), the WHO strongly recommends against using urine LF‐LAM for TB screening in PLHIV in general, except for those PLHIV with low CD4 or who are seriously ill (WHO 2015b). Portable digital chest radiographs with computer‐assisted diagnosis, and non‐sputum based biomarkers may have a role for initial screening in the future, but are not currently recommended in the initial screening or triage for TB in PLHIV (UNITAID 2015).

Rationale

The aim of this systematic review is to synthesize evidence regarding the accuracy of the four‐symptom screen for active TB in pregnant PLHIV.

This subject is particularly relevant as identifying PLHIV with TB to initiate timely TB treatment, and identifying PLHIV who are safe to initiate IPT, are cornerstones of key strategies of the Global Plan to End TB 2016‐2020 (Stop TB Partnership 2015). The Maternal‐Child Tuberculosis Working Group of the International Union Against Tuberculosis and Lung Disease has identified the performance of TB screening tools in pregnant PLHIV as an important research gap (Modi 2017 [pers comm]).

This review will focus on pregnant PLHIV for a number of reasons. The risk of TB in women, especially those with HIV, appears higher during pregnancy and postpartum periods (Deluca 2009; Hoffmann 2013; Kancheya 2014; LaCourse 2016; Modi 2016). Maternal TB is associated with poor maternal and infant outcomes, particularly among HIV‐infected women and their children (Getahun 2012: Mathad 2012; Salazar‐Austin 2017). Timely detection of TB in peripartum PLHIV may reduce TB‐associated morbidity and mortality. Antenatal and PMTCT settings provide opportunity for routine TB symptom screening in pregnant PLHIV (Deluca 2009; Getahun 2012).

Objectives

To assess the accuracy of the four‐symptom screen (cough, fever, night sweats, or weight loss) for identifying active TB in pregnant PLHIV who are screened in an outpatient or community setting.

Secondary objectives

  • To investigate potential sources of heterogeneity of the accuracy of the four‐symptom screen between studies including: ART status, CD4 cell count, gestational age, pregnancy stage (pregnancy vs. postpartum), screening test definition of cough (any cough vs. cough greater than 2 weeks).

  • To describe the accuracy of single symptoms included within the four‐symptom screen, additional symptoms or symptom combinations, for identifying active TB in pregnant PLHIV. For example, additional symptoms may include failure to gain weight or fatigue.

Methods

Criteria for considering studies for this review

Types of studies

We will include cross‐sectional and cohort studies in which pregnant PLHIV are tested with both TB symptom screening as well as at least one of the reference standards (Xpert or culture). In addition, we will include diagnostic case control studies. Case control studies may overestimate sensitivity and specificity (Lijmer 1999); however, as we anticipate identifying few relevant studies, we will include them. We will also include randomized controlled trials with each arm as a separate study. In addition, we will include published TB prevalence surveys; however we will exclude participants with known prevalent TB (i.e. on anti‐tuberculosis treatment at time of screening). Data from baseline measurement in longitudinal cohorts, as well as interventional trials in which persons with TB need to be excluded, will be eligible. For longitudinal studies where incident TB is identified after enrolment, data regarding symptom screening which occurs at the time of incident TB diagnosis will be eligible.

We will focus on studies that clearly describe TB symptom screening in pregnant PLHIV either in their methods or results. We will include studies that report data from which we can extract true positives (TP), false positives (FP), false negatives (FN), and true negatives (TN) for the four‐symptom screen. Studies with zero TB cases will be eligible for inclusion for estimates of specificity. Studies in which not all participants undergo the reference standard will be eligible for inclusion, as long as it is clear which participants underwent the index (symptom screening) and reference tests (Xpert or culture). Studies that screened PLHIV or pregnant women in general but included pregnant PLHIV will be eligible, as long as data can be extracted from the study specifically for pregnant PLHIV.

Participants

We will include pregnant PLHIV of all ages eligible for TB screening and not yet known to have TB at time of screening, who were screened in the outpatient or community setting. This may include pregnant PLHIV from all types of populations including the general population of PLHIV (included in mass case finding in TB prevalence surveys) as well as specifically targeted populations (screening of household members of known TB cases).

Index tests

For the primary analyses, we will include studies which use the four‐symptom screen (cough, fever, night sweats, weight loss: CFSW). A positive index test is the presence of any one of the four symptoms. A negative index test is the absence of all four of these symptoms. For our secondary analyses, we will also include studies which include additional symptoms (e.g. fatigue, failure to gain weight), and symptom combinations for identifying active TB.

Target conditions

The target condition is active TB.

Reference standards

Our primary reference standards will be sputum mycobacterial culture (liquid or solid) and Xpert, where ‘TB’ is defined as a positive culture for Mtb or a positive Xpert result, and ‘No TB’ is defined as a negative culture for Mtb or negative Xpert result (or if both tests are performed, then both tests are negative). In general, mycobacterial culture is considered the reference standard by which other TB diagnostics are assessed. For the purposes of this systematic review, mycobacterial culture will include both liquid (automated reading of mycobacterial growth inhibitor tubes (MGIT)) and solid medium (Löwenstein‐Jensen (LJ)). MGIT is thought to have increased sensitivity for identifying Mtb, but potentially lower specificity due to higher rates of contamination (Somoskovi 2000). Xpert is a WHO‐endorsed nucleic acid amplification (NAAT) for the diagnosis of TB, and is the recommended primary diagnostic test for PLHIV in resource‐limited settings (WHO 2011b). In a recent systematic review, Xpert has a pooled sensitivity of 89% (95% Credible Interval (CrI) 85% to 92%) and pooled specificity of 99% (95% CrI 98% to 99%) as an initial diagnostic test as measured against mycobacterial culture as the reference standard, with slightly reduced pooled sensitivity among PLHIV, 79% (95% CrI 70% to 86%) (Steingart 2014). For the purpose of this review, participants with mycobacterial culture negative for Mtb, but positive for nontuberculous mycobacteria will also be considered to have ‘No TB’.

Search methods for identification of studies

We will attempt to identify all relevant studies regardless of language or publication status (published, unpublished, in press, and in progress).

Electronic searches

We will search the following databases using the search terms and strategy described in Appendix 1: Cochrane Infectious Diseases Group Specialized Register; Central Register of Controlled Trials (CENTRAL), published in the Cochrane Library; MEDLINE (OVID); Embase (OVID); CINAHL (EBSCOHost); Science Citation Index‐Expanded (Web of Science) and Scopus. We will search the WHO International Clinical Trials Registry Platform (ICTRP) (www.who.int/ictrp/en/), and ClinicalTrials.gov (https://clinicaltrials.gov/ct2/home), for trials in progress, using 'tuberculosis' and 'symptom screening' as search terms.

Searching other resources

We will contact researchers and experts in the field to identify any additional eligible studies. We will also check the references of all included studies to identify additional studies.

Data collection and analysis

Selection of studies

Two review authors will independently screen all titles and abstracts to determine potentially eligible studies. We will then obtain the full‐text articles of these potentially eligible studies and two review authors will independently assess whether they should be included based on pre‐defined inclusion and exclusion criteria. We will resolve disagreements by discussion between the two review authors, as well as a third author if necessary. We will contact study authors for clarification of methods and other information if necessary. Reasons for exclusion will be recorded and summarized in a ‘Characteristics of excluded studies’ table. We will illustrate the study selection process in a PRISMA diagram.

Data extraction and management

We have drafted a data extraction form (Appendix 2) and will pilot the form with at least two included studies. We will finalize the form based on this pilot. Two review authors will independently extract data from each study using the finalized form. Both review authors will discuss any inconsistencies to obtain consensus. Any disagreements will be resolved either through consensus or by a third review author if necessary. We will enter into a database (either Excel or Research Electronic Data Capture [REDCap]) (Harris 2009). Extracted data will be stored on password‐protected computers, or in the case of REDCap (www.iths.org/investigators/services/bmi/redcap/), in a password protected online web application. Extracted data will be stored for review updates and we will seek Cochrane and CDC approval prior to update.

Data extraction will include the following characteristics.

  • Authors, title, publication year, journal, email address of corresponding author.

  • Whether author contacted, dates of contact, author response.

  • Year(s) study conducted.

  • Language of the publication, publication status.

  • Country where study conducted.

  • Reference standard(s): sputum Xpert, culture (solid or liquid), number of samples per individual.

  • Whether participants unable to produce sputum included in study.

  • Clinical setting (outpatient, community screening), participant selection.

  • Study design (including direction of study data collection i.e. prospective, retrospective).

  • Index test(s): four‐symptom screen (CFSW), additional symptoms (cough > 2 weeks, failure to gain weight, fatigue, etc.).

  • Purpose of screening as described in the study.

  • Number after screening by study inclusion and exclusion criteria.

  • Number included in analysis (include if available # in study, # pregnant PLHIV, # screened ‐ # withdrew).

  • Patient characteristics: age (range, mean (SD), median (IQR));

  • HIV characteristics: HIV status of participants (#, %), ART status (combination ART vs. ART for PMTCT alone), CD4 (range, mean (SD), median (IQR)), HIV viral load (VL) (% with undetectable VL, range, mean (SD), median (IQR)).

  • Pregnancy characteristics: gestational age (range, mean (SD), median (IQR)), postpartum age (range, mean (SD), median (IQR)), pregnancy vs. postpartum.

  • Patients with a previous history TB (#, %).

  • Details of outcomes: # of true positives (TP), true negatives (TN), false positives (FP), false negatives (FN); # participants with missing and/or unavailable results.

We will classify country income status as ‘low and middle income’ or ‘high income’, according to the World Bank List of Economies (World Bank 2016). Additional tables may be created for other symptoms or symptom combinations if they are reported. Review authors who are also authors of primary studies will recuse themselves from the screening and data extraction of their own studies and an alternate reviewer will perform the screening and data extraction.

Assessment of methodological quality

We will assess the methodological quality of included studies using the Quality Assessment of Diagnostic Accuracy Studies (QUADAS‐2) instrument (Whiting 2011), which we will tailor to this review. The QUADAS‐2 consists of four domains: (1) patient selection; (2) index test(s); (3) reference standard(s); and (4) flow and timing. We will assess all domains for risk of bias and the first three domains for concerns regarding applicability. As recommended, we will first develop guidance on how to appraise each question and interpret this information. Then, one review author will pilot the tool with two of the included studies. Based on experience gained from the pilot, we will finalize the tool. Two review authors will independently complete QUADAS‐2. We will resolve disagreements through discussion or, failing that, arbitration by a third review author. We will present the results of the quality assessment in the text, table, and graphs. The preliminary tool with signalling questions tailored to this review is in Appendix 3.

Statistical analysis and data synthesis

We will perform descriptive analyses of the characteristics of the included studies using Stata 14 (StataCorp 2015), and present key study characteristics in ‘Characteristics of included studies’ tables.

We will use data reported in the two‐by‐two tables to calculate sensitivity and specificity estimates and 95% confidence intervals (CI) for the accuracy of the four‐symptom screen from individual studies using Review Manager 5 (RevMan 5) (RevMan 2014). When possible, TB cases diagnosed by CXR or clinical suspicion alone will be excluded from the extracted data. We will present individual study results graphically by plotting the estimates of sensitivity and specificity (and their 95% CIs) in forest plots and receiver operating characteristic (ROC) space using RevMan 5. If data allow, we will include descriptive analyses of the performance of single symptoms, additional symptoms, or different symptom combinations other than the four‐symptom screen.

We will fit a bivariate random‐effects meta‐analysis model (Chu 2010; Macaskill 2010; Reitsma 2005) to estimate the pooled sensitivity and specificity and corresponding 95% CIs using the meqrlogit and metandi commands in Stata (version 14). We have selected the bivariate model because the four‐symptom TB screen uses a common positivity criteria or threshold, i.e. the presence of any one of the four symptoms is considered a positive result. The bivariate random‐effects approach will allow us to calculate the pooled estimates of sensitivity and specificity while dealing with potential sources of variation caused by: (1) variability of sensitivity and specificity estimates within individual studies; (2) correlation between sensitivity and specificity across studies; and (3) variation in sensitivity and specificity between studies. For some subgroups or screen definitions we may not be able to give meaningful summary estimates of sensitivity and specificity, therefore they will be evaluated using descriptive methods.

If we find that, in using hierarchical models, the analyses fail to converge due to a small number of studies or sparse data, we will consider simplifying the models into fixed‐effect models by eliminating the random‐effects parameters for sensitivity or specificity (Takwoingi 2017).

Investigations of heterogeneity

We will examine the forest plots and ROC plots through visual examination for heterogeneity. If the data allow, we will analyse potential determinants or sources of heterogeneity as covariates in the models using meta‐regression (Macaskill 2010). We will include the following pre‐specified categorical study‐level covariates.

  • ART status (on ART vs. not ART).

  • Pregnancy stage (pregnant vs. postpartum).

  • Screening test definition of cough (any cough vs. cough longer than two weeks).

Sensitivity analyses

If there are sufficient data, we will perform sensitivity analyses to explore the contribution of risk of bias and study characteristics on accuracy of the four‐symptom TB screen by including only studies that meet the following criteria in the meta‐analysis.

  • Studies that avoided case‐control design.

  • Studies that avoided inappropriate exclusions.

  • Studies that interpreted the result of symptom screen (index test) without knowledge to the result of the reference standard.

  • Studies where TB diagnosis is based solely on the reference of Xpert or culture (e.g. studies did not use CXR or clinical diagnosis as the basis for classifying patients as having “TB” or “No TB”).

Assessment of reporting bias

We will not formally assess reporting bias using funnel plots or regression tests as these have not been reported as helpful for diagnostic test accuracy studies (Macaskill 2010).

Algorithm for TB screening in adults and adolescents living with HIV in HIV‐prevalent and resource‐constrained settings. Reprinted from: Guidelines for intensified tuberculosis case‐finding and isoniazid preventive therapy for people living with HIV in resource‐constrained settings. Copyright: WHO 2011a.
Figures and Tables -
Figure 1

Algorithm for TB screening in adults and adolescents living with HIV in HIV‐prevalent and resource‐constrained settings. Reprinted from: Guidelines for intensified tuberculosis case‐finding and isoniazid preventive therapy for people living with HIV in resource‐constrained settings. Copyright: WHO 2011a.