Scolaris Content Display Scolaris Content Display

Pruebas séricas del primer y segundo trimestres con y sin ecografía del primer trimestre para el cribado del síndrome de Down

Contraer todo Desplegar todo

Resumen

Antecedentes

El síndrome de Down ocurre cuando una persona tiene tres copias del cromosoma 21 (o del área específica del cromosoma 21 que causa el síndrome de Down), en lugar de dos. Es la causa congénita más frecuente de discapacidad mental. El cribado no invasivo basado en el análisis bioquímico del suero o la orina de la madre, o las mediciones de la ecografía fetal, permiten calcular el riesgo de que un embarazo esté afectado, y proporcionan información para guiar las decisiones acerca de una prueba definitiva.

Antes de aceptar las pruebas de cribado, los padres necesitan contar con toda la información acerca de los posibles riesgos y ,efectos beneficiosos, así como sus consecuencias. Lo anterior incluye elecciones subsiguientes para pruebas adicionales que se puedan necesitar, y las implicaciones de las pruebas de cribado con falsos positivos (pruebas diagnósticas invasivas y la posibilidad de que se produzca el aborto espontáneo de un feto con cromosomas normales) y falsos negativos (es decir, que pueda no detectarse un feto con síndrome de Down). Las decisiones que pueden enfrentar los padres que esperan un niño generan inevitablemente un nivel alto de ansiedad en todos los estadios del proceso de cribado, y los resultados del cribado se pueden asociar con considerable morbilidad física y psicológica. Ninguna prueba de cribado puede predecir la gravedad de los problemas que tendrá un paciente con síndrome de Down.

Objetivos

Calcular y comparar la exactitud de los marcadores séricos del primer y segundo trimestres con y sin marcadores ecográficos del primer trimestre para la detección del síndrome de Down en el período prenatal, como combinaciones de los marcadores.

Métodos de búsqueda

Se realizó una búsqueda bibliográfica sensible y exhaustiva en MEDLINE (1980 hasta 25 de agosto 2011), Embase (1980 hasta 25 de agosto 2011), BIOSIS vía EDINA (1985 hasta 25 de agosto 2011), CINAHL vía OVID (1982 hasta 25 de agosto 2011), la Database of Abstracts of Reviews of Effectiveness (The Cochrane Library 25 de agosto 2011), MEDION (25 de agosto 2011), la Database of Systematic Reviews and Meta‐Analyses in Laboratory Medicine (25 de agosto 2011), el National Research Register (archivado 2007), en la base de datos de los Health Services Research Projects in Progress (25 de agosto 2011). No se aplicó un filtro de búsqueda de pruebas de diagnóstico. Se realizaron búsquedas prospectivas de las citas de "artículos relacionados" en los índices de citación ISI, Google Scholar y PubMed. También se hicieron búsquedas en las listas de referencias de los artículos recuperados

Criterios de selección

Estudios que evaluaran pruebas que combinan marcadores séricos maternos del primer y segundo trimestre del síndrome de Down en pacientes con hasta 24 semanas de gestación, con o sin marcadores ecográficos del primer trimestre, en comparación con un estándar de referencia, la comprobación cromosómica o la inspección macroscópica posnatal.

Obtención y análisis de los datos

Los datos se extrajeron como resultados negativos o positivos de las pruebas para embarazos con y sin síndrome de Down, lo que permitió el cálculo de las tasas de detección (sensibilidad) y las tasas de falsos positivos (1‐especificidad). La evaluación de la calidad se realizó según los criterios QUADAS. Se utilizaron los métodos jerárquicos metanalíticos resumen de ROC para analizar el rendimiento de las pruebas y comparar su exactitud. Se realizó el análisis de los estudios para permitir la comparación directa entre las pruebas. El impacto de la edad materna en el rendimiento de las pruebas se investigó en análisis de subgrupos.

Resultados principales

Se incluyeron 22 estudios (informados en 25 publicaciones) con 228 615 embarazos (incluidos 1067 con síndrome de Down). Generalmente los estudios fueron de calidad alta, aunque fue frecuente la verificación diferencial con las pruebas invasivas solamente en los embarazos de alto riesgo. Diez estudios realizaron comparaciones directas entre las pruebas. Se evaluaron 32 combinaciones de pruebas diferentes formadas por combinaciones de ocho pruebas diferentes y la edad de la madre; la translucencia nucal (TN) del primer trimestre y los marcadores séricos AFP, uE3, hCG total, βhCG libre, inhibina A, PAPP‐A y ADAM 12. Se examinaron las pruebas que combinaran marcadores del primer y segundo trimestre con o sin ecografía como pruebas completas, y también se examinaron estrategias graduales y contingentes.

El metanálisis de las seis combinaciones evaluadas con mayor frecuencia mostró que una estrategia que incluye la edad materna y una combinación de PAPP‐A y TN del primer trimestre, y hCG, uE3, AFP e inhibina A del segundo trimestre superó significativamente otras combinaciones que solo incluyeron un marcador sérico o la TN del primer trimestre, y se detectaron cerca de nueve de cada diez embarazos con síndrome de Down, a una tasa de falsos positivos del 5%. Sin embargo, la evidencia estuvo limitada en cuanto al número de estudios que evaluaron esta estrategia, y por lo tanto, no es posible recomendar una única estrategia de cribado.

Conclusiones de los autores

Las pruebas que incluyen la ecografía del primer trimestre con los marcadores séricos del primer y segundo trimestre en combinación con la edad materna son significativamente mejores que las que no incluyen la ecografía, o las que evalúan la ecografía del primer trimestre en combinación con los marcadores séricos del segundo trimestre, sin marcadores séricos del primer trimestre. No es posible realizar recomendaciones acerca de una estrategia específica debido al número pequeño de estudios disponibles.

Resumen en términos sencillos

Pruebas de cribado para el síndrome de Down en las primeras 24 semanas de embarazo

Antecedentes
El síndrome de Down (también conocido como Trisomía 21) es un trastorno genético incurable que causa problemas de salud físicos y mentales significativos, así como discapacidades. Sin embargo, hay una variación amplia en cómo el síndrome de Down afecta a los pacientes. Algunos individuos tienen una afectación grave, aunque otros tienen problemas leves y pueden llevar una vida relativamente normal. No hay una manera de predecir en qué magnitud se verá afectado un niño.

A los padres que esperan un niño se les ofrece la opción de la prueba del síndrome de Down durante el embarazo para ayudarlos a tomar decisiones. Si una madre está embarazada de un niño con síndrome de Down, entonces debe tomar la decisión de interrumpir o continuar el embarazo. La información les ofrece a los padres la oportunidad de planificar la vida con un hijo con síndrome de Down.

Las pruebas más exactas para detectar el síndrome de Down incluyen la obtención de líquido de alrededor del feto (amniocentesis) o del tejido de la placenta (toma de muestras de las vellosidades coriónicas [TMVC]) para detectar los cromosomas anormales asociados con el síndrome de Down. Ambas pruebas incluyen insertar agujas a través del abdomen de la madre y se conoce que aumentan el riesgo de aborto espontáneo. Por lo tanto, estas pruebas pueden no ser adecuadas para todas las embarazadas. En su lugar, se utilizan pruebas que miden los marcadores en la sangre o la orina de la madre, o ecografías del feto, para el cribado. Estas pruebas de cribado no son perfectas, pueden omitir casos de síndrome de Down y también proporcionar resultados de prueba de alto riesgo a varias mujeres cuyos niños no están afectados por el síndrome. Por lo tanto, los embarazos identificados como de alto riesgo mediante estas pruebas de cribado requieren de pruebas adicionales con amniocentesis o TMVC para confirmar un diagnóstico de síndrome de Down.

Qué se hizo
Se evaluaron las combinaciones de pruebas de cribado séricas del primer trimestre (hasta las 14 semanas de gestación) y del segundo trimestre (hasta las 24 semanas de gestación), con o sin ecografía del primer trimestre. El objetivo fue identificar la o las pruebas que predigan con mayor exactitud el riesgo de tener un embarazo afectado por el síndrome de Down. Se examinó un marcador ecográfico (translucencia nucal) y siete marcadores séricos diferentes (PAPP‐A, hCG total, βhCG libre, uE3, AFP, inhibina A, ADAM 12) que se pueden utilizar solos, como cocientes o en combinación, medidos antes de las 24 semanas de gestación, por lo que se cuenta con 32 pruebas de cribado del síndrome de Down. Se encontraron 22 estudios con 228 615 embarazos (incluidos 1067 fetos afectados por el síndrome de Down).

Datos encontrados
Para el cribado del síndrome de Down, en el que se realizan pruebas en el primer y segundo trimestres y luego se combinan para estimar el riesgo general, se encontró que la prueba que incluyó PAPP‐A y translucencia nucal del primer trimestre y la inhibina A, la AFP, la uE3 y la hCG total del segundo trimestre, fue la prueba más sensible, que detecta nueve de cada diez embarazos afectados por el síndrome de Down. De las embarazadas que reciben el resultado de un examen de alto riesgo basado en esta combinación, el 5% no se vería afectada por el síndrome de Down. Hubo relativamente pocos estudios que evaluaran estas pruebas y, por lo tanto, no es posible hacer una recomendación firme acerca de la mejor prueba.

Otra información importante a considerar
La ecografía no causa efectos adversos a la paciente; los análisis de sangre pueden causar malestar, equimosis y, rara vez, infección. Sin embargo, algunas mujeres que tienen un resultado de la prueba de cribado de "alto riesgo" y que son sometidas a amniocentesis o a TMVC tienen riesgo de aborto de un feto no afectado por el síndrome de Down. Los padres deberán sopesar este riesgo cuando decidan realizar una amniocentesis o una TMVC después de un resultado de una prueba de cribado de "alto riesgo".

Authors' conclusions

Implications for practice

The evidence supports the use of the six marker maternal age‐adjusted combination of first trimester nuchal translucency (NT) and pregnancy‐associated plasma protein‐A (PAPP‐A) with second trimester total human chorionic gonadotrophin (hCG), unconjugated oestriol (uE3), alpha‐fetoprotein (AFP) and inhibin A, which outperformed other test strategies. However the evidence was limited, based on small numbers of studies and the finding was not demonstrated in direct comparison of markers in the same populations of women. We cannot recommend a single test combination for Down’s syndrome screening. The choice of multiple markers will depend on the availability of certain assays in local laboratories. There is little evidence to recommend the use of first and second trimester serum markers without the addition of first trimester ultrasound. We would not recommend that these tests be introduced into wider clinical practice without careful consideration of cost.

Implications for research

Further evaluation of test combinations involving contingent and stepwise strategies are required to determine whether they offer superior test performance.

Future studies should ensure that adequate sample sizes are recruited, and take opportunities to make comparisons of test performance testing several alternative test combinations on the same population. Such direct comparison removes issues of confounding when making test comparisons, and allows a clear focus on testing the incremental benefit of increasingly complex and expensive testing strategies. The reporting of studies of test accuracy can be improved and more closely adhere to the standards for the reporting of diagnostic accuracy studies (STARD) guideline. Three key aspects of this are: 1) formally testing the statistical significance of differences in test performance in direct comparisons and estimating incremental changes in detection rates (together with confidence intervals); 2) clearly reporting the number of mothers studied and their results; and 3) reporting the numbers of women who are lost to follow‐up. Many authors reported results of extrapolating findings to age‐standardised national cohorts to demonstrate the performance of the test, and failed to report the actual numbers studied and evaluated.

For the purposes of meta‐analysis and to allow for comparisons to be made between different tests and combinations, we would recommend the publication of consensus standard algorithms for estimating risk, and reporting of test performance at a standard set of thresholds. This would be difficult to achieve and implement, but an attempt at consensus should be made.

Summary of findings

Open in table viewer
Summary of findings 1. Performance of the six most evaluated first and second trimester serum test strategies with or without ultrasound

Test strategy (with maternal age)

Studies

Women (cases)

Sensitivity (95% CI) at a 5% FPR

Test*

First trimester PAPP‐A and second trimester total hCG, uE3 and AFP

4

2474 (236)

85 (78, 89)

P = 0.014

First trimester PAPP‐A and second trimester total hCG, uE3, AFP and inhibin A

3

35,361 (217)

87 (81, 91)

First trimester NT and second trimester total hCG and AFP

4

22,793 (135)

85 (77, 91)

First trimester NT and second trimester total hCG, uE3 and AFP

4

13,708 (136)

86 (78, 92)

First trimester NT and PAPP‐A, and second trimester total hCG, uE3, AFP and inhibin A

3

39,670 (184)

95 (90, 97)

First trimester NT and PAPP‐A, and second trimester free ßhCG, uE3, AFP and inhibin A

4

40,348 (266)

92 (88, 95)

*Likelihood ratio test for the difference in accuracy between the six test strategies compared in a single meta‐analytic model

AFP = alpha‐fetoprotein; ßhCG = beta human chorionic gonadotrophin; FPR = false positive rate;hCG = human chorionic gonadotrophin; NT = nuchal translucency; PAPP‐A = pregnancy‐associated plasma protein‐A; uE3 = unconjugated oestriol

CI = confidence interval

Open in table viewer
Summary of findings 2. Performance of other first and second trimester serum strategies with or without ultrasound

Test

Studies

Women (cases)

Sensitivity* (95% CI)

Specificity* (95% CI)

Threshold

Without maternal age and ultrasound

Single tests

ADAM 12 second trimester to first trimester ratio

1

579 (17)

53 (28, 77)

95 (93, 97)

5% FPR

With maternal age and without ultrasound

Triple tests

First trimester PAPP‐A and second trimester total hCG and AFP

1

1188 (98)

83 (74, 90)

95 (93, 96)

5% FPR

First trimester PAPP‐A and second trimester free ßhCG and AFP

2

2197 (94)

83 to 85

94 to 95

5% FPR, 1:300 risk

Quadruple tests

First trimester PAPP‐A and second trimester free ßhCG, uE3 and AFP

1

1188 (98)

86 (77, 92)

95 (93, 96)

5% FPR

Quintuple tests

First trimester PAPP‐A and second trimester free ßhCG, uE3, AFP and inhibin A

1

1188 (98)

90 (82, 95)

95 (93, 96)

5% FPR

First trimester PAPP‐A and second trimester total hCG, uE3, AFP and PAPP‐A

2

707 (121)

78 (66, 86)

98 (96, 99)

1:200 risk

First trimester PAPP‐A and total hCG, and second trimester total hCG, uE3 and AFP

2

707 (121)

80 (68, 88)

97 (94, 98)

1:200 risk

First trimester PAPP‐A and uE3, and second trimester total hCG, uE3 and AFP

2

707 (121)

80 (68, 88)

96 (93, 98)

1:200 risk

Sextuple tests

First trimester AFP, free ßhCG and uE3, and second trimester total hCG, uE3 and AFP

1

12,339 (34)

82 (65, 93)

94 (93, 94)

1:250 risk

First trimester PAPP‐A and second trimester total hCG, uE3, AFP, inhibin A and PAPP‐A

1

540 (32)

84 (67, 95)

96 (94, 98)

1:250 risk

Septuple tests

First trimester PAPP‐A, total hCG and uE3, and second trimester total hCG, uE3, AFP and PAPP‐A

2

707 (121)

49 (36, 61)

98 (96, 99)

1:200 risk

With maternal age and ultrasound

Triple tests

First trimester NT and second trimester free ßhCG and AFP

2

6616 (105)

83 (70, 91)

95

5% FPR

Quadruple tests

First trimester NT and second trimester free ßhCG, uE3 and AFP

1

1110 (85)

88 (79, 94)

95 (94, 96)

5% FPR

First trimester NT and PAPP‐A, and second trimester total hCG and AFP

1

1110 (85)

91 (82, 96)

95 (94, 96)

5% FPR

First trimester NT and PAPP‐A, and second trimester free ßhCG and AFP

2

3400 (93)

88 to 91

95 to 98

5% FPR, 1:300 risk

Quintuple tests

First trimester NT and second trimester total hCG, uE3, AFP and inhibin A

1

1110 (85)

91 (82, 96)

95 (94, 96)

5% FPR

First trimester NT and second trimester free ßhCG, uE3, AFP and inhibin A

1

1110 (85)

91 (82, 96)

95 (94, 96)

5% FPR

First trimester NT and PAPP‐A, and second trimester free ßhCG, uE3 and AFP

1

1100 (85)

92 (84, 97)

95 (94, 96)

5% FPR

First trimester NT and PAPP‐A, and second trimester total hCG, uE3 and AFP

2

33,337 (171)

88 to 92

95 to 97

5% FPR, 1:200 risk

Sextuple tests

First trimester NT, PAPP‐A and free ßhCG, and second trimester total hCG, uE3 and AFP

1

5060 (13)

100 (75, 100)

97 (96, 97)

1:250 risk

Septuple tests

First trimester NT, PAPP‐A and free ßhCG, and second trimester uE3, AFP, total hCG and inhibin A

1

33,546 (87)

94 (87, 98)

89 (89, 89)

1:150 risk

Contingent tests

First trimester NT, PAPP‐A and free ßhCG, if risk 1:30‐1:1500, second trimester total hCG, uE3, AFP and inhibin A

1

32,355 (86)

91 (82, 96)

95 (95, 96)

1:270 risk

First trimester NT, PAPP‐A and free ßhCG, if risk 1:30‐1:1500, second trimester free ßhCG, uE3, AFP and inhibin A

1

7842 (59)

95 (86, 99)

95 (94, 95)

5% FPR

Stepwise tests

First trimester NT and PAPP‐A, if risk < 1:100, second trimester free ßhCG, uE3 and AFP

1

1507 (12)

92 (62, 100)

97 [(96, 98)

1:250 risk

First trimester NT, PAPP‐A and free ßhCG, if risk < 1:30, second trimester total hCG, uE3, AFP and inhibin A

1

32,355 (86)

92 (84, 97)

95 (95, 95)

1:270 risk

First trimester NT, PAPP‐A and free ßhCG, if risk < 1:30, second trimester free ßhCG, uE3, AFP and 2T inhibin A

1

7842 (59)

97 (88, 100)

95 (94, 95)

5% FPR

*Tests evaluated by at least one study are presented in the table. Where there were two studies at the same threshold, estimates of summary sensitivity and summary specificity were obtained by using univariate fixed‐effect logistic regression models to pool sensitivities and specificities separately. if the threshold used was a 5% FPR, then only the sensitivities were pooled. The range of sensitivities and specificities are presented where there were two studies and the thresholds used were different.

AFP = alpha‐fetoprotein; ßhCG = beta human chorionic gonadotrophin; FPR = false positive rate; hCG = human chorionic gonadotrophin; NT = nuchal translucency; PAPP‐A = pregnancy‐associated plasma protein‐A; uE3 = unconjugated oestriol

CI = confidence interval

Background

This is one of a series of reviews on antenatal screening for Down's syndrome following a generic protocol (Alldred 2010) ‐ see Published notes for more details.

Target condition being diagnosed

Down’s syndrome

Down’s syndrome affects approximately one in 800 live born babies (Cuckle 1987a).  It results from a person having three, rather than two, copies of chromosome 21 – or the specific area of chromosome 21 implicated in causing Down's syndrome – as a result of trisomy (an additional copy of the whole chromosome) or translocation (duplication of part of the chromosome caused by rearrangements of parts of different chromosomes, resulting in three copies of information responsible for Down's syndrome). If not all cells are affected, the pattern is described as 'mosaic'. Down’s syndrome can cause a wide range of physical and mental problems. It is the commonest cause of mental disability, and is also associated with a number of congenital malformations, notably affecting the heart. There is also an increased risk of cancers such as leukaemia, and numerous metabolic problems including diabetes and thyroid disease. Some of these problems may be life‐threatening, or lead to considerable ill health, while some individuals with Down’s syndrome have only mild problems and can lead a relatively normal life. 

There is no cure for Down’s syndrome, and antenatal diagnosis allows for preparation for the birth and subsequent care of a baby with Down’s syndrome, or for the offer of a termination of pregnancy. Having a baby with Down’s syndrome is likely to have a significant impact on family and social life, relationships and parents’ work. Special provisions may need to be made for education and care of the child, as well as accommodating the possibility of periods of hospitalisation.

Definitive invasive tests (amniocentesis and chorionic villus sampling (CVS)) exist that allow the diagnosis of Down's syndrome before birth but carry a risk of miscarriage. No test can predict the severity of problems a person with Down’s syndrome will have. Non‐invasive screening tests based on biochemical analysis of maternal serum or urine, or fetal ultrasound measurements, allow an estimate of the risk of a pregnancy being affected and provide parents with information to enable them to make choices about definitive testing.  Such screening tests are used during the first and second trimester of pregnancy.

Screening tests for Down's syndrome

Initially, screening was determined solely by using maternal age to classify a pregnancy as high or low risk for trisomy 21, as it was known that older women had a higher chance of carrying a baby with Down’s syndrome (Penrose 1933).

Further advances in screening were made in the early 1980s, when Merkatz and colleagues investigated the possibility that low maternal serum alpha‐fetoprotein (AFP), obtained from maternal blood in the second trimester of pregnancy could be associated with chromosomal abnormalities in the fetus. Their retrospective case‐control study showed a statistically significant relationship between fetal trisomy, such as Down’s syndrome, and lowered maternal serum AFP (Merkatz 1984). This was further explored by Cuckle and colleagues in a larger retrospective trial using data collected as part of a neural tube defect (NTD) screening project (Cuckle 1984a). This work was followed by calculation of risk estimates using maternal serum AFP values and maternal age, which ultimately led to the introduction of the two screening parameters in combination (Alfirevic 2004).

In 1987, in a small case‐control study of women carrying fetuses with known chromosomal abnormalities, Bogart and colleagues investigated maternal serum levels of human chorionic gonadotrophin (hCG) as a possible screening tool for chromosomal abnormalities in the second trimester (Bogart 1987). This followed the observations that low hCG levels were associated with miscarriages, which are commonly associated with fetal chromosomal abnormalities. They concluded that high hCG levels were associated with Down’s syndrome and because hCG levels plateau at 18 to 24 weeks, that this would be the most appropriate time for screening. Later work suggested that the ß sub‐unit of hCG (free βhCG) was a more effective marker than total hCG (Macri 1990; Macri 1993).

Second trimester unconjugated oestriol (uE3), produced by the fetal adrenals and the placenta, was also evaluated as a potential screening marker.  In another retrospective case‐control study, uE3 was shown to be lower in Down’s syndrome pregnancies compared with unaffected pregnancies.  When used in combination with AFP and maternal age, it appeared to identify more pregnancies affected by Down’s syndrome than AFP and age alone (Canick 1988). Further work suggested that all three serum markers (AFP, hCG and uE3) showed even higher detection rates when combined with maternal age (Wald 1988a; Wald 1988b) and appeared to be a cost‐effective screening strategy (Wald 1992a).

Three other serum markers, produced by the placenta, have been linked with Down’s syndrome, namely pregnancy‐associated plasma protein A or PAPP‐A, inhibin A and a disintegrin and metalloprotease 12 (ADAM12). PAPP‐A has been shown to be reduced in the first trimester of Down’s syndrome pregnancies, with its most marked reduction in the early first trimester (Bersinger 1995). Inhibin A is high in the second trimester in pregnancies affected by Down’s syndrome (Cuckle 1995; Wallace 1995). There are some issues concerning the biological stability ‐ for example, delay in samples arriving in the laboratory ‐ and hence reliability of this marker, and the effect this will have on individual risk. ADAM 12 has been shown to be a potential first trimester marker with reduced maternal serum levels in pregnancy prior to 10 weeks (Laigaard 2003; Spencer 2008a).

In 1992, Nicolaides and colleagues demonstrated an association between increased nuchal translucency (NT) and chromosomal abnormalities (Nicolaides 1992). Nuchal translucency measurement requires an ultrasound scan of the fluid at the fetal neck between 10 and 13+6 weeks' gestation. If the amount is large, it suggests an increased risk of Down’s syndrome. This study was small (827 women), but led to further research into the use of NT scanning and its value when combined with serum tests. Other first trimester ultrasound markers, such as absent nasal bone, abnormal ductus venosus flow velocity and tricuspid regurgitation, have also been investigated.

In addition to serum and ultrasound markers for Down’s syndrome, work has been carried out looking at urinary markers. These markers include invasive trophoblast antigen, ß‐core fragment, free ßhCG and total hCG (Cole 1999). There is controversy about their value (Wald 2003a).

Screening and parental choice

Antenatal screening is used for several reasons (Alfirevic 2004), but the most important is to enable parental choice regarding pregnancy management and outcome. Before a woman and her partner opt to have a screening test, they need to be fully informed about the risks, benefits and possible consequences of such a test. This includes the choices they may have to face should the result show that the woman has a high risk of carrying a baby with Down’s syndrome and the implications of both false positive and false negative screening tests. They need to be informed of the risk of a miscarriage due to invasive diagnostic testing, and the possibility that a miscarried fetus may be chromosomally normal. If, following invasive diagnostic testing, the fetus is shown to have Down’s syndrome, further decisions need to be made about continuation or termination of the pregnancy, the possibility of adoption and finally, preparation for parenthood. Equally, if a woman has a test that shows she is at a low risk of carrying a fetus with Down’s syndrome, it does not necessarily mean that the baby will be born with a normal chromosomal make up. This possibility can only be excluded by an invasive diagnostic test (Alfirevic 2003). The decisions that may be faced by expectant parents inevitably engender a high level of anxiety at all stages of the screening process, and the outcomes of screening can be associated with considerable physical and psychological morbidity. No screening test can predict the severity of problems a person with Down's syndrome will have.

Index test(s)

This review examined serum screening tests used in the first and second trimester of pregnancy (up to 24 weeks' gestation) with and without first trimester ultrasound tests (up to 14 weeks' gestation). The review examined the following individual markers; NT measurement in the first trimester, ADAM 12, AFP, uE3, total hCG, free βhCG, Inhibin A and PAPP‐A. These markers can be used individually, in combination with age, and can also be used in combination with each other. The risks are calculated by comparing a woman's test result for each marker with values for an unaffected population, and multiplying this with her age‐related risk. Where several markers are combined, risks are computed using risk equations (often implemented in commercial software) that take into account the correlational relationships between the different markers and marker distributions in affected and unaffected populations.

Stepwise testing allows for triage of women into risk categories at two stages. Women found to be very high risk at the end of first trimester screening are offered invasive testing, whereas those women deemed to be lower risk are then screened again in the second trimester and a further overall risk is calculated once both stages of the test are completed.

Contingent screening is similar, however at the completion of first trimester screening women are classified into three groups – high risk, medium risk and low risk. High risk women are offered invasive testing at this stage, low risk women undergo no further screening and medium risk women are offered second trimester serum tests and calculation of a further overall risk once both stages of the test are completed.

Alternative test(s)

Down’s syndrome can be detected during pregnancy with invasive diagnostic tests such as amniocentesis or CVS, with or without prior screening. The ability to determine fetal chromosomal make up (also known as a karyotype) from amniotic fluid samples was demonstrated in 1966 by Steele and Breg (Steele 1966), and the first antenatal diagnosis of Down’s syndrome was made in 1968 (Valenti 1968).  Amniocentesis is an invasive procedure which involves taking a small sample of the amniotic fluid (liquor) surrounding the baby, using a needle which goes through the abdominal wall into the uterus, and is usually performed after 15 weeks' gestation.  CVS involves taking a sample of the placental tissue using a needle which goes through the abdominal wall and uterus or a cannula through the cervix.  It is usually performed between 10 and 13 weeks' gestation.  Amniocentesis and CVS are both methods of obtaining fetal chromosomes material, which are then used to diagnose Down’s syndrome.  Both tests use ultrasound scans to guide placement of the needle.  Amniocentesis carries a risk of miscarriage in the order of 1%; transabdominal CVS may carry a similar risk (Alfirevic 2003). Recent developments in the use of cell‐free fetal DNA detection in maternal serum are paving the way for non‐invasive diagnosis of Down's syndrome and other trisomies, however these tests were not used as reference standards in any of the studies examined.

There are many different screening tests which are available and offered which are the subject of additional Cochrane reviews and there are other reviews looking at this area. Tests being assessed in the other Cochrane reviews include first trimester serum tests (Alldred 2015); urine tests (Alldred 2015a); second trimester serum markers (Alldred 2012); and first trimester ultrasound tests alone, or in combination with first trimester serum tests (in press). Second trimester ultrasound markers have been assessed in a previous systematic review (Smith‐Bindman 2001).

Rationale

This is one of a suite of Cochrane reviews, the aim of which is to identify all screening tests for Down's syndrome used in clinical practice, or evaluated in the research setting, in order to try to identify the most accurate test(s) available, and to provide clinicians, policy makers and women with robust and balanced evidence on which to base decisions about interpreting test results and implementing screening policies to triage the use of invasive diagnostic testing. The full set of reviews is described in the generic protocol (Alldred 2010).

A systematic review of second trimester ultrasound markers for detection of Down’s syndrome concluded that nuchal fold thickening may be useful in detecting Down’s syndrome, but that it was not sensitive enough to be used as a screening test (Smith‐Bindman 2001). The review concluded that other second trimester ultrasound markers did not usefully distinguish between Down’s syndrome and pregnancies without Down’s syndrome. There has been no systematic review and meta‐analysis of serum, urine and first trimester ultrasound markers to enable rigorous and robust conclusions to be made about the diagnostic accuracy of available Down’s syndrome screening tests.

The topic has been split into several different reviews to allow for greater ease of reading and greater accessibility of data, and also to allow the reader to focus on separate groups of tests, for example, first trimester serum tests alone, first trimester ultrasound alone, first trimester serum and ultrasound, second trimester serum alone, first and second trimester serum, combinations of serum and ultrasound markers and urine markers alone. An overview review will compare the best tests, focusing on commonly used strategies, from each of these groups to give comparative results between the best tests in the different categories. This review is written with the global perspective in mind, rather than to conform with any specific local or national policy, as not all tests will be available in all areas where screening for Down's syndrome is carried out.

Objectives

The aim of this review was to estimate and compare the accuracy of first and second trimester serum markers with and without first trimester ultrasound markers for the detection of Down’s syndrome in the antenatal period, as combinations of markers. Individual markers are described in the other reviews belonging to this suite. Accuracy is described by the proportion of fetuses with Down’s syndrome detected by screening before birth (sensitivity or detection rate) and the proportion of women with a low risk (normal) screening test result who subsequently had a baby unaffected by Down's syndrome (specificity).

Investigation of sources of heterogeneity

We planned to investigate whether a uniform screening test is suitable for all women, or whether different screening methods are more applicable to different groups, defined by advanced maternal age, ethnic groups and aspects of the pregnancy and medical history such as multiple pregnancy, diabetes and family history of Down's syndrome. We also considered whether there existed evidence of overestimation of test accuracy in studies evaluating risk equations in the derivation sample rather than in a separate validation sample.

Methods

Criteria for considering studies for this review

Types of studies

We included studies in which all women from a given population had one or more index test(s) compared to a reference standard. Both consecutive series and diagnostic case‐control study designs were included. Randomised trials where individuals were randomised to different screening strategies and all verified using a reference standard were also eligible for inclusion. Studies in which test strategies were compared head‐to‐head either in the same women, or between randomised groups were identified for inclusion in separate comparisons of test strategies. Studies were excluded if they included less than five Down's syndrome cases, or more than 20% of participants were not followed up.

Participants

Pregnant women up to 24 weeks' gestation confirmed by ultrasound, who had not undergone previous testing for Down’s syndrome in their pregnancy were eligible. Studies were included if the pregnant women were unselected, or if they represented groups with increased risk of Down’s syndrome, or difficulty with conventional screening tests including maternal age greater than 35 years old, multiple pregnancy, diabetes mellitus and family history of Down’s syndrome.

Index tests

The following index tests were examined; nuchal translucency (NT) scanning, ADAM12, AFP, uE3,total hCG, free βhCG, Inhibin A, PAPP‐A, and combinations of these markers with maternal age. Combinations without maternal age were excluded.

We looked at comparisons of tests in isolation and in various combinations. All strategies included first and second trimester serum tests, and some included additional first trimester ultrasound markers. The maximum number of markers in any one test was seven, in combination with maternal age.

Where tests were used in comparison we looked at the performance of test comparisons according to predicted probabilities computed using risk equations and dichotomised into high risk and low risk (and medium risk, where applicable).

Target conditions

Down's syndrome in the fetus due to trisomy, translocation or mosaicism.

Reference standards

We considered several reference standards, involving chromosomal verification and postnatal macroscopic inspection.

Amniocentesis and chorionic villus sampling (CVS) are invasive chromosomal verification tests undertaken during pregnancy. They are highly accurate, but the process carries a 1% miscarriage rate, and therefore they are only used in pregnancies considered to be at high risk of Down's, or at the mother's request. All other types of testing (postnatal examination, postnatal karyotyping, birth registers and Down’s syndrome registers) are based on information available at the end of pregnancy. The greatest concern is not their accuracy, but the loss of the pregnancy to miscarriage between the serum test and the reference standard. Miscarriage with cytogenetic testing of the fetus is included in the reference standard where available. We anticipated that older studies, and studies undertaken in older women are more likely to have used invasive chromosomal verification tests in all women.

Studies undertaken in younger women and more recent studies were likely to use differential verification as they often only used prenatal karyotypic testing on fetuses considered screen positive/high risk according to the screening test; the reference standard for most unaffected infants being observing a phenotypically normal baby. Although the accuracy of this combined reference standard is considered high, it is methodologically a weaker approach as pregnancies that miscarry between the index test and birth are likely to be lost from the analysis, and miscarriage is more likely to occur in Down's than normal pregnancies. We investigated the impact of the likely missing false negative results in sensitivity analyses.

Search methods for identification of studies

We used one generic search strategy to identify studies for all reviews in this series.

Electronic searches

We applied a sensitive search strategy to search the following databases using the text words and MeSH terms detailed in Appendix 1, adapting the search strategy for each different database.

The following databases were searched.

  1. MEDLINE via OVID (1980 to 25 August 2011)

  2. Embase via Dialog Datastar (1980 to 25 August 2011)

  3. BIOSIS via EDINA (1985 to 25 August 2011)

  4. CINAHL via OVID (1982 to 25 August 2011)

  5. The Database of Abstracts of Reviews of Effectiveness (25 August 2011)

  6. MEDION (25 August 2011)

  7. The Database of Systematic Reviews and Meta‐Analyses in Laboratory Medicine (www.ifcc.org/) (25 August 2011)

  8. The National Research Register (Archived 2007)

  9. Health Services Research Projects in Progress database (HSRPROJ) (25 August 2011)

The search strategy combined three sets of search terms (see Appendix 1). The first set was made up of named tests, general terms used for screening/diagnostic tests and statistical terms. Note that the statistical terms were used to increase sensitivity and were not used as a methodological filter to increase specificity. The second set was made up of terms that encompass Down's syndrome and the third set made up of terms to limit the testing to pregnant women. All terms within each set were combined with the Boolean operator OR and then the three sets were combined using AND. The terms used were a combination of subject headings and free‐text terms. The search strategy was adapted to suit each database searched.

We attempted to identify cumulative papers that reported data from the same data set, and contacted authors to obtain clarification of the overlap between data presented in these papers, in order to prevent data from the same women being analysed more than once. 

Searching other resources

In addition, we examined references cited in studies identified as being potentially relevant, and those cited by previous reviews.  We contacted authors of studies where further information was required. 

We carried out forward citation searching of relevant items, using the search strategy in ISI citation indices, Google scholar and Pubmed ‘related articles’.

We did not apply language restrictions to the search.

Data collection and analysis

Selection of studies

Two review authors screened the titles and abstracts (where available) of all studies identified by the search strategy.  Full‐text versions of studies identified as being potentially relevant were obtained and independently assessed by two review authors for inclusion, using a study eligibility screening pro forma according to the pre‐specified inclusion criteria.  Any disagreement between the two review authors was settled by consensus, or where necessary, by a third party.

Data extraction and management

A data extraction form was developed and piloted using a subset of 20 identified studies (from all identified studies in this suite of reviews). Two review authors independently extracted data, and where disagreement or uncertainty existed, a third review author validated the information extracted.

Data on each marker were extracted as binary test positive/test negative results for Down's and non‐Down's pregnancies, with a high risk result ‐ as defined by each individual study ‐ being regarded as test positive (suggestive or diagnostic of Down's syndrome), and a low risk result being regarded as test negative (suggestive of absence of Down's syndrome). Where results were reported at several thresholds, we extracted data at each threshold.

We noted those in special groups that posed either increased risk of Down’s syndrome or difficulty with conventional screening tests including maternal age greater than 35 years old, multiple pregnancy, diabetes mellitus and family history of Down’s syndrome. 

Assessment of methodological quality

We used a modified version of the QUADAS tool (Whiting 2003), a quality assessment tool for use in systematic reviews of diagnostic accuracy studies, to assess the methodological quality of included studies. We anticipated that a key methodological issue would be the potential for bias arising from the differential use of invasive testing and follow‐up for the reference standard according to index test results, bias arising due to higher loss to miscarriage in false negatives than true negatives. We chose to code this issue as originating from differential verification in the QUADAS tool: we are aware that it could also be coded under delay in obtaining the reference standard, and reporting of withdrawals. We omitted the QUADAS item assessing quality according to length of time between index and reference tests, as Down's syndrome is either present or absent rather than a condition that evolves and resolves, and disregarding the differential reference standard issue thus any length of delay is acceptable. Two review authors assessed each included study separately. Any disagreement between the two authors was settled by consensus, or where necessary, by a third party. Each item in the QUADAS tool was marked as ‘yes’, ‘no’ or ‘unclear’, and scores were summarised graphically. We did not use a summary quality score. 

QUADAS criteria included the following 10 questions.

  1. Was the spectrum of women representative of the women who will receive the test in practice? (Criteria met if the sample was selected from a wide range of childbearing ages, or selected from a specified ‘high risk’ group such as over 35s, family history of Down’s syndrome, multiple pregnancy or diabetes mellitus, provided all affected and unaffected fetuses included that could be tested at the time point when the screening test would be applied; criteria not met if the sample taken from a select or unrepresentative group of women (i.e. private practice), was an atypical screening population or recruited at a later time point when selection could be affected by selective fetal loss.)

  2. Is the reference standard likely to correctly classify the target condition? (Amniocentesis, chorionic villus sampling, postnatal karyotyping, miscarriage with cytogenetic testing of the fetus, a phenotypically normal baby or birth registers are all regarded as meeting this criteria.)

  3. Did the whole sample or a random selection of the sample receive verification using a reference standard of diagnosis?

  4. Did women receive the same reference standard regardless of the index test result?

  5. Was the reference standard independent of the index test result (i.e. the index test did not form part of the reference standard)?

  6. Were the index test results interpreted without knowledge of the results of the reference standard?

  7. Were the reference standard results interpreted without knowledge of the results of the index test?

  8. Were the same clinical data (i.e. maternal age and weight, ethnic origin, gestational age) available when test results were interpreted as would be available when the test is used in practice?

  9. Were uninterpretable/intermediate test results reported?

  10. Were withdrawals from the study explained?

Statistical analysis and data synthesis

We initially examined each test or test strategy at each of the common risk thresholds used to define test positivity by plotting estimates of sensitivity and specificity from each study on forest plots and in receiver operating characteristic (ROC) space. Test strategies were selected for further investigation if they were evaluated in four or more studies or, if there were three or fewer studies, but the individual study results indicated performance likely to be superior to a sensitivity of 70% and specificity of 90%.

Estimation of average sensitivity and specificity

The analysis for each test strategy was undertaken first, by restricting to studies that reported a common threshold to estimate average sensitivity and specificity for each test at each threshold. Although data on all thresholds were extracted, we present only key common thresholds close to risks of 1:384, 1:250 and the 5% false positive rate (FPR), unless other thresholds were more commonly reported. Where combinations of tests were used in a risk score, we extracted the result for the test combination using the risk score and not the individual components that made up the test.

Meta‐analyses were undertaken using hierarchical summary ROC (HSROC) models, which included estimation of random‐effects in accuracy and threshold parameters when there were four or more studies. Otherwise, average sensitivity and specificity values were computed by using univariate random‐effects logistic regression models to average logit sensitivity and logit specificity separately because of insufficient number of studies to reliably estimate all the parameters in the HSROC model. It is common in this field for studies to report sensitivity for a fixed specificity (usually a 5% FPR). This removes the requirement to account for the correlation between sensitivity and specificity across studies by using a bivariate meta‐analytical method since all specificities are the same value. Thus, at a fixed specificity value, logit sensitivities were pooled using a univariate random‐effects model. This model was further simplified to a fixed‐effect model when there were only two or three studies and heterogeneity was not observed on the SROC plot. All analyses were undertaken using the NLMIXED procedure in SAS (version 9.2; SAS Institute, Cary, NC) and the xtmelogit command in Stata version 11.2 (Stata‐Corp, College Station, TX, USA).

Comparisons between tests

Comparisons between tests were first made utilising all available studies, selecting one threshold from each study to estimate a summary ROC curve without restricting to a common threshold. The threshold was chosen for each study according to the following order of preference: a) the risk threshold closest to one in 250; b) a multiples of the median (MoM) or presence/absence threshold; c) the performance closest to a 5% FPR or 95th percentile. The 5% FPR was chosen as a cut‐off point as this is the cut‐off most commonly reported in the literature. The analysis that used all available studies was performed by including the six most evaluated test strategies in a single HSROC model. The model included two indicator terms for each test to allow for differences in accuracy and threshold. As there were few studies for each test, a single summary ROC shape parameter was included in the model such that the fitted summary ROC curves did not cross.  An estimate of the sensitivity of each test for a 5% FPR was derived from the summary ROC curve, and associated confidence intervals were obtained using the delta method.

Direct comparisons between tests were based on results of very few studies, and were analysed using a fixed‐effect HSROC model with symmetrical underlying summary ROC curves because the number of studies was insufficient to estimate between‐study heterogeneity in accuracy and threshold or asymmetry in the shape of the summary ROC curves.  A separate model was used to make each pair‐wise comparison. Comparisons between tests were assessed by using likelihood ratio tests to test if the differences in accuracy were statistically significant or not. The differences were expressed as relative diagnostic odds ratios and were reported with 95% confidence intervals.  As studies rarely report data cross‐classified by both tests for Down's and normal pregnancies, the analytical method did not take full account of the pairing of test results, but the restriction to direct head‐to‐head comparisons should have removed the potential confounding of test comparisons with other features of the studies. The strength of evidence for differences in performance of test strategies relied on evidence from both the direct and indirect comparisons.

Investigations of heterogeneity

If there were 10 or more studies available for a test, we planned to investigate heterogeneity by adding covariate terms to the HSROC model to assess the effect of a covariate on accuracy and threshold.

Sensitivity analyses

Mothers with pregnancies identified as high risk for Down's syndrome by ultrasound and serum testing are often offered immediate definitive testing by amniocentesis, whereas those considered low risk are assessed for Down's syndrome by inspection at birth. Such delayed and differential verification will introduce bias, most likely through there being greater loss to miscarriage in the Down's syndrome pregnancies that were not detected by the ultrasound and serum testing (the false negative diagnoses). Testing and detection of miscarriages is impractical in many situations, and no clear data are available on the magnitude of these miscarriage rates.

To account for the possible bias introduced by such a mechanism, we planned to perform sensitivity analyses by increasing the percentage of false negatives in studies where delayed verification in test negatives occurred (Mol 1999). We planned to incrementally increase the percentage from 10% to 50%, the final value representing a scenario where a third of more Down's pregnancies than normal pregnancies were likely to miscarry, thought to be higher than the likely value. We intended to conduct the sensitivity analyses on the analysis investigating the effect of maternal age on test sensitivity.

Assessment of reporting bias

Assessment of reporting bias was not performed.

Results

Results of the search

The search for the whole suite of reviews identified a total of 15,394 papers, once the results from each bibliographic database were combined and duplicates were removed. After screening out obviously irrelevant papers based on their title and abstract, 1145 papers remained and we obtained full‐text copies for formal assessment of eligibility. From these a total of 269 papers were deemed eligible and were included in the suite of reviews. A total of 22 studies (reported in 25 publications) were included in this review of first and second trimester serum screening, with and without ultrasound, involving 228,615 pregnancies including 1067 Down's syndrome pregnancies.

A total of 32 different test strategies combinations were evaluated in the 22 studies. The tests were produced from combinations of eight different tests, with and without maternal age; first trimester nuchal translucency (NT) and the serum markers AFP, uE3, total hCG, free βhCG, Inhibin A, PAPP‐A and ADAM 12. We examined tests combining first and second trimester markers with or without ultrasound as complete tests, and also examined stepwise and contingent strategies. The studies evaluated the following serum‐only tests: one single test without maternal age, and one septuple test, two sextuple tests, five quintuple tests, two quadruple tests and two triple test in combination with maternal age. Serum and ultrasound markers were evaluated in combination with maternal age: one study of seven markers, three studies of six markers, four studies of five markers, four studies of four markers and two studies of three markers. In addition, there were two contingent and three stepwise test strategies. Twelve of the 22 studies only evaluated the performance of a single test or test strategy, five compared two tests, two compared three tests, two compared five tests, and one compared 20 tests (Wald 2003b).

The following test combinations were the most evaluated and were each evaluated in four studies.

Six markers

  • First trimester NT, first trimester PAPP‐A , second trimester free ßhCG, second trimester uE3, second trimester AFP, second trimester Inhibin A, and maternal age (four studies; 40,348 women including 266 Down's syndrome pregnancies)

Four markers

  • First trimester PAPP‐A, second trimester total hCG, second trimester uE3, second trimester AFP and maternal age (four studies; 2474 women, including 236 Down's syndrome pregnancies)

  • First trimester NT, second trimester total hCG, second trimester uE3, second trimester AFP and maternal age (four studies; 13,708 women, including 136 Down's syndrome pregnancies)

Three markers

  • First trimester NT, second trimester total hCG, second trimester AFP and maternal age (four studies; 22,793 women, including 135 Down's syndrome pregnancies)

Of the remaining 28 test combinations, two were evaluated in three studies, eight were evaluated in two studies and the remaining 18 in single studies only.

Methodological quality of included studies

Methodological quality of the studies was judged to be high in half of the categories (Figure 1). Due to the nature of testing for Down's syndrome screening and the potential side effects of invasive testing, differential verification is almost universal in the general screening population, as most women whose screening test result is defined as low risk will have their screening test verified at birth, rather than by invasive diagnosis in the antenatal period. Additionally, it was not possible to ascertain from the included studies whether or not the results of index tests and reference standards were blinded. It would be difficult to blind clinicians performing invasive diagnostic tests (reference standards) to the index test result, unless all women received the same reference standard, which would not be appropriate in most scenarios. Any biases secondary to a lack of clinician blinding are likely to be minimal.


Methodological quality graph: review authors' judgements about each methodological quality item presented as percentages across all included studies.

Methodological quality graph: review authors' judgements about each methodological quality item presented as percentages across all included studies.

Although not explicitly stated, most studies seemed to indicate 100% follow‐up. Inevitably there will be losses to follow‐up due to, for example, women moving out of the area of a study. It was therefore difficult to measure reporting of uninterpretable tests and hence reporting of withdrawals. Studies usually accounted for these and it is unlikely to have introduced significant bias. There was definitely under‐ascertainment of miscarriage, and very few papers accounted for miscarriage or performed tissue karyotyping in pregnancies resulting in miscarriage. Some studies attempted to adjust for predicted miscarriage rate and the incidence of Down's syndrome in this specific population, but most did not. We have not attempted to adjust for expected miscarriage rate in this review. This issue has the potential to have more influence with first trimester testing due to a higher miscarriage rate per se in this trimester.

Some studies that provided estimates of risk using multivariable equations used the same data set to evaluate performance of the risk equation as was used to derive the equation. This is often thought to lead to over‐estimation of test performance.

Findings

The results for the six most evaluated test strategies are presented in summary of findings Table 1. Additional information is provided below.

1) First trimester nuchal translucency, first trimester PAPP‐A, second trimester free ßhCG, second trimester uE3, second trimester AFP, second trimester Inhibin A, and maternal age

Four studies (Aagaard‐Tillery 2009; Bestwick 2010; Wald 2003b; Wald 2009) evaluated this test strategy. The studies included 40,348 women in whom 266 pregnancies were affected by Down's syndrome. Over half the data were provided by Bestwick 2010 (22,746 women, including 106 Down's syndrome pregnancies). Studies presented data for different cut‐points but three (Aagaard‐Tillery 2009; Bestwick 2010; Wald 2003b) of the four studies also presented data for a 5% false positive rate (FPR). At a fixed cut‐point of 5% FPR on the summary ROC curve, the estimated sensitivity based on all four studies was 92% (95% confidence interval (CI) 88 to 95).

2) First trimester PAPP‐A, second trimester total hCG, second trimester uE3, second trimester AFP and maternal age

Four studies (Baviera 2010; Wald 2003b; Wright 2010 FASTER trial; Wright 2010 North York) evaluated this test strategy. The studies included 2474 women in whom 236 pregnancies were affected by Down's syndrome. Most of the data were provided by Wald 2003b (118 women, including 98 Down’s syndrome pregnancies). Studies presented data for cut‐points of 5% FPR (two studies Baviera 2010; Wald 2003b) and 1:250 risk (two studies Wright 2010 FASTER trial; Wright 2010 North York). At a fixed cut‐point of 5% FPR, the estimated sensitivity was 85% (95% CI 78 to 89).

3) First trimester nuchal translucency, second trimester total hCG, second trimester uE3, second trimester AFP and maternal age

Results for this test strategy were derived from four studies (Babbur 2005; Herman 2002; Schuchter 2001; Wald 2003b) and included 13,708 women in whom 136 pregnancies were known to be affected by Down's syndrome. Schuchter 2001 contributed 9342 pregnancies to the data. Studies presented data for cut‐points of 5% FPR (two studies: Schuchter 2001; Wald 2003b) and 1:250 risk (two studies:Babbur 2005; Herman 2002). At a fixed cut‐point of 5% FPR, the estimated sensitivity was 86% (95% CI 78 to 92).

4) First trimester nuchal translucency, second trimester total hCG, second trimester AFP and maternal age

Results were derived from four studies (Audibert 2001; Benattar 1999; Lam 2002; Wald 2003b) and included 22,793 women in whom 135 pregnancies were known to be affected by Down's syndrome. Lam 2002 contributed 16,237 pregnancies to the data. Studies presented data for cut‐points of 5% FPR (two studies: Lam 2002; Wald 2003b;) and 1:250 risk (two studies: Audibert 2001; Benattar 1999). At a fixed cut‐point of 5% FPR, the estimated sensitivity was 85% (CI 77 to 91).

5) Other test combinations

Of the 28 test combinations evaluated in three or fewer studies, 25 test combinations demonstrated estimated sensitivities of at least 70% and estimated specificities of more than 90%. Sixteen of these were evaluated in single studies only (see summary of findings Table 2). Of the remaining nine test combinations evaluated in two or three studies, data were pooled for the following six tests.

  • First trimester PAPP‐A and second trimester total hCG, uE3, AFP and PAPP‐A, and maternal age evaluated in two studies (Wright 2010 FASTER trial; Wright 2010 North York) estimated a sensitivity of 78% (CI 66 to 86) and specificity of 98% (CI 96 to 99) at a cut‐point of 1:200 risk.

  • First trimester PAPP‐A and second trimester total hCG, uE3, AFP and inhibin A, and maternal age evaluated in three studies (Malone 2005; Palomaki 2006; Wald 2003b) estimated a sensitivity of 87% (CI 81 to 91) at a cut‐point of 5% FPR.

  • First trimester PAPP‐A and total hCG, and second trimester total hCG, uE3 and AFP evaluated in two studies (Wright 2010 FASTER trial; Wright 2010 North York) estimated a sensitivity of 80% (CI 68 to 88) and specificity of 97% (CI 94 to 98) at a cut‐point of 1:200 risk.

  • First trimester PAPP‐A and uE3, and second trimester total hCG, uE3 and AFP evaluated in two studies (Wright 2010 FASTER trial; Wright 2010 North York) estimated a sensitivity of 80% (CI 68 to 88) and specificity of 96% (CI 93 to 98) at a cut‐point of 1:200 risk.

  • First trimester NT and second trimester free ßhCG and AFP, and maternal age evaluated in two studies (Rozenberg 2002; Wald 2003b) estimated a sensitivity of 83% (CI 70 to 91) at a cut‐point of 5% FPR.

  • First trimester NT and PAPP‐A, and second trimester total hCG, uE3, AFP and Inhibin A, and maternal age evaluated in three studies (Malone 2005; Wald 2003b; Wald 2009) estimated a sensitivity of 95% (CI 90 to 97) at a cut‐point of 5% FPR.

Comparative analysis of the six selected test strategies

For each test, we obtained the detection rate (sensitivity) for a fixed false positive rate (FPR) (1‐specificity), a metric which is commonly used in Down’s syndrome screening to describe test performance. We chose to estimate detection rates at a 5% FPR in common with much of the literature. Figure 2 shows point estimates of the detection rate (and their 95% CIs) at a 5% FPR based on all available data for the six test strategies; the test strategies are ordered according to decreasing detection rates. The plot shows that all six test strategies have detection rates of 85% and above. The six marker maternal age‐adjusted combination of first trimester NT and PAPP‐A with second trimester total hCG, uE3, AFP and inhibin A showed the highest detection rate with an estimated detection rate of 95% (95% CI 90 to 97) based on data from three studies with 184 affected cases out of a total of 39,670 pregnancies. The next best performing strategy was a test combination with the same markers except that it included free ßhCG instead of total hCG. For this combination, the estimated detection rate was 92% (95% CI 88 to 95) based on data from four studies with 266 affected cases out of a total of 40,348 pregnancies. The remaining four test strategies showed similar detection rates.


Detection rates (% sensitivity) at a 5% false positive rate for the six most evaluated test strategies (estimates from summary ROC curves).A = First trimester NT and PAPP‐A , second trimester total hCG, uE3, AFP and inhibin A;B = First trimester NT and PAPP‐A , second trimester free ßhCG, uE3, AFP and inhibin A; C = First trimester PAPP‐A , second trimester total hCG, uE3, AFP and inhibin A; D = First trimester NT, second trimester total hCG, uE3 and AFP; E = First trimester NT, second trimester total hCG and AFP; and F = First trimester PAPP‐A , second trimester total hCG, uE3 and AFP.All test combinations include maternal age. Each circle represents the summary sensitivity for a test strategy at a 5% false positive rate. The size of each circle is proportional to the number of Down's cases. The estimates are shown with 95% confidence intervals. The test strategies are ordered on the plot according to decreasing detection rate. The number of studies, cases and women included for each test strategy are shown on the horizontal axis.

Detection rates (% sensitivity) at a 5% false positive rate for the six most evaluated test strategies (estimates from summary ROC curves).

A = First trimester NT and PAPP‐A , second trimester total hCG, uE3, AFP and inhibin A;B = First trimester NT and PAPP‐A , second trimester free ßhCG, uE3, AFP and inhibin A; C = First trimester PAPP‐A , second trimester total hCG, uE3, AFP and inhibin A; D = First trimester NT, second trimester total hCG, uE3 and AFP; E = First trimester NT, second trimester total hCG and AFP; and F = First trimester PAPP‐A , second trimester total hCG, uE3 and AFP.

All test combinations include maternal age. Each circle represents the summary sensitivity for a test strategy at a 5% false positive rate. The size of each circle is proportional to the number of Down's cases. The estimates are shown with 95% confidence intervals. The test strategies are ordered on the plot according to decreasing detection rate. The number of studies, cases and women included for each test strategy are shown on the horizontal axis.

The strength of evidence for differences in the diagnostic performance of the six test strategies relied on evidence from both direct and indirect comparisons. Table 1 shows pair‐wise direct comparisons (head‐to‐head) where studies were available. Such comparisons are regarded as providing the strongest evidence as differences between tests are unconfounded by study characteristics. The table shows the number of studies (K), the ratios of diagnostic odds ratios (DORs) with 95% CI and P values for each test comparison. There were no statistically significant differences in accuracy between any pair of tests. However, all comparisons in this table were based on one or two studies and so are unlikely to be powered to detect differences in accuracy.

Open in table viewer
Table 1. Direct comparisons of the diagnostic accuracy of the six most evaluated test strategies

Ratio of DORs

(95% CI); P value

(Studies)

1T PAPP‐A, 2T total hCG, 2T uE3 and 2T AFP

1T PAPP‐A, 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A

1T NT, 2T total hCG and 2T AFP

1T NT, 2T total hCG, 2T uE3 and 2T AFP

1T NT, 1T PAPP‐A, 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A

1T PAPP‐A, 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A

1.43 (0.39, 5.25); P = 0.49

(K = 1)

1T NT, 2T total hCG and 2T AFP

0.86 (0.25, 2.96); P = 0.75

(K = 1)

0.60 (0.16, 2.22); P = 0.34

(K = 1)

1T NT, 2T total hCG, 2T uE3 and 2T AFP

1.23 (0.33, 4.57); P = 0.68

(K = 1)

0.86 (0.22, 3.43); P = 0.78

(K = 1)

1.44 (0.38, 5.41);

P = 0.49

(K = 1)

1T NT, 1T PAPP‐A, 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A

2.97 (0.53, 16.6); P = 0.15

(K = 1)

2.08 (0.35, 12.3); P = 0.32

(K = 1)

3.48 (0.62,19.6);

P = 0.12

(K = 1)

2.41 (41, 14.3);

P = 0.24

(K = 1)

1T NT, 1T PAPP‐A, 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A

2.41 (0.53, 11.0); P = 0.18

(K = 1)

1.69 (0.35, 8.16); P = 0.41

(K = 2)

2.82 (0.61, 13.0);

P = 0.13

(K = 1)

1.96 (0.40, 9.53);

P = 0.30

(K = 1)

1.87 (0.57, 6.06);

P = 0.26

(K = 2)

Direct comparisons were made using only data from studies that compared each pair of tests in the same population. Ratio of diagnostic odds ratios (DORs) were computed by division of the DOR for the test in the row by the DOR for the test in the column. If the ratio of DORs is greater than one, then the diagnostic accuracy of the test in the row is higher than that of the test in the column; if the ratio is less than one, the diagnostic accuracy of the test in the column is higher than that of the test in the row. All test combinations include maternal age. All test comparisons that were evaluated by only one study were from Wald 2003b.

1T = first trimester; 2T = second trimester; K = number of studies; CI = confidence interval

AFP = alpha‐fetoprotein; ßhCG = beta human chorionic gonadotrophin; FPR = false positive rate; hCG = human chorionic gonadotrophin; NT = nuchal translucency; PAPP‐A = pregnancy‐associated plasma protein‐A; uE3 = unconjugated oestriol.

Table 2 shows the same comparisons made using all available data. Results are generally in agreement with the direct comparisons, and in addition, showed some statistically significance differences (P < 0.05) suggesting that the six marker maternal age‐adjusted combination of first trimester NT and PAPP‐A with second trimester total hCG, uE3, AFP and inhibin A outperformed all the other test strategies except when compared with a similar strategy that included free ßhCG instead total hCG.

Open in table viewer
Table 2. Indirect comparisons of the diagnostic accuracy of the six most evaluated test strategies

Ratio of DORs

(95% CI); P value

1T PAPP‐A, 2T total hCG, 2T uE3 and 2T AFP

1T PAPP‐A, 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A

1T NT, 2T total hCG and 2T AFP

1T NT, 2T total hCG, 2T uE3 and 2T AFP

1T NT, 1T PAPP‐A, 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A

DOR (95% CI)

Studies

96 (48, 190)

K =4

114 (62, 210)

K = 3

103 (49, 215)

K = 4

109 (51, 233)

K = 4

214 (125, 367)

K = 4

1T PAPP‐A, 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A

114 (62, 210)

K = 3

1.19 (0.61, 2.32); P = 0.58

1T NT, 2T total hCG and 2T AFP

103 (49, 215)

K = 4

1.08 (0.51, 2.36); P = 0.83

0.91 (0.43, 1.90); P = 0.78

1T NT, 2T total hCG, 2T uE3 and 2T AFP

109 (51, 233)

K = 4

1.14 (0.54, 2.42); P = 0.71

0.96 (0.45, 2.03); P = 0.90

1.06 (0.47, 2.41);

P = 0.88

1T NT, 1T PAPP‐A, 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A

214 (125, 367)

K = 4

2.24 (1.00, 5.00); P = 0.049

1.88 (0.88, 3.99); P = 0.094

2.08 (0.89, 4.87);

P = 0.09

1.96 (0.82, 4.67);

P = 0.12

1T NT, 1T PAPP‐A, 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A

339 (163, 705)

K = 3

3.55 (1.28, 9.89); P = 0.019

2.98 (1.14; 7.80); P = 0.029

3.29 (1.15, 9.47);

P = 0.030

3.11 (1.07, 9.07);

P = 0.039

1.58 (0.64, 3.95); P = 0.30

Indirect comparisons were made using all available data. Ratio of diagnostic odds ratios (DORs) were computed by division of the DOR for the test in the row by the DOR for the test in the column. If the ratio of DORs is greater than one, then the diagnostic accuracy of the test in the row is higher than that of the test in the column; if the ratio is less than one, the diagnostic accuracy of the test in the column is higher than that of the test in the row. All test combinations include maternal age.

1T = first trimester; 2T = second trimester; K = number of studies; CI ‐ confidence interval.

AFP = alpha‐fetoprotein; ßhCG = beta human chorionic gonadotrophin; FPR = false positive rate; hCG = human chorionic gonadotrophin; NT = nuchal translucency; PAPP‐A = pregnancy‐associated plasma protein‐A; uE3 = unconjugated oestriol.

Comparison of integrated, contingent and stepwise strategy for a septuple combination of serum tests and first trimester nuchal translucency

Table 3 shows the results of two studies that assessed integrated, contingent or stepwise strategies. Integrated testing involves performing first trimester NT, PAPP‐A and free ßhCG, and second trimester uE3, AFP, total hCG and inhibin A, without disclosure of the first trimester result. The strategy was evaluated in one study (Malone 2005) that reported a 94% sensitivity (95% CI 87 to 98) and 89% specificity (95% CI 89 to 89) for a cut‐point of 1:150.

Open in table viewer
Table 3. Comparison of integrated, contingent and stepwise strategy for a septuple combination of serum tests and first trimester nuchal translucency

Test combination

Screening policy

Study

Women (cases)

Sensitivity (95% CI)

Specificity (95% CI)

Threshold

First trimester NT, PAPP‐A and free ßhCG, and second trimester uE3, AFP, total hCG and inhibin A

Integrated

Malone 2005

33,546 (87)

94 (87, 98)

89 (89, 89)

1:150 risk

First trimester NT, PAPP‐A and free ßhCG, if risk <1:30 invasive testing is offered, if risk 1:30‐1:1500, second trimester total hCG, uE3, AFP and inhibin A is performed

Contingent

Cuckle 2008

32,355 (86)

91 (82, 96)

95 (95, 96)

1:270 risk

First trimester NT, PAPP‐A and free ßhCG, if risk <1:30 invasive testing is offered, if ≥ 1:30 second trimester total hCG, uE3, AFP and inhibin A is performed

Stepwise

Cuckle 2008

32,355 (86)

92 (84, 97)

95 (95, 95)

1:270 risk

AFP = alpha‐fetoprotein; ßhCG = beta human chorionic gonadotrophin; FPR = false positive rate; hCG = human chorionic gonadotrophin; NT = nuchal translucency; PAPP‐A = pregnancy‐associated plasma protein‐A; uE3 = unconjugated oestriol.

CI ‐ confidence interval.

In one study (Cuckle 2008), stepwise and contingent tests were compared in the same patient population, with similar detection rates (stepwise 91% (95% CI 84 to 97); contingent 92% (95% CI 82 to 96)) and identical false positive rates of 5% at cut‐points of 1:270.

The perceived advantages of the stepwise and contingent methods are that women deemed to be very high risk are offered invasive testing in the first trimester, allowing for earlier detection of Down's syndrome and subsequent management. Termination of pregnancy in the first trimester of pregnancy is safer than at later gestations. With contingent screening, where women are deemed to be low risk with a numerical risk of < 1:1500, no further testing is offered, and it does not appear to adversely affect the detection rate. In those women who are considered to be intermediate risk, additional second trimester serum tests may detect cases of Down's syndrome that would have been missed. Of note, in the study evaluated, all of the women found to have a risk of > 1:30 on first trimester screening were found to be high risk upon completion of the full contingent screening package. This type of screening may facilitate patient decision making, however further evaluation needs to be carried out.

It is difficult to make a comparison between the integrated method and the stepwise and contingent methods in practical terms, as the non‐disclosure of the first trimester result means that women would not be offered earlier diagnostic testing. More information is required about all three methods of testing in order to make a recommendation, as not all methods will be acceptable to women.

Investigation of heterogeneity and sensitivity analyses

The key characteristics of the 22 included studies is summarised in Table 4 with further details available in the Characteristics of included studies table. None of the tests was evaluated by 10 or more studies and so we were unable to investigate the effect of any potential source of heterogeneity. The planned sensitivity analyses were also not possible.

Open in table viewer
Table 4. Maternal age, reference standard and study design characteristics of included studies

Study

Maternal age (years)*

Reference standard†

Withdrawals explained?

Study design

Aagaard‐Tillery 2009

30.6 (SD 6.1)

Karyotyping or follow‐up to birth

Of 33,546 trial participants only 7842 women with complete information for all screening tests and genetic sonography were included in the study.

Prospective cohort

Audibert 2001

30.1, all < 38, 86% < 35, 14% ≥35

Prenatal karyotype conducted (in 7.6% of patients) depending on presence of risk >1/125, high maternal age, parental anxiety, history of chromosomal defects or parental translocation or abnormal second trimester scan. Cytogenetic testing of newborns with suspected abnormalities. Postmortum on terminations of pregnancy or miscarriages. Follow‐up to neonatal examination in newborns.

35 women were lost to follow‐up (they had all had normal NT results). 340 women who did not want second trimester serum screening withdrew from that part of the study. Women lost to follow‐up were excluded in the final analysis. All detected cases were terminated.

Prospective consecutive series

Babbur 2005

Median 37 (range 19 to 46)

Invasive testing offered to women with NT > 3 mm or risk > 1:250 as defined by combined NT and serum results CVS from 11 weeks, amniocentesis from 15 weeks). Rapid in situ hybridisation test in patients with risk > 1:30. No details given of any follow‐up to birth

463 patients having NT did not go on to have second trimester serum testing. Women with miscarriages excluded.

Prospective cohort

Baviera 2010

35.3 for Down's cases, 30.4 for controls

Amniocentesis or follow‐up to birth

No details of withdrawals given.

Case control

Benattar 1999

32 (16 to 46), 8.3% > 35

Amniocentesis due to maternal age > 38 years (6.1% or women). Karyotyping encouraged for women with positive result on one or more index test. No details of reference standard for index test negative women.

No details of withdrawals given. 12 patients were lost to follow‐up due to miscarriages

Prospective cohort

Bestwick 2010

Median 39 for Down's cases, 34 for non‐Down's cases

Karyotyping or follow‐up to birth

No details of withdrawals given.

Retrospective cohort

Cuckle 2008

Not reported

Karyotyping or follow‐up to birth

No details of withdrawals given.

Prospective cohort

Goh 1996

33

Karyotyping or follow‐up to birth

No details of withdrawals given.

Cohort

Guanciali‐Franchi 2010

31.8

Karyotyping or follow‐up to birth

No details of withdrawals given.

Prospective cohort

Habayeb 2010

Median 35.4 (range 18 to 49)

Karyotyping or follow‐up to birth

No details of withdrawals given.

Cohort

Herman 2002

Not reported

Karyotyping or follow‐up to birth

No details of withdrawals given.

Case control

Lam 2002

30.5 (19% ≥35) (unaffected pregnancies)

Women considered high risk offered CVS (0.7%) or amniocentesis (11.8%). Follow‐up to birth

Details given for patients excluded and those without follow‐up data.

Prospective cohort

Malone 2005

21.6% aged 35 and above

Amniocentesis (offered to women with positive results from any screening test) or follow‐up to birth.

Details given for patients who did not undergo different index tests. Unclear which patients did not have follow‐up data. Appears that aborted/miscarried foetuses did not have follow‐up.

Prospective cohort

Okun 2008 Integrated

32

Karyotyping or follow‐up to birth

2614 (8%) of women undergoing integrated screening did not return for the second trimester part of the test.

Prospective cohort

Palomaki 2006

33.9 (SD 4.4) for Down's cases, 35.9 (SD 3.6) for controls

Karyotyping or follow‐up to birth

No details of withdrawals given.

Case control

Rodrigues 2009

30.6 for integrated screening, 30.9 for serum integrated screening

Karyotyping or follow‐up to birth

No details of withdrawals given.

Retrospective cohort

Rozenberg 2002

30.5 (18 to 37)

Amniocentesis offered to patients with NT > 3 mm or serum marker risk was > 1:250. Follow‐up to birth.

No details of withdrawals given. 3.4% of patients were lost to follow‐up and were excluded from the study. This included 113 women (1.2%) with miscarriages.

Prospective cohort

Schuchter 2001

28 (range 15 to 46), 10.7% aged 35 and above

CVS (offered to patients with first trimester NT > 3.5 mm), amniocentesis (offered to patients with first trimester NT 2.5 to 3.4, high risk on second trimester serum testing (> 1:250) and those > 35 years) or follow‐up to birth.

No details of withdrawals given. Women having miscarriages were excluded from the study.

Retrospective cohort

Wald 2003b

Not reported

Invasive testing (following second trimester screening) or follow‐up to birth.

No details of withdrawals given.

Case control

Wald 2009

Median 33 (range 15 to 51), 20% aged 37 and above

Karyotyping or follow‐up to birth

No details of withdrawals given.

Retrospective cohort

Wright 2010 FASTER trial

Not reported

Karyotyping or follow‐up to birth

No details of withdrawals given.

Case control

Wright 2010 North York

Not reported

Karyotyping or follow‐up to birth

No details of withdrawals given.

Case control

CVS = chorionic villus sampling; NT = nuchal translucency; SD = standard deviation

*Mean maternal age presented unless otherwise indicated.

†In all studies the choice of reference standard was dependent on the results of the index test.

Discussion

Summary of main results

We found 22 studies evaluating first and second trimester Down’s syndrome serum screening tests, with or without first trimester nuchal translucency (NT). Few studies provided unconfounded comparisons of test strategies by applying and comparing several strategies using the same serum sample, the majority of studies only evaluating a single test combination. A summary of results for the six most commonly evaluated test strategies is given in summary of findings Table 1, and the remaining 26 test strategies are given in summary of findings Table 2.

Three key findings were noted.

  1. The combined test comprised of first trimester NT and PAPP‐A, and second trimester total hCG, uE3, AFP and Inhibin A, and maternal age evaluated in three studies (Malone 2005; Wald 2003b; Wald 2009) estimated a sensitivity of 95% (confidence interval (CI) 90 to 97) at a cut‐point of 5% FPR. In indirect comparisons this test combination significantly outperformed all others, except the test combination of first trimester NT, first trimester PAPP‐A, second trimester free ßhCG, second trimester uE3, second trimester AFP, second trimester Inhibin A, and maternal age with a sensitivity of 92% (95% CI 88 to 95) for a fixed 5% FPR.

  2. In direct comparisons of tests in the same population of women, no test was found to be significantly better. These comparisons were based on one or two studies, and are therefore unlikely to be powered to detect differences.

  3. Stepwise and contingent screening strategies show promising detection rates for fixed FPRs, however due to the nature of the test strategies it is not appropriate to make comparisons between these tests and those that do not involved stratification or risk at several different points in the screening journey. These test strategies warrant further study.

Strengths and weaknesses of the review

This review is the first comprehensive review of first and second trimester serum and ultrasound screening. We examined papers from around the world, covering a wide cross‐section of women in varying populations. We contacted authors to verify data where necessary to give as complete a picture as possible while trying to avoid replication of data.

There were a number of factors that made meta‐analysis of the data difficult, which we tried to adapt for in order to allow for comparability of data presented in different studies.

  1. There were many different cut‐points used to define pregnancies as high or low risk for Down's syndrome. This means that direct comparison is more difficult than if all studies used the same cut‐point to dichotomise their populations.

  2. There were many different risk equations and software applications in use for combination of multiple markers, which were often not described in the papers. This means that risks may be calculated by different formulae and they may not be directly comparable for this reason. It is possible that this is responsible for unexplained heterogeneity in results.

  3. Different laboratories and clinics run different assays and use different machines and methods. This may influence raw results and subsequent risk calculations. Many laboratories have a quality assessment or audit trail, however, this may not necessarily be standard across the board. For example, how many assays are run, how often medians are calculated and adjusted for a given population and how quickly samples are tested from initially being taken.

  4. Few studies made direct comparisons between tests, making it difficult to detect if a real difference exists between tests (i.e. how different tests perform in the same population). There were differences in populations, with assay medians being affected, for example, by race. It is not certain whether it is appropriate to make comparisons between populations which are inherently different.

  5. We were unable to perform the investigations of heterogeneity that we had originally intended to because the data simply were not available. The vast majority of papers looking at pregnancies conceived by IVF, affected by diabetes, multiple gestation or a family history of Down's syndrome involved unaffected pregnancies only.

In addition, the search for this review was last updated in August 2011, and it is possible that new studies may have been published which have not been included. Since the search was completed we have kept a watching brief on outputs and are not aware of any studies with large sample sizes which could substantially affect the findings.

Applicability of findings to the review question

Potentially, when planning screening policy or a clinical screening programme, clinicians and policy makers need to make decisions about a finite number of tests or type of tests that can be offered. These policies are often driven by both the needs of a specific population and by financial resources. Economic analysis was considered to be outside of the scope of this review. Many of the tests examined as part of this review are already commercially available and in use in the clinical setting. The studies were carried out on populations of typical pregnant women and therefore, the results should be considered comparable with most pregnant populations encountered in every day clinical practice.

We were unable to extract information about harms of testing, information about miscarriage rates and uptake of definitive testing as the data were not available the majority of the time. While it is unlikely that major differences between the tests evaluated here exist in terms of direct harms of testing, as they are all based on ultrasound, with or without a blood sample, differences in accuracy may lead to differences in the use of definitive testing and its consequent adverse outcomes.

In some countries with a defined screening policy (i.e. the UK), first trimester screening plays a major role, usually in combination with first trimester ultrasound scanning, and second trimester serum screening is also readily available. In other countries however, there may only be a limited range of tests or markers available—often second trimester markers, rather than first trimester markers. The results of this review should be interpreted and applied in the context of test availability and local restrictions, populations or policies.

Methodological quality graph: review authors' judgements about each methodological quality item presented as percentages across all included studies.
Figuras y tablas -
Figure 1

Methodological quality graph: review authors' judgements about each methodological quality item presented as percentages across all included studies.

Detection rates (% sensitivity) at a 5% false positive rate for the six most evaluated test strategies (estimates from summary ROC curves).A = First trimester NT and PAPP‐A , second trimester total hCG, uE3, AFP and inhibin A;B = First trimester NT and PAPP‐A , second trimester free ßhCG, uE3, AFP and inhibin A; C = First trimester PAPP‐A , second trimester total hCG, uE3, AFP and inhibin A; D = First trimester NT, second trimester total hCG, uE3 and AFP; E = First trimester NT, second trimester total hCG and AFP; and F = First trimester PAPP‐A , second trimester total hCG, uE3 and AFP.All test combinations include maternal age. Each circle represents the summary sensitivity for a test strategy at a 5% false positive rate. The size of each circle is proportional to the number of Down's cases. The estimates are shown with 95% confidence intervals. The test strategies are ordered on the plot according to decreasing detection rate. The number of studies, cases and women included for each test strategy are shown on the horizontal axis.
Figuras y tablas -
Figure 2

Detection rates (% sensitivity) at a 5% false positive rate for the six most evaluated test strategies (estimates from summary ROC curves).

A = First trimester NT and PAPP‐A , second trimester total hCG, uE3, AFP and inhibin A;B = First trimester NT and PAPP‐A , second trimester free ßhCG, uE3, AFP and inhibin A; C = First trimester PAPP‐A , second trimester total hCG, uE3, AFP and inhibin A; D = First trimester NT, second trimester total hCG, uE3 and AFP; E = First trimester NT, second trimester total hCG and AFP; and F = First trimester PAPP‐A , second trimester total hCG, uE3 and AFP.

All test combinations include maternal age. Each circle represents the summary sensitivity for a test strategy at a 5% false positive rate. The size of each circle is proportional to the number of Down's cases. The estimates are shown with 95% confidence intervals. The test strategies are ordered on the plot according to decreasing detection rate. The number of studies, cases and women included for each test strategy are shown on the horizontal axis.

Age, 1T PAPP‐A , 2T free ßhCG and 2T AFP at 5% FPR.
Figuras y tablas -
Test 1

Age, 1T PAPP‐A , 2T free ßhCG and 2T AFP at 5% FPR.

Age, 1T PAPP‐A , 2T free ßhCG and 2T AFP, risk 1:300.
Figuras y tablas -
Test 2

Age, 1T PAPP‐A , 2T free ßhCG and 2T AFP, risk 1:300.

Age, 1T PAPP‐A , 2T total hCG, and 2T AFP at 5% FPR.
Figuras y tablas -
Test 3

Age, 1T PAPP‐A , 2T total hCG, and 2T AFP at 5% FPR.

Age, 1T PAPP‐A , 2T free ßhCG, 2T uE3 and 2T AFP at 5% FPR.
Figuras y tablas -
Test 4

Age, 1T PAPP‐A , 2T free ßhCG, 2T uE3 and 2T AFP at 5% FPR.

Age, 1T PAPP‐A , 2T total hCG, 2T uE3 and 2T AFP at 2% FPR.
Figuras y tablas -
Test 5

Age, 1T PAPP‐A , 2T total hCG, 2T uE3 and 2T AFP at 2% FPR.

Age, 1T PAPP‐A , 2T total hCG, 2T uE3 and 2T AFP at 5% FPR.
Figuras y tablas -
Test 6

Age, 1T PAPP‐A , 2T total hCG, 2T uE3 and 2T AFP at 5% FPR.

Age, 1T PAPP‐A , 2T total hCG, 2T uE3 and 2T AFP at risk 1:200.
Figuras y tablas -
Test 7

Age, 1T PAPP‐A , 2T total hCG, 2T uE3 and 2T AFP at risk 1:200.

Age, 1T PAPP‐A , 2T total hCG, 2T uE3 and 2T AFP, mixed cutpoints.
Figuras y tablas -
Test 8

Age, 1T PAPP‐A , 2T total hCG, 2T uE3 and 2T AFP, mixed cutpoints.

Age, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A at 5% FPR.
Figuras y tablas -
Test 9

Age, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A at 5% FPR.

Age, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:50.
Figuras y tablas -
Test 10

Age, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:50.

Age, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:100.
Figuras y tablas -
Test 11

Age, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:100.

Age, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:150.
Figuras y tablas -
Test 12

Age, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:150.

Age, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:200.
Figuras y tablas -
Test 13

Age, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:200.

Age, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:250.
Figuras y tablas -
Test 14

Age, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:250.

Age, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:300.
Figuras y tablas -
Test 15

Age, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:300.

Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A at 5% FPR.
Figuras y tablas -
Test 16

Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A at 5% FPR.

Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:100.
Figuras y tablas -
Test 17

Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:100.

Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:150.
Figuras y tablas -
Test 18

Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:150.

Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:200.
Figuras y tablas -
Test 19

Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:200.

Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:250.
Figuras y tablas -
Test 20

Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:250.

Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A, mixed cutpoints.
Figuras y tablas -
Test 21

Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A, mixed cutpoints.

Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T PAPP‐A at 2% FPR.
Figuras y tablas -
Test 22

Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T PAPP‐A at 2% FPR.

Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T PAPP‐A at risk 1:200.
Figuras y tablas -
Test 23

Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T PAPP‐A at risk 1:200.

Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP, 2T Inhibin A and 2T PAPP‐A at risk 1:100.
Figuras y tablas -
Test 24

Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP, 2T Inhibin A and 2T PAPP‐A at risk 1:100.

Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP, 2T Inhibin A and 2T PAPP‐A at risk 1:150.
Figuras y tablas -
Test 25

Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP, 2T Inhibin A and 2T PAPP‐A at risk 1:150.

Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP, 2T Inhibin A and 2T PAPP‐A at risk 1:200.
Figuras y tablas -
Test 26

Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP, 2T Inhibin A and 2T PAPP‐A at risk 1:200.

Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP, 2T Inhibin A and 2T PAPP‐A at risk 1:250.
Figuras y tablas -
Test 27

Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP, 2T Inhibin A and 2T PAPP‐A at risk 1:250.

Age, 1T PAPP‐A , 1T total hCG, 2T total hCG, 2T uE3 and 2T AFP at 2% FPR.
Figuras y tablas -
Test 28

Age, 1T PAPP‐A , 1T total hCG, 2T total hCG, 2T uE3 and 2T AFP at 2% FPR.

Age, 1T PAPP‐A , 1T total hCG, 2T total hCG, 2T uE3 and 2T AFP at risk 1:200.
Figuras y tablas -
Test 29

Age, 1T PAPP‐A , 1T total hCG, 2T total hCG, 2T uE3 and 2T AFP at risk 1:200.

Age, 1T PAPP‐A , 1T uE3, 2T total hCG, 2T uE3 and 2T AFP at 2% FPR.
Figuras y tablas -
Test 30

Age, 1T PAPP‐A , 1T uE3, 2T total hCG, 2T uE3 and 2T AFP at 2% FPR.

Age, 1T PAPP‐A , 1T uE3, 2T total hCG, 2T uE3 and 2T AFP at risk 1:200.
Figuras y tablas -
Test 31

Age, 1T PAPP‐A , 1T uE3, 2T total hCG, 2T uE3 and 2T AFP at risk 1:200.

Age, 1T PAPP‐A , 1T total hCG, 1T uE3, 2T total hCG, 2T uE3, 2T AFP and 2T PAPP‐A at 2% FPR.
Figuras y tablas -
Test 32

Age, 1T PAPP‐A , 1T total hCG, 1T uE3, 2T total hCG, 2T uE3, 2T AFP and 2T PAPP‐A at 2% FPR.

Age, 1T PAPP‐A , 1T total hCG, 1T uE3, 2T total hCG, 2T uE3, 2T AFP and 2T PAPP‐A at risk 1:200.
Figuras y tablas -
Test 33

Age, 1T PAPP‐A , 1T total hCG, 1T uE3, 2T total hCG, 2T uE3, 2T AFP and 2T PAPP‐A at risk 1:200.

Age, 1T AFP, 1T free ßhCG, 1T uE3, 2T total hCG, 2T uE3 and 2T AFP at risk 1:250.
Figuras y tablas -
Test 34

Age, 1T AFP, 1T free ßhCG, 1T uE3, 2T total hCG, 2T uE3 and 2T AFP at risk 1:250.

Age, 1T AFP, 1T free ßhCG, 1T uE3, 2T total hCG, 2T uE3 and 2T AFP at risk 1:384.
Figuras y tablas -
Test 35

Age, 1T AFP, 1T free ßhCG, 1T uE3, 2T total hCG, 2T uE3 and 2T AFP at risk 1:384.

Age, 1T NT, 2T total hCG and 2T AFP, 5FPR.
Figuras y tablas -
Test 36

Age, 1T NT, 2T total hCG and 2T AFP, 5FPR.

Age, 1T NT, 2T total hCG and 2T AFP, risk 1:250.
Figuras y tablas -
Test 37

Age, 1T NT, 2T total hCG and 2T AFP, risk 1:250.

Age, 1T NT, 2T total hCG and 2T AFP, mixture cutpoint.
Figuras y tablas -
Test 38

Age, 1T NT, 2T total hCG and 2T AFP, mixture cutpoint.

Age, 1T NT, 2T free ßhCG and 2T AFP, 5FPR.
Figuras y tablas -
Test 39

Age, 1T NT, 2T free ßhCG and 2T AFP, 5FPR.

Age, 1T NT, 2T free ßhCG and 2T AFP, mixture cutpoint.
Figuras y tablas -
Test 40

Age, 1T NT, 2T free ßhCG and 2T AFP, mixture cutpoint.

Age, 1T NT, 2T free ßhCG, 2T uE3 and 2T AFP, 5FPR.
Figuras y tablas -
Test 41

Age, 1T NT, 2T free ßhCG, 2T uE3 and 2T AFP, 5FPR.

Age, 1T NT, 2T total hCG, 2T uE3 and 2T AFP, 5FPR.
Figuras y tablas -
Test 42

Age, 1T NT, 2T total hCG, 2T uE3 and 2T AFP, 5FPR.

Age, 1T NT, 2T total hCG, 2T uE3 and 2T AFP, risk 1:250.
Figuras y tablas -
Test 43

Age, 1T NT, 2T total hCG, 2T uE3 and 2T AFP, risk 1:250.

Age, 1T NT, 2T total hCG, 2T uE3 and 2T AFP, mixture cutpoint.
Figuras y tablas -
Test 44

Age, 1T NT, 2T total hCG, 2T uE3 and 2T AFP, mixture cutpoint.

Age, 1T NT, 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A, 5FPR.
Figuras y tablas -
Test 45

Age, 1T NT, 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A, 5FPR.

Age, 1T NT, 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A, 5FPR.
Figuras y tablas -
Test 46

Age, 1T NT, 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A, 5FPR.

Age, 1T NT, 2T free ßhCG, 2T uE3, 2T AFP and 1T PAPP‐A , 5FPR.
Figuras y tablas -
Test 47

Age, 1T NT, 2T free ßhCG, 2T uE3, 2T AFP and 1T PAPP‐A , 5FPR.

Age, 1T NT, 2T free ßhCG, 2T uE3, 2T AFP and 1T PAPP‐A , risk 1:250.
Figuras y tablas -
Test 48

Age, 1T NT, 2T free ßhCG, 2T uE3, 2T AFP and 1T PAPP‐A , risk 1:250.

Age, 1T NT, 1T PAPP‐A , 2T total hCG and 2T AFP, 5FPR.
Figuras y tablas -
Test 49

Age, 1T NT, 1T PAPP‐A , 2T total hCG and 2T AFP, 5FPR.

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG and 2T AFP, 5FPR.
Figuras y tablas -
Test 50

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG and 2T AFP, 5FPR.

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG and 2T AFP,risk 1:250.
Figuras y tablas -
Test 51

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG and 2T AFP,risk 1:250.

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG and 2T AFP, risk 1:300.
Figuras y tablas -
Test 52

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG and 2T AFP, risk 1:300.

Age, 1T NT, 1T PAPP‐A, 2T total hCG, 2T uE3 and 2T AFP 5FPR.
Figuras y tablas -
Test 53

Age, 1T NT, 1T PAPP‐A, 2T total hCG, 2T uE3 and 2T AFP 5FPR.

Age, 1T NT, 1T PAPP‐A , 2T total hCG, 2T uE3 and 2T AFP, risk 1:200.
Figuras y tablas -
Test 54

Age, 1T NT, 1T PAPP‐A , 2T total hCG, 2T uE3 and 2T AFP, risk 1:200.

Age, 1T NT, 1T PAPP‐A, 2T total hCG, 2T uE3 and 2T AFP, mixed cutpoints.
Figuras y tablas -
Test 55

Age, 1T NT, 1T PAPP‐A, 2T total hCG, 2T uE3 and 2T AFP, mixed cutpoints.

Age, 1T NT, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A, 5FPR.
Figuras y tablas -
Test 56

Age, 1T NT, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A, 5FPR.

Age, 1T NT, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A, risk 1:150.
Figuras y tablas -
Test 57

Age, 1T NT, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A, risk 1:150.

Age, 1T NT, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A, mixed cutpoints.
Figuras y tablas -
Test 58

Age, 1T NT, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A, mixed cutpoints.

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A,risk 1:300.
Figuras y tablas -
Test 59

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A,risk 1:300.

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A, 1:270.
Figuras y tablas -
Test 60

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A, 1:270.

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A,risk 1:250.
Figuras y tablas -
Test 61

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A,risk 1:250.

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A,risk 1:200.
Figuras y tablas -
Test 62

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A,risk 1:200.

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A,risk 1:150.
Figuras y tablas -
Test 63

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A,risk 1:150.

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A,risk 1:100.
Figuras y tablas -
Test 64

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A,risk 1:100.

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A,risk 1:50.
Figuras y tablas -
Test 65

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A,risk 1:50.

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A, 5FPR.
Figuras y tablas -
Test 66

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A, 5FPR.

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A, 3FPR.
Figuras y tablas -
Test 67

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A, 3FPR.

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A, 1FPR.
Figuras y tablas -
Test 68

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A, 1FPR.

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A, mixed cutpoints.
Figuras y tablas -
Test 69

Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A, mixed cutpoints.

Age, 1T NT, 1T PAPP‐A, 1T free ßhCG, 2T total hCG, 2T uE3 and 2T AFP, risk 1:250.
Figuras y tablas -
Test 70

Age, 1T NT, 1T PAPP‐A, 1T free ßhCG, 2T total hCG, 2T uE3 and 2T AFP, risk 1:250.

Age, 1T NT, 1T PAPP‐A, 1T free ßhCG, 2T uE3, 2T AFP, 2T total hCG and 2T Inhibin A, risk 1:150.
Figuras y tablas -
Test 71

Age, 1T NT, 1T PAPP‐A, 1T free ßhCG, 2T uE3, 2T AFP, 2T total hCG and 2T Inhibin A, risk 1:150.

ADAM 12 2T TO 1T RATIO.
Figuras y tablas -
Test 72

ADAM 12 2T TO 1T RATIO.

Stepwise: Age, 1T NT, 1T PAPP‐A , 1T free ßhCG, if risk <1/30, 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A risk 1:270.
Figuras y tablas -
Test 73

Stepwise: Age, 1T NT, 1T PAPP‐A , 1T free ßhCG, if risk <1/30, 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A risk 1:270.

Stepwise: Age, 1T NT, 1T PAPP‐A , 1T free ßhCG, if risk <1/30, 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A risk 1:270.
Figuras y tablas -
Test 74

Stepwise: Age, 1T NT, 1T PAPP‐A , 1T free ßhCG, if risk <1/30, 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A risk 1:270.

Stepwise: Age, 1T NT, 1T PAPP‐A , 1T free ßhCG, if risk <1/30, 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A 5% FPR.
Figuras y tablas -
Test 75

Stepwise: Age, 1T NT, 1T PAPP‐A , 1T free ßhCG, if risk <1/30, 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A 5% FPR.

Stepwise: Age, 1T NT, 1T PAPP‐A , if risk <1:100, 2T free ßhCG, 2T uE3, 2T AFP, risk 1:250.
Figuras y tablas -
Test 76

Stepwise: Age, 1T NT, 1T PAPP‐A , if risk <1:100, 2T free ßhCG, 2T uE3, 2T AFP, risk 1:250.

Contingent: Age, 1T NT, 1T PAPP‐A , 1T free ßhCG, if risk 1/30‐1/1500, 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A risk 1:270.
Figuras y tablas -
Test 77

Contingent: Age, 1T NT, 1T PAPP‐A , 1T free ßhCG, if risk 1/30‐1/1500, 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A risk 1:270.

Contingent: Age, 1T NT, 1T PAPP‐A , 1T free ßhCG, if risk 1/30‐1/1500, 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A risk 1:270.
Figuras y tablas -
Test 78

Contingent: Age, 1T NT, 1T PAPP‐A , 1T free ßhCG, if risk 1/30‐1/1500, 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A risk 1:270.

Contingent: Age, 1T NT, 1T PAPP‐A , 1T free ßhCG, if risk 1/30‐1/1500, 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A 5%FPR.
Figuras y tablas -
Test 79

Contingent: Age, 1T NT, 1T PAPP‐A , 1T free ßhCG, if risk 1/30‐1/1500, 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A 5%FPR.

Summary of findings 1. Performance of the six most evaluated first and second trimester serum test strategies with or without ultrasound

Test strategy (with maternal age)

Studies

Women (cases)

Sensitivity (95% CI) at a 5% FPR

Test*

First trimester PAPP‐A and second trimester total hCG, uE3 and AFP

4

2474 (236)

85 (78, 89)

P = 0.014

First trimester PAPP‐A and second trimester total hCG, uE3, AFP and inhibin A

3

35,361 (217)

87 (81, 91)

First trimester NT and second trimester total hCG and AFP

4

22,793 (135)

85 (77, 91)

First trimester NT and second trimester total hCG, uE3 and AFP

4

13,708 (136)

86 (78, 92)

First trimester NT and PAPP‐A, and second trimester total hCG, uE3, AFP and inhibin A

3

39,670 (184)

95 (90, 97)

First trimester NT and PAPP‐A, and second trimester free ßhCG, uE3, AFP and inhibin A

4

40,348 (266)

92 (88, 95)

*Likelihood ratio test for the difference in accuracy between the six test strategies compared in a single meta‐analytic model

AFP = alpha‐fetoprotein; ßhCG = beta human chorionic gonadotrophin; FPR = false positive rate;hCG = human chorionic gonadotrophin; NT = nuchal translucency; PAPP‐A = pregnancy‐associated plasma protein‐A; uE3 = unconjugated oestriol

CI = confidence interval

Figuras y tablas -
Summary of findings 1. Performance of the six most evaluated first and second trimester serum test strategies with or without ultrasound
Summary of findings 2. Performance of other first and second trimester serum strategies with or without ultrasound

Test

Studies

Women (cases)

Sensitivity* (95% CI)

Specificity* (95% CI)

Threshold

Without maternal age and ultrasound

Single tests

ADAM 12 second trimester to first trimester ratio

1

579 (17)

53 (28, 77)

95 (93, 97)

5% FPR

With maternal age and without ultrasound

Triple tests

First trimester PAPP‐A and second trimester total hCG and AFP

1

1188 (98)

83 (74, 90)

95 (93, 96)

5% FPR

First trimester PAPP‐A and second trimester free ßhCG and AFP

2

2197 (94)

83 to 85

94 to 95

5% FPR, 1:300 risk

Quadruple tests

First trimester PAPP‐A and second trimester free ßhCG, uE3 and AFP

1

1188 (98)

86 (77, 92)

95 (93, 96)

5% FPR

Quintuple tests

First trimester PAPP‐A and second trimester free ßhCG, uE3, AFP and inhibin A

1

1188 (98)

90 (82, 95)

95 (93, 96)

5% FPR

First trimester PAPP‐A and second trimester total hCG, uE3, AFP and PAPP‐A

2

707 (121)

78 (66, 86)

98 (96, 99)

1:200 risk

First trimester PAPP‐A and total hCG, and second trimester total hCG, uE3 and AFP

2

707 (121)

80 (68, 88)

97 (94, 98)

1:200 risk

First trimester PAPP‐A and uE3, and second trimester total hCG, uE3 and AFP

2

707 (121)

80 (68, 88)

96 (93, 98)

1:200 risk

Sextuple tests

First trimester AFP, free ßhCG and uE3, and second trimester total hCG, uE3 and AFP

1

12,339 (34)

82 (65, 93)

94 (93, 94)

1:250 risk

First trimester PAPP‐A and second trimester total hCG, uE3, AFP, inhibin A and PAPP‐A

1

540 (32)

84 (67, 95)

96 (94, 98)

1:250 risk

Septuple tests

First trimester PAPP‐A, total hCG and uE3, and second trimester total hCG, uE3, AFP and PAPP‐A

2

707 (121)

49 (36, 61)

98 (96, 99)

1:200 risk

With maternal age and ultrasound

Triple tests

First trimester NT and second trimester free ßhCG and AFP

2

6616 (105)

83 (70, 91)

95

5% FPR

Quadruple tests

First trimester NT and second trimester free ßhCG, uE3 and AFP

1

1110 (85)

88 (79, 94)

95 (94, 96)

5% FPR

First trimester NT and PAPP‐A, and second trimester total hCG and AFP

1

1110 (85)

91 (82, 96)

95 (94, 96)

5% FPR

First trimester NT and PAPP‐A, and second trimester free ßhCG and AFP

2

3400 (93)

88 to 91

95 to 98

5% FPR, 1:300 risk

Quintuple tests

First trimester NT and second trimester total hCG, uE3, AFP and inhibin A

1

1110 (85)

91 (82, 96)

95 (94, 96)

5% FPR

First trimester NT and second trimester free ßhCG, uE3, AFP and inhibin A

1

1110 (85)

91 (82, 96)

95 (94, 96)

5% FPR

First trimester NT and PAPP‐A, and second trimester free ßhCG, uE3 and AFP

1

1100 (85)

92 (84, 97)

95 (94, 96)

5% FPR

First trimester NT and PAPP‐A, and second trimester total hCG, uE3 and AFP

2

33,337 (171)

88 to 92

95 to 97

5% FPR, 1:200 risk

Sextuple tests

First trimester NT, PAPP‐A and free ßhCG, and second trimester total hCG, uE3 and AFP

1

5060 (13)

100 (75, 100)

97 (96, 97)

1:250 risk

Septuple tests

First trimester NT, PAPP‐A and free ßhCG, and second trimester uE3, AFP, total hCG and inhibin A

1

33,546 (87)

94 (87, 98)

89 (89, 89)

1:150 risk

Contingent tests

First trimester NT, PAPP‐A and free ßhCG, if risk 1:30‐1:1500, second trimester total hCG, uE3, AFP and inhibin A

1

32,355 (86)

91 (82, 96)

95 (95, 96)

1:270 risk

First trimester NT, PAPP‐A and free ßhCG, if risk 1:30‐1:1500, second trimester free ßhCG, uE3, AFP and inhibin A

1

7842 (59)

95 (86, 99)

95 (94, 95)

5% FPR

Stepwise tests

First trimester NT and PAPP‐A, if risk < 1:100, second trimester free ßhCG, uE3 and AFP

1

1507 (12)

92 (62, 100)

97 [(96, 98)

1:250 risk

First trimester NT, PAPP‐A and free ßhCG, if risk < 1:30, second trimester total hCG, uE3, AFP and inhibin A

1

32,355 (86)

92 (84, 97)

95 (95, 95)

1:270 risk

First trimester NT, PAPP‐A and free ßhCG, if risk < 1:30, second trimester free ßhCG, uE3, AFP and 2T inhibin A

1

7842 (59)

97 (88, 100)

95 (94, 95)

5% FPR

*Tests evaluated by at least one study are presented in the table. Where there were two studies at the same threshold, estimates of summary sensitivity and summary specificity were obtained by using univariate fixed‐effect logistic regression models to pool sensitivities and specificities separately. if the threshold used was a 5% FPR, then only the sensitivities were pooled. The range of sensitivities and specificities are presented where there were two studies and the thresholds used were different.

AFP = alpha‐fetoprotein; ßhCG = beta human chorionic gonadotrophin; FPR = false positive rate; hCG = human chorionic gonadotrophin; NT = nuchal translucency; PAPP‐A = pregnancy‐associated plasma protein‐A; uE3 = unconjugated oestriol

CI = confidence interval

Figuras y tablas -
Summary of findings 2. Performance of other first and second trimester serum strategies with or without ultrasound
Table 1. Direct comparisons of the diagnostic accuracy of the six most evaluated test strategies

Ratio of DORs

(95% CI); P value

(Studies)

1T PAPP‐A, 2T total hCG, 2T uE3 and 2T AFP

1T PAPP‐A, 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A

1T NT, 2T total hCG and 2T AFP

1T NT, 2T total hCG, 2T uE3 and 2T AFP

1T NT, 1T PAPP‐A, 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A

1T PAPP‐A, 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A

1.43 (0.39, 5.25); P = 0.49

(K = 1)

1T NT, 2T total hCG and 2T AFP

0.86 (0.25, 2.96); P = 0.75

(K = 1)

0.60 (0.16, 2.22); P = 0.34

(K = 1)

1T NT, 2T total hCG, 2T uE3 and 2T AFP

1.23 (0.33, 4.57); P = 0.68

(K = 1)

0.86 (0.22, 3.43); P = 0.78

(K = 1)

1.44 (0.38, 5.41);

P = 0.49

(K = 1)

1T NT, 1T PAPP‐A, 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A

2.97 (0.53, 16.6); P = 0.15

(K = 1)

2.08 (0.35, 12.3); P = 0.32

(K = 1)

3.48 (0.62,19.6);

P = 0.12

(K = 1)

2.41 (41, 14.3);

P = 0.24

(K = 1)

1T NT, 1T PAPP‐A, 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A

2.41 (0.53, 11.0); P = 0.18

(K = 1)

1.69 (0.35, 8.16); P = 0.41

(K = 2)

2.82 (0.61, 13.0);

P = 0.13

(K = 1)

1.96 (0.40, 9.53);

P = 0.30

(K = 1)

1.87 (0.57, 6.06);

P = 0.26

(K = 2)

Direct comparisons were made using only data from studies that compared each pair of tests in the same population. Ratio of diagnostic odds ratios (DORs) were computed by division of the DOR for the test in the row by the DOR for the test in the column. If the ratio of DORs is greater than one, then the diagnostic accuracy of the test in the row is higher than that of the test in the column; if the ratio is less than one, the diagnostic accuracy of the test in the column is higher than that of the test in the row. All test combinations include maternal age. All test comparisons that were evaluated by only one study were from Wald 2003b.

1T = first trimester; 2T = second trimester; K = number of studies; CI = confidence interval

AFP = alpha‐fetoprotein; ßhCG = beta human chorionic gonadotrophin; FPR = false positive rate; hCG = human chorionic gonadotrophin; NT = nuchal translucency; PAPP‐A = pregnancy‐associated plasma protein‐A; uE3 = unconjugated oestriol.

Figuras y tablas -
Table 1. Direct comparisons of the diagnostic accuracy of the six most evaluated test strategies
Table 2. Indirect comparisons of the diagnostic accuracy of the six most evaluated test strategies

Ratio of DORs

(95% CI); P value

1T PAPP‐A, 2T total hCG, 2T uE3 and 2T AFP

1T PAPP‐A, 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A

1T NT, 2T total hCG and 2T AFP

1T NT, 2T total hCG, 2T uE3 and 2T AFP

1T NT, 1T PAPP‐A, 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A

DOR (95% CI)

Studies

96 (48, 190)

K =4

114 (62, 210)

K = 3

103 (49, 215)

K = 4

109 (51, 233)

K = 4

214 (125, 367)

K = 4

1T PAPP‐A, 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A

114 (62, 210)

K = 3

1.19 (0.61, 2.32); P = 0.58

1T NT, 2T total hCG and 2T AFP

103 (49, 215)

K = 4

1.08 (0.51, 2.36); P = 0.83

0.91 (0.43, 1.90); P = 0.78

1T NT, 2T total hCG, 2T uE3 and 2T AFP

109 (51, 233)

K = 4

1.14 (0.54, 2.42); P = 0.71

0.96 (0.45, 2.03); P = 0.90

1.06 (0.47, 2.41);

P = 0.88

1T NT, 1T PAPP‐A, 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A

214 (125, 367)

K = 4

2.24 (1.00, 5.00); P = 0.049

1.88 (0.88, 3.99); P = 0.094

2.08 (0.89, 4.87);

P = 0.09

1.96 (0.82, 4.67);

P = 0.12

1T NT, 1T PAPP‐A, 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A

339 (163, 705)

K = 3

3.55 (1.28, 9.89); P = 0.019

2.98 (1.14; 7.80); P = 0.029

3.29 (1.15, 9.47);

P = 0.030

3.11 (1.07, 9.07);

P = 0.039

1.58 (0.64, 3.95); P = 0.30

Indirect comparisons were made using all available data. Ratio of diagnostic odds ratios (DORs) were computed by division of the DOR for the test in the row by the DOR for the test in the column. If the ratio of DORs is greater than one, then the diagnostic accuracy of the test in the row is higher than that of the test in the column; if the ratio is less than one, the diagnostic accuracy of the test in the column is higher than that of the test in the row. All test combinations include maternal age.

1T = first trimester; 2T = second trimester; K = number of studies; CI ‐ confidence interval.

AFP = alpha‐fetoprotein; ßhCG = beta human chorionic gonadotrophin; FPR = false positive rate; hCG = human chorionic gonadotrophin; NT = nuchal translucency; PAPP‐A = pregnancy‐associated plasma protein‐A; uE3 = unconjugated oestriol.

Figuras y tablas -
Table 2. Indirect comparisons of the diagnostic accuracy of the six most evaluated test strategies
Table 3. Comparison of integrated, contingent and stepwise strategy for a septuple combination of serum tests and first trimester nuchal translucency

Test combination

Screening policy

Study

Women (cases)

Sensitivity (95% CI)

Specificity (95% CI)

Threshold

First trimester NT, PAPP‐A and free ßhCG, and second trimester uE3, AFP, total hCG and inhibin A

Integrated

Malone 2005

33,546 (87)

94 (87, 98)

89 (89, 89)

1:150 risk

First trimester NT, PAPP‐A and free ßhCG, if risk <1:30 invasive testing is offered, if risk 1:30‐1:1500, second trimester total hCG, uE3, AFP and inhibin A is performed

Contingent

Cuckle 2008

32,355 (86)

91 (82, 96)

95 (95, 96)

1:270 risk

First trimester NT, PAPP‐A and free ßhCG, if risk <1:30 invasive testing is offered, if ≥ 1:30 second trimester total hCG, uE3, AFP and inhibin A is performed

Stepwise

Cuckle 2008

32,355 (86)

92 (84, 97)

95 (95, 95)

1:270 risk

AFP = alpha‐fetoprotein; ßhCG = beta human chorionic gonadotrophin; FPR = false positive rate; hCG = human chorionic gonadotrophin; NT = nuchal translucency; PAPP‐A = pregnancy‐associated plasma protein‐A; uE3 = unconjugated oestriol.

CI ‐ confidence interval.

Figuras y tablas -
Table 3. Comparison of integrated, contingent and stepwise strategy for a septuple combination of serum tests and first trimester nuchal translucency
Table 4. Maternal age, reference standard and study design characteristics of included studies

Study

Maternal age (years)*

Reference standard†

Withdrawals explained?

Study design

Aagaard‐Tillery 2009

30.6 (SD 6.1)

Karyotyping or follow‐up to birth

Of 33,546 trial participants only 7842 women with complete information for all screening tests and genetic sonography were included in the study.

Prospective cohort

Audibert 2001

30.1, all < 38, 86% < 35, 14% ≥35

Prenatal karyotype conducted (in 7.6% of patients) depending on presence of risk >1/125, high maternal age, parental anxiety, history of chromosomal defects or parental translocation or abnormal second trimester scan. Cytogenetic testing of newborns with suspected abnormalities. Postmortum on terminations of pregnancy or miscarriages. Follow‐up to neonatal examination in newborns.

35 women were lost to follow‐up (they had all had normal NT results). 340 women who did not want second trimester serum screening withdrew from that part of the study. Women lost to follow‐up were excluded in the final analysis. All detected cases were terminated.

Prospective consecutive series

Babbur 2005

Median 37 (range 19 to 46)

Invasive testing offered to women with NT > 3 mm or risk > 1:250 as defined by combined NT and serum results CVS from 11 weeks, amniocentesis from 15 weeks). Rapid in situ hybridisation test in patients with risk > 1:30. No details given of any follow‐up to birth

463 patients having NT did not go on to have second trimester serum testing. Women with miscarriages excluded.

Prospective cohort

Baviera 2010

35.3 for Down's cases, 30.4 for controls

Amniocentesis or follow‐up to birth

No details of withdrawals given.

Case control

Benattar 1999

32 (16 to 46), 8.3% > 35

Amniocentesis due to maternal age > 38 years (6.1% or women). Karyotyping encouraged for women with positive result on one or more index test. No details of reference standard for index test negative women.

No details of withdrawals given. 12 patients were lost to follow‐up due to miscarriages

Prospective cohort

Bestwick 2010

Median 39 for Down's cases, 34 for non‐Down's cases

Karyotyping or follow‐up to birth

No details of withdrawals given.

Retrospective cohort

Cuckle 2008

Not reported

Karyotyping or follow‐up to birth

No details of withdrawals given.

Prospective cohort

Goh 1996

33

Karyotyping or follow‐up to birth

No details of withdrawals given.

Cohort

Guanciali‐Franchi 2010

31.8

Karyotyping or follow‐up to birth

No details of withdrawals given.

Prospective cohort

Habayeb 2010

Median 35.4 (range 18 to 49)

Karyotyping or follow‐up to birth

No details of withdrawals given.

Cohort

Herman 2002

Not reported

Karyotyping or follow‐up to birth

No details of withdrawals given.

Case control

Lam 2002

30.5 (19% ≥35) (unaffected pregnancies)

Women considered high risk offered CVS (0.7%) or amniocentesis (11.8%). Follow‐up to birth

Details given for patients excluded and those without follow‐up data.

Prospective cohort

Malone 2005

21.6% aged 35 and above

Amniocentesis (offered to women with positive results from any screening test) or follow‐up to birth.

Details given for patients who did not undergo different index tests. Unclear which patients did not have follow‐up data. Appears that aborted/miscarried foetuses did not have follow‐up.

Prospective cohort

Okun 2008 Integrated

32

Karyotyping or follow‐up to birth

2614 (8%) of women undergoing integrated screening did not return for the second trimester part of the test.

Prospective cohort

Palomaki 2006

33.9 (SD 4.4) for Down's cases, 35.9 (SD 3.6) for controls

Karyotyping or follow‐up to birth

No details of withdrawals given.

Case control

Rodrigues 2009

30.6 for integrated screening, 30.9 for serum integrated screening

Karyotyping or follow‐up to birth

No details of withdrawals given.

Retrospective cohort

Rozenberg 2002

30.5 (18 to 37)

Amniocentesis offered to patients with NT > 3 mm or serum marker risk was > 1:250. Follow‐up to birth.

No details of withdrawals given. 3.4% of patients were lost to follow‐up and were excluded from the study. This included 113 women (1.2%) with miscarriages.

Prospective cohort

Schuchter 2001

28 (range 15 to 46), 10.7% aged 35 and above

CVS (offered to patients with first trimester NT > 3.5 mm), amniocentesis (offered to patients with first trimester NT 2.5 to 3.4, high risk on second trimester serum testing (> 1:250) and those > 35 years) or follow‐up to birth.

No details of withdrawals given. Women having miscarriages were excluded from the study.

Retrospective cohort

Wald 2003b

Not reported

Invasive testing (following second trimester screening) or follow‐up to birth.

No details of withdrawals given.

Case control

Wald 2009

Median 33 (range 15 to 51), 20% aged 37 and above

Karyotyping or follow‐up to birth

No details of withdrawals given.

Retrospective cohort

Wright 2010 FASTER trial

Not reported

Karyotyping or follow‐up to birth

No details of withdrawals given.

Case control

Wright 2010 North York

Not reported

Karyotyping or follow‐up to birth

No details of withdrawals given.

Case control

CVS = chorionic villus sampling; NT = nuchal translucency; SD = standard deviation

*Mean maternal age presented unless otherwise indicated.

†In all studies the choice of reference standard was dependent on the results of the index test.

Figuras y tablas -
Table 4. Maternal age, reference standard and study design characteristics of included studies
Table Tests. Data tables by test

Test

No. of studies

No. of participants

1 Age, 1T PAPP‐A , 2T free ßhCG and 2T AFP at 5% FPR Show forest plot

1

1188

2 Age, 1T PAPP‐A , 2T free ßhCG and 2T AFP, risk 1:300 Show forest plot

1

1009

3 Age, 1T PAPP‐A , 2T total hCG, and 2T AFP at 5% FPR Show forest plot

1

1188

4 Age, 1T PAPP‐A , 2T free ßhCG, 2T uE3 and 2T AFP at 5% FPR Show forest plot

1

1188

5 Age, 1T PAPP‐A , 2T total hCG, 2T uE3 and 2T AFP at 2% FPR Show forest plot

2

707

6 Age, 1T PAPP‐A , 2T total hCG, 2T uE3 and 2T AFP at 5% FPR Show forest plot

2

1767

7 Age, 1T PAPP‐A , 2T total hCG, 2T uE3 and 2T AFP at risk 1:200 Show forest plot

2

707

8 Age, 1T PAPP‐A , 2T total hCG, 2T uE3 and 2T AFP, mixed cutpoints Show forest plot

4

2474

9 Age, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A at 5% FPR Show forest plot

1

1188

10 Age, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:50 Show forest plot

1

1188

11 Age, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:100 Show forest plot

1

1188

12 Age, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:150 Show forest plot

1

1188

13 Age, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:200 Show forest plot

1

1188

14 Age, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:250 Show forest plot

1

1188

15 Age, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:300 Show forest plot

1

1188

16 Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A at 5% FPR Show forest plot

2

34821

17 Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:100 Show forest plot

1

540

18 Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:150 Show forest plot

1

540

19 Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:200 Show forest plot

1

540

20 Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A at risk 1:250 Show forest plot

1

540

21 Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A, mixed cutpoints Show forest plot

3

35361

22 Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T PAPP‐A at 2% FPR Show forest plot

2

707

23 Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T PAPP‐A at risk 1:200 Show forest plot

2

707

24 Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP, 2T Inhibin A and 2T PAPP‐A at risk 1:100 Show forest plot

1

540

25 Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP, 2T Inhibin A and 2T PAPP‐A at risk 1:150 Show forest plot

1

540

26 Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP, 2T Inhibin A and 2T PAPP‐A at risk 1:200 Show forest plot

1

540

27 Age, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP, 2T Inhibin A and 2T PAPP‐A at risk 1:250 Show forest plot

1

540

28 Age, 1T PAPP‐A , 1T total hCG, 2T total hCG, 2T uE3 and 2T AFP at 2% FPR Show forest plot

2

707

29 Age, 1T PAPP‐A , 1T total hCG, 2T total hCG, 2T uE3 and 2T AFP at risk 1:200 Show forest plot

2

707

30 Age, 1T PAPP‐A , 1T uE3, 2T total hCG, 2T uE3 and 2T AFP at 2% FPR Show forest plot

2

707

31 Age, 1T PAPP‐A , 1T uE3, 2T total hCG, 2T uE3 and 2T AFP at risk 1:200 Show forest plot

2

707

32 Age, 1T PAPP‐A , 1T total hCG, 1T uE3, 2T total hCG, 2T uE3, 2T AFP and 2T PAPP‐A at 2% FPR Show forest plot

2

707

33 Age, 1T PAPP‐A , 1T total hCG, 1T uE3, 2T total hCG, 2T uE3, 2T AFP and 2T PAPP‐A at risk 1:200 Show forest plot

2

707

34 Age, 1T AFP, 1T free ßhCG, 1T uE3, 2T total hCG, 2T uE3 and 2T AFP at risk 1:250 Show forest plot

1

12339

35 Age, 1T AFP, 1T free ßhCG, 1T uE3, 2T total hCG, 2T uE3 and 2T AFP at risk 1:384 Show forest plot

1

12339

36 Age, 1T NT, 2T total hCG and 2T AFP, 5FPR Show forest plot

2

17347

37 Age, 1T NT, 2T total hCG and 2T AFP, risk 1:250 Show forest plot

2

5446

38 Age, 1T NT, 2T total hCG and 2T AFP, mixture cutpoint Show forest plot

4

22793

39 Age, 1T NT, 2T free ßhCG and 2T AFP, 5FPR Show forest plot

2

6616

40 Age, 1T NT, 2T free ßhCG and 2T AFP, mixture cutpoint Show forest plot

2

6616

41 Age, 1T NT, 2T free ßhCG, 2T uE3 and 2T AFP, 5FPR Show forest plot

1

1110

42 Age, 1T NT, 2T total hCG, 2T uE3 and 2T AFP, 5FPR Show forest plot

1

1110

43 Age, 1T NT, 2T total hCG, 2T uE3 and 2T AFP, risk 1:250 Show forest plot

2

3256

44 Age, 1T NT, 2T total hCG, 2T uE3 and 2T AFP, mixture cutpoint Show forest plot

4

13708

45 Age, 1T NT, 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A, 5FPR Show forest plot

1

1110

46 Age, 1T NT, 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A, 5FPR Show forest plot

1

1110

47 Age, 1T NT, 2T free ßhCG, 2T uE3, 2T AFP and 1T PAPP‐A , 5FPR Show forest plot

1

1110

48 Age, 1T NT, 2T free ßhCG, 2T uE3, 2T AFP and 1T PAPP‐A , risk 1:250 Show forest plot

1

390

49 Age, 1T NT, 1T PAPP‐A , 2T total hCG and 2T AFP, 5FPR Show forest plot

1

1110

50 Age, 1T NT, 1T PAPP‐A , 2T free ßhCG and 2T AFP, 5FPR Show forest plot

1

1110

51 Age, 1T NT, 1T PAPP‐A , 2T free ßhCG and 2T AFP,risk 1:250 Show forest plot

1

390

52 Age, 1T NT, 1T PAPP‐A , 2T free ßhCG and 2T AFP, risk 1:300 Show forest plot

1

2290

53 Age, 1T NT, 1T PAPP‐A, 2T total hCG, 2T uE3 and 2T AFP 5FPR Show forest plot

1

1110

54 Age, 1T NT, 1T PAPP‐A , 2T total hCG, 2T uE3 and 2T AFP, risk 1:200 Show forest plot

1

32227

55 Age, 1T NT, 1T PAPP‐A, 2T total hCG, 2T uE3 and 2T AFP, mixed cutpoints Show forest plot

2

33337

56 Age, 1T NT, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A, 5FPR Show forest plot

2

34743

57 Age, 1T NT, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A, risk 1:150 Show forest plot

1

4927

58 Age, 1T NT, 1T PAPP‐A , 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A, mixed cutpoints Show forest plot

3

39670

59 Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A,risk 1:300 Show forest plot

1

390

60 Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A, 1:270 Show forest plot

1

7842

61 Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A,risk 1:250 Show forest plot

1

390

62 Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A,risk 1:200 Show forest plot

1

390

63 Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A,risk 1:150 Show forest plot

2

9759

64 Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A,risk 1:100 Show forest plot

1

390

65 Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A,risk 1:50 Show forest plot

1

390

66 Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A, 5FPR Show forest plot

3

31698

67 Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A, 3FPR Show forest plot

1

22746

68 Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A, 1FPR Show forest plot

1

22746

69 Age, 1T NT, 1T PAPP‐A , 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A, mixed cutpoints Show forest plot

4

40348

70 Age, 1T NT, 1T PAPP‐A, 1T free ßhCG, 2T total hCG, 2T uE3 and 2T AFP, risk 1:250 Show forest plot

1

5060

71 Age, 1T NT, 1T PAPP‐A, 1T free ßhCG, 2T uE3, 2T AFP, 2T total hCG and 2T Inhibin A, risk 1:150 Show forest plot

1

33546

72 ADAM 12 2T TO 1T RATIO Show forest plot

1

579

73 Stepwise: Age, 1T NT, 1T PAPP‐A , 1T free ßhCG, if risk <1/30, 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A risk 1:270 Show forest plot

1

32355

74 Stepwise: Age, 1T NT, 1T PAPP‐A , 1T free ßhCG, if risk <1/30, 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A risk 1:270 Show forest plot

1

7842

75 Stepwise: Age, 1T NT, 1T PAPP‐A , 1T free ßhCG, if risk <1/30, 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A 5% FPR Show forest plot

1

7842

76 Stepwise: Age, 1T NT, 1T PAPP‐A , if risk <1:100, 2T free ßhCG, 2T uE3, 2T AFP, risk 1:250 Show forest plot

1

1507

77 Contingent: Age, 1T NT, 1T PAPP‐A , 1T free ßhCG, if risk 1/30‐1/1500, 2T total hCG, 2T uE3, 2T AFP and 2T Inhibin A risk 1:270 Show forest plot

1

32355

78 Contingent: Age, 1T NT, 1T PAPP‐A , 1T free ßhCG, if risk 1/30‐1/1500, 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A risk 1:270 Show forest plot

1

7842

79 Contingent: Age, 1T NT, 1T PAPP‐A , 1T free ßhCG, if risk 1/30‐1/1500, 2T free ßhCG, 2T uE3, 2T AFP and 2T Inhibin A 5%FPR Show forest plot

1

7842

Figuras y tablas -
Table Tests. Data tables by test