'Scared Straight' and other juvenile awareness programs for preventing juvenile delinquency

Anthony Petrosino; Carolyn Turpin‐Petrosino; Meghan E Hollis‐Peel; Julia G Lavenberg

doi:10.1002/14651858.CD002796.pub2

"Scared Straight" y otros programas juveniles de concienciación para la prevención de la delincuencia juvenil

Declaraciones de intereses de los autores

Versión publicada: 30 abril 2013 Historial de versiones

https://doi.org/10.1002/14651858.CD002796.pub2

Contraer todo Desplegar todo

Resumen

disponible en

Antecedentes

"Scared Straight" y otros programas similares incluyen visitas organizadas a la prisión de delincuentes juveniles o niños en riesgo de conducta criminal. Los programas están diseñados para disuadir a los participantes de delitos futuros mediante la observación directa de la vida en prisión y la interacción con presidiarios adultos. Estos programas permanecen en uso a pesar de los estudios de investigación que cuestionan su efectividad. Ésta es una actualización de una revisión de 2002.

Objetivos

Evaluar los efectos de los programas que comprenden visitas organizadas a las prisiones de delincuentes juveniles (oficialmente sentenciados, es decir, declarados culpables por un juzgado de menores) o predelincuentes (niños problemáticos pero no sentenciados oficialmente como delincuentes), con el objetivo de disuadirlos de la delincuencia.

Métodos de búsqueda

Para actualizar esta revisión se hicieron búsquedas en 22 bases de datos electrónicas que incluyeron CENTRAL, MEDLINE, PsycINFO y Criminal Justice Abstracts, en diciembre de 2011. Además, se realizaron búsquedas en registros de ensayos clínicos, se consultó a expertos, se realizaron búsquedas en Google Scholar, y se hizo un seguimiento de todas las citas relevantes.

Criterios de selección

Se incluyeron los estudios que probaron programas de visitas organizadas a instituciones penales como prisiones o reformatorios de delincuentes o niños en riesgo de delinquir. Se incluyeron los estudios que abarcaron muestras superpuestas de jóvenes y adultos jóvenes (por ejemplo, 14 a 20 años de edad). Solamente se consideraron los estudios que asignaron a los participantes a las condiciones de forma aleatoria o cuasialeatoria (o sea, asignación impar / par a las condiciones). Cada estudio tuvo que tener una condición control sin tratamiento y al menos una medida de resultado de la conducta criminal "posterior a la visita".

Obtención y análisis de los datos

Los métodos de búsqueda de la revisión original generaron 487 citas, la mayoría de las cuales tenía resúmenes. El revisor principal examinó estas citas y determinó que 30 fueron informes de evaluación. Dos revisores examinaron de forma independiente estas citas y estuvieron de acuerdo en que 11 eran ensayos aleatorios potenciales. Se obtuvieron todos los informes. Al inspeccionar el texto completo de los informes dos revisores coincidieron de forma independiente en excluir dos estudios, lo que dejó nueve ensayos aleatorios. El revisor principal extrajo los datos de cada uno de los nueve informes de estudio mediante un instrumento especialmente diseñado. En los casos en que faltó información de resultado en los informes originales se trató de establecer contacto con los investigadores originales mediante correspondencia para recuperar los datos para el análisis. Un segundo revisor (CTP) verificó de forma independiente los datos de resultado.

En esta revisión se presentan de forma narrativa los resultados de cada uno de los nueve ensayos. Se realizaron dos metanálisis de siete estudios que proporcionaron las tasas de delitos posteriores a la intervención mediante datos oficiales. La información de otras fuentes (por ejemplo, autoinformada) se perdió de algunos estudios o se omitió la información crítica (por ejemplo, desviaciones estándar). Se examinaron los efectos inmediatos después del tratamiento (es decir, "primeros efectos") al computar los odds ratios (OR) de los datos sobre las proporciones de cada grupo con condenas reiteradas y en los análisis se utilizaron modelos de efectos fijos y aleatorios.

Resultados principales

En esta revisión se incluyeron nueve estudios. Todos formaron parte de la revisión sistemática original; mediante las búsquedas actualizadas no se identificaron ensayos nuevos que cumplieran con los criterios de elegibilidad. Los estudios se realizaron en ocho estados diferentes de los EE.UU. durante los años 1967 a 1992. Participaron casi 1000 (946) jóvenes o adultos jóvenes de razas diferentes, varones en su mayoría. La edad promedio de los participantes en cada estudio varió entre 15 y 17 años.

Los metanálisis de siete estudios muestran que la intervención fue más perjudicial que no hacer nada. Los OR (efectos fijos) de los efectos sobre el primer efecto después del tratamiento sobre la conducta criminal medida de forma oficial indicó un efecto negativo del programa (OR 1,68; intervalo de confianza [IC] del 95%: 1,20 a 2,36) y casi idéntico independientemente de la estrategia metanalítica (OR efectos aleatorios 1,72; IC del 95%: 1,13 a 2,62). Los análisis de sensibilidad (efectos aleatorios) demostraron que los resultados fueron consistentes incluso cuando se extrajo un estudio con una estrategia de asignación al azar inadecuada (OR 1,47; IC del 95%: 1,03 a 2,11), o cuando se extrajo un estudio con desgaste alto (OR 1,96; IC del 95%: 1,25 a 3,08), o ambos (OR 1,68; IC del 95%: 1,10 a 2,58).

Conclusiones de los autores

Se concluye que los programas como "Scared Straight" aumentan la delincuencia con respecto a no hacer nada en jóvenes similares. Debido a estos resultados, no se puede recomendar este programa como una estrategia de prevención de la criminalidad. Por lo tanto, los organismos que permiten tales programas deben evaluarlos rigurosamente para asegurar que no causen más efectos perjudiciales que efectos beneficiosos a los mismos ciudadanos que se comprometen a proteger.

PICO

Population

Intervention

Comparison

Outcome

El uso y la enseñanza del modelo PICO están muy extendidos en el ámbito de la atención sanitaria basada en la evidencia para formular preguntas y estrategias de búsqueda y para caracterizar estudios o metanálisis clínicos. PICO son las siglas en inglés de cuatro posibles componentes de una pregunta de investigación: paciente, población o problema; intervención; comparación; desenlace (outcome).

Para saber más sobre el uso del modelo PICO, puede consultar el Manual Cochrane.

Resumen en términos sencillos

disponible en

"Scared Straight" y otros programas juveniles de concienciación para la prevención de la delincuencia juvenil

Programas como "Scared Straight" incluyen visitas organizadas a la prisión de delincuentes juveniles o niños en riesgo de conducta criminal. Los programas están diseñados para disuadir a los participantes de delitos futuros mediante la observación directa de la vida en prisión y la interacción con presidiarios adultos. Esta revisión, que es una actualización de la publicada en 2002, incluye nueve estudios con 946 adolescentes, casi todos varones. Los estudios se realizaron en diferentes partes de los EE.UU. y participaron jóvenes de diferentes razas cuya edad promedio varió entre 15 y 17 años. Los resultados indican que realizar estos programas no solo no logra impedir la violencia, si no que en realidad dan lugar a una conducta más delictiva. La intervención aumenta las probabilidades de delinquir entre 1,6 a 1 y 1,7 a 1. Los funcionarios del gobierno que permiten este programa deben realizar esfuerzos dirigidos a una evaluación rigurosa para asegurar que no causen más efectos perjudiciales a los mismos ciudadanos que desean proteger.

Authors' conclusions

Implications for practice

The strong indication here is that these programs have a harmful effect. This raises a dilemma for policymakers. Criminological interventions, when they cause harm, are not just toxic to the participants. They cause more harm to citizens who were not part of the experiment because of the increase in criminal victimization. Policymakers should take steps to build the kind of research infrastructure within their jurisdiction that could rigorously evaluate criminological interventions to ensure they are not harmful to the very citizens they aim to help. We believe that our updated review places the onus on every jurisdiction to show how their current or proposed program is different than the ones studied here. Given that, they should then put in place rigorous evaluation to ensure that no harm is caused by the intervention.

Some literature indicates the program can have a positive effect on the inmates involved in the prison visits and that argument is sometimes used to legitimize use of the program. These arguments are undoubtedly used under the assumption that the program does no harm. In light of the findings of this review, assertions that Scared Straight and similar programs ought to be used because they have other positive effects raises ethical questions about potentially harming children (and others in the community who may be victimized) in order to accomplish other important, but latent, goals.

The authors have received communications from different prison facilities that are using a juvenile awareness program. One argument used to sustain such programs is that the research reported here does not apply to their particular program. Our recommendation is that correctional research units, either at the facility or at a regional or national government level, collaborate with program staff to conduct a rigorous evaluation. If such units do not exist or cannot conduct their own study, we suggest they collaborate with a local university, college or research firm that could undertake this work to ensure that the program is working as planned and not unintentionally causing more harm than good.

Correctional administrators sometimes ask whether our results are relevant to their particular program. For example, inmates running the program may go outside the prison to speak at schools about their life experiences. Our review only looked at programs involving visits of young people to prisons, and, as far as we know, no review has examined juvenile awareness interventions that involve offenders leaving prison grounds to speak to children at school. We are not aware of any controlled studies testing it.

We receive periodic correspondence from concerned citizens about how to get a juvenile who is in trouble with the law into a Scared Straight program. We cannot, in good conscience, recommend this program. Our response to these well‐meaning citizens is to refer them to national, regional or local centers that specialize in youth crime prevention services.

Implications for research

One question that continues to arise about these findings is why Scared Straight and similar programs seem to lead to more crime rather than less in participants. What is the critical mechanism? Although there were many good post‐hoc theories about this, none of the evaluations were structured to provide the kind of mediating variables necessary to respond to this in the context of a systematic review (Petrosino 2000). One explanation may be 'peer contagion' (Dishion 1999). According to this theory, any positive impact by an intervention for youth might be offset by processes of peer influence that occur when deviant youths are allowed to interact with each other in groups, such as what occurs in Scared Straight and similar programs. This would need to be explicitly tested in careful evaluation studies to confirm as a potential mechanism for harmful effects.

We plan to update this review again within 36 months to incorporate any new studies or respond to cogent criticisms. Given that we found only nine studies (and only seven were used in the meta‐analysis), we were cautious not to propose the use of moderating variables in subsequent analyses. Initially we wondered if one program factor might have particular salience, which was the degree of harshness in the inmate presentations. It may be that the more brutal and vulgar the presentation, the more that it causes a type of 'backfire' effect, producing in the juveniles the very behavior it seeks to deter. However, when looking at this more closely, we discovered that one trial involving a tour of a reformatory with no presentation reported one of the largest negative effects (Michigan D.O.C. 1967).

This review has led us to consider two others, contingent on future funding. 'Shock value'‐type interventions are tried across many fields. For example, high school students are sometimes shown horrific footage of car accidents in order to deter them from drinking and driving. In industrial arts classes, students are shown films of what occurs when safety glasses are not worn; this is often graphic and is designed to increase compliance with such regulations. There are many other examples across fields. But is there any evidence that any of these 'shock value' interventions work? Or do they produce disappointing, or even toxic, results as we have reported here? The early evidence is not promising, as fear appeals in reducing drug and alcohol among young people have been described in at least one review as 'disappointing' (Prevention First 2008).

It may be true that Scared Straight and similar programs do not work because they only convey a threat that juveniles do not think will be carried out. What about the evidence for deterrence if it is not a third‐party threat but actual involvement in the juvenile justice system? There has been a wide range of randomized trials that test for the effects of official processing in juvenile courts with some other intervention (such as diverting the child from such processing). Is there evidence that the delivery of a threat ‐ official system processing ‐ deters future criminal behavior? Petrosino 2010 examined 29 randomized trials that evaluated the effects of some diversionary alternative (services or outright release) and compared it to official processing or progression deeper into the juvenile justice system. That review, published by the Campbell Collaboration, also indicated that formal system processing or progression had no crime deterrent effect, and, in some instances, increased crime in contrast to diversionary alternatives.

Background

Description of the condition

Juvenile delinquency, also known as juvenile offending or youth crime, is illegal behavior committed by someone before becoming an adult. The second United Nations Congress on the Prevention of Crime and Treatment of the Offender recommended that the meaning of the term juvenile delinquency should be restricted as far as possible to violations of the criminal law (Kvaraceus 1964). Juveniles are considered to be those persons who have yet to reach age 18 years. Although laws vary across nations, juvenile delinquents, therefore, would be those who have been found guilty (adjudicated) of committing a law violation before they are 18 years of age. A significant percentage of violent and nonviolent offenses are committed by juveniles. For example, in the USA, 15% of all persons arrested by the police for illegal behavior in 2008 were juveniles (US Census 2012). Besides the problem of youth crime, offending as a juvenile is a risk factor for later involvement with the criminal justice system as an adult (McCord 2001). Thus, governments everywhere are looking for effective interventions to address juvenile delinquency. 'Scared Straight' and similar type programs have been used in various places in the world, and offer a low‐cost and easy to implement strategy to prevent juvenile delinquency.

Description of the intervention

The basic component of programs such as Scared Straight is organized visits to prison facilities by juvenile delinquents or children at risk for becoming delinquent. Nearly all of these interventions have the juveniles interact with inmates confined in the facility. The most famous of these, 'Scared Straight' in New Jersey (USA), included confrontational 'rap' sessions in which adult inmates shared graphic stories about prison life with the juveniles. Other programs have included less confrontational and more educational sessions, in which inmates shared their life stories and described the choices they made that ultimately led to imprisonment. In the Texas Face‐to‐Face program, juveniles spent one day living as an adult prisoner and the intervention also included a counseling component.

The most well‐known version of the Scared Straight type programs was initiated in the 1970s, as inmates serving life sentences at a New Jersey prison began a program to 'scare' or deter at‐risk or delinquent children from a future life of crime. It featured as its main component an aggressive presentation by inmates to juveniles visiting the prison facility. The presentation depicted life in adult prisons, and often included exaggerated stories of rape and murder (Finckenauer 1982). A television documentary on the program aired in 1979 provided evidence that 16 of the 17 delinquents remained law‐abiding for three months after attending Scared Straight, and claimed a 94% success rate (Finckenauer 1982). Other data provided in the film indicated success rates that varied between 80% and 90% (Finckenauer 1982). The program received considerable and favorable media attention and was soon replicated in over 30 states nationwide, resulting in special Congressional hearings on the program and the film by the US House Subcommittee on Human Resources (US HCEL 1979).

Scared Straight and other 'kids visit prison' programs are also used in other nations. For example, the 'day in prison' or 'day in gaol' in Australia (O'Malley 1993), 'day visits' in the UK (Lloyd 1995) and the 'Ullersmo Project' in Norway (Storvoll 1998). Hall 1999 reports positively on a program in Germany designed to deter young offenders with ties to Neo‐Nazi and other organized hate groups. Scared Straight has been also tried in Canada (O'Malley 1993). In 1999, 'Scared Straight: 20 Years Later' (UPN 1999; 'Kids and Crooks') was shown on US television and claimed similar results to the 1979 film. In this version, the film reports that 10 of the 12 juveniles attending the program remained offense‐free in the three months' follow‐up (Muhammed 1999). As in the 1979 television program, no data on a control or comparison group of young people were presented. Positive reports and descriptions of Scared Straight‐type programs have also been reported in Germany (Hall 1999) and in Florida (USA) (Rasmussen 1996). Sometimes the program is embedded as one component in a multicomponent juvenile intervention package (Trusty 1995; Rasmussen 1996).

How the intervention might work

The underlying theory of programs such as Scared Straight is deterrence. Program advocates and others believe that realistic depictions of life in prison and presentations by inmates will deter juvenile offenders or children at risk for becoming delinquent from further involvement with crime. Although the harsh and sometimes vulgar presentation in the earlier New Jersey version is the most well known, inmate presentations are now sometimes designed to be more educational than confrontational but with a similar crime prevention goal (Lundman 1993; Finckenauer 1999). Some of these programs feature discussions in which the adult inmates confront and challenge the juveniles about their behavior, also referred to as 'rap sessions'. Programs featuring inmates as speakers who describe their life experiences and the current reality of prison life have a rather long history, in the USA at least (Michigan D.O.C. 1967; Brodsky 1970).

Why it is important to do this review

In 1982, a randomized controlled trial testing the New Jersey program was published, reporting no effect on the criminal behavior of participants in comparison with a no‐treatment control group (Finckenauer 1982). In fact, Finckenauer reported that participants in the experimental program were more likely to be arrested. Other randomized trials reported in the USA also questioned the effectiveness of Scared Straight‐type programs in reducing subsequent criminality (GERP&DC 1979; Lewis 1983).

Despite the convergence of evidence from these studies, Scared Straight‐type programs remained popular and continued to be used in the USA through the 1990s (Finckenauer 1999). For example, a program in Carson City, Nevada (USA) took juvenile delinquents on a tour of an adult Nevada State Prison (Scripps 1999). One youngster claimed that the part of the tour that made the most impact on him was, "all the inmates calling us for sex and fighting for our belongings" (Scripps 1999). The United Community Action Network has its own program called 'Wisetalk' in which at‐risk youth are locked in a jail cell for over one hour with four or five parolees. They claim that only 10 of 300 youngsters exposed to this intervention were re‐arrested (U‐CAN 2001). In 2001, a group of guards ‐ apparently without the knowledge of administrators ‐ strip‐searched Washington DC students during their tours of a local jail under the guise that they were using "a sound strategy to turn around the lives of wayward kids" ‐ claiming the prior success of Scared Straight (Blum 2001). It is not surprising that such programs are popular: they fit with some commonly held notions about how to prevent or reduce crime (by 'getting tough'); they are very inexpensive (a Maryland program was estimated to cost less than USD1 US per participant); and they provide one way for incarcerated offenders to contribute productively to society by preventing youngsters from following the same path (Finckenauer 1982).

In 2000, Petrosino and his colleagues reported on a preliminary systematic review of nine randomized field trials, drawing on the raw percentage differences in each study (Petrosino 2000). They found that programs such as Scared Straight generally increased crime between 1% and 28% in the experimental group when compared to a no‐treatment control group. In 2002, our formal Cochrane review was published (Petrosino 2002) (simultaneously as a pilot Campbell Collaboration review), which updated the 2000 work and used more sophisticated meta‐analytic techniques. We reported similarly negative findings for Scared Straight and juvenile‐awareness programs.

Still, Scared Straight type programs continue. In 2003, then‐Governor of Illinois, Rod Blagojevich, signed a bill into law that mandated the Chicago Public School system set up a program called 'Choices' (Swanson 2003). The program would identify students at risk for committing future crime and set up a program to give them 'tours of state prison' to discourage any future criminal conduct (Swanson 2003). More recently, the Arts and Entertainment (A&E) station has been running a weekly series entitled 'Beyond Scared Straight'. Created by the producer of the original Scared Straight program (Arnold Shapiro), the program is now the highest rated in A&E's history (Denhart 2011). The success of the television show has renewed interest in Scared Straight and similar programs as a crime prevention strategy (for example, Denhart 2011), but has also resulted in criticism that it ignores a long history of scientific evidence (for example, Robinson 2011).

The question about whether Scared Straight and similar programs have a crime deterrent effect is best answered by continued examination of the existing scientific evidence. The current review updates the version published in 2002 and includes new and extended searches to December 2011, as well as additional analyses.

Objectives

To assess the effects of programs comprising organized visits to prisons of juvenile delinquents (officially adjudicated or convicted by a juvenile court) or predelinquents (children in trouble but not officially adjudicated as delinquents), aimed at deterring them from criminal activity.

Methods

Criteria for considering studies for this review

Types of studies

Only studies that used randomization or quasi‐random procedures (that is, alternate assignment such as all odd numbered cases to treatment and even numbered cases to control) to assign participants, with or without blinding, were included, provided they had a no‐treatment control group.

Types of participants

Only studies involving juveniles, that is children 17 years of age or younger, were included. Participants were delinquents or predelinquents. Studies that contain overlapping samples of juveniles and young adults (for example, ages 13 to 21 years) were also included.

Types of interventions

Only studies that featured as their main component a visit by program participants to a prison facility were included. Programs may include a presentation by the inmates, ranging from graphic (Finckenauer 1982) to educational (Cook 1992). Additionally, programs may feature an orientation session (for example, living as a prisoner for eight hours) or a tour of the facility.

Types of outcome measures

Primary outcomes

The interest of citizens, policy and practice decision‐makers, media, and the research community is in whether Scared Straight and its variations have any crime deterrent effect, therefore crime measures are our primary outcomes. Studies had to report at least one outcome of subsequent offending behavior, as measured by such indices as arrests, convictions, contacts with police or self‐reported offenses.

Secondary outcomes

We had no secondary outcomes in our analysis, although 'non‐crime' measures (for example, attitudinal, educational) reported by the primary investigators are included in Table 1 to enable review authors in the Cochrane and Campbell Collaborations to identify potentially eligible studies for their systematic reviews.

Open in table viewer

Table 1. Crime outcome data reported in original studies

Study Reference	At 3 months	At 6 months	At 9 months	At 12 months	Beyond 12 months
Michigan D.O.C. 1967		Percentage with new offense or new violation of probation
GERP&DC 1979		Percentage subsequently contacted by police
Yarborough 1979	Percentage with new offenses, type of offenses, percentage with new petitions, average offense rate and standard deviations, average weeks to new offense and standard deviations, number of days in detention and standard deviations	Percentage with new offenses, type of offenses, percentage with new petitions, average offense rate and standard deviations, average weeks to new offense and standard deviations, average days in detention and standard deviations
Orchowsky 1981		Percentage with new intakes, average intakes (no standard deviations but test statistic), average severity score (no standard deviations but test statistic)	Percentage with new intakes, average intakes (with no standard deviations but test statistic) and average severity score (no standard deviations but test statistic)	Percentage with new intakes, average intakes (no standard deviations but test statistic), average severity score (no standard deviations but test statistic)
Vreeland 1981		Percentage with new offenses (official measures), percentage with new offenses (self‐reported data)
Finckenauer 1982		Percentage new complaints, contacts or court appearances, average severity score (no standard deviation, but test statistic)
Lewis 1983				Percentage arrested, percentage charged, average arrests (no standard deviation), average charges (no standard deviation), average time to first arrest (no standard deviation)
Locke 1986		Only test statistic reported
Cook 1992				Average offenses (no standard deviations), average severity score (no standard deviations)	Average offenses (no standard deviations), average severity score (no standard deviations)

Search methods for identification of studies

To minimize publication bias, we conducted a search strategy designed to identify published and unpublished studies. We also conducted a comprehensive search strategy to minimize discipline bias, that is, that evaluations reported in criminological journals or indexed in field‐specific abstracting databases might differ from those reported in psychological, sociological, social service, public health or educational sources. The search methods for the original review are described in detail in Appendix 1.

In December 2011 we searched 11 of the 16 previously searched databases, and expanded our searches to include an additional nine bibliographic sources. We searched all available years of the additional sources, and limited the search of the databases used previously to 2001 onwards. The five databases not searched for this update included one that was no longer accessible (C2‐Spectr), and four that produced zero yield in the previous searches (Current Contents, GPO Monthly, National Clearinghouse of Child Abuse and Neglect (NCCAN) abstracts, and Political Science Abstracts). In November 2012 we also searched two trials registers. The 22 databases searched during the update were:

Cochrane Central Register of Controlled Trials (CENTRAL), 2011(4), searched December 2011
Academic Search Premier, all available dates to December 2011
Ovid MEDLINE, 2001 to December 2011
Clinical Trials.Gov, all available dates, searched November 2012
Criminal Justice Abstracts, 2001 to December 2011
Directory of Open Access Journals, all available dates to December 2011
Dissertations and Theses (ProQuest), which covers Dissertation Abstracts, 2001 to December 2011
Education FullText, 2001 to December 2011
ERIC (Proquest), 2001 to December 2011
Google Scholar, all available dates, searched December 2011
HeinOnline, all dates to December 2011
Illinois Researcher Information Service (IRIS), all dates to December 2011
International Bibliography of the Social Sciences, 2001 to December 2011
National Criminal Justice Reference Service Abstracts Database (NCJRS), 2001 to December 2011
Public Affairs Information Service (PAIS), 2001 to December 2011
PsycArticles, all dates to December 2011
PsycINFO, 2001 to December 2011
SCOPUS Science Direct, all dates to December 2011
Scandinavian Research Council for Criminology, all dates to December 2011
Sociofile, including Sociological Abstracts and Social Planning and Development Abstracts, 2001 to December 2011
SSCI (Web of Science), which includes the Social Science Citation Index (SSCI), 2001 to December 2011
World Health Organization International Clinical Trials Registry Platform (ICTRP), searched November 2012

Our keywords were similar to those used in the previous two searches. A list of search terms is provided in Appendix 1.

We also contacted an informal list of researchers in the field, and examined citations in relevant literature, including previous systematic and narrative reviews. We did not limit our results to English language journals, and did retrieve some abstracts in Spanish (but none to empirical studies), but one limitation is that our search terms were entered in English. Our next update will include a wider range of terms and translation of these terms into Spanish and French languages.

Data collection and analysis

Selection of studies

AP screened citations generated for the original review. AP and CTP independently examined these citations. Full reports were obtained for 11 potential randomized trials. Both review authors agreed that two of these should be excluded. Arbitration was not required as the two review authors agreed. For this update, two review authors (MHP and JL) scanned each citation and determined that there were no trials suitable for inclusion in this review. Details of six new 'excluded studies' with reasons for exclusion are provided in Excluded studies.

Data extraction and management

AP extracted data from each of the nine main study reports using a specially designed instrument adapted from his earlier study (Petrosino 1997), and included items are listed in the 'Characteristics of included studies'. Where outcome information was missing from the original reports, we made attempts via email and regular mail correspondence to retrieve the data for the analysis from the original investigators. Investigators were helpful but unable to locate additional data. In two cases we retrieved unpublished Masters' theses from university libraries to see if they contained this information (Locke 1984; Cook 1990). They did not. Another review author (CTP) double checked all extracted data on outcomes to ensure they were correct.

Assessment of risk of bias in included studies

For each study, we assessed methodological quality using the Cochrane 'Risk of bias' tool. The study reports generally lacked explicit details about randomization and concealment, and the 'Risk of bias' ratings reflect the uncertainty stemming from this lack of description. The Cochrane 'Risk of bias' tool asks review authors to rate each of the following areas of risk:

random sequence generation;
allocation concealment;
blinding of participants and personnel;
blinding of outcome assessment;
incomplete outcome data (attrition);
selective reporting;
other sources of bias. Here we rated whether the implementation of the program rendered a fair test. This is a very low cost and easy to implement program, and no reports included details of program implementation problems.

Measures of treatment effect

Studies had to include at least one outcome of subsequent offending behavior, as measured by such indices as arrests, convictions, contacts with police or self‐reported offences. The interest of citizens, policy and practice decision‐makers, media and the research community is in whether Scared Straight and other kids visit prison programs have any effect on these measures. Although we do not analyze them, we list other 'noncrime measures' and their effects (for example, attitudinal, educational) reported by evaluators in case subsequent review authors in the Cochrane or Campbell Collaborations require them.

Unit of analysis issues

All of the included studies involved randomization of individuals to conditions. No cluster‐randomized trials were located. Most studies involved a single treatment and a single control group; in one instance in which multiple groups were involved (Vreeland 1981), we only included data from the strongest contrast (the most intensive treatment versus control).

Dealing with missing data

As mentioned earlier, we made unsuccessful attempts to acquire missing outcome data for two studies (Locke 1986; Cook 1992). Due to the lack of subsequent follow‐up intervals for outcome measurement in the included studies, we focused exclusively on first treatment effects. This likely limited missing outcome data problems as only one study experienced postrandomization attrition (Yarborough 1979). We examined the impact of excluding this study in a sensitivity analysis, discussed below.

Assessment of heterogeneity

The included studies represent some variation in geographic locations, specific types of interventions implemented, and juvenile treatment populations. Thus, heterogeneity should be examined, although the small number of included studies makes interpretation risky. The Chi² and I² statistics for heterogeneity are reported.

Assessment of reporting biases

Seven studies were included in the meta‐analyses, and just two were published in academic peer‐reviewed publications (Finckenauer 1982; Lewis 1983). Therefore, we do not believe publication bias is a threat to the results. In the future, if additional studies are located, we will include Egger's regression test for funnel plot asymmetry (Egger 1997).

Data synthesis

Using Review Manager software (RevMan 2011), we expressed dichotomous outcome measures of crime as odds ratios (OR). We reported the 95% confidence intervals (CI). Both fixed‐effect and random‐effects models were assumed across the randomized trials and compared to assess the impact of statistical heterogeneity, and both were reported. We examined OR at first follow‐up interval, that is, first post‐treatment effect.

Subgroup analysis and investigation of heterogeneity

No subgroup analyses were determined a priori at the protocol stage. We did not change our plans, given that only seven studies that included outcome data for analysis. Thus, we did not explore heterogeneity by conducting analyses of subgroups or moderators.

Sensitivity analysis

We conducted two sensitivity analyses that examined the impact on the results of excluding studies with significant methodological issues. The first analysis involved dropping a study that experienced randomization problems (Finckenauer 1982). The second sensitivity analysis involved dropping a study that involved substantial postrandomization attrition (Yarborough 1979).

Results

Description of studies

Whether relying on the actual data reported or measures of statistical significance, the nine trials do not yield evidence for the effectiveness of 'Scared Straight' and other juvenile awareness programs on subsequent delinquency.

Michigan Department of Corrections (1967)

In an internal, unpublished government document, the Michigan Department of Corrections reported a trial testing a program that involved taking adjudicated juvenile boys on a tour of a state reformatory (Michigan D.O.C. 1967). Unfortunately, the report is remarkably brief. Sixty juvenile delinquent boys were randomly assigned to attend two tours of a state reformatory or to a no‐treatment control group. Tours included 15 juveniles at a time. No other part of the program is described. Recidivism was measured as a petition in juvenile court for either a new offense or a violation of existing probation order. The Michigan Department of Corrections found that 43% of the experimental group reoffended, compared to only 17% of the control group. This large negative result curiously receives little attention in the original document.

The Greater Egypt Planning and Development Commission, Illinois, USA (1979)

This program at the Menard Correctional Facility started in 1978 and is described as a frank and realistic portrayal of adult prison life. The researchers randomly assigned 161 youths aged 13 to 18 years to attend the program or a no‐treatment control. The participants were a mix of delinquents or children at risk of becoming delinquent. Participants were compared on their subsequent contact with police, on two personality inventories (Piers‐Berne and Jesness) and used surveys of parents, teachers, inmates and young people. The outcomes are also negative in direction but not statistically significant, with 17% of the experimental participants being recontacted by police in contrast to 12% of the controls (GERP&DC 1979). The authors concluded that, "Based on all available findings one would be ill advised to recommend continuation or expansion of the juvenile prison tours. All empirical findings indicate little positive outcome, indeed, they may actually indicate negative effects" (p. 19). Researchers report no effect for the program on two attitude tests (Jesness Inventory, Piers Harris Self‐Concept Scale). In contrast, interview and mail surveys of participants and their parents and teachers indicated unanimous support for the program (p. 12). Researchers also note how positive and enthusiastic inmates were about their efforts.

Michigan JOLT Study, USA (Yarborough 1979)

In the Juvenile Offenders Learn Truth (JOLT) program, juvenile delinquents in contact with one of four Michigan county courts participated. Each juvenile spent five total hours in the facility. Half of this time was spent in a confrontational 'rap' session. This followed a tour of the facility, during which participants were escorted to a cell and exposed to interaction with inmates (for example, taunting). In the evaluation, 227 youngsters were randomly assigned to JOLT or to a no‐treatment control. Participants were compared on a variety of crime outcomes collected from participating courts at three and six months' follow‐up. This second Michigan study reported very little difference between the intervention and control group (Yarborough 1979). The average offense rate for program participants, however, was 0.69 compared to 0.47 for the control group. Yarborough (p. 14) concluded that, "…the inescapable conclusion was that youngsters who participated in the program, undergoing the JOLT experience, did no better than their control counterparts."

Virginia Insiders Program, USA (Orchowsky and Taylor 1981)

The Insiders Program was described as an inmate‐run, confrontational intervention with verbal intimidation and graphic descriptions of adult prison life. Juveniles were locked in a cell 15 at a time and told about the daily routine by a guard. They then participated in a two‐hour confrontational rap session with inmates. Juvenile delinquents from three court service units in Virginia participated in the study. The investigators randomly assigned 80 juveniles ages 13 to 20 years with two or more prior adjudications for delinquency to the Insiders program or a no‐treatment control group. Orchowsky and Taylor report on a variety of crime outcome measures at six‐, nine‐, and 12‐month intervals. The only positive findings, though not statistically significant, were reported in Virginia (Orchowsky 1981). Although the difference at six months was not statistically significant (39% of controls had new court intakes versus 41% of experimental participants), they favor the experimental participants at nine and 12 months. The investigators noted, however, that the attrition rates in their experiment were dramatic. At nine months, 42% of the original sample dropped out, and at 12 months, 55% dropped out. The investigators conducted analyses that seemed to indicate that the constituted groups were still comparable on selected factors.

Texas Face‐to‐Face Program, USA (Vreeland 1981)

The Face‐to‐Face program included a 13‐hour orientation session in which the juvenile lived as an inmate followed by counseling. Participants were 15 to 17 years of age and on probation from Dallas County Juvenile Court; most averaged two or three offenses before the study. A total of 160 boys were randomly assigned to four conditions: prison orientation and counseling, orientation only, counseling only or a no‐treatment control group. Vreeland examined official court records and self‐reported delinquency at six months. This evaluation also reported little effect for the intervention (Vreeland 1981). Vreeland reported that the control participants outperformed the three treatment groups on official delinquency (28% delinquent for control versus 39% for prison orientation plus counseling versus 36% for prison onlyversus 39% for counseling only). This more robust measure contradicts data from the self‐report measures used, which suggest that all three treatment groups did better than the no‐treatment controls. None of these findings reached a level of statistical significance. Viewing all the data, Vreeland concluded that there was no evidence that Face‐to‐Face was an effective delinquency prevention program. He finds no effect for Face‐to‐Face on several attitudinal measures, including the 'Attitudes Toward Obeying Law Scale.'

New Jersey 'Scared Straight' Program, USA (Finckenauer 1982)

The New Jersey Lifers' Program began in 1975 and stressed confrontation with groups of juveniles ages 11 to 18 years who participated in a rap session. Finckenauer randomly assigned 82 juveniles, some of whom were not delinquents, to the program or to a no‐treatment control group. He then followed them for six months in the community, using official court records to assess their behavior. Finckenauer reported that 41% of the children and young people who attended the 'Scared Straight' program in New Jersey committed new offenses, while only 11% of the controls did, a difference that was statistically significant (Finckenauer 1982). He also reported that the program participants committed more serious offenses and that the program had no impact on nine attitude measures with the exception of a measure called 'attitudes toward crime.' On this measure experimental participants did much worse than controls. We deal with Finckenauer's own concerns about randomization integrity in a sensitivity analysis that is reported later.

California SQUIRES Program, USA (Lewis 1983)

This is supposedly the oldest such program in the USA beginning in 1964 (Lewis 1983). The San Quentin Utilization of Inmate Resources, Experience and Studies (SQUIRES) program included male juvenile delinquents from two California counties between the ages of 14 and 18 years, most with multiple prior arrests. The intervention included confrontational rap sessions with rough language, guided tours of prison with personal interaction with prisoners, and a review of pictures depicting prison violence. The intervention took place one day per week over three weeks. The rap session was three hours long, and normally included 20 youngsters at a time. In the study, 108 participants were randomly assigned to treatment or to a no‐treatment control group. Lewis compared participants on seven crime outcomes at 12 months. Lewis reported that 81% of the program participants were arrested compared to 67% of the controls. He also found that the program did worse with seriously delinquent youths, leading him to conclude that such children and young people could not be "turned around by short‐term programs such as SQUIRES…a pattern for higher risk youth suggested that the SQUIRES program may have been detrimental" (p. 222). The only deterrent effect for the program was the average length of time it took to be rearrested: 4.1 months for experimental participants and 3.3 months for controls. Data were reported on eight attitudinal measures, and Lewis reported that the program favored the experimental group on all of them, again underscoring the difficulty of achieving behavioral change even when positively affecting the attitudes of juvenile delinquents.

Kansas Juvenile Education Program, USA (Locke et al. 1986)

Kansas Juvenile Education Program (KEP) was designed to educate children about the law and the consequences of violating it (Locke 1986). The program also tried to match juveniles with inmates based on personality types. Fifty‐two juvenile delinquents aged 14 to 19 years from three Kansas counties were randomly assigned while on probation to KEP or a no‐treatment control. The investigators examined official (from police and court sources) and self‐report crime outcomes at six months. Locke and his colleagues reported little effect of the KEP program. Both groups improved from pretest to post‐test but the investigators concluded that there were no differences between experimental and control groups on any of the crime outcomes measured. Investigators also reported no effect for the program on the Jesness and Cerkovich attitude tests.

Mississippi Project Aware, USA (Cooke and Spirrison 1992)

Project Aware was a nonconfrontational, educational program comprising one five‐hour session run by prisoners (Cook 1992). The intervention was delivered to juveniles in groups of six to 30. In the study, 176 juveniles (ages 12 to 16 years) under the jurisdiction of the county youth court were randomly assigned to the program or to a no‐treatment control. The experimental and control groups were compared on a variety of crime outcomes retrieved from court records at 12 and 24 months. Little difference was found between experimental and control participants in the study. For example, the mean offending rate for controls at 12 months was 1.25 for control cases versus 1.32 for Project Aware participants. Both groups improved from 12 to 24 months, but the control mean offending rate was still lower than the experimental group. The investigators concluded that, "attending the treatment program had no significant effect on the frequency or severity of subsequent offenses" (p. 97). The investigators also reported on two educational measures: school attendance and dropout. Curiously, they report an effect for the program on school dropout data, but not that "…it is not clear how the program succeeded in reducing dropout rates…" (p. 97).

Results of the search

The search methods for the original review generated 487 citations, most of which had abstracts. AP screened these citations, determining that 30 were evaluation reports. AP and CTP independently examined these citations and agreed that 11 were potential randomized trials. All reports were obtained. Upon inspection of the full‐text reports, we excluded two studies. One study was excluded because it did not include any post program measure of offending. This was 'Project Aware', which had been conducted in a Wisconsin prison (Dean 1982). Attempts to contact the study author or retrieve these data from any other reports by the Wisconsin Department of Corrections have been unsuccessful. A second study of 'Stay Straight', conducted in Hawaii, was also excluded, due to the absence of random assignment (Chesney‐Lind 1981). After the two exclusions, we were left with nine randomized trials.

Our updated searches yielded no new eligible studies or reports of any ongoing trials. Two review authors (MHP and JL) scanned each citation and identified five potentially relevant reports. One, an evaluation of a Scared Straight program for truants, was excluded because it did not involve randomization (Bazemore 2004). Another study was excluded because it did not include eligible outcome measures; it measured change in attitudes toward jail or prison (Feinstein 2005). Two articles discussed a related 'experiment' (Blunkett 2008; Wilson 2010), but upon further examination we discovered these studies did not use experimental methods or eligible outcomes. Another positive descriptive report was identified of a juvenile awareness program involving 'fear appeal messages' (Windell 2005), but no evaluative data were provided. A systematic review (Klenowski 2010) was identified that included narrative descriptions of 10 studies, but it contained no new studies eligible for inclusion in our review.Thus, information contained in this update is based on studies located for the previous review.

Included studies

Collectively, the nine studies were conducted in eight different states of the USA, with Michigan the site for two studies (Michigan D.O.C. 1967; Yarborough 1979). No set of researchers conducted more than one experiment. The studies span the years 1967 to 1992. The first five studies located were unpublished and were disseminated in government documents or dissertations; the remaining four were found in academic journal or book publications. The average age of the juvenile participants in each study ranged from 15 to 17 years. Only the New Jersey study included girls (Finckenauer 1982). Racial composition across the nine studies was diverse, ranging from 36% to 84% white people. Most of the studies dealt with delinquent youths already in contact with the juvenile justice system. All of the experiments were simple two‐group experiments except Vreeland's evaluation of the Texas Face‐to‐Face program (Vreeland 1981). Only one study used quasi‐random alternation techniques to assign participants (Cook 1992); the remaining studies claimed to use randomization although not all were explicit about how such assignment was conducted. Only the Texas study (Vreeland 1981) included data from self‐report measures. In two studies (Locke 1986; Cook 1992), no postintervention offending rates were reported. Some of the studies that included average or mean rates did not include standard deviations to make it possible to compute the weighted mean effect sizes. Also, the follow‐up periods were diverse and included measurements at three, six, nine, 12 and 24 months.

Excluded studies

There were six studies that were excluded during this update. These are often included in other review authors' samples. We describe these in more detail below, along with their reason for exclusion.

Bazemore 2004 evaluated a 'Scared Straight' program for truants, however their study did not involve randomization. This program involved a collaborative intervention administered by a local sheriff's department. It followed 550 youth (350 'treatment' and 200 'control'). Three outcome measures were used: (1) whether or not youths returned to school the next day or were stopped by an officer (different measures for treatment and control youths), (2) comparison of the number of unexcused absences 30 days pre‐intervention and postintervention and (3) total number of days of school missed following the intervention. Delinquent involvement was also measured. This study provided mixed results regarding program effectiveness.

Berry 1985 evaluated the 'Shape Up' program carried out in Colorado. The experimental group consisted of 30 males ages 14 to 18 years, and the control group consisted of 27 males of the same age. The study used a matched comparison group design and did not use randomization. The study assessed perception of certainty, severity and seriousness of punishment; delinquency proneness; intelligence quotient (IQ); family dynamics as measured by Family Adaptability and Cohesion Evaluation Scale (FACES) II, and recidivism rates. No difference was found between the two groups on attitude change, re‐arrest, conviction, and weighted seriousness of crime after program involvement.

Buckner 1983 evaluated a program, 'Stay Straight', which was carried out in Hawaii. This study did not randomize participants and instead used a matched comparison group design. They assessed rearrest rates, finding that there was no effect on female participants. Male participants had higher rearrest rates than nonparticipants following the intervention.

Chesney‐Lind 1981 evaluated a program, 'Stay Straight', which was carried out in Hawaii. This study was excluded due to a lack of random assignment of participants. An after‐the‐fact matched group design was used in this study. The frequency and severity of police arrests in the year following program exposure was used as an outcome measure.

Dean 1982 evaluated a two‐session juvenile awareness program in Wisconsin. This study used a small sample of boys who were involved in a residential treatment program for delinquents. The study assessed 13 traits thought to be associated with a delinquent personality finding internal locus of control had increased significantly, while chance expectation and social self concept had decreased significantly. A pretest‐post‐test design with randomization was used, but no data on delinquency outcomes were collected.

Langer 1980 evaluated the Juvenile Awareness Program of the Lifers' Group at the Rahway State Prison in New Jersey. This study used a matched comparison group design. The study assessed delinquent involvement, finding that at the 10‐month follow‐up there was no significant difference between treatment and control groups. At long‐term (average of 22 months) follow‐up, the control group had significantly higher delinquency rates than the treatment group.

Risk of bias in included studies

Review authors AP and MHP rated quality of included studies using The Cochrane Collaboration's 'Risk of bias' tool (Higgins 2011). Unfortunately, clear data on all seven items in the 'Risk of bias' tool was not included in study reports. Figure 1 provides summary results, and we discuss each of the rating areas below.

Figure 1

Risk of bias summary: review authors' judgments about each risk of bias item for each included study

Green circle: low risk of bias
Question mark: unclear risk of bias
Red circle: high risk of bias

Allocation

Random sequence generation

Finckenauer reported violations of randomization (Finckenauer 1982). Only eight of the 11 participating agencies that referred troubled or delinquent boys to the program correctly assigned their cases. Finckenauer did conduct additional analyses in an attempt to compensate for violation of randomization. We agreed that a sensitivity analysis should be done to determine the influence of this evaluation on the pooled analysis (Analysis 1.3). Another study was rated as at high risk of bias because alternation was used (Cook 1992). This latter study was not included in the meta‐analysis because it did not include data on postintervention offending. Two other studies did not provide any further information on randomization and their risk of bias was rated as 'unclear' (GERP&DC 1979; Michigan D.O.C. 1967).

Allocation concealment

All of the studies are rated as presenting 'unclear' risk as there is no information on how randomization was performed.

Blinding

Blinding of participants and personnel (performance bias)

Blinding was not possible in these studies, and all are rated as presenting ''high risk'.

Blinding of outcome assessment (detection bias)

We should note that only one study author reported that steps were taken to 'blind' those responsible for collecting the outcome data to treatment assignment (Michigan D.O.C. 1967) and is rated as presenting 'low risk'. All others are rated as presenting 'unclear risk'.

Incomplete outcome data

Six studies experienced little or no attrition and are rated as presenting 'low risk' of bias. Two studies appeared to report significant attrition (defined as 10% or more from the originally randomized sample). The Virginia Insiders study reported a major loss of participants from the initial randomization sample (Orchowsky 1981). They reported this, however, at the second and third follow‐up intervals (not the first, at six months). Because there was a paucity of data beyond the immediate follow‐up interval across studies, we only conducted a pooled analysis using data at that time interval. Therefore a sensitivity analysis of the impact of this later attrition was not performed. The Cook study is also rated as presenting a 'high risk' due to attrition, but the study did not include data for the first follow‐up and was not included in any meta‐analyses (Cook 1992).

The Michigan JOLT study reported a large number of no‐shows but they were deleted from the analysis (Yarborough 1979). The problem is that we do not know how many participants were initially assigned and no data were reported that the remaining sample was similar to the initial sample. We also conducted a sensitivity analysis to determine the influence of this study on the pooled analysis.

Selective reporting

We rated this as presenting a 'low risk' of bias across the studies. In several cases, the program was a government intervention and the researchers were employed by the same agency; nonetheless, the negative or null findings were clearly presented (Michigan D.O.C. 1967; GERP&DC 1979; Yarborough 1979; Orchowsky 1981; Lewis 1983). In three instances, the authors were students and a number of outcomes were presented (Vreeland 1981; Locke 1986; Cook 1992). In another instance, the author was an academic researcher who presented a number of findings in an academic book (Finckenauer 1982).

Other potential sources of bias

In terms of 'other bias' as rated on the tool, a major threat to study results is if the program is so poorly implemented that it does not represent a true test of the treatment. Scared Straight programs appear to be relatively simple and short‐term and pose few problems for implementation. No investigator reported implementation problems, and we rated these as 'low risk' of bias. We should note that not one of the nine included studies provided data on monitoring of the control group to determine if compensation was an issue. It is probably very unlikely that control group participants received anything like Scared Straight but it was not specifically addressed by authors of the reports.

Effects of interventions

Findings from the individual studies

Whether relying on the actual data reported or measures of statistical significance, the nine trials do not yield evidence for the effectiveness of Scared Straight and other juvenile awareness programs on subsequent delinquency. In the first such study, the Michigan Department of Corrections found that 43% of the experimental group reoffended, compared to only 17% of the control group (Michigan D.O.C. 1967). No test of statistical significance was reported by the trialists. We performed a Chi² test, which indicated no statistical significance for this outcome, likely due to the low statistical power of the sample. The original document does not comment on this large percentage difference.

In Illinois, the outcomes were also negative in direction but not statistically significant, with 17% of the experimental participants being recontacted by police in contrast to 12% of the controls (GERP&DC 1979). The authors concluded that "based on all available findings one would be ill‐advised to recommend continuation or expansion of the juvenile prison tours. All empirical findings indicate little positive outcome, indeed, they may actually indicate negative effects" (p. 19). Researchers reported no effect for the program on two attitude tests (Jesness Inventory, Piers Harris Self‐Concept Scale). In contrast, interview and mail surveys of participants and their parents and teachers indicated unanimous support for the program (p. 12). Researchers also note how positive and enthusiastic inmates were about their efforts.

The second Michigan study also reported very little difference between the intervention and control group (Yarborough 1979). The average offense rate for program participants, however, was 0.69 compared to 0.47 for the control group. As Yarborough (p. 14) pointed out, "…the inescapable conclusion was that youngsters who participated in the program, undergoing the JOLT experience, did no better than their control counterparts."

The only positive findings, though not statistically significant, were reported in Virginia (Orchowsky 1981). Although the difference at six months was not statistically significant (39% of controls had new court intakes versus 41% of experimental participants), they favor the experimental participants at nine and 12 months. The investigators noted, however, that the attrition rates in their experiment were dramatic. At nine months, 42% of the original sample dropped out, and at 12 months, 55% dropped out. The investigators conducted analyses that seemed to indicate that the constituted groups were still comparable on selected factors such as race and age.

A study of the Face‐to‐Face program in Texas also reported little effect for these interventions (Vreeland 1981). Vreeland 1981 reported that the control participants outperformed the three treatment groups on official delinquency (28% delinquent for control versus 39% for prison orientation plus counseling versus 36% for prison only versus 39% for counseling only). This more robust measure contradicts data from the self‐report measures used, which suggest that all three treatment groups did better than the no‐treatment controls. None of these findings reached a level of statistical significance. Viewing all the data, Vreeland 1981 concluded that there was no evidence that Face‐to‐Face was an effective delinquency prevention program. He finds no effect for Face‐to‐Face on several attitudinal measures, including the Attitudes Toward Obeying Law Scale.

Finckenauer 1982 reported that 41% of the children and young people who attended the Scared Straight program in New Jersey committed new offenses, while only 11% of controls did, a difference that was statistically significant. He also reported that the program participants committed more serious offenses and that the program had no impact on nine attitude measures with the exception of a measure called 'attitudes toward crime.' On this measure experimental participants did much worse than control participants. We deal with Finckenauer's own concerns about randomization integrity in this study in a sensitivity analysis.

Additional evidence of a possible harmful effect can be found in the evaluation of the California SQUIRES program (Lewis 1983). Lewis 1983 reported that 81% of the program participants were arrested compared to 67% of the controls. He also found that the program did worse with seriously delinquent youths, leading him to conclude that such children and young people could not be "turned around by short‐term programs such as SQUIRES…a pattern for higher risk youth suggested that the SQUIRES program may have been detrimental" (p. 222). The only deterrent effect for the program was the average length of time it took to be rearrested: 4.1 months for experimental participants and 3.3 months for control participants. Data were reported on eight attitudinal measures, and Lewis reported that the program favored the experimental group on all of them, again underscoring the difficulty of achieving behavioral change even when positively affecting the attitudes of juvenile delinquents.

Locke and his colleagues reported little effect of the Juvenile Education Program in the Kansas State Prison (Locke 1986). Both groups improved from pretest to post‐test but the investigators concluded that there were no differences between experimental and control groups on any of the crime outcomes measured. Investigators also reported no effect for the program on the Jesness and Cerkovich attitude tests.

Finally, little difference was found between experimental and control participants in the Mississippi Project Aware study (Cook 1992). For example, the mean offending rate for control participants at 12 months was 1.25 versus 1.32 for Project Aware participants. Both groups improved from 12 to 24 months, but the control mean offending rate was still lower than the experimental group. The investigators concluded that, "attending the treatment program had no significant effect on the frequency or severity of subsequent offenses" (p. 97). The investigators also reported on two educational measures: school attendance and dropout. Curiously, they report an effect for the program on school dropout data, but note that "...it is not clear how the program succeeded in reducing dropout rates..." (p. 97).

Meta‐analysis

For each study, we extracted all of the relevant crime outcome data. Our protocol included an organization of analyses by examining official reports (from government administrative records) distinct from self‐reported criminality (obtained from investigator‐administered survey questionnaires). Given that we expected a diverse number of measures of crime to be reported, the protocol called for us to organize it into four indexes that would be most relevant to policy and practice. These included prevalence rates (what percentage of each group reoffended or did not?), average incidence rates (what was the average number of offenses or other incidents per individual in each group?), offense severity rates (what was the average severity of offenses per individual in each group?) and latency (how long was the average return to crime or failure delayed per individual in each group?). As Table 1 shows, however, few measures except for prevalence were reported.

Given the limitation of the data, we conducted one meta‐analysis. We report the crime outcomes for official measures at the first‐effect or first (and usually the only) follow‐up interval period reported. Each analysis focused on proportion data (that is, the proportion of each group reoffending), as the outcomes reporting means or averages were sparse and often did not include the standard deviations. Thus, because the data relied on dichotomous outcomes, both analyses report ORs and 95% CIs for each study. As a sensitivity analysis, we assume both random‐effects and fixed‐effect models for treatment effects across the studies.

Immediate post‐treatment effects for reoffending rates: official measures

The analysis of the data in comparison Table 1 from the seven studies reporting reoffending rates shows that intervention increases the crime or delinquency outcomes at the first follow‐up period. Assuming either a fixed‐effect or random‐effects model does not change its overall negative impact. Using a fixed‐effect model, the OR was 1.68 (95% CI 1.20 to 2.36). Heterogeneity statistics should be interpreted with caution given that only seven studies were included in the meta‐analysis (Chi² = 8.49, P value = 0.20, I² = 29%) (Analysis 1.1). The mean OR assuming a random‐effects model was similar at 1.72 (95% CI 1.13 to 2.62); heterogeneity statistics were nearly identical (Chi² = 8.50, P value = 0.20, I² = 29%) (Analysis 1.2). Both fixed‐effect OR and random‐effects OR are statistically significant; the intervention increases the odds of offending by between 1.6 to 1 and 1.7 to 1.

Sensitivity analysis 1. Excluding Finckenauer study

We excluded the Finckenauer study from the analysis because of its randomization problems. Finckenauer reported that only eight of the 11 referring agencies correctly followed the randomization procedures. His reanalyses taking these randomization problems into account still indicated a negative impact. Nonetheless, we determined to examine the impact of this study on the meta‐analytic findings. Given the little difference in OR whether assuming a fixed‐effect or random‐effects model, we conducted a meta‐analysis assuming a random‐effects model. Given that the Finckenauer study reported the largest negative effects for the program, it is not surprising that the OR decreased. However, it is still negative in direction at 1.47, and statistically significant (95% CI 1.03 to 2.11). Heterogeneity statistics should be interpreted with caution given the small number of studies (Tau² = 0.00; Chi² = 4.25, degrees of freedom (df) = 5, P value = 0.51; I² = 0%) (Analysis 1.3).

Sensitivity analysis 2. Excluding Yarborough study

We excluded the Yarborough study because of its deletion of no‐shows postrandomization from analysis of the results, indicating a potential for high attrition bias. Yarborough did not report any analyses to indicate how this affected the remaining sample. We again assumed a random‐effects model. The deletion of this study did not alter the overall negative impact of these programs, as the OR was 1.96. This is statistically significant (95% CI 1.25 to 3.08). Heterogeneity statistics should be interpreted with caution given the small number of studies (Tau² = 0.06; Chi² = 6.25, df = 5, P value = 0.28; I² = 20%) (Analysis 1.4).

Although the methodological limitations of the studies warrant our sensitivity analyses, their exclusion did not alter the main conclusion of the meta‐analyses: a significant negative impact of the program.

Sensitivity analysis 3. Excluding both Finckenauer and Yarborough studies

We excluded both the Finckenauer and Yarborough studies to see how this affected the overall meta‐analysis. As Analysis 1.5 shows, even with two studies removed for sensitivity analysis, the overall effect of the intervention in the five remaining studies shows a 'criminogenic' effect that is statistically significant, that is, favors the control group not Scared Straight (OR 1.69, 95% CI 1.10 to 2.58). Heterogeneity statistics should be interpreted with caution given only five studies are in the analysis (Tau² = 0.00; Chi² = 2.94, df = 4, P value = 0.57, I² = 0%).

Discussion

Summary of main results

These randomized trials, conducted over a 25‐year period in eight different US states, provide evidence that Scared Straight and other 'juvenile awareness' programs are not effective as a stand‐alone crime prevention strategy. More importantly, they provide empirical evidence ‐ under experimental conditions ‐ that these programs likely increase the odds that children exposed to them will commit offenses in future. Despite the variability in the type of intervention used, ranging from harsh, confrontational interactions to tours of the facility, they converge on the same result: an increase in criminality in the experimental group when compared to a no‐treatment control. Doing nothing would have been better than exposing juveniles to the program.

We noted that the other two trials that did not report prevalence data for the meta‐analysis also reported no effect for the intervention (Locke 1986; Cook 1992). Indeed, the mean data from the Mississippi study was also negative in direction, and the Kansas investigators reported that the self‐reported data showed a negative impact.

Overall completeness and applicability of evidence

Given that the seven trials used in the meta‐analysis were conducted in six states using different conceptions of the intervention underscore the high external validity of these findings. However, note that all trials were of US programs, and no trial was reported after 1992. Indeed, no trial included in the meta‐analysis was reported since 1983.

Quality of the evidence

Nine randomized trials were included in the review; only randomized trials, if implemented with good fidelity, produce statistically unbiased effects. However, the nine studies were not exemplars of trial quality. These were small studies, with very few providing convincing evidence that they reduced bias threats as measured by the Cochrane 'Risk of bias' tool (Figure 1). In fact, for some of the bias threats, the trials were rated with a great deal of uncertainty due to the lack of descriptive data in the report. However, three sensitivity analyses were conducted, the first dropping the study that experienced the greatest threat of bias due to randomization compromise (Finckenauer 1982), the second study that lost a considerable number of participants postrandomization (Yarborough 1979), and the third dropping them both. The effect sizes remained stable in all three analyses, indicating that the negative effect for Scared Straight and other juvenile awareness findings is robust.

Potential biases in the review process

Although we believe we have identified all relevant RCTs, it is possible that studies in languages other than English and not indexed in English language databases could have been missed. In addition, it is possible that the sensitivity of our search could have been increased; for example, by using additional indexing terms specific to the databases we searched and using truncation to ensure we searched for word variations. A revised search strategy will be developed for the next update.

Agreements and disagreements with other studies or reviews

The results of this review converge with the findings from many other narrative or quantitative reviews. This is expected as the reviews generally consider the same studies. For example, reviewers of research on the effects of crime prevention programs have not found deterrence‐oriented programs, such as Scared Straight, effective (Lipsey 1992; Lundman 1993; Sherman 1997). In fact, the University of Maryland's well‐publicised review of over 500 crime prevention evaluations listed Scared Straight as one program that 'doesn't work' (Sherman 1997). These findings also mirror a meta‐analysis of juvenile prevention and treatment programs by Lipsey 1992, who indicated that the effect size for 11 "shock incarceration and 'Scared Straight' programs" was ‐0.14 (or produced about 7% higher recidivism rates in experimental participants than control participants assuming a 50% baseline).

The one disagreement, in terms of syntheses of evidence, is with the US Department of Justice's CrimeSolutions.Gov registry of effects on crime policies and programs (US Department of Justice 2012). The Crime Solutions project has rated the evidence as inconclusive. There are two reasons for the discrepancy. The first is that the Crime Solutions rating scheme relies on statistical significance to determine whether there is evidence of effect; indeed, some of the program evaluations included here were underpowered due to small sample size and did not report a statistically significant finding. Second, the Crime Solutions project is defining Scared Straight narrowly, as its initial iteration in New Jersey defined it, in contrast with the broader definition of Scared Straight and similar "kids visit prison" programs used here. Thus, while Crime Solutions is only considering a small set of studies that examined a narrowly defined intervention (known as Scared Straight), this review includes nine program evaluations that would fall under a broader heading of juvenile awareness programs.

Figure 1

Risk of bias summary: review authors' judgments about each risk of bias item for each included study

Green circle: low risk of bias
Question mark: unclear risk of bias
Red circle: high risk of bias

Ir a la figura de la revisiónAbrir en una pestaña nueva

Analysis 1.1

Comparison 1 Intervention versus control, crime outcome, Outcome 1 Postintervention ‐ group recidivism rates ‐ official measures only (fixed‐effect).

Ir a la figura de la revisiónAbrir en una pestaña nueva

Analysis 1.2

Comparison 1 Intervention versus control, crime outcome, Outcome 2 Postintervention ‐ group recidivism rates ‐ official measures only (random‐effects).

Ir a la figura de la revisiónAbrir en una pestaña nueva

Analysis 1.3

Comparison 1 Intervention versus control, crime outcome, Outcome 3 Sensitivity analysis ‐ excluding Finckenauer study.

Ir a la figura de la revisiónAbrir en una pestaña nueva

Analysis 1.4

Comparison 1 Intervention versus control, crime outcome, Outcome 4 Sensitivity analysis ‐ excluding Yarborough study.

Ir a la figura de la revisiónAbrir en una pestaña nueva

Analysis 1.5

Comparison 1 Intervention versus control, crime outcome, Outcome 5 Sensitivity analysis ‐ excluding both Finckenauer and Yarborough studies.

Ir a la figura de la revisiónAbrir en una pestaña nueva

Table 1. Crime outcome data reported in original studies

Study Reference	At 3 months	At 6 months	At 9 months	At 12 months	Beyond 12 months
Michigan D.O.C. 1967		Percentage with new offense or new violation of probation
GERP&DC 1979		Percentage subsequently contacted by police
Yarborough 1979	Percentage with new offenses, type of offenses, percentage with new petitions, average offense rate and standard deviations, average weeks to new offense and standard deviations, number of days in detention and standard deviations	Percentage with new offenses, type of offenses, percentage with new petitions, average offense rate and standard deviations, average weeks to new offense and standard deviations, average days in detention and standard deviations
Orchowsky 1981		Percentage with new intakes, average intakes (no standard deviations but test statistic), average severity score (no standard deviations but test statistic)	Percentage with new intakes, average intakes (with no standard deviations but test statistic) and average severity score (no standard deviations but test statistic)	Percentage with new intakes, average intakes (no standard deviations but test statistic), average severity score (no standard deviations but test statistic)
Vreeland 1981		Percentage with new offenses (official measures), percentage with new offenses (self‐reported data)
Finckenauer 1982		Percentage new complaints, contacts or court appearances, average severity score (no standard deviation, but test statistic)
Lewis 1983				Percentage arrested, percentage charged, average arrests (no standard deviation), average charges (no standard deviation), average time to first arrest (no standard deviation)
Locke 1986		Only test statistic reported
Cook 1992				Average offenses (no standard deviations), average severity score (no standard deviations)	Average offenses (no standard deviations), average severity score (no standard deviations)

Table 1. Crime outcome data reported in original studies

Ir a la tabla de la revisión

Comparison 1. Intervention versus control, crime outcome

Outcome or subgroup title	No. of studies	No. of participants	Statistical method	Effect size
1 Postintervention ‐ group recidivism rates ‐ official measures only (fixed‐effect) Show forest plot	7	794	Odds Ratio (M‐H, Fixed, 95% CI)	1.68 [1.20, 2.36]

2 Postintervention ‐ group recidivism rates ‐ official measures only (random‐effects) Show forest plot	7	794	Odds Ratio (M‐H, Random, 95% CI)	1.72 [1.13, 2.62]

3 Sensitivity analysis ‐ excluding Finckenauer study Show forest plot	6	713	Odds Ratio (M‐H, Random, 95% CI)	1.47 [1.03, 2.11]

4 Sensitivity analysis ‐ excluding Yarborough study Show forest plot	6	567	Odds Ratio (M‐H, Random, 95% CI)	1.96 [1.25, 3.08]

5 Sensitivity analysis ‐ excluding both Finckenauer and Yarborough studies Show forest plot	5	486	Odds Ratio (M‐H, Random, 95% CI)	1.68 [1.10, 2.58]

Comparison 1. Intervention versus control, crime outcome

Ir a la tabla de la revisión

	Idioma de la Revisión Cochrane Escoja su idioma de preferencia para las revisiones Cochrane y otros contenidos. Las secciones sin una traducción aparecerán en inglés.

	Idioma de la web Escoja su idioma de preferencia para la web de la Biblioteca Cochrane.

Idioma de la Revisión Cochrane

Idioma de la web

Resumen

Antecedentes

Objetivos

Métodos de búsqueda

Criterios de selección

Obtención y análisis de los datos

Resultados principales

Conclusiones de los autores

PICO

PICO

Population

Intervention

Comparison

Outcome

Resumen en términos sencillos

"Scared Straight" y otros programas juveniles de concienciación para la prevención de la delincuencia juvenil

Resumen visual

Authors' conclusions

Implications for practice

Implications for research

Background

Description of the condition

Description of the intervention

How the intervention might work

Why it is important to do this review

Objectives

Methods

Criteria for considering studies for this review

Types of studies

Types of participants

Types of interventions

Types of outcome measures

Primary outcomes

Secondary outcomes

Search methods for identification of studies

Data collection and analysis

Selection of studies

Data extraction and management

Assessment of risk of bias in included studies

Measures of treatment effect

Unit of analysis issues

Dealing with missing data

Assessment of heterogeneity

Assessment of reporting biases

Data synthesis

Subgroup analysis and investigation of heterogeneity

Sensitivity analysis

Results

Description of studies

Michigan Department of Corrections (1967)

The Greater Egypt Planning and Development Commission, Illinois, USA (1979)

Michigan JOLT Study, USA (Yarborough 1979)

Virginia Insiders Program, USA (Orchowsky and Taylor 1981)

Texas Face‐to‐Face Program, USA (Vreeland 1981)

New Jersey 'Scared Straight' Program, USA (Finckenauer 1982)

California SQUIRES Program, USA (Lewis 1983)

Kansas Juvenile Education Program, USA (Locke et al. 1986)

Mississippi Project Aware, USA (Cooke and Spirrison 1992)

Results of the search

Included studies

Excluded studies

Risk of bias in included studies

Allocation

Random sequence generation

Allocation concealment

Blinding

Blinding of participants and personnel (performance bias)

Blinding of outcome assessment (detection bias)

Incomplete outcome data

Selective reporting

Other potential sources of bias

Effects of interventions

Findings from the individual studies

Meta‐analysis

Immediate post‐treatment effects for reoffending rates: official measures

Sensitivity analysis 1. Excluding Finckenauer study

Sensitivity analysis 2. Excluding Yarborough study

Sensitivity analysis 3. Excluding both Finckenauer and Yarborough studies