Scolaris Content Display Scolaris Content Display

Cochrane Database of Systematic Reviews Protocol - Intervention

Video feedback for improving parental sensitivity and attachment

This is not the most recent version

Collapse all Expand all

Abstract

This is a protocol for a Cochrane Review (Intervention). The objectives are as follows:

To evaluate the effectiveness of video feedback for improving parental sensitivity and promoting attachment security.

Background

Description of the condition

Attachment

An infant’s relationship with his or her mother is the first, and arguably the most important, relationship formed following birth. This emotional bond is known as a 'selective attachment' relationship. Attachment is a biobehavioural system that is a product of human evolution, and is designed to trigger protection in the face of perceived danger and its associated response, fear. When an infant is distressed, he or she is programmed to seek and secure proximity to, and contact with, the primary caregiver (Bowlby 1969). Attachment behaviour may be activated by circumstances internal to the child, such as illness, hunger or pain; by separation from the primary caregiver, such as when a mother leaves the room or discourages proximity; or by external events that cause distress, such as frightening events or rejection by others (Bowlby 1969). Depending on the intensity of the threat, the attachment behaviour may be terminated by the appearance of the caregiver or physical contact with them. The younger the child, or the more serious the threat, the more likely that only physical reassurance and containment will provide comfort. The attachment relationship is, therefore, dynamic in which the infant plays an active part (see Shin 2008).

Zeanah and colleagues describe the parent‐child attachment relationship as a reciprocal relationship of seeker (infant) and provider (parent), the purpose of which is to provide the following (Zeanah 1993).

  1. Comfort for distress.

  2. Warmth, empathy and nurturance.

  3. Emotional availability and regulation of emotions.

  4. Physical and psychological protection.

Children translate their early relationship experiences into internal working models of themselves, other people and what they can expect from other people in response to their attachment needs. These working models are stable over time but can be revised in light of new information or experiences (see Goldberg 2000).

Infants whose caregivers provide sensitive and responsive care develop secure attachments to those carers. Children who experience insensitive, unpredictable or intrusive parenting develop attachments that are insecure, with adverse consequences for a range of aspects of their psychosocial development, including being more reliant on teachers, showing less positive affective expression and social problem‐solving skills, more frustration and less persistence, more negative responses to others and less overall social competence (Sroufe 2005). In addition to children being classified as either secure or insecure, they may also be classified as either organised (i.e. secure) or disorganised (i.e. insecure); the latter refers to evidence of an approach and avoidance conflict in relation to the caregiver when the attachment system is activated (Main 1990a). Disorganised attachment occurs when children are frightened of the caregiver and have been exposed to a range of anomalous, atypical parent‐infant interactions (Madigan 2006); and disorganisation is associated with predictors of later psychopathology, including externalising (Fearon 2010), and personality disorders (Steele 2010). Approximately 60% of children are securely attached, and the remainder (40%) are insecure (Moullin 2014). For insecurely attached children, 25% learn to avoid their parent when they are distressed (avoidant attachment) and 15% learn to resist the parent, often because the parent responds unpredictably or amplifies their distress (disorganised or resistant attachment) (Moullin 2014). In addition, around 40% of disadvantaged children are classified as disorganised (Weinfield 2004), and as many as 80% of abused children receive this classification (Cyr 2010).

Although children usually have a particularly strong bond with one primary caregiver, most have more than one attachment relationship, often with fathers, siblings and grandparents (see, for example, Hallers‐Haalboom 2014), and, as such, children can be securely attached to one parent but insecurely attached to another. The role of early relationship experiences and the development of infant self‐regulatory skills have been linked to the child’s ability to control behavioural and physiological responses such as, for example, anger (Gilliom 2002), aggression (Alink 2009), and anxiety (Hannesdottir 2007). It has also been found to be associated with increased executive functioning, which, in turn, is associated with proficiency and ability in core curriculum subjects such as English and Mathematics (Bernier 2010).

One key predictor of child attachment status is the parents' attachment status (Van Ijzendoorn 1995), but the impact of the parents' attachment status on the child's attachment appears to be mediated by parental sensitivity to infant cues; a systematic review of the antecedents of attachment security showed that maternal sensitivity was an important predictor, which explained around one third of the variance (De Wolff 1997).

Caregiver sensitivity

Ainsworth and colleagues defined sensitivity as a mother’s ability to attend and respond to her infant in ways that accurately ‘match’ her infant's needs (Ainsworth 1978). Sensitive and responsive parents do each of the following (Ainsworth 1974).

  1. Notice a child’s signals.

  2. Interpret these signals correctly.

  3. Respond to signals in a timely and appropriate manner.

The concept of sensitivity, therefore, refers not to a set of maternal behaviours but to something much more dynamic and relational.

Parental sensitivity can be compromised by a variety of factors. These include social influences such as social isolation (Belsky 2002; Kivijärvi 2004); psychological factors such as maternal depression (Dannemiller 1999; Karl 1995; Murray 1997), or personality disorder (Laulik 2013); maternal history of maltreatment (Pereria 2012), substance dependency (Eiden 2014), domestic violence (Levendosky 2006), or low self‐esteem (Leerkes 2002; Shin 2008); or cognitive factors such as maternal preconceptions about parenting (Kiang 2004; Leerkes 2010). Child characteristics can also impact negatively on parental sensitivity, including infant prematurity (Singer 1999); the presence of excessive negative infant behaviour, for example, general distress (Leerkes 2002); and the child's proneness to anger (Ciciolla 2013), and irritability (Van den Boom 1991). Some studies have examined father involvement as a mediator of maternal sensitivity (see, for example, Stolk 2008), whilst others have examined the role of the father as caregiver (see, for example, Pelchat 2003). Comparative research on the relative sensitivity of mothers and fathers is scarce and therefore the findings are somewhat inconclusive; some studies report fathers as less sensitive than mothers (see Hallers‐Haalboom 2014; Heerman 1994; Lovas 2005), while others have found no difference in parental sensitivity in terms of the gender of the caregiver (Pelchat 2003).

Although parental sensitivity has been found to be an important predictor of infant attachment security, it has also been found to explain around only one third of the variance (De Wolff 1997). Recent research has also highlighted the importance of mid‐range contingency (Beebe 2010), and maternal reflective functioning (RF) (Slade 2005), or mind‐mindedness (Meins 2001). Mid‐range contingency refers to the ability of the parent to flexibly regulate both their own internal emotional states and the interaction with the baby, and is characterised by moments of synchrony or attunement, followed by rupture and then repair. Recent research found that interaction that occurred outside this mid‐range, resulting from the parent’s preoccupation either with self‐regulation (e.g. depressed parents) or interactive regulation (e.g. anxious parents), was associated with insecure or disorganised attachment (Beebe 2010).

RF refers to the parent's capacity to understand their child's behaviour in terms of internal mental states (e.g. intentions, feelings, desires, etc.) (Slade 2005). High RF is associated with positive maternal parenting behaviours, such as flexibility and responsiveness, and use of the mother as a secure base on the part of the infant, while low RF is associated with emotionally‐unresponsive maternal behaviours such as withdrawal, hostility and intrusiveness (Kelly 2005; Slade 2005). ‘Mind‐mindedness’ has similarly been found to be correlated with behavioural sensitivity and interactive synchrony (Meins 2001), and to be a better predictor of attachment security of the child at one year of age than maternal sensitivity (Lundy 2003; Meins 2001).

Other studies have identified a range of ‘atypical’ or ‘anomalous’ parent‐infant interactions characterised as ‘Fr‐behaviours’ (i.e. the behaviours of parents who are either frightened or frightening, or both (Jacobvitz 1997; Main 1990b), or who are hostile and helpless (Lyons‐Ruth 2005)). ‘Fr‐behaviours’ have been described as being subtle (e.g. periods of being dazed and unresponsive) or more overt (deliberately frightening children) (ibid), and such behaviours are strongly associated with a disorganised attachment (Madigan 2006).

Description of the intervention

Video feedback is a generic term that refers to the use of videotaped interactions of the parent and child to promote parental sensitivity; it is variously known as Video Interaction Guidance (VIG), Interaction Guidance (IG), Video Home Training (VHT) and Video Feedback Intervention to Promote Positive Parenting (VIPP). Developed by Harrie Biemans and colleagues in the 1980s, video feedback is a relationship‐based parenting intervention that aims to enhance maternal sensitivity at the behavioural level (Kennedy 2010).

The core aspects of interventions based on video feedback are as follows.

  1. Video‐recording the parent‐child interaction during play or aspects of daily caregiving.

  2. Editing the recording to select micro‐moments of interaction that demonstrate the child's contact initiatives and examples of the parent's attuned response to these signals.

  3. Parent and 'guider' (the person responsible for the therapy) jointly reviewing the recordings, with the guider providing praise to the parent, not for the attunement per se, but for engaging in the evaluation of the interactions being viewed.

The intervention model is underpinned by two core concepts: intersubjectivity and mediated learning. Intersubjectivity, or 'shared moments of attunement', is modelled by the therapist (or 'guider') in their relationship and interactions with the parent, as well as being identified in the video recordings of the parent‐child interaction. Mediated learning, or 'scaffolding', refers to the role played by adults in helping children learn how to do things that they might not otherwise manage alone. Mediated learning is also modelled by the guider in his or her relationships with the parent, as the guider helps the parent to describe what is happening in the clips being viewed, and what the parent and child in the video might be thinking or feeling, and to identify the consequences for the parent and the child (Kennedy 2011).

Video feedback may be delivered on both a one‐to‐one (e.g. VIPP, VIG) or group basis (e.g. Circle of Security (CS)), and has been used with first‐time mothers (Velderman 2006); hard‐to‐reach families (Kennedy 2010); parents of premature infants (Hoffenkamp 2015), or who have mental health problems, including postpartum depression (Vik 2006); middle‐class, insensitive mothers (Kalinauskiene 2009); parents of autistic (Poslawsky 2015), maltreated (Moss 2011), and adopted children (Juffer 1997; Juffer 2005); parents of premature children and children with atopic dermatitis (Cassibba 2015); ethnic‐minority parents (Yagmur 2014), and parents with an eating disorder (Stein 2006). Although video feedback is usually delivered in the home environment, it has also been used in clinical settings such as, for example, hospital environments with mothers of preterm babies (Hoffenkamp 2015), and residential treatment centres (Kennedy 2010). It is now used in over 15 countries by practitioners that work in a range of helping professions (e.g. social work, education, health) (Kennedy 2010).

How the intervention might work

Interventions designed to enhance parental sensitivity include three types of support (Berlin 2005), which are described as follows.

  1. Interventions focused explicitly on enhancing sensitivity at the behavioural level.

  2. Interventions to promote positive parental representations of their children.

  3. Interventions that provide the parent with social support.

These are not necessarily exclusive and some interventions involve all three types of activity. In a meta‐analysis of interventions designed to promote maternal sensitivity and child attachment in early childhood, Bakermans‐Kranenburg 2003 concluded that interventions were more effective when they included a component of video feedback with families.

In terms of the underpinning theoretical model, most forms of video feedback are attachment‐based in the sense that they are aimed at enhancing maternal sensitivity, in order to promote secure attachment and reduce insecure/disorganised attachment. However, the presumed mechanisms by which this is achieved vary across various models of video feedback. All video feedback interventions primarily target the behavioural level using video‐recorded episodes of the parent‐child interaction. The attuned and mentalising stance of the guider provides an opportunity for the caregiver to experience attuned interactions with an adult who is able to think about the caregiver's internal states and the ways in which these might be informing their interactions with the baby. These attuned, mentalising interactions have the potential to improve a number of aspects of the caregiver's functioning, including their RF and their representations regarding their own attachment. In addition, the opportunity to view themselves in interaction with their baby and to observe positive responses from the infant, together with the guider using the video to prompt the parent to think about what the baby might be feeling, can bring about a range of meta‐cognitive changes that result from the discrepancy between their own beliefs about their ability to parent and what they can see on the video, in addition to an increase in feelings of empowerment and self‐efficacy, and their ability for RF (Kennedy 2011).

Some models of video feedback intervention include additional components that may provide a more explicit focus on representational issues. For example, Video Feedback Intervention to Promote Positive Parenting with Discussions on the Representational Level (VIPP‐R) (Juffer 2008) involves the therapist addressing the mother's representations and attachment using discussions that may, for example, focus explicitly on the mother's own experiences of separation in early childhood, and those experienced with her own child (Klein Velderman 2006).

Other models involve the inclusion of teaching about sensitive discipline techniques: Video Feedback Intervention to Promote Positive Parenting ‐ Sensitive Discipline (VIPP‐SD). There is evidence that suggests the effectiveness of video feedback may vary with factors at both the level of the parent and of the child. For example, Bakermans‐Kranenburg 1998 reported that amongst mothers with insecure attachments, those classed as 'insecure dismissing' (who idealise their own parents or minimise the importance of attachment relationships in their own lives) benefited most from video feedback, whilst those classed as 'insecure preoccupied' benefited most from video feedback with additional discussions about their childhood attachment experiences. Other studies have found evidence of differential susceptibility in the child, with highly reactive infants (Velderman 2006), and children with the dopamine receptor D4 (DRD4) 7‐repeat allele (Bakermans‐Kranenburg 2008), being more susceptible to change.

Why it is important to do this review

Improvment of the health and well‐being of children is part of a global agenda. While the basic needs of children (e.g. food, sanitation, health care) are paramount to survival and development, living with an adult who is responsive to their needs is also important (Jones 2003). UNICEF 2008 highlights that a loving, stable and stimulating relationship with caregivers in the earliest months and years are critical for every aspect of a child’s development. Research suggests that early targeted interventions aimed at increasing parental sensitivity and promoting attachment may be effective in promoting healthy child development (see, for example, Lieberman 1999; Schore 2001; Van Ijzendoorn 1995), and in preventing emotional maltreatment (Barlow 2010). Daly 2015 highlights the possible outcomes of providing family and parenting support based on various strategies implemented globally. The only quantitative review of the effectiveness of video feedback to date included both non‐randomised between‐ and within‐group designs (Fukkink 2008).

An early review by Fukkink 2008 concluded that video feedback was an effective means of improving parenting behaviour and attitudes, and child development. However, the paper does not provide the literature search dates for the review, which was submitted in June 2008; the review was very broad in its scope and includes all uses of video feedback with no age limits on the children (who ranged in age from birth to seven years, with an average age of 2.4 years (standard deviation (SD) = 2.7 years). More importantly, the authors paid little attention to the quality of included studies (that is, there were no 'Risk of bias' assessments) and included quasi‐experimental studies (no random assignment). A systematic review of current best evidence that addresses the methodological weakness of Fukkink 2008 will be of interest to policy makers and practitioners internationally, both in terms of evidence‐based parenting interventions, and the promotion of infant and child mental health.

Objectives

To evaluate the effectiveness of video feedback for improving parental sensitivity and promoting attachment security.

Methods

Criteria for considering studies for this review

Types of studies

Randomised controlled trials (RCTs) and quasi‐RCTs (in which the allocation to study arms is not truly random, for example, a form of alternation such as days of the week or by date of birth). We will exclude studies that have an alternative treatment but no control group.

Types of participants

Parent‐infant dyads (including foster or adoptive carers), where the infant is aged between birth and four years 11 months, and where problems have been identified regarding the parent (e.g. bonding, depression, eating disorders, maltreatment) or the child (e.g. attachment or behaviour problems, challenging temperament, preterm birth).

If studies include a proportion of participants above four years 11 months, we will endeavour to obtain data on the sample aged up to four years 11 months.

As this Cochrane review focuses on secondary prevention or 'at risk' parents/carers, we will exclude studies that use the intervention preventatively with a population group.

Types of interventions

Video feedback delivered in any setting compared with no treatment or treatment as usual. We will include studies in which the parent and child are filmed and then feedback is provided to the parent, either on a one‐to‐one basis or in groups, with the aim of improving the sensitivity of their interactions with the child or the mental representations of the parent.

We will include interventions that primarily involve the delivery of video feedback but that also provide one to two additional sessions related to the primary aim of the intervention (e.g. Video Feedback Intervention to Promote Positive Parenting (VIPP) with Discussions on the Representational Level (VIPP‐R); VIPP ‐ Sensitive Discipline (VIPP‐SD)).

We will exclude interventions in which video feedback is used as part of a wider set of methods of working with the family and in which we cannot differentiate the effect of video feedback. We will also exclude programmes that use videotape modelling or videotape vignettes (e.g. Webster‐Stratton 2013).

Types of outcome measures

Primary outcomes

  1. Parental sensitivity (as measured by, for example, the Ainsworth Sensitivity Scale (ASS) (Ainsworth 1969), Child‐Adult Relationship Experimental Index (CARE‐Index) (Crittenden 2001), Parental Sensitivity Assessment Scale (PSAS) (Hoff 2004), Coding Interactive Behaviour (CIB) (Feldman 1998), Emotional Availability (EA) Scales (Biringen 2000), Global Ratings Scales of Mother‐Infant Interaction (GRS) (Murray 1996), Maternal Behaviour Q‐sort (MBQS) (Pederson 1999) or Nursing Child Assessment Teaching Scale (NCATS) (Sumner 1994)).

  2. Parental reflective functioning (RF) (as measured by, for example, the Parent Development Interview (PDI) (unpublished manuscript by Aber 1985), PDI‐Revised (PDI‐R) (unpublished manuscript by Slade 2004)).

  3. Attachment security (as measured by, for example, Attachment Q‐set (AQS) (Waters 1987), or the Strange Situation Procedure (SSP) (Ainsworth 1978)).

  4. Adverse effects: parent anxiety or stress (as measured by, for example, the Parenting Stress Index (PSI) (Abidin 1997) or the Parenting Stress Scale (PSS) (Berry 1995)).

Secondary outcomes

  1. Infant mental health (as measured by behavioural assessments of emotional disorders, hyperactivity and conduct disorders).

  2. Child physical and socioemotional development (as measured through, for example, the Bayley Scales of Infant and Toddler Development, third edition (Bayley‐III) (Bayley 2005), Strengths and Difficulties Questionnaire (SDQ) (Goodman 1997)); child behaviour (as measured by, for example, the Child Behaviour Assessment Instrument (CBAI) (Samarakkody 2010)).

Timing of outcome assessment

We will collect outcome data at time points provided within the included studies and group these as postintervention (immediately upon completion), short term (up to six months), medium term (up to one year) and long term (over one year). However, we do not anticipate that we will identify many studies that report long‐term outcomes.

Search methods for identification of studies

Electronic searches

We will search the electronic databases and trial registers listed below. We will not apply any date or language restrictions to the electronic searches and will secure translations where necessary.

  1. The Cochrane Central Register of Controlled Trials (CENTRAL; current issue; part of the Cochrane Library), and which includes the Cochrane Developmental, Psychosocial and Learning Problems Group Specialised Register.

  2. Ovid MEDLINE (1946 to current).

  3. Embase (1974 to current; Ovid).

  4. CINAHL Plus (1937 to current; EBSCOhost).

  5. PsycINFO (1806 to current; Ovid).

  6. Sociological Abstracts (1952 to current; ProQuest).

  7. Social Sciences Citation Index (SSCI; 1970 to current; Web of Science).

  8. Social Services Abstracts (1979 to current; ProQuest).

  9. Conference Proceedings Citation Index ‐ Social Science & Humanities (CPCI‐SS&H; 1990 to current; Web of Science).

  10. LILACS (1985 to current; lilacs.bvsalud.org/en).

  11. Cochrane Database of Systematic Reviews (CDSR; current issue; part of the Cochrane Library).

  12. Database of Abstracts of Reviews of Effects (DARE; current issue; part of the Cochrane Library).

  13. Networked Digital Library of Theses and Dissertations (NDLTD; all available years; ndltd.org).

  14. WorldCat (limited to dissertations and theses; all available years; Worldcat.org).

  15. Clinicaltrials.gov (all available years; Clinicaltrials.gov).

  16. World Health Organization International Clinical Trials Registry Platform (WHO ICTRP; all available years; who.int/ictrp/en).

We will search Ovid MEDLINE using the strategy in Appendix 1. We will adapt it, as appropriate, for other databases.

Searching other resources

We will scrutinise bibliographies of included studies and relevant reviews to identify any additional relevant publications. We will draft a list of these studies to send to experts in the field and ask them to forward to us any published or unpublished work that we may have missed. We will also search other online sources, including the websites of relevant organisations and government departments (e.g. Association of Infant Mental Health (AIMH), UK (aimh.org.uk); United Nations International Children's Emergency Fund (UNICEF) Global Evaluation Database (unicef.org/evaldatabase); National Society for the Prevention of Cruelty to Children (NSPCC) Impact and Evidence Hub (nspcc.org.uk/services‐and‐resources/impact‐evidence‐evaluation‐child‐protection); and the Association for Video Interaction Guidance UK (AVigUK) (videointeractionguidance.net)). We will also use Google and Google Scholar to search the internet for unpublished work.

Data collection and analysis

Selection of studies

Two review authors (LOH and JB) will independently screen titles and abstracts yielded by the searches against the inclusion criteria for the review. The review authors will then retrieve the full‐text reports of all studies selected for potential inclusion, or those where there may be some uncertainty, and assess the texts for eligibility. Where the two review authors cannot make a final decision on the basis of the published information, we will write to the study authors to seek clarification. We will resolve any differences between the review authors (LOH and JB) through discussion and, where necessary, we will consult the other review authors (GM, NL). We will record and document the excluded studies in the 'Characteristics of excluded studies' tables and will note the reason(s) for their exclusion. We will report the flow of studies using a PRISMA diagram (Liberati 2009).

Data extraction and management

Two review authors (LOH, NL) will independently extract data from each included study and will record the following information on a prepiloted data extraction form.

  1. Participant characteristics (age, gender, ethnicity, location).

  2. Intervention characteristics (including delivery, duration, outcomes and measures, and within‐intervention variability).

  3. Comparison characteristics (including whether the study used an active or inactive comparison).

  4. Study characteristics (study design, sample size, length of follow‐up, attrition or dropout, handling of missing data, methods of analysis).

  5. Outcome data (relevant details on all primary and secondary outcome measures used, and summary data, including means, standard deviations (SDs), confidence intervals (CIs) and significance levels for continuous data and proportions for dichotomous data).

Assessment of risk of bias in included studies

Two review authors (LOH and NL) will independently assess the risk of bias within each included study using the Cochrane 'Risk of bias' assessment tool (Higgins 2011a). We will classify judgements as either ‘low risk of bias’ ‘high risk of bias’ or 'unclear risk of bias' across the following seven domains for each included study (see Appendix 2) and record the judgements in a 'Risk of bias' table.

1. Sequence generation

Description: we will describe the method used to generate the allocation sequence in detail to assess whether it should have produced comparable groups.

Review authors' judgement: was the allocation concealment sequence adequately generated?

2. Allocation concealment

Description: we will describe the method used to conceal allocation sequence in sufficient detail to assess whether intervention schedules could have been foreseen in advance of, or during, recruitment.

Review authors' judgement: was allocation adequately concealed?

3. Blinding of participants and personnel

Description: we will describe any measures used to blind participants and personnel to assess their knowledge of which intervention a given participant might have received.

Review authors' judgement: was knowledge of the allocated intervention by participants and personnel adequately prevented during the study?

4. Blinding of outcome assessors

Description: we will describe any measures used to blind outcome assessors to assess their knowledge of which intervention a given participant might have received.

Review authors' judgement: was knowledge of the allocated intervention by outcome assessors adequately prevented during the study?

5. Incomplete outcome data

Description: we will extract and report data on attrition and exclusions, as well the number of participants involved (compared with total randomised), reasons for attrition/exclusion (where reported or obtained from investigators) and any re‐inclusions performed by the review authors if they retrieve missing data.

Review authors' judgement: were incomplete outcome data adequately addressed?

6. Selective outcome reporting

Description: we will attempt to assess the possibility of selective outcome reporting by investigators.

Review authors' judgement: are reports of the study free of suggestion of selective outcome reporting?

7. Other sources of bias

Description: we will describe any important concerns about bias not addressed in the other domains in the 'Risk of bias' tool.

Review authors' judgement: was the study was free of other problems that could put it at a high risk of bias (e.g. parent recruitment to the study)?

We will resolve any disagreements through discussion and, if necessary, will consult the second (JB) or fourth (GM) review author, or both.

Measures of treatment effect

We will calculate unadjusted treatment effects using Review Manager (RevMan) (RevMan 2014).

Dichotomous outcome data

We will calculate the odds ratios (OR) with a 95% CI for dichotomous outcomes. For meta‐analyses of dichotomous outcomes that we include in the 'Summary of findings' tables, we will express the results as absolute risks and will use high and low observed risks among the control groups as reference points.

Continuous outcome data

We will calculate mean difference (MD) values if all included studies use the same measurement scale, or standardised mean differences (SMDs) if studies use different measurements scales, and 95% CIs for continuous outcome measures. If necessary, we will compute effect estimates from P values, t statistics, analysis of variance (ANOVA) tables or other statistics, as appropriate. We will calculate SMDs using Hedges g.

Multiple outcomes

When a study provides multiple, interchangeable measures of the same construct at the same point in time (e.g. multiple measures of maternal sensitivity), we will calculate the average SMD across these outcomes and the average of their estimated variances. This strategy aims to avoid the need to select a single measure and to avoid inflated precision in meta‐analyses (i.e. preventing studies that report on more outcome measures receiving more weight in the analysis than comparable studies that report on a single outcome measure).

Economics issues

We will summarise available data on the costs of programmes within the included studies.

Unit of analysis issues

Cluster‐RCTs

In the event that we identify relevant cluster‐RCTs that meet the inclusion criteria of the review, we will deploy appropriate statistical methods based on the guidance provided in the Cochrane Handbook for Systematic Reviews of Interventions (Higgins 2011b). Where study authors have dealt appropriately with the clustered design in their analyses, we will try to obtain direct estimates of the effect (e.g. an OR with its CI). Where study authors have not dealt appropriately with the cluster design in their analyses, we will extract or calculate effect estimates and their standard errors (SEs) as for a parallel group trial, and adjust the SEs to account for the clustering (Donner 1980). To do this, we will need to identify an appropriate intraclass correlation coefficient (ICC), which describes the relative variability in outcome within and between clusters (Donner 1980). Where available, we will look for this information in the reports of relevant trials. If this is unavailable, we will try to obtain the information from the study authors. If this proves unsuccessful, we will use external estimates obtained from similar studies. We will find closest‐matching scenarios (regarding both outcome measures and types of clusters) from existing databases of ICCs. If we are unable to identify any matches, we will perform sensitivity analyses using a high ICC of 0.1, a moderate ICC of 0.01 and a small ICC or 0.001, to cover a broader range of plausible values while still allowing for strong design effects for smaller studies (see Sensitivity analysis). Furthermore, we will combine these estimates and their corrected SEs from the cluster‐RCTs with those from parallel designs using the generic inverse variance method in RevMan (RevMan 2014).

Studies with multiple treatment groups

In the primary analysis, we will combine results across all eligible intervention (Video Interaction Guidance (VIG)) groups and compare them with the combined results across all eligible control groups, and will make single pair‐wise comparisons. Where studies compare more than one form of VIG with a control group(s) such that combining them prevents investigation of potential sources of heterogeneity, we will analyse each VIG group separately (against a common control group), but divide the sample size for common comparator groups proportionately across each comparison (Higgins 2011b, Section 16.5.5). This simple approach allows the use of standard software and prevents inappropriate double‐counting of individuals.

Dealing with missing data

Where necessary, one review author (LOH) will contact the authors of included studies to supply any unreported data, such as group means and SDs, details of dropouts and descriptive data regarding the intervention. If this fails or is not possible, we will follow the recommendations in the Cochrane Handbook for Systematic Reviews of Interventions (Higgins 2011b, Section 16.1) and we will do one of the following.

  1. Only analyse the data available (if data is assumed to be missing at random).

  2. Where appropriate, develop a strategy for data imputation (if we assume that data is not missing at random).

In the case of data imputation, we will specify the methods used in the 'Characteristics of included studies' tables. We will describe other missing data and dropouts/attrition for each included study in the ‘Risk of bias’ table, and we will discuss the extent to which these missing data could alter the results or conclusions of the review. We will assess the sensitivity of any primary meta‐analyses to missing data using meta‐regression to test for any effect of missingness on the summary estimates (Higgins 2011b, Section 16.1.2).

Assessment of heterogeneity

We will assess clinical heterogeneity across studies by examining the distribution of important participant factors (e.g. age, socioeconomic status, maternal maltreatment history, caregiver depression) and intervention characteristics (e.g. style, setting, personnel, context of delivery) among trials.

We will assess methodological heterogeneity across studies by comparing the distribution of study factors (e.g. allocation concealment, blinding of outcome assessment, losses to follow‐up, treatment type, cointerventions).

We will describe statistical heterogeneity by computing the I² statistic (Higgins 2002), which describes approximately the proportion of variation in point estimates that is due to heterogeneity rather than sampling error. In addition, we will use the Chi² test (P < 0.10) of homogeneity to detect the strength of evidence that heterogeneity is genuine.

Assessment of reporting biases

We will draw funnel plots (estimated differences in treatment effects against their SE) if we identify 10 or more studies that provide data on an outcome. Asymmetry might be due to publication bias, but might also reflect a relationship between trial size and effect size such as when larger trials have lower compliance, and compliance is positively related to effect size. If we find such a relationship, we will examine the clinical variation of the studies (Sterne 2011, Section 10.4). As a direct test for publication bias, we will compare results extracted from published journal reports with results obtained from other sources.

Data synthesis

Where interventions are similar in (1) the age of the child(ren), (2) parent gender and (3) intensity, frequency and duration of VIG, we plan to synthesise results in a meta‐analysis. We will use both fixed‐effect and random‐effects models and compare the results to assess the impact of statistical heterogeneity. When we report the results of the random‐effects model, we will include an estimate of the between study variance (Tau²). Unless the model is contraindicated (e.g. if there is funnel plot asymmetry), we plan to present the results from the random‐effects model. In the occurrence of severe funnel plot asymmetry, we will present both fixed‐effect and random‐effects analyses, under the assumption that asymmetry suggests that neither model is appropriate. If both indicate a presence (or absence) of effect we will be reassured; if they do not agree we will report this. We will calculate all overall effects using inverse variance methods. If some primary studies report an outcome as a dichotomous measure and others use a continuous measure of the same construct, we will convert results for the former from an OR to a SMD, provided that we can assume the underlying continuous measure has approximately a normal or logistic distribution (otherwise we will perform two separate analyses).

'Summary of findings' table

We will create a 'Summary of findings' table per comparison. The comparisons will include the following.

  1. Video feedback versus no intervention.

  2. Video feedback versus alternative intervention.

We will follow the guidelines in the Cochrane Handbook for Systematic Reviews of Interventions (Schünemann 2011a, Section 11.5.3), and include the following six elements in these tables.

  1. A list of all outcomes.

  2. A measure of the typical burden of these outcomes.

  3. Absolute and relative magnitude of effect.

  4. Numbers of participants and studies that address these outcomes.

  5. A rating of the overall quality of evidence for each outcome.

  6. Additional comments.

Two review authors (LOH, NL) will independently assess the quality of the evidence. They will use the following five Grading of Recommendations, Assessment, Development and Evaluation (GRADE) considerations to assess the quality of the evidence.

  1. Limitations in study design and implementation: for RCTs, for example, these will include lack of allocation concealment, lack of blinding and large loss to follow‐up.

  2. Indirectness of evidence: for example, when findings are restricted to indirect comparisons between two interventions. RCTs that meet the eligibility criteria but which address a restricted version of the main review questions in terms of population, intervention, comparator or outcomes are another example of this and will also be downgraded.

  3. Unexplained heterogeneity or inconsistency of results: we will look for robust explanations for heterogeneity in studies that yield widely differing estimates of effect.

  4. Imprecision of results: we will downgrade the quality of evidence for those studies that include few participants and few events and thus have wide CIs.

  5. Publication bias: we will downgrade the quality of evidence level if investigators fail to report studies or outcomes on the basis of results (Schünemann 2011b, Section 12.2.2).

We will use the GRADEprofiler Guideline Development Tool (GRADEpro GDT) to prepare the 'Summary of findings' tables, and specifically to enable us to produce relative effects and absolute risks associated with the interventions (GRADEpro GDT 2015). We will use all primary and secondary outcomes of interest to populate the ‘Summary of findings’ table.

Subgroup analysis and investigation of heterogeneity

We will investigate heterogeneity using subgroup analyses or meta‐regression, if appropriate. We will group the included studies and analyse them according to the intervention approach, including the following.

  1. Intervention dose (e.g. intensity; duration).

  2. Intervention type, including the use of additional components (e.g. VIG, VIPP‐R, VIPP‐SD).

  3. Delivery method (i.e. group based versus individual delivery).

  4. Participating carer (e.g. interventions that involve mothers and infants only; interventions that involve fathers and infants only; interventions that involve foster parents); and infant (e.g. prebirth or highly temperamental babies).

Sensitivity analysis

We will assess the robustness of findings to decisions made in obtaining them by conducting sensitivity analyses. We will perform sensitivity analyses by conducting the following reanalyses.

  1. Reanalysis excluding studies at high risk of bias.

  2. Reanalysis excluding studies with imputed data.

  3. Reanalysis using different statistical approaches (Deeks 2011, Section 9.7).