Scolaris Content Display Scolaris Content Display

Liderazgo colectivo para mejorar el ejercicio profesional, los desenlaces sanitarios y el bienestar del personal

Contraer todo Desplegar todo

Antecedentes

Las partes interesadas internacionales abogan fuertemente por el liderazgo colectivo como un enfoque clave para la prestación de servicios sanitarios en respuesta a formas de organización cada vez más complejas definidas por los rápidos cambios que experimentan la tecnología, la profesionalización y la creciente especialización sanitarias. Un liderazgo insuficiente debilita los sistemas sanitarios y puede contribuir a generar eventos adversos, como la negativa a priorizar y aplicar las recomendaciones de seguridad de forma coherente, y la reticencia a abordar el desgaste profesional («burnout»). El aumento de la esperanza de vida y el número de personas que viven con varias enfermedades crónicas contribuyen a una mayor complejidad de los sistemas sanitarios en todo el mundo. Un entorno tan complejo requiere de la colaboración y el liderazgo de varios profesionales que compartan puntos de vista y conocimientos.

Objetivos

Evaluar los efectos del liderazgo colectivo para el personal sanitario en el ejercicio de su profesión, los desenlaces de la atención sanitaria y el bienestar del personal, en comparación con los enfoques habituales de liderazgo centralizado.

Métodos de búsqueda

El 5 de enero de 2021 se realizaron búsquedas en CENTRAL, MEDLINE, Embase, otras cinco bases de datos y dos registros de ensayos. También se buscó en la literatura gris, se verificaron las referencias para obtener citas adicionales y se estableció contacto con los autores de los estudios para identificar estudios adicionales. No se aplicaron límites en el idioma.

Criterios de selección

Dos grupos de dos autores revisaron, cribaron y seleccionaron de forma independiente los estudios para su inclusión; el autor principal formó parte de ambos grupos para garantizar la coherencia. Se incluyeron ensayos controlados aleatorizados (ECA) que compararon las intervenciones de liderazgo colectivo con el liderazgo centralizado habitual o ninguna intervención.

Obtención y análisis de los datos

Tres grupos de dos autores extrajeron de forma independiente los datos de los estudios incluidos y evaluaron la calidad de los mismos; el autor principal participó en todos los grupos. Se siguieron los procedimientos metodológicos estándar previstos por Cochrane y el Grupo Cochrane para una Práctica y organización sanitaria efectivas (Effective Practice and Organisation of Care Group; EPOC). Para evaluar la certeza de la evidencia se utilizó el método GRADE.

Resultados principales

Se identificaron tres ensayos aleatorizados para su inclusión en la síntesis. Todos los estudios se llevaron a cabo en un contexto de hospitalización de cuidados intensivos; los estudios se realizaron en Canadá, Irán y EE.UU. Se incluyeron un total de 955 pacientes entre todos los estudios. Hubo una considerable variación en los participantes, las intervenciones y las medidas para cuantificar los desenlaces. Solo fue posible completar un metanálisis para un desenlace (liderazgo) y se elaboró un resumen narrativo para otros desenlaces. Se consideró que todos los estudios presentaban un riesgo de sesgo general incierto.

Las intervenciones de liderazgo colectivo probablemente mejoran el liderazgo (tres ECA, 955 participantes). El liderazgo colectivo podría mejorar el rendimiento del equipo (un ECA, 164 participantes). Se desconoce el efecto del liderazgo colectivo en el rendimiento clínico (un ECA, 60 participantes). Se desconoce el efecto de la intervención sobre los desenlaces de la atención sanitaria, incluido el estado de salud (mortalidad de los pacientes hospitalizados) (un ECA, 60 participantes). El liderazgo colectivo podría mejorar ligeramente el bienestar del personal al reducir el estrés laboral (un ECA, 164 participantes). No se identificó evidencia directa sobre el desgaste profesional ni los síntomas psicológicos. Se desconocen los efectos de la intervención sobre las consecuencias no deseadas, concretamente sobre el ausentismo laboral (un ECA, 60 participantes).

Conclusiones de los autores

El liderazgo colectivo implica que múltiples profesionales compartan puntos de vista y conocimientos con el potencial de influir positivamente en la calidad de la atención y el bienestar del personal. La confianza en los efectos de las intervenciones de liderazgo colectivo sobre la práctica profesional, los desenlaces de la atención sanitaria y el bienestar del personal es moderada en los desenlaces de liderazgo, baja en el rendimiento del equipo y el estrés laboral, y muy baja en los desenlaces de rendimiento clínico, mortalidad de los pacientes hospitalizados y ausentismo del personal. La evidencia fue de certeza moderada, baja y muy baja debido al riesgo de sesgo y la imprecisión, lo que significa que la evidencia futura podría cambiar la interpretación de los resultados. Se necesitan más estudios de alta calidad en este ámbito, con informes coherentes sobre el liderazgo, el rendimiento del equipo, el rendimiento clínico, el estado de salud y los desenlaces de bienestar del personal.

PICO

Population
Intervention
Comparison
Outcome

El uso y la enseñanza del modelo PICO están muy extendidos en el ámbito de la atención sanitaria basada en la evidencia para formular preguntas y estrategias de búsqueda y para caracterizar estudios o metanálisis clínicos. PICO son las siglas en inglés de cuatro posibles componentes de una pregunta de investigación: paciente, población o problema; intervención; comparación; desenlace (outcome).

Para saber más sobre el uso del modelo PICO, puede consultar el Manual Cochrane.

¿El liderazgo colectivo mejora la actuación de los profesionales sanitarios, la atención sanitaria de los pacientes y el bienestar del personal?

Mensajes clave

El liderazgo colectivo implica que varios profesionales compartan puntos de vista y conocimientos. De acuerdo con las pruebas disponibles, no es posible afirmar con seguridad que suponga una gran diferencia para las acciones profesionales, la atención sanitaria de los pacientes o el bienestar del personal. La confianza en estos resultados varía de moderada a muy baja y está limitada en gran medida por la calidad baja y el escaso número de estudios incluidos.

¿Qué se quería averiguar?

El objetivo fue comprobar si las experiencias de liderazgo colectivo (en contraposición a los estilos de liderazgo más centralizados y jerárquicos) mejoran las acciones profesionales, la atención sanitaria de los pacientes y el bienestar del personal. Se buscaron estudios en los que los investigadores compararan el liderazgo colectivo con el centralizado.

¿Qué se hizo?

Se recopilaron y analizaron todos los estudios pertinentes con intervenciones de liderazgo colectivo caracterizadas por decisiones e interacciones compartidas entre las profesiones sanitarias.

¿Qué se encontró?

Se encontraron tres estudios relevantes (955 participantes). Los estudios se realizaron en hospitales de Canadá, Irán y EE.UU. Las intervenciones de liderazgo colectivo probablemente mejoran el liderazgo (tres estudios, 955 participantes), podrían mejorar el trabajo en equipo (un estudio, 164 participantes) y podrían disminuir ligeramente el estrés laboral (un estudio, 164 participantes). Se desconoce si el liderazgo colectivo tiene un efecto sobre estos desenlaces: rendimiento clínico (un estudio, 60 participantes), mortalidad en pacientes hospitalizados (un estudio, 60 participantes) y ausentismo del personal (un estudio, 60 participantes).

¿Cuáles son las limitaciones de la evidencia?

Existe una seguridad moderada de que el liderazgo colectivo mejora el liderazgo en los entornos sanitarios. Las pruebas mostraron que el liderazgo colectivo tuvo un gran efecto en las estrategias de liderazgo. Se tiene menos certeza en los resultados sobre el trabajo en equipo y el estrés laboral. No se tiene confianza en las pruebas relacionadas con el rendimiento clínico, la mortalidad en pacientes hospitalizados ni las ausencias del personal. También es posible que los participantes de los estudios fueran conscientes del tratamiento que estaban recibiendo. No todos los estudios proporcionaron datos sobre todos los aspectos que interesaban en esta revisión. La evidencia se basa en pocos casos.

¿Qué grado de actualización tiene esta revisión?

Se buscaron estudios publicados hasta enero de 2021.

Authors' conclusions

Implications for practice

Collective leadership involves multiple professionals sharing viewpoints and knowledge, with the theoretical potential to influence the quality of care and staff well‐being. However, in the studies analysed, we noticed difficulties to advance collective leadership in practice. This review provided limited evidence about the effects of collective leadership interventions on professional practice, healthcare outcomes and staff well‐being. Our confidence in the effects of collective leadership interventions on professional practice, healthcare outcomes and staff well‐being is moderate in leadership outcomes, low in team performance and work‐related stress, and very low for clinical performance, inpatient mortality and staff absence. The evidence was of moderate, low and very low certainty due to risk of bias and imprecision, meaning that future evidence may change our interpretation of the results. 

Implications for research

The notion of collective leadership is gaining traction in health service delivery debates. However, there is little consensus about how to conceptualise, define or measure collective leadership. More high‐quality studies could shed light on the impact of varied interventions based on different theoretical underpinnings. Further development and evaluation of team‐based training interventions would strengthen the evidence base for collective leadership. Further work is required to assess the consequences of collective leadership interventions on: health status, measured as mortality within 30 days of discharge; staff well‐being, measured as burnout and psychological symptoms; and unintended consequences, including safety of care events such as errors and healthcare‐associated complications. A qualitative evidence synthesis about the factors that influence adoption and sustainability of collective leadership in health care could help inform future intervention development and implementation strategies.

Summary of findings

Open in table viewer
Summary of findings 1. Collective leadership to improve professional practice, healthcare outcomes and staff well‐being compared with usual centralised and hierarchical leadership

Effect of collective leadership to improve professional practice, healthcare outcomes and staff well‐being compared with usual centralised and hierarchical leadership

Patients or population: healthcare professionals

Settings: tertiary care (hospitals: trauma centre and clinical units)

Intervention: collective leadership interventions to improve professional practice, healthcare outcomes and staff well‐being

Comparison: usual approaches to leadership identified as centralised and hierarchical

Outcomes

Impact

Number of 
participants
(studies)

Certainty of the evidence (GRADE)* 

Professional practice

Leadership

Average difference (SMD): 0.78 higher (95% CI 0.19 to 1.38 higher) 

 

Collective leadership probably improves leadershipa,b,c.

955 participants (3 RCTs)

⊕⊕⊕⊖
Moderate. Downgraded due to unclear risk of biasa,b,c and serious imprecision.
Upgraded due to large magnitude of effect.

Team performance

Average difference (SMD): 0.47 higher (95% CI 0.16 to 0.78 higher) 

 

Collective leadership may improve team performancea.

164 participants (1 RCT)

⊕⊕⊖⊖

Low. Downgraded due to unclear risk of biasa and serious imprecision.

Clinical performance

Average difference (SMD): 0.21 higher (95% CI ‐0.30 to 0.72)

 

It is uncertain whether collective leadership improves clinical performancec.

60 participants (1 RCT)

⊕⊖⊖⊖
Very low. Downgraded due to unclear risk of biasc and very serious imprecision.

Health care outcomes

Inpatient mortality

Relative effect (RR): 0.86 lower (95% CI 0.42 to 1.77)

 

Absolute difference: 17 fewer deaths per 1000 patients (95% CI 73 fewer to 96 more per 1000 patients)

 

It is uncertain whether collective leadership decreases inpatient mortalityc

60 participants (1 RCT)

⊕⊖⊖⊖
Very low. Downgraded due to unclear risk of biasc and very serious imprecision.

Staff well‐being

Work‐related stress

 

Average difference (SMD): 0.17 lower (95% CI ‐0.48 to 0.13)

 

Collective leadership may slightly decrease work‐related stressa.

164 participants (1 RCT)

⊕⊕⊖⊖
Low. Downgraded due to unclear risk of biasa and serious imprecision.

Unintended consequences

Staff absence 

Average difference (SMD): 0.2 higher (95% CI ‐0.11 to 0.51) 

 

It is uncertain whether collective leadership prevents staff absencea.

 60 participants (1 RCT)

⊕⊖⊖⊖
Very low. Downgraded due to unclear risk of biasa and very serious imprecision.

GRADE Working Group grades of evidence
High: this research provides a very good indication of the likely effect. The likelihood that the effect will be substantially different is low.
Moderate: this research provides a good indication of the likely effect. The likelihood that the effect will be substantially different is moderate.
Low: this research provides some indication of the likely effect. However, the likelihood that it will be substantially different is high.
Very low: this research does not provide a reliable indication of the likely effect. The likelihood that the effect will be substantially different is very high.

aWeir 1997, which employed the Work Environment Scale, Peer Cohesion Scale, Work Pressure Scale, and measured paid staff absence hours.
bShirazi 2016, which employed the Supportive Leadership Behaviour scale.
cFernandez 2020, which employed the Leadership Taxonomy and Total Patient Care score.

Background

Collective leadership is leadership that involves all staff members and patients, where everyone shares responsibility for the success of patient care and the healthcare service. This contrasts with traditional hierarchical leadership approaches, which are focused on individual capabilities. Collective leadership requires leaders to adopt strategies for collaborative working and collective openness. Leaders that work as a collective group may provide improved patient experience and safety, continuing quality improvement in health care, and create a culture of engagement that promotes staff autonomy, accountability and well‐being (Eckert 2014West 2014a).

Collective leadership is not a new concept, but it is increasingly and strongly advocated by international stakeholders as a key approach for health service delivery, as a response to increasingly complex forms of organisation defined by rapid changes in health technology, professionalisation and growing specialisation. Such a complex environment requires the contribution and leadership of multiple professionals sharing viewpoints and knowledge (Raelin 2011Raelin 2018West 2014a).

Globally, increases in life expectancy and the number of people living with multiple long‐term conditions contribute to greater complexity of healthcare systems. This context requires a collective leadership culture that values and promotes care that is high quality, compassionate (West 2014aWest 2014b), and interprofessional (Folkman 2019Reeves 2010Reeves 2018). This is evidenced by a recent Cochrane Review (Reeves 2017). The case for collective leadership is justified by the need to employ professional skills more effectively, and to bring social and healthcare professionals together to improve quality of care (West 2014aWest 2014b). This is an interactive and interdependent process (Orchard 2019).

Description of the condition

Collective leadership is a way of characterising the engagement of multiple healthcare team members to come together to make decisions and strengthen health service and system performance, towards quality of care enhancement. Leadership challenges in health and social care teams arise partly as a consequence of the siloed, mono‐professional education of health and social care workers, and of restrictive professional regulation that seeks to define the scope of practice for members of some professional groups while restricting activities to others (Reeves 2010). Moreover, historically, the autonomy and work model that has dominated in medicine generates a hierarchical relationship by subordination, limitation and exclusion, giving rise to leadership challenges. In this context, leaders require a change in posture to move team members away from professional competition and towards a patient‐centred and collaborative approach.

Description of the intervention

The review sought to establish the evidence base for collective leadership interventions developed for health and social care professionals, with or without patient involvement, at any level of the healthcare system, including primary, secondary and tertiary care.

Collective leadership is a polysemic concept and the recent literature uses different terms interchangeably, describing leadership as: 'shared' (Aube 2018Aufegger 2019D'Innocenzo 2016Fischer 2018Forsyth 2017Idelji‐Tehrani 2019Jackson 2000Janssens 2021Kelley‐Patterson 2012Merkens 1998Nicolaides 2014Wang 2014); 'collective' (Denis 2001Friedrich 2009McAuliffe 2017Raelin 2011Raelin 2018Ward 2018West 2014aYammarino 2012); 'collectivistic' (De Brún 2019Yammarino 2012); 'distributed' (Gronn 2002); 'collaborative' (Iachini 2019Markle‐Reid 2017Okpala 2018Orchard 2019Sonnino 2016VanVactor 2012Wang 2018); 'team' (Crowe 2017Farh 2018Smith 2018); or 'interprofessional' (McGrath 2019Smith 2018).

For the purposes of the current review, we define collective leadership interventions as those which explicitly aim to improve professional practice, healthcare outcomes, or both, through fostering mechanisms to: 

  • share decisions in an interactive, interdependent process;

  • clarify roles and sharing responsibilities across team members; and

  • empower and motivate staff.

Collective leadership can be established in two different ways: either through a non‐hierarchical leadership with spontaneous leaders (informal/influential) emerging naturally; or through a hierarchical leadership, with leaders formally appointed by the organisation (D'Innocenzo 2016De Brún 2019Nicolaides 2014Wang 2014). The path to collective leadership normally begins with exchanging information for collective situation analysis, sharing meanings, and setting common goals and understanding among team members. It requires partnership between leaders and team members to solve problems through delegation of responsibilities, empowerment and decision‐sharing, to make best use of relevant knowledge and expertise.

In this review, we were concerned with collective leadership interventions that focused on either patient case management, professional management or organisational strategies. Patient case management refers to interventions which involve patients, families, caregivers and professionals. Professional management considers all initiatives that promote professional development through short‐ or long‐term training in general, or specific formal education. This kind of intervention can be organised inside or outside of healthcare facilities in workshops, e‐learning, short courses, co‐design, educational modules and programmes. Organisational strategies include practices and routines to introduce or improve resources, equipment, work dynamics and new clinical protocols pertaining to collective leadership.

Some examples of collective leadership interventions include:

  • a multi‐component leadership intervention that involves having team leaders attend a workshop focused on responsive leadership; implementing care‐team huddles into daily practice; and implementing a team leaders’ support system (Caspar 2017);

  • a supportive leadership workshop designed for head nurses (Shirazi 2016);

  • a simulation with nurses and doctors sharing leadership during cardiopulmonary resuscitation (Armstrong 2020); and

  • a co‐designed collective leadership intervention to improve team performance and safety culture (McAuliffe 2017).

How the intervention might work

We followed the initial programme theory for collective leadership, which depicts context‐mechanism‐outcome configurations (De Brún 2020). De Brún and collaborators described the contextual conditions for collective leadership as team training, co‐design strategies, dedicated time for team reflection, inclusive communication, shared decision‐making processes and strong interpersonal relationships with teams (De Brún 2020). We used this to inform identification of relevant interventions and outcomes.

Collective leadership interventions may result in improved staff engagement, satisfaction, empowerment, collaborative decision‐making, communication, role and goal clarity, understanding about teamwork, mutual respect, trust, self‐confidence in contributing to work (Aufegger 2019De Brún 2019), proactive helping behaviours, patient satisfaction, and improvements in quality and safety care (De Brún 2020). It may also lead to cost reduction (Okpala 2018). Studies have repeatedly suggested a positive relationship between collective leadership approaches and team effectiveness or team performance (Aufegger 2019D'Innocenzo 2016De Brún 2019Nicolaides 2014Wang 2014).

The dynamics of collective leadership are team‐based, involving formal, informal and hierarchical analysis (Yammarino 2012). A key feature of the collective leadership process is having the leader as a facilitator of interactions between team members. This facilitation role aims to generate a common understanding and agreement around team goals (Friedrich 2009Smith 2018), and identify potential areas of contribution by team members. This collective approach to leadership addresses a limitation identified in traditional hierarchical methods, in which leaders are responsible for taking decisions that impact multiple individuals (Raelin 2018Yammarino 2012).

The intervention can be applied equally to health settings across lower‐, middle‐, and high‐income countries. We acknowledge that research to date has focused on the latter. There is no evidence that the intervention can disproportionally affect people from a lower‐ or middle‐income country setting. However, the review was sensitive to such issues and we sought to capture any outcomes or implications that may widen health inequity.

Why it is important to do this review

Inadequate leadership weakens health systems and can contribute to adverse events, including refusal to prioritise and implement safety recommendations consistently, and resistance to addressing staff burnout (Joint Comission 2017). Quality of leadership is consistently identified as a factor that contributes to safety and quality failings, but is also central to solutions that minimise preventable health system issues and enhance staff well‐being (Eckert 2014West 2014aWest 2014b). Contemporary health systems operate with multiple leaders throughout every level; thus, the notion of collective leadership holds critical importance for health system enhancement. Yet we lack evidence of the effects of collective leadership interventions in health and social care teams. Therefore, the current review is both timely and important.

Collective leadership is an important component in developing and sustaining a culture of high‐quality care, which requires a leadership commitment to:

  • translate the collective vision into actions;

  • set clear objectives aligned with organisational priorities;

  • feed back evidence of positive patient outcomes to the team to inform future actions;

  • promote a culture of engagement related to decision‐making; and

  • foster ongoing learning and quality improvement by listening to patients, caregivers and team members (West 2014a).

Previous attempts to summarise the literature on collective leadership in healthcare settings share some limitations, but have suggested promising outcomes for team engagement (Aufegger 2019De Brún 2019), and organisational performance (De Brún 2019). For example, Janssens and collaborators completed a review on shared leadership in healthcare action teams, which focused on emergency care teams (Janssens 2021). They considered different forms of shared leadership: institutionalised practices, sharing‐to‐mentor to develop team member leaders, and spontaneous collaboration to improve ineffective leadership. De Brún and collaborators analysed interventions classified as co‐design, co‐leadership, service improvement, team training and individual team training in health care. They took out MeSH headings to narrow their search and restricted them to publications since 2000, so they may have missed some studies (De Brún 2019). Moreover, a review of acute healthcare teams suggested that shared leadership enables understanding of roles and tasks, awareness, social support, environmental safety and team satisfaction (Aufegger 2019). However, in these reviews, the authors expressed caution about the limitations of the evidence and their approaches.

We are still lacking a high‐quality systematic review that specifically evaluates the effects of collective leadership interventions to improve professional practice, healthcare outcomes and staff well‐being. This hinders health professionals, policy makers and researchers’ ability to plan effective collective leadership interventions.

Objectives

To assess the effects of collective leadership for healthcare providers on professional practice, healthcare outcomes and staff well‐being, when compared with usual centralised leadership approaches.

Methods

Criteria for considering studies for this review

Types of studies

We considered any study meeting the study design criteria from the Cochrane Effective Practice and Organisation of Care Group (EPOC 2017a): randomised trials; non‐randomised trials; controlled before‐and‐after studies with contemporaneous data collection and with two or more control and intervention sites; repeated measures studies and interrupted time series (ITS) studies with a clearly‐defined point in time when the intervention occurred, and at least three data points before and after implementation of the intervention.

We included studies that were available as a full text, regardless of publication status or language. We included abstracts and associated these with full‐text studies. For abstracts of studies that were potentially eligible, we searched for a corresponding publication and also contacted study authors to obtain any unpublished full study report, if available.

Types of participants

We considered study interventions that targeted any type of health and social care team ‐ uniprofessional, multiprofessional or interprofessional ‐ from different professional areas (e.g. chiropodists or podiatrists, complementary therapists, dentists, dieticians, doctors or physicians, hygienists, midwives, nurses, occupational therapists, pharmacists, physiotherapists, psychologists, psychotherapists, radiographers, social workers or speech therapists), with or without the inclusion of patients. We did not exclude studies on the basis of demographic factors or healthcare setting.

Types of interventions

We considered all collective leadership interventions that:

  • had an explicit aim to improve professional practice and healthcare outcomes;

  • included a focus on sharing decisions in an interactive and interdependent process;

  • fostered role clarification and sharing of responsibilities; and

  • aimed to enable staff empowerment and motivation.

Interventions could include workshops, coaching (e.g. Caspar 2017), simulation, protocols (e.g. Armstrong 2020), and role‐playing (e.g. Shirazi 2016). 

We were interested in professional‐directed collective leadership interventions, which may also be referred to as collectivistic leadership, distributed leadership, shared leadership, collaborative leadership, participatory leadership, inclusive leadership, democratic leadership, plural leadership, dispersed leadership, empowering leadership, compassionate leadership, informal leadership, peer leadership and team leadership. We were concerned with collective leadership interventions with a focus on patient case management, professional management or organisational strategies.

For this review, we planned the following comparisons in primary, secondary or tertiary healthcare settings:

  • collective leadership versus usual approaches to leadership identified as centralised and hierarchical;

  • collective leadership educational interventions with or without organisational restructuring;

  • collective leadership interventions targeting frontline teams versus management teams.

We excluded interventions that focused exclusively on strengthening professional collaboration without explicit reference to collective leadership; this allowed us to complement and not overlap with the Cochrane Review on interprofessional collaboration by Reeves 2017.

Types of outcome measures

Primary outcomes

We defined the following primary outcomes.

  • Professional practice

  • Healthcare outcomes

    • Health status (inpatient mortality, mortality within 30 days of discharge)

  • Staff well‐being

    • Burnout, psychological symptoms (anxiety, depression) and work‐related stress, e.g. Professional Quality of Life (ProQOL) (Stamm 2010)

  • Unintended consequences

    • Any unintended safety of care events (including errors, healthcare‐associated complications), e.g. Patient Measure of Safety (PMOS) (McEachan 2014), or professional, organisational and staff consequences

Secondary outcomes

We included these other patient and organisational outcomes as secondary outcomes:

  • Satisfaction

    • Patient satisfaction and experience, e.g. Patient Satisfaction Questionnaire (PSQ) (Ware 1976)

  • Organisational

    • Staff turnover (including intention to quit and absenteeism)

We included studies in the review even if they only reported secondary outcomes.

Search methods for identification of studies

We used a combination of search methods for the identification of studies, including electronic database searches, trial registries and grey literature searching.

Electronic searches

The review authors developed the database search strategy in consultation with the EPOC Information Specialist. We searched the Cochrane Database of Systematic Reviews (CDSR) and the Database of Abstracts of Reviews of Effects (DARE) for related systematic reviews.

We searched the following databases for primary studies on 5 January 2021:

  • Cochrane Central Register of Controlled Trials (CENTRAL; 2021, issue 1), in the Cochrane Library;

  • MEDLINE Ovid (1946 to date of search);

  • Embase Ovid (1974 to date of search);

  • CINAHL EBSCO (Cumulative Index to Nursing and Allied Health Literature; 1982 to date of search);

  • LILACS (Latin American and Caribbean Health Sciences Information database; 1982 to date of search); and

  • Web of Science, Conference Proceedings Citation Index ‐ Science (1990 to date of search).

Search strategies were comprised of natural language and controlled vocabulary terms. We applied no limits on language or publication date. In databases where it was possible and appropriate, we used study design filters to limit to the study designs of interest. For randomised trials in MEDLINE, we used a modified version of the Cochrane Highly Sensitive Search Strategy for identifying randomised trials in MEDLINE: sensitivity‐ and precision‐maximising version (2008 revision) (Lefebvre 2021), with additional terms for other relevant study designs. The MEDLINE strategy was peer reviewed by a second Cochrane Information Specialist. We used limits in CINAHL and Embase to remove MEDLINE records in order to avoid duplication in downloaded results. We de‐duplicated the remaining results in Endnote against each other. Please see all search strategies used in Appendix 1.

Searching other resources

Trial registries

We searched the following trials registries on 5 January 2021:

Grey literature

We conducted a grey literature search to identify studies not indexed in the databases listed above by searching:

We also:

  • reviewed reference lists of all included studies and relevant systematic reviews for additional potentially eligible primary studies;

  • contacted authors of included studies/reviews to clarify reported published information and to seek unpublished results/data;

  • contacted researchers with expertise relevant to the review topic/EPOC interventions;

  • conducted cited reference searches for all included studies in ISI Web of Science, Clarivate and screened individual journals and conference proceedings (handsearch);

  • provided appendices for all strategies used, including a list of sources screened and relevant reviews/primary studies reviewed.

We contacted experts in the field and authors of included studies for advice on other relevant studies. We conducted handsearches in key e‐journals, including the Journal of Leadership and Organizational Studies, BMJ Leader, Leadership Quarterly and Journal of Management.

Data collection and analysis

Selection of studies

We downloaded all titles and abstracts retrieved by electronic searching to a reference management database and removed duplicates. Two review author pairs (JAMS with MP, JAMS with VAM) independently screened titles and abstracts for inclusion. We retrieved the full‐text study reports/publications and two review author pairs (JAMS with MP, JAMS with VAM) independently applied the eligibility criteria to the full texts, identified studies for inclusion, and identified and recorded reasons for exclusion of ineligible studies. We resolved any disagreement through discussion or, when required, through consulting a third review author (AX or RH).

We listed studies that initially appeared to meet the inclusion criteria but that we later excluded in the 'Characteristics of excluded studies' table, with the reasons for their exclusion. We collated multiple reports of the same study so that each study, rather than each report, is the unit of interest in the review. We also provided any information we could obtain about ongoing studies. We recorded the selection process in sufficient detail to complete a PRISMA flow diagram (Liberati 2009).

Data extraction and management

We used the EPOC standard data collection form and adapted it for study characteristics and outcome data (EPOC 2017b); we piloted the form on at least one study in the review. Two review author pairs (JAMS with MP, JAMS with VAM) independently extracted the following study characteristics from the included studies and enter the data into RevMan Web (RevMan Web 2022).

  • Methods: study design, number of study centres and location, study setting, withdrawals, date of study, follow‐up.

  • Participants: number, mean age, age range, gender, eligibility criteria, other relevant characteristics.

  • Interventions: intervention components, comparison, fidelity assessment.

  • Outcomes: main and other outcomes specified and collected, time points reported.

  • Notes: funding for trial, notable conflicts of interest of trial authors, ethical approval.

Two review author pairs (JAMS with MP, JAMS with VAM) independently extracted outcome data from included studies. We noted in the Characteristics of included studies table if a trial reported outcome data in an unusable way. We resolved disagreements by consensus or by involving a third review author (AX or RH).

Assessment of risk of bias in included studies

Two review author pairs (JAMS with MP, JAMS with VAM, or JAMS with HFA) independently assessed the risk of bias for each study, using the criteria outlined in the Cochrane Handbook for Systematic Reviews of Interventions Section 7.1.2 and Chapter 8 (Higgins 2021a), and the guidance from the EPOC Group (EPOC 2017c). We resolved any disagreements through discussion or by involving a third review author (AX or RH). We assessed the risk of bias in randomised trials according to the following domains:

  • random sequence generation;

  • allocation concealment;

  • blinding of participants and personnel;

  • blinding of outcome assessment;

  • incomplete outcome data;

  • selective outcome reporting;

  • baseline outcomes measurement;

  • baseline characteristics;

  • other bias

To analyse the risk of bias in non‐randomised studies, we had planned to use the 'Risk of Bias in Non‐randomized Studies of Interventions' (ROBINS‐I) tool (Sterne 2016). For interrupted time series studies, we had planned to assess the risk of bias according to the following domains (EPOC 2017c):

  • independence from other changes;

  • specification of the shape of effects before analysis;

  • independence from data collection;

  • knowledge of intervention allocation;

  • incomplete outcome data;

  • selective outcome reporting;

  • other bias.

We judged each potential source of bias to be high, low or unclear, and where appropriate, provided a quote from the study report with a justification for our judgement in the risk of bias table. We summarised the risk of bias judgements across different studies for each of the domains listed. We assigned an overall risk of bias assessment (high, moderate or low) to each of the included studies, using the approach suggested in Chapter 8 of the Cochrane Handbook for Systematic Reviews of Interventions (Higgins 2021a). We considered studies with a low risk of bias for all key domains, or where it seemed unlikely that bias would have seriously altered the results, to have a low risk of bias. We considered studies where risk of bias in at least one domain was unclear, or where we judged it to have some bias that could plausibly raise doubts about the conclusions, to have an unclear risk of bias. We considered studies to have a high risk of bias if they had a high risk of bias in at least one domain, or if we judged them to have serious bias that decreased the certainty of the conclusions.

We considered blinding separately for different key outcomes where necessary. Where information on risk of bias related to unpublished data or correspondence with a trialist, we noted this in the risk of bias table. We did not exclude studies on the grounds of their risk of bias, but clearly reported the risk of bias when presenting the results of the studies. When considering specific effects, we took into account the risk of bias for the studies that contributed to that outcome.

We conducted the review according to our published protocol and report any deviations from it in the 'Differences between protocol and review' section.

Measures of treatment effect

Where possible, we estimated the effect of the intervention using the risk ratio (RR) and risk difference (RD) for dichotomous data, with the appropriate associated 95% confidence interval (CI); and mean difference (MD) or standardised mean difference (SMD) for continuous data, with their 95% CIs (Higgins 2021a). We ensured that an increase in scores for continuous outcomes was interpreted in the same way for each outcome and explained the direction to the reader. For controlled before‐and‐after, ITS and repeated measures studies, we planned to report measures before and after the intervention, as well as the difference of the periods for specific time points.

Unit of analysis issues

We did not have issues with cluster‐randomised trials that did not account adequately for the effects of the clustering on the effect estimate, so adjustments of the analyses to avoid unit of analysis errors was not required (EPOC 2017d).

Dealing with missing data

We attempted to contact the study authors, asking them to provide missing outcome data. Where this was not possible, we concluded that the missing data introduced serious bias impacting on the overall assessment of results (Sterne 2011).

Assessment of heterogeneity

We planned to assess clinical and methodological heterogeneity by considering study design, participants and how the collective leadership intervention was applied. We were only able to identify one outcome (leadership) for which we judged measures, participants, intervention approach and study designs of the included studies were sufficiently similar to conduct a meta‐analysis and assess statistical heterogeneity (Borenstein 2009). For this outcome, we assessed statistical heterogeneity using the Chi2 statistic and related P value, alongside the I2 statistic with associated percentage values (Higgins 2021a).

Assessment of reporting biases

We were not able to pool more than 10 trials so did not create and examine a funnel plot to explore possible publication biases (Sterne 2011).

Data synthesis

We could only undertake a meta‐analysis for one outcome, for which pooling made sense. For the remaining outcomes, we provided a narrative description of the results. For the meta‐analysis, we combined continuous data from leadership scales that were sufficiently similar (and where direction of effect was the same) using generic inverse variance and standardised mean difference to account for differences in the scales (Higgins 2021a). We used a random‐effects model for the meta‐analysis to account for possible differences among studies in which conditions of the healthcare setting and approach to collective leadership may have varied. We completed the meta‐analysis using the Review Manager (RevMan Web) calculator.

In order to synthesise the remaining outcomes without meta‐analyses, we considered the nine items suggested by Campbell 2020 to guide our reporting:

  • group studies for synthesis;

  • describe the standardised metric and transformation method used;

  • describe the synthesis methods;

  • describe the criteria used to prioritise results for summary and synthesis;

  • investigate heterogeneity in reported effects;

  • assess the certainty of the evidence;

  • describe the data presentation methods;

  • report the results; and

  • report the limitations of the synthesis.

We noted the scales used to measure continuous outcomes, including the range of possible scores and direction of the effect, alongside the mean difference and 95% confidence interval, to enable more meaningful interpretation of results. In the summary of findings table, we present the standardised mean difference, calculated using the RevMan Web calculator, to allow easier comparison of the effect size of collective leadership on different outcomes. For one dichotomous outcome (death), we reported the relative risk next to the 95% confidence interval.

Subgroup analysis and investigation of heterogeneity

We  included all kinds of health professionals involved across all types of clinical settings, but noted length of the intervention, implementation approach, use of explicit intervention protocols, organisational support and frequency of meetings. We planned for but were unable to carry out any of the following subgroup analyses:

  • collective leadership hierarchical interventions subgrouped by setting (primary care, secondary care or tertiary care);

  • collective leadership non‐hierarchical interventions subgrouped by setting (primary care, secondary care or tertiary care);

  • subgroup of collective leadership hierarchical interventions in high‐income versus low‐income setting;

  • subgroup of collective leadership non‐hierarchical interventions in high‐income versus low‐income setting.

To conduct the subgroup analysis, we planned to use all outcomes that were common to at least three studies, performing a separate analysis for each outcome. However, this was not possible with the included studies.

Sensitivity analysis

We planned to perform prespecified sensitivity analyses to assess the robustness of our conclusions and explore their impact on effect sizes. This would have involved the following:

  • restricting the analysis to published studies;

  • restricting the analysis to studies with a low risk of bias; and

  • imputing missing data.

Our included studies and available data did not allow for sensitivity analysis to take place.

Summary of findings and assessment of the certainty of the evidence

Two review authors (JAMS and HFA) independently assessed the certainty of the evidence (high, moderate, low and very low) using the five GRADE considerations (risk of bias, consistency of effect, imprecision, indirectness and publication bias) (Guyatt 2008). We used methods and recommendations described in Chapter 8 and 15 of the Cochrane Handbook for Systematic Reviews of interventions (Higgins 2021a), and the EPOC worksheets (EPOC 2017e), and we used GRADEpro software (GRADEpro GDT). We resolved disagreements on certainty ratings through discussion, provided justification for decisions to downgrade or upgrade the ratings using footnotes in the table, and made comments to aid readers' understanding of the review where necessary. We used plain language statements to report these findings in the review (EPOC 2017e).

We summarised our results in 'summary of findings Table 1' for the main intervention against each comparison described in the Types of interventions. We included the most important outcomes in order to draw conclusions about the certainty of the evidence within the text of the review.

  • Professional practice

    • Leadership

    • Team performance

    • Clinical performance

  • Healthcare outcomes

    • Health status (inpatient mortality, mortality within 30 days of discharge)

  • Staff well‐being

    • Burnout, psychological symptoms (anxiety, depression) and work‐related stress

  • Unintended consequences

    • Any unintended safety of care events (including errors, healthcare‐associated complications) or professional, organisational and staff consequences

We synthesised data on prespecified outcomes of collective leadership along with any other unintended consequences found in the included studies. We used the same eligibility criteria to assess intended (beneficial) and unintended (adverse) effects, in terms of types of studies, types of participants and types of interventions (Peryer 2019).

We considered whether there was any additional outcome information that we could not incorporate into the meta‐analysis to note in the 'Comments' section, and stated if it supported or contradicted the information from the meta‐analysis.

Results

Description of studies

We retrieved 4449 records for title and abstract screening after removing duplicates (n = 809). We shortlisted 102 records for independent full‐text screening by two review author pairs. 

Results of the search

From the full texts, we identified three randomised trials eligible for inclusion. Handsearching did not produce any additional articles. The study selection is summarised in Figure 1


PRISMA study flow diagram

PRISMA study flow diagram

Included studies

We identified three randomised trials that met the inclusion criteria (Fernandez 2020Shirazi 2016Weir 1997). Follow‐up measurements varied, while the level of analysis was on individuals rather than on the team or organisational level (Weir 1997Shirazi 2016Fernandez 2020).

Locations and participants

Studies originated from three different higher‐ and upper‐middle income countries: Canada (Weir 1997), Iran (Shirazi 2016), and the USA (Fernandez 2020). All studies were conducted in tertiary care, including inpatient clinical units (Weir 1997), an academic hospital (Shirazi 2016), and a trauma centre (Fernandez 2020).

There was a total of 955 participants across all the studies. The number of professionals ranged from 60 (Fernandez 2020) to 731 (Shirazi 2016). The mean age varied between 20 (Fernandez 2020) and 44 years old (Weir 1997). Participants included nurse managers (Shirazi 2016Weir 1997), nurses (registered nurses and nurse aids) (Shirazi 2016), general surgery and emergency medicine residents (Fernandez 2020), physical therapists, occupational therapists, social workers, clinical dieticians, respiratory technicians, and laboratory technicians (Weir 1997). Two studies included participants from uniprofessional areas such as medicine (Fernandez 2020) and nursing (Shirazi 2016). The remaining study included participants from different professional areas, including nursing, physical therapists, occupational therapists, social workers, clinical dieticians, respiratory technicians and laboratory technicians (Weir 1997).

Interventions

Two studies included professional management by leadership training called supportive leadership (Shirazi 2016) and team leadership (Fernandez 2020). For an organisational strategy, one study aimed to facilitate a decentralised and participatory style of problem‐solving management (Weir 1997). The three studies adopted varied intervention strategies, including workshops (Shirazi 2016Weir 1997), simulation (Fernandez 2020), and problem‐solving sessions (Weir 1997). 

In Weir 1997, the intervention was to facilitate a decentralised and participatory style of problem‐solving management meeting (participative decision‐making), with unit staff members and their nurse managers versus the more traditional centralised approach to decision‐making. The intervention duration was 12 months, delivered by external consultants with experience in resolving problem situations and developing a helping relationship. All consultants had expertise in mental health consultation and small group work, as well as problem‐based learning in professional, undergraduate and graduate programs. 

Shirazi 2016 developed an interactive workshop on supportive leadership behaviour for head nurses working in academic hospitals; the workshop included supervisors and nursing personnel. One‐day, 8‐hour workshops were conducted by experts in the leadership field, utilising teaching methods, role‐playing, mini‐lectures and the modified 'goldfish bowl technique' (a typical small‐group technique used in medical education).

Fernandez 2020 used simulation‐based training on team leadership and patient care during trauma resuscitation in a medical trauma centre with emergency medicine residents. Attending physicians and senior residents in both emergency medicine and trauma surgery supervised a 4‐hour leadership curriculum focused on assuming a leadership role, sharing information and interpreting data, planning and prioritising tasks, assigning roles and assessing team members' skills, and seeking input and identifying tasks barriers. Sessions began with a 1‐hour didactic session, followed by a series of simulations in which one participant served as the team leader while the second participant was observed. An instructor‐led debrief occurred after each simulation.

Each included study adopted a slightly different conception of collective leadership, though all were characterised by sharing decisions and interactions among health professions. Fernandez 2020 referred to team leadership focused on team performance by team‐oriented goals, facilitating coordination and improving the working environment. Shirazi 2016 applied supportive leadership, as defined by Cummings 2010, as a process in which leaders consider staff needs for motivation, based on intellectual stimulation, individual consideration and effective communication. Weir 1997 adopted a decentralised leadership approach, through participative decision‐making towards effective problem‐solving and relationship skills.

Excluded studies

We excluded 93 studies (see Characteristics of included studies). The most common reasons for exclusion were: ineligible study design (n = 45); ineligible intervention (n = 11); ineligible indication (n = 31), when the study was not about collective leadership as a topic and did not mention any aspects of collective leadership as a concept; ineligible participants (n = 3); ineligible setting (n = 3). We also identified three ongoing studies.

Risk of bias in included studies

We used Cochrane's Risk of Bias 2 (RoB2) tool for assessing the risk of bias in each study. We present a summary of our judgements in Figure 2 and Figure 3. We applied RoB2 for the three included RCTs (Fernandez 2020Shirazi 2016Weir 1997), following the guidance for the revised Cochrane risk of bias tool for randomised trials (Higgins 2021a). The risk of bias for each randomised trial included in this review is available in the Characteristics of included studies, and summarised below.


Risk of bias summary: review authors' judgements about each risk of bias item for each included study

Risk of bias summary: review authors' judgements about each risk of bias item for each included study


Risk of bias graph: review authors' judgements about each risk of bias item

Risk of bias graph: review authors' judgements about each risk of bias item

Allocation

Randomisation process

All studies included presented some concerns with unclear randomisation processes, because of insufficient information provided by the authors (Fernandez 2020; Shirazi 2016; Weir 1997). None of the studies gave information concerning a random component being used in the sequence generation process; the only information about randomisation methods were statements that the studies were randomised. None of the studies gave information about whether the allocation sequence was concealed until participants were enrolled and assigned to the intervention. No baseline imbalances were apparent between groups to suggest a problem with the randomisation process.

Blinding

Deviations from intended interventions

We assessed all three studies as having an unclear risk of bias in this domain because they provided no information about deviations from the intended intervention due to the trial context. Additionally, none of the studies provided information about whether participants were aware of their assigned intervention during the trial. In all three studies, the people delivering the intervention were probably aware of participants' assigned intervention during the trial (Fernandez 2020Shirazi 2016Weir 1997). In Shirazi 2016, allocation was concealed from participants; however, the nature of the intervention gives strong reason to believe participants could become aware of their assigned intervention during the trial. 

Incomplete outcome data

Missing outcome data

We judged two studies to have a low risk of bias in this domain (Fernandez 2020Shirazi 2016). Outcome data were available for nearly all randomised participants. These studies gave no information about the reasons for withdrawals, which were balanced between the groups. For all three included studies, there is no evidence that the result was not biased by missing outcome data. It is not probable that missingness in the outcomes could depend on true value. We judged Weir 1997 as having an unclear risk of bias because outcome data were not available for all randomised participants. Missingness in the outcome could depend on its true value, but this is unlikely.

Selective reporting

Measurement of the outcome

In one study (Fernandez 2020), we assessed the risk of bias as low because the methods of measuring the outcomes were appropriate. The measurement of the outcomes did not differ between groups. The raters were blind to study hypotheses and experimental conditions of participants (Fernandez 2020). We assessed the remaining two studies as having an unclear risk of bias because their methods of measuring the outcomes were probably appropriate (Shirazi 2016Weir 1997). These studies used a validated tool, though details of its validity were not provided. The measurement of the outcome did not differ between groups. Study participants were the outcome assessors, and given the self‐reported outcome, were probably aware of intervention status. Assessment of the outcome could have been influenced by knowledge of the intervention received, but there is no strong reason to believe that it did.

Selection of the reported result

We judged all three studies to have an unclear risk of bias because they did not analyse data in accordance with a prespecified plan finalised before unblinded outcome data were available for analysis (Fernandez 2020Shirazi 2016Weir 1997). Authors' intentions for their analysis were unavailable, or their intentions were not reported in sufficient detail to enable an assessment. The results reported were not likely to have been selected, on the basis of the results, from multiple eligible outcome measurements (e.g. scales, definitions, time points) within the outcome domain.  

Other potential sources of bias

We did not identify other potential sources of bias.

Effects of interventions

See: Summary of findings 1 Collective leadership to improve professional practice, healthcare outcomes and staff well‐being compared with usual centralised and hierarchical leadership

We were only able to complete a meta‐analysis for one outcome (leadership) and completed a narrative synthesis for other outcomes. A meta‐analysis of effect estimates was not possible for all outcomes because of incompletely‐reported outcome/effect estimates, and different effect measures across studies alongside risk of bias in the evidence (McKenzie 2021). 

See summary of findings Table 1.

Professional practice: leadership, team performance and clinical performance

Collective leadership interventions probably improve leadership (3 RCTs, 955 participants), and may improve team performance; however, we are uncertain about their effect on clinical performance. The studies included in this review adopted different scales. To facilitate interpretation of the results, we describe these scales below, together with significant results. 

Leadership

Three studies reported on leadership outcomes using different scales. Weir 1997 applied a composite measurement with outcome variables from the Work Environment Scale (WES) (Moos 1981). The WES is a 90‐item, true‐false, self‐reported questionnaire consisting of ten subscales; scores on each subscale can range from 0 to 9. The subscale on 'supervisory support' offers an indicator for leadership, for which no statistically significant differences were found (mean difference (MD) 2.73 favouring the intervention, 95% confidence interval (CI) ‐1.98 to 7.44).   

Shirazi 2016 developed a Supportive Leadership Behaviour (SLB) questionnaire, using items from three different leadership scales: Ohio State (OSQ) (Larsson 2006); Developmental leadership (DL) (Hersey 1979), and Hersey and Blanchard's situational theory (HBS) (Halpin 1962). Some items related to SLB were extracted from these questionnaires and some new questions were developed and validated by the authors. The SLB final version is an instrument with 40 items organised in a 5‐point Likert scale with four dimensions: support for development, integrity, sincerity and recognition. Each item is scored between 1 and 5, yielding an overall score ranging from 40 to 200. Statistically significant results were found for overall leadership at three months' follow‐up, favouring the intervention (MD 21.41, 95% CI 18.16 to 24.61).

Fernandez 2020 adapted a leadership taxonomy from Rosenman 2016, designed to capture a team leader's best effort when targeting behaviours and communication events. It consists of 10 items grouped under five dimensions, with an overall score ranging from zero to 38. There was a statistically significant improvement in overall leadership score, favouring the intervention (MD 4.06, 95% CI 2.45 to 5.67).

We combined data from the three studies in a meta‐analysis using a random‐effects model and reporting standardised mean difference, taking into account the differences in the leadership scales used. We found an effect size of 0.78 (95% CI 0.19 to 1.38; P = 0.01), which, following the standard rule of thumb, we interpret as moderate difference (Analysis 1.1).

Team performance

One study provided outcomes indicative of team performance as peer cohesion, relationship and task orientation using the WES (Weir 1997). A statistically significant result was only found in the peer cohesion subscale, favouring the intervention (MD 7.08, 95% CI 2.45 to 11.71). 

Clinical performance 

One study reported on a clinical performance outcome as patient care overall score (maximum scores ranging from 20 to 38), but without a statistically significant result (MD 2, 95% CI ‐2.78 to 6.78) (Fernandez 2020).

Healthcare outcomes: health status (inpatient mortality, mortality within 30 days of discharge)

We are uncertain of the collective leadership interventions' effect on health care outcomes. Only one study reported patient deaths, but not specifically at 30 days from discharge, with no statistically significant results (relative risk 0.86, 95% CI 0.42 to 1.77) (Fernandez 2020).

Staff well‐being: burnout, psychological symptoms (anxiety, depression) and work‐related stress

We are uncertain of the collective leadership interventions' effect on staff well‐being. One study described the effects of work‐related stress as 'work pressure' using the WES, but without any statistically significant results (MD ‐2.24, 95% CI ‐6.18 to 1.70) (Weir 1997).

Unintended consequences: any unintended safety of care events (including errors, healthcare‐associated complications) or professional, organisational and staff consequences

We are uncertain of the effect collective leadership interventions have on unintended consequences. One study reported on staff absences (paid hours) as organisational and staff unintended consequences, without a statistically significant effect (MD 20.35, 95% CI ‐10.65 to 51.35) (Fernandez 2020).

Discussion

Summary of main results

Three RCTs met our inclusion criteria. Overall, 955 participants were involved in these studies. Collective leadership interventions in these studies were conducted in Canada, Iran and the USA in tertiary care, including a trauma centre and clinical units. There was variation in participants, interventions and measures for quantifying outcomes. We were only able to complete a meta‐analysis for one outcome (leadership); we completed a narrative synthesis for other outcomes.

Collective leadership interventions probably improve leadership (3 RCTs, 955 participants), and may improve team performance (1 RCT, 164 participants). We are uncertain about the collective leadership effect on clinical performance (1 RCT, 60 participants). We are uncertain about the effect on healthcare outcomes, including health status (inpatient mortality) (1 RCT, 60 participants). We identified no direct evidence related to mortality within 30 days of discharge. Collective leadership interventions may slightly improve staff well‐being by reducing work‐related stress (1 RCT, 164 participants). We are uncertain of the intervention effects on unintended consequences, specifically staff absence (1 RCT, 60 participants). Most of this evidence comes from studies with small numbers of participants. 

The majority of interventions were uniprofessional and focused on professional management. There was little consensus about the operationalisation of collective leadership, although the studies shared commonalities in their definition of collective leadership as sharing decisions and interactions among professionals.

Overall completeness and applicability of evidence

Many of the relevant outcomes in our protocol were not reported in the included studies. None of the included studies evaluated the impact of collective leadership interventions on staff well‐being (specifically, burnout or psychological symptoms (anxiety, depression)) or unintended consequences in professional or organisational outcomes. We found no studies involving unintended safety of care events (including errors and healthcare‐associated complications).

The interventions were heterogeneous in nature. Two studies were conducted in higher‐income countries, and one from an upper‐middle‐income country, which makes inference on the generalisability of the observed effects of the interventions to lower‐ and middle‐income countries difficult.

Quality of the evidence

All included studies were randomised controlled trials (RCTs). Our confidence in the effects of collective leadership interventions on professional practice, healthcare outcomes and staff well‐being is moderate in leadership outcomes, low in team performance and work‐related stress, and very low for clinical performance, inpatient mortality and staff absence outcomes. The evidence was of moderate, low and very low certainty, meaning future evidence may change our interpretation of the results. Reasons for downgrading the certainty of evidence included risk of bias when all studies were judged at unclear risk of bias; and imprecision, when the absolute number of events was low, confidence intervals were wide and included no difference, or both.

Potential biases in the review process

We carried out the review in accordance with EPOC guidelines and using the Cochrane Handbook for Systematic Reviews of Interventions (Higgins 2021aHiggins 2021b). The COVID‐19 pandemic led to significant and unexpected workload increases for all review team members. The scarcity of intervention studies in the collective leadership field led to the inclusion of only three small studies with a negative impact on the certainty of the evidence produced. We could not conduct subgroup analysis and investigation of heterogeneity as planned, because the presentation of the outcomes was diverse among the included studies. We were also unable to retrieve some missing data from study authors. It is possible that non‐reporting of information in the published articles may have influenced the risk of bias assessments.

Agreements and disagreements with other studies or reviews

We did not identify any reviews that address the same objectives as this review. Related reviews have pointed to similar results with regard to team engagement and performance outcomes, though our confidence about this evidence is low (Aufegger 2019De Brún 2019Janssens 2021). This study substantially expands the scope and comprehensiveness of previous reviews. In our review, we identified different effects of collective leadership interventions, including: professional practice – leadership and clinical performance; healthcare outcomes – inpatient mortality; staff well‐being – work‐related stress; and unintended consequences – staff absence. Like this review, other reviews were limited by methodological aspects of the included studies, heterogeneous interventions, comparison groups and multiple outcomes.

While collective leadership has the potential for positive influences on professional practice, healthcare outcomes and staff well‐being, only three studies reported outcomes potentially attributable to collective leadership and the quality of evidence was moderate, low and very low, respectively.

The low number of included studies is indicative of the limited literature examining the effects of collective leadership models in health care. This paucity could potentially be explained by the challenges inherent in implementing collective leadership approaches in health care due to the established hierarchical leadership models and barriers to implementing change in real‐life healthcare contexts (Aufegger 2019De Brún 2019Janssens 2021). Further studies on collective leadership are required to develop an understanding of how collective leadership relates to professional practice, healthcare outcomes and staff well‐being.

PRISMA study flow diagram

Figuras y tablas -
Figure 1

PRISMA study flow diagram

Risk of bias summary: review authors' judgements about each risk of bias item for each included study

Figuras y tablas -
Figure 2

Risk of bias summary: review authors' judgements about each risk of bias item for each included study

Risk of bias graph: review authors' judgements about each risk of bias item

Figuras y tablas -
Figure 3

Risk of bias graph: review authors' judgements about each risk of bias item

Comparison 1: Professional practice, Outcome 1: Leadership

Figuras y tablas -
Analysis 1.1

Comparison 1: Professional practice, Outcome 1: Leadership

Summary of findings 1. Collective leadership to improve professional practice, healthcare outcomes and staff well‐being compared with usual centralised and hierarchical leadership

Effect of collective leadership to improve professional practice, healthcare outcomes and staff well‐being compared with usual centralised and hierarchical leadership

Patients or population: healthcare professionals

Settings: tertiary care (hospitals: trauma centre and clinical units)

Intervention: collective leadership interventions to improve professional practice, healthcare outcomes and staff well‐being

Comparison: usual approaches to leadership identified as centralised and hierarchical

Outcomes

Impact

Number of 
participants
(studies)

Certainty of the evidence (GRADE)* 

Professional practice

Leadership

Average difference (SMD): 0.78 higher (95% CI 0.19 to 1.38 higher) 

 

Collective leadership probably improves leadershipa,b,c.

955 participants (3 RCTs)

⊕⊕⊕⊖
Moderate. Downgraded due to unclear risk of biasa,b,c and serious imprecision.
Upgraded due to large magnitude of effect.

Team performance

Average difference (SMD): 0.47 higher (95% CI 0.16 to 0.78 higher) 

 

Collective leadership may improve team performancea.

164 participants (1 RCT)

⊕⊕⊖⊖

Low. Downgraded due to unclear risk of biasa and serious imprecision.

Clinical performance

Average difference (SMD): 0.21 higher (95% CI ‐0.30 to 0.72)

 

It is uncertain whether collective leadership improves clinical performancec.

60 participants (1 RCT)

⊕⊖⊖⊖
Very low. Downgraded due to unclear risk of biasc and very serious imprecision.

Health care outcomes

Inpatient mortality

Relative effect (RR): 0.86 lower (95% CI 0.42 to 1.77)

 

Absolute difference: 17 fewer deaths per 1000 patients (95% CI 73 fewer to 96 more per 1000 patients)

 

It is uncertain whether collective leadership decreases inpatient mortalityc

60 participants (1 RCT)

⊕⊖⊖⊖
Very low. Downgraded due to unclear risk of biasc and very serious imprecision.

Staff well‐being

Work‐related stress

 

Average difference (SMD): 0.17 lower (95% CI ‐0.48 to 0.13)

 

Collective leadership may slightly decrease work‐related stressa.

164 participants (1 RCT)

⊕⊕⊖⊖
Low. Downgraded due to unclear risk of biasa and serious imprecision.

Unintended consequences

Staff absence 

Average difference (SMD): 0.2 higher (95% CI ‐0.11 to 0.51) 

 

It is uncertain whether collective leadership prevents staff absencea.

 60 participants (1 RCT)

⊕⊖⊖⊖
Very low. Downgraded due to unclear risk of biasa and very serious imprecision.

GRADE Working Group grades of evidence
High: this research provides a very good indication of the likely effect. The likelihood that the effect will be substantially different is low.
Moderate: this research provides a good indication of the likely effect. The likelihood that the effect will be substantially different is moderate.
Low: this research provides some indication of the likely effect. However, the likelihood that it will be substantially different is high.
Very low: this research does not provide a reliable indication of the likely effect. The likelihood that the effect will be substantially different is very high.

aWeir 1997, which employed the Work Environment Scale, Peer Cohesion Scale, Work Pressure Scale, and measured paid staff absence hours.
bShirazi 2016, which employed the Supportive Leadership Behaviour scale.
cFernandez 2020, which employed the Leadership Taxonomy and Total Patient Care score.

Figuras y tablas -
Summary of findings 1. Collective leadership to improve professional practice, healthcare outcomes and staff well‐being compared with usual centralised and hierarchical leadership
Comparison 1. Professional practice

Outcome or subgroup title

No. of studies

No. of participants

Statistical method

Effect size

1.1 Leadership Show forest plot

3

955

Std. Mean Difference (IV, Random, 95% CI)

0.78 [0.19, 1.38]

Figuras y tablas -
Comparison 1. Professional practice