Current guidelines recommend noninvasive ventilation (NIV) over high-flow nasal cannula (HFNC) in patients at high risk of extubation failure. This systematic review and meta-analysis aimed to compare the efficacy of HFNC vs NIV in post-extubation patients at high risk of reintubation.
MethodsA systematic search was conducted in PubMed, Embase, Cochrane Library, and ClinicalTrials.gov without language restrictions. Randomized controlled trials (RCTs) evaluating HFNC vs NIV in post-extubation patients were included. The primary outcome was reintubation within 72h. Effect estimates were pooled as risk ratios (RR) with 95%CI. Certainty of evidence was assessed using the GRADE (Grading of Recommendations Assessment, Development, and Evaluation) framework and categorized as high, moderate, low, or very low.
ResultsEleven RCTs (n=2765 patients) met inclusion criteria. No statistically significant differences were observed between HFNC and NIV for reintubation within 72h (RR, 1.22; 95%CI, 0.83–1.80; moderate), reintubation within 7 days (RR, 1.23; 95%CI, 0.90–1.69; low), post-extubation respiratory failure (RR, 0.82; 95%CI, 0.66–1.02; moderate), intensive care unit (ICU) mortality (RR, 0.90; 95%CI, 0.63–1.29; very low), in-hospital mortality (RR, 0.96; 95%CI, 0.74–1.25; moderate), or 28-day mortality (RR, 0.99; 95%CI, 0.59–1.65; moderate).
ConclusionsCompared with NIV, HFNC was not associated with increased reintubation or mortality. However, variability in study definitions may limit direct applicability to bedside decision-making in individual high-risk patients.
Noninvasive ventilation (NIV) has been proposed as a strategy to prevent post-extubation respiratory failure and the need for reintubation, particularly in patients at high risk of extubation failure [1].
There is no universally accepted definition of “patients at high risk of extubation failure.” In general, this population includes patients who develop hypercapnia during the spontaneous breathing trial (SBT), those with chronic cardiac or respiratory disease, older adults, and individuals with impaired airway patency [2].
Guidelines from the European Respiratory Society (ERS) and the American Thoracic Society (ATS) recommend NIV over high-flow nasal cannula (HFNC) in patients at high risk of extubation failure, unless contraindications to NIV are present [1,3].
Compared with NIV, HFNC offers improved patient comfort, reduces adverse events related to interface intolerance, and may represent a more tolerable alternative [4]. Its ease of use and better tolerability have led to increasing adoption in post-extubation care [5]. However, its role in patients at high risk of reintubation remains uncertain.
Therefore, the aim of this study was to perform a systematic review and meta-analysis of RCTs to determine whether HFNC is noninferior to NIV in post-extubation patients at high risk of reintubation.
MethodsData sources and search strategyThis systematic review and meta-analysis was conducted in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement [6]. The protocol was registered in April 2025 in the PROSPERO database (ID: 1036660; registration pending confirmation).
PubMed, Embase, Cochrane Library, and ClinicalTrials.gov were searched for relevant studies published up to April 27, 2025.
Study selectionGiven heterogeneity in definitions and inclusion criteria across studies, which may introduce variability in study populations, we selected the relatively objective outcome of tracheal reintubation rate as the primary inclusion criterion.
Studies were eligible if they met the following criteria: (a) included adult post-extubation patients at high risk of reintubation (defined as a reintubation rate ≥10% at 72h or 7 days in either HFNC or NIV groups); (b) used a randomized controlled design comparing HFNC with NIV (including bilevel positive airway pressure [BiPAP]); and (c) reported outcomes including reintubation or mortality.
The PubMed search strategy included the following terms: (“high flow” OR “HFNC” OR “HFNO” OR “HFNT” OR “HFNP”) AND (“NIV” OR “non-invasive” OR “noninvasive” OR “BiPAP” OR “NIPPV”) AND (“high risk” OR “reintubation” OR “extubation” OR “extubating”).
Reference lists of relevant review articles were also screened to identify additional eligible studies. No language restrictions were applied. Full search strategies for all databases are provided in Table S1.
Data extraction and quality assessmentTwo reviewers independently screened articles according to the predefined inclusion criteria. Discrepancies were resolved by consensus. Standardized data extraction forms were used to collect relevant information from the text, tables, and figures of each included trial, including study characteristics (author, year of publication or last update), patient demographics (sample size, age), and baseline clinical variables (body mass index [BMI], arterial pH, arterial carbon dioxide partial pressure [PaCO2], arterial oxygen partial pressure [PaO2], and PaO2/fraction of inspired oxygen [FiO2]).
Outcomes extracted included reintubation within 72h, reintubation within 7 days, intensive care unit (ICU) mortality, in-hospital mortality, 28-day mortality, post-extubation respiratory failure, ventilator-associated pneumonia (VAP), and treatment switching (HFNC to NIV or NIV to HFNC).
Risk of bias assessmentTwo reviewers independently assessed the risk of bias using the Cochrane Risk of Bias 2 (RoB 2) tool for randomized controlled trials [7]. Five domains were evaluated: randomization process, deviations from intended interventions, missing outcome data, outcome measurement, and selection of the reported results.
EndpointsThe primary endpoint was reintubation within 72h. Secondary endpoints included reintubation within 7 days, post-extubation respiratory failure, treatment switching, ICU mortality, in-hospital mortality, 28-day mortality, and ventilator-associated pneumonia (VAP).
Data synthesis and statistical analysisMeta-analyses were performed when appropriate; otherwise, results were summarized narratively. Statistical analyses were conducted using Review Manager (RevMan), version 5.1 (The Cochrane Collaboration).
For dichotomous outcomes, pooled estimates were expressed as risk ratios (RR) with 95%CI. Heterogeneity was assessed using the χ2 test and the I2 statistic. A chi-square test with P≤.10 or I2≥50% was considered indicative of significant heterogeneity. A random-effects model was applied.
Prespecified subgroup analyses were conducted according to PaCO2 levels. Statistical significance was defined as P<.05. Sensitivity analyses included restriction to trials at low risk of bias and leave-one-out analyses.
Publication bias was assessed using Egger's linear regression test (Stata version 12.0), with P<.05 indicating potential bias.
Trial sequential analysis (TSA)Trial sequential analysis (TSA) was performed using TSA software (version 0.9.5.10) to evaluate the robustness of the evidence with respect to type I and type II errors and to estimate the required information size (RIS), defined as the cumulative sample size needed to detect a statistically significant difference between treatments [8].
Trial sequential monitoring boundaries (TSMBs) were calculated to determine statistical significance adjusted for accrued sample size. Evidence was considered conclusive when the Z-curve crossed the TSMB for benefit or harm before reaching the RIS.
Certainty of evidenceThe certainty of evidence for each outcome was assessed using the Grading of Recommendations Assessment, Development, and Evaluation (GRADE) framework [9].
ResultsStudy selection and characteristicsOf 1901 records identified in the initial search, 37 articles were retrieved for full-text assessment, and 11 RCTs [10–20] met the inclusion criteria (Fig. 1). Baseline characteristics of the included studies are presented in Table 1.
Baseline characteristics of trials included in the meta-analysis.
| Study (Ref. no.) | Year | Timing of intervention | Device/mode | n | Age, y(SD) | Male, % | BMI, kg/m2(SD) | PaCO2, mmHg(SD) | PaO2, mmHg(SD) | Arterial pH(SD) | PaO2/FiO2(SD) |
|---|---|---|---|---|---|---|---|---|---|---|---|
| De Jong [10] | 2023 | Within 30min after extubation | HFNC/BiPAP | 246/245 | 61 (14)/61 (14) | 58/63 | 35.5 (5.3)/35.4 (4.9) | 39 (8)/39 (8) | 90 (33)/91 (30) | 7.42 (0.07)/7.42 (0.07) | 246 (98)/255 (95) |
| Fang [11] | 2021 | After extubation | HFNC (Fisher & Paykel)/BiPAP (V60, Philips) | 20/24 | 67.9 (6.9)/72.3 (7.8) | 70/46 | 24.5 (3.5)/26.3 (10.5) | 39.4 (6.5)/40.0 (8.9) | 106.4 (22.2)/109.9 (21.3) | 7.40 (0.06)/7.41 (0.07) | NR |
| Hernández [12] | 2016 | Immediately after extubation | HFNC (Optiflow, Fisher & Paykel)/BiPAP (Vision, Respironics) | 290/314 | 64.6 (15.4)/64.4 (15.8) | 64/64 | NR | 41 (2.2)/39 (3.2) | NR | 7.39 (0.3)/7.40 (0.2) | 191 (34)/194 (37) |
| Hernández [13] | 2022 | Immediately after extubation | HFNC (Optiflow)/BiPAP (V60/V60 Plus, Philips) | 90/92 | 59.9 (15.4)/60.9 (14.3) | 56/73 | NR | 41.6 (6.9)/44.2 (8.6) | 102.7 (34.8)/116.2 (41.1) | 7.42 (0.1)/7.39 (0.2) | NR |
| Hernández [14] | 2025 | Immediately after extubation | HFNC (Airvo-2)/BiPAP (V60/V60 Plus) | 72/72 | 60 (49–67)*/62 (54–67)* | 56/54 | 33 (32–35)*/33 (32–36)* | 40 (38–42)*/39 (35–42)* | 104 (89–127)*/103 (86–126)* | 7.41 (7.36–7.46)*/7.45 (7.42–7.49)* | NR |
| Ketan [15] | 2023 | Immediately after extubation | HFNC/BiPAP | 30/32 | 65.3 (7.8)/65.4 (9.8) | 67/69 | NR | 48 (7.9) 47 (4.8) | 70.4 (11.8)/65.7 (11.3) | 7.42 (0.04)/7.42 (0.03) | 244.7 (72)/267.8 (48) |
| Kumari [16] | 2024 | After extubation | HFNC/BiPAP | 30/30 | 53 (16.7)/54 (18.7) | 70/67 | 27.2 (2.7)/28.1 (3.1) | 41.7 (6.6)/42.5 (6.6) | NR | 7.40 (0.06)/7.40 (0.07) | 253 (70)/268 (71) |
| Stéphan [15] | 2015 | After extubation | HFNC (Optiflow)/BiPAP (Vision, Evita XL/4, Monnal T75) | 414/416 | 63.8 (NR)/63.9 (NR) | 66/67 | 28.3 (NR)/28.2 (NR) | 38.7 (NR)/39.1 (NR) | NR | 7.39 (NR)/7.39 (NR) | 196 (NR)/203 (NR) |
| Tan [18] | 2020 | Immediately after extubation | HFNC (Airvo-2)/BiPAP (V60, Vision) | 44/42 | 68.4 (9.3)/71.4 (7.8) | 64/56 | NR | 51 (48–58)*/53 (49–61)* | NR | 7.48 (7.42–7.51)*/7.45 (7.40–7.49)* | 239 (47)/229 (42) |
| Tongyoo [19] | 2021 | Immediately after extubation | HFNC (Optiflow)/BiPAP (Carina, Dräger) | 112/110 | 62.6 (18.0)/63.0 (17.5) | 54/56 | NR | 34.2 (6.8)/35.3 (6.6) | 139.5 (41.2)/138.4 (37.2) | 7.45 (0.05)/7.45 (0.06) | 351 (101)/348 (89) |
| Tseng [20] | 2023 | After extubation | HFNC (Optiflow)/BiPAP (Trilogy 202) | 20/20 | 74 (11)/73 (13) | 50/65 | NR | 39 (8.3)/38 (6.4) | NR | 7.50 (0.04)/7.50 (0.05) | 316 (139)/373 (150) |
BiPAP, bi-level positive airway pressure; BMI, body mass index; FiO2, fraction of inspired oxygen; HFNC, high-flow nasal cannula; NR, not reported; PaCO2, arterial carbon dioxide partial pressure; PaO2, arterial oxygen partial pressure; SD, standard deviation.
This meta-analysis focused on the preventive use of HFNC and NIV following extubation. Among the 11 included RCTs, 6 were multicenter [10,12–14,17,18] and 5 were single-center [11,15,16,19,20] studies.
The study populations varied: one trial included patients with obesity (BMI≥30) [10]; three included patients with acute exacerbation of COPD (AECOPD) [11,15,18]; two included patients with at least one risk factor for reintubation [12,16]; one included patients with ≥4 risk factors [13]; one included patients with BMI >30 and ≤3 risk factors [14]; one included patients after cardiothoracic surgery [17]; one included patients with sepsis [19]; and one included patients receiving prolonged mechanical ventilation (≥14 days) [20].
All included trials were published within the past 10 years. Sample sizes ranged from 40 to 830 participants. A total of 2765 patients were included, with 1368 assigned to HFNC and 1397 to NIV.
Details of HFNC and NIV settings are summarized in Table S2.
Risk of bias assessments are presented in Fig. S1. Four studies (36.4%) were judged to be at low risk of bias, whereas seven studies (63.6%) raised some concerns. No study was classified as high risk of bias.
Reintubation within 72hData on reintubation within 72h were extracted from 6 RCTs (1479 patients). There was no statistically significant difference between groups (RR, 1.22; 95%CI, 0.83–1.80; P=.32; moderate-certainty evidence [Fig. 2A]), with rates of 18.41% in the HFNC group and 15.31% in the NIV group.
Moderate heterogeneity was observed (I2=47%; P=.09). Egger's test (P=.80) did not indicate publication bias. Trial sequential analysis (TSA) showed that neither conventional thresholds for benefit or harm nor trial sequential monitoring boundaries (TSMBs) for benefit, harm, or futility were reached (Fig. S2). Leave-one-out sensitivity analysis confirmed the robustness of the pooled estimate (Table S3).
Reintubation within 7 daysData from 7 RCTs (1952 patients) were available. There was no statistically significant difference between groups (RR, 1.23; 95%CI, 0.90–1.69; P=.20; low-certainty evidence [Fig. 2B]), with rates of 19.32% vs 16.24%.
Moderate heterogeneity was observed (I2=53%; P=.05). Egger's test (P=.06) did not demonstrate significant publication bias. TSA indicated that neither conventional thresholds nor TSMBs were reached (Fig. S3). Leave-one-out sensitivity analysis confirmed the robustness of the pooled estimate (Table S3).
Post-extubation respiratory failureFour RCTs (1420 patients) reported this outcome. Although minor variations existed, definitions were broadly consistent across studies and included respiratory acidosis, severe hypoxemia, increased respiratory rate, decreased level of consciousness, respiratory muscle fatigue, hypotension, and ineffective cough (Table S4).
There was no statistically significant difference between groups (RR, 0.82; 95%CI, 0.66–1.02; P=.07; moderate-certainty evidence [Fig. 2C]). The rate was 23.96% in the HFNC group and 30.57% in the NIV group.
Low heterogeneity was observed (I2=33%; P=.22). Egger's test (P=.35) did not indicate publication bias. TSA showed that no thresholds or TSMBs were reached (Fig. S4).
However, leave-one-out sensitivity analysis indicated instability of the pooled estimate, as statistical significance changed after exclusion of the study by Hernández et al [13] (Table S3).
Treatment switchFour RCTs (1200 patients) reported treatment switching. There was no statistically significant difference between groups (RR, 1.05; 95%CI, 0.43–2.57; P=.91; low-certainty evidence [Fig. 2D]), with rates of 11.00% vs 8.83%.
Substantial heterogeneity was observed (I2=68%; P=.02). Egger's test (P=.996) did not indicate publication bias. TSA showed that no thresholds or TSMBs were reached (Fig. S5). Leave-one-out sensitivity analysis confirmed the robustness of the pooled estimate (Table S3).
ICU mortalitySix RCTs (2471 patients) reported ICU mortality. There was no statistically significant difference between groups (RR, 0.90; 95%CI, 0.63–1.29; P=.56; very low-certainty evidence [Fig. 3A]). Mortality rates were 6.54% in the HFNC group and 7.05% in the NIV group.
Low heterogeneity was observed (I2=27%; P=.23). Egger's test (P=.007) suggested the presence of publication bias. TSA indicated that no thresholds or TSMBs were reached (Fig. S6). Leave-one-out sensitivity analysis confirmed the robustness of the pooled estimate (Table S3).
In-hospital mortalityFive RCTs (1196 patients) reported in-hospital mortality. There was no statistically significant difference between groups (RR, 0.96; 95%CI, 0.74–1.25; P=.77; moderate-certainty evidence [Fig. 3B]). Mortality rates were 16.78% vs 17.48%.
No heterogeneity was observed (I2=4%; P=.39). Egger's test (P=.32) did not indicate publication bias. TSA showed that no thresholds or TSMBs were reached (Fig. S7). Leave-one-out sensitivity analysis confirmed the robustness of the pooled estimate (Table S3).
28-Day mortalityData on 28-day mortality were extracted from 3 RCTs (797 patients). There was no statistically significant difference between groups (RR, 0.99; 95%CI, 0.59–1.65; P=.96; moderate-certainty evidence [Fig. 3C]).
No heterogeneity was observed (I2=0%; P=.76). The 28-day mortality was 6.73% in the HFNC group and 6.82% in the NIV group. Egger's test (P=.64) did not indicate publication bias. TSA showed that neither conventional thresholds for benefit or harm nor trial sequential monitoring boundaries (TSMBs) for benefit, harm, or futility were reached (Fig. S8). Leave-one-out sensitivity analysis confirmed the robustness of the pooled estimate (Table S3).
Ventilator-associated pneumoniaThree RCTs (930 patients) reported ventilator-associated pneumonia (VAP) occurring after reintubation. In the included studies, VAP was defined consistently as fever (temperature >38°C) or abnormal leukocyte count (>12,000/mL or <4000/mL), plus new purulent endotracheal secretions or change in sputum, in combination with new, progressive, or persistent infiltrate, consolidation, or cavitation, and significant pathogen culture (>105cfu/mL in semiquantitative endotracheal aspirate, >104cfu/mL in bronchoalveolar lavage fluid, or >103cfu/mL in protected brush specimens).21
There was no statistically significant difference between groups (RR, 0.97; 95%CI, 0.56–1.68; P=.91; moderate-certainty evidence [Fig. 3D]), with rates of 5.31% vs 5.44%.
No heterogeneity was observed (I2=0%; P=.49). Egger's test (P=.38) did not indicate publication bias. TSA showed that no thresholds or TSMBs were reached (Fig. S9). Leave-one-out sensitivity analysis confirmed the robustness of the pooled estimate (Table S3).
Certainty of evidenceThe GRADE assessment is summarized in Table S5. Certainty of evidence was rated as moderate for reintubation within 72h, post-extubation respiratory failure, 28-day mortality, in-hospital mortality, and ventilator-associated pneumonia; low for reintubation within 7 days and treatment switch; and very low for ICU mortality.
The most common reason for downgrading was inconsistency across studies.
Subgroup analysisGiven that NIV may be more effective in patients with hypercapnia, subgroup analyses stratified by baseline PaCO2 (>45mmHg vs ≤45mmHg) were performed. Results are presented in Table S6.
Restriction to low-risk-of-bias trialsTo address heterogeneity, analyses restricted to trials at low risk of bias were conducted. Results are summarized in Table S7.
DiscussionThe aim of this meta-analysis was to compare the efficacy of HFNC and NIV in post-extubation patients at high risk of reintubation. Based on current evidence, no statistically significant differences were observed between groups in terms of reintubation or mortality outcomes.
However, several pooled estimates were imprecise, with wide confidence intervals compatible with both clinically meaningful benefit and harm. These findings likely reflect limited statistical power and between-study heterogeneity rather than true equivalence between treatments. Moreover, variability in mortality endpoints across trials and the overall low incidence of events further limit interpretability. Therefore, the absence of statistically significant differences should not be interpreted as evidence of no clinically relevant effect.
Leave-one-out sensitivity analysis identified the study by Hernández et al. [13] as influential, as its exclusion altered the pooled estimate for post-extubation respiratory failure. This study included patients with a higher burden of risk factors (≥4) and reported the highest reintubation rate among included trials, which may explain its disproportionate impact.
Notably, a post hoc analysis of the Hernández et al., 2016 study [22] suggested that patients with ≥4 risk factors for reintubation may derive greater benefit from prophylactic NIV. This finding highlights the potential importance of patient selection and risk stratification when comparing HFNC and NIV in clinical practice.
Weaning is defined as the gradual reduction of ventilatory support in patients whose underlying cause of respiratory failure is improving [23]. In most studies, weaning failure is defined as the inability to successfully complete a spontaneous breathing trial (SBT) or the need for reintubation within 48h after extubation [24]. However, more recent studies [25] have adopted a more conservative definition of weaning success – namely, the absence of reintubation within 7 days of extubation – thereby capturing events over a longer post-extubation period. This approach may be more rigorous than earlier definitions using shorter timeframes [26], as it reduces the risk of misclassifying patients who require delayed reintubation as successful weaning cases. Accordingly, in the present meta-analysis, both 72-h and 7-day reintubation windows were used to provide a more comprehensive evaluation of the effects of HFNC and NIV.
Because the use of NIV as rescue therapy for established post-extubation respiratory failure has not been shown to improve outcomes [1], clinical focus has shifted toward prevention of respiratory failure and reintubation. Current European Respiratory Society (ERS) recommendations favor the use of NIV over HFNC in patients at high risk of extubation failure, unless contraindications are present [1].
Recent expert consensus suggests that post-extubation respiratory support should be individualized according to patient risk profiles. For example, humidified NIV may be preferred in patients at very high risk or in obese patients with intermediate risk of hypoxemic failure. More broadly, NIV-based strategies may be appropriate for high-risk patients, including those with obesity or hypercapnia at the end of the SBT. Patients at intermediate risk may receive either HFNC or NIV depending on individualized clinical factors, whereas HFNC alone may be sufficient for low-risk patients [27].
Although both HFNC and NIV have demonstrated superiority over conventional oxygen therapy, the optimal noninvasive respiratory support strategy for individual patients remains uncertain [28]. Several predictive models for extubation failure have been proposed; however, their clinical applicability remains limited [25,29]. Furthermore, the commonly used definition of “high risk” based on the presence of ≥1 risk factor may introduce substantial heterogeneity across study populations [29,30].
Certain subgroups of high-risk patients may derive greater benefit from NIV. These include patients identified by Thille et al. [31] (e.g., elderly individuals, obese patients, those receiving prolonged mechanical ventilation ≥7 days, patients with ineffective cough, or those with underlying chronic cardiac or pulmonary disease) and those described by Ferrer et al. [32] (patients who develop hypercapnia at the end of the SBT).
Despite these insights, important knowledge gaps remain. First, it is unclear whether specific risk factors confer differential benefit from distinct noninvasive strategies. Second, although prior studies have suggested synergistic interactions among risk factors for reintubation (e.g., advanced age combined with cardiac or respiratory disease, or secretion burden combined with impaired cough and neurological status) [33], the potential additive effects of multiple risk factors – and their influence on treatment response – remain insufficiently characterized [22].
A recent post hoc analysis [34] evaluated the diagnostic accuracy of several predictive models in 2341 patients enrolled in 5 multicenter randomized trials conducted in Spain [12–14,35,36]. Three models were assessed: a 3-factor model (age >65 years, chronic heart or pulmonary disease), a 4-factor model (adding prolonged mechanical ventilation), and an 11-factor model. Predictive performance was poor across all models, with the 3-factor model showing very low accuracy (Youden index, 0.08; κ=0.04) and the 4-factor and 11-factor models demonstrating only modest improvements (Youden index, 0.12 and 0.16; κ=0.06 and 0.07, respectively).
These findings indicate that simply counting risk factors is insufficient to accurately identify patients at high risk of reintubation. Accordingly, in this meta-analysis, we adopted an alternative definition of high risk – based on an observed reintubation rate ≥10% in either the HFNC or NIV group – rather than relying solely on predefined clinical risk factors. This approach aimed to reduce heterogeneity and more accurately reflect real-world risk when comparing the preventive effects of HFNC and NIV in post-extubation patients.
HFNC offers both clinical advantages (including improved patient comfort and ease of use) and physiological benefits, such as enhanced oxygenation, alveolar recruitment, humidification and heating, improved secretion clearance, and reduction of anatomical dead space [37]. These effects may help prevent deterioration in lung function and reduce the need for tracheal intubation [38]. Compared with conventional oxygen therapy (COT) and NIV, HFNC is generally better tolerated and associated with greater patient comfort. However, its capacity to reduce respiratory muscle workload in acute respiratory failure (ARF) may be inferior to that of NIV [1].
In patients who respond poorly to HFNC or NIV, prolongation of noninvasive respiratory support may delay intubation and increase in-hospital mortality [39]. In such cases, a combined strategy – such as NIV alternating with HFNC – may be more appropriate. Indeed, in mechanically ventilated patients at high risk of extubation failure, the combined use of HFNC and NIV after extubation has been shown to significantly reduce the risk of reintubation compared with HFNC alone [31]. However, the role of combined HFNC+NIV strategies in post-extubation management requires further investigation.
This study fulfills most methodological criteria recommended for systematic reviews and meta-analyses [40]. Nevertheless, several limitations should be acknowledged. First, HFNC and NIV settings varied across studies, potentially influencing outcomes. Second, all included trials were unblinded due to the nature of the interventions, introducing potential performance bias. Third, baseline PaCO2 and pH values differed between studies. Fourth, neither the included trials nor this meta-analysis differentiated between hypercapnic and hypoxemic post-extubation patients. As a result, a potential inferiority of HFNC in hypercapnic patients may have been diluted by apparent equivalence in hypoxemic populations.
Fifth, some studies focused on specific patient subgroups (eg, obese patients or those undergoing cardiothoracic surgery), which may limit external validity. Sixth, the use of a ≥10% reintubation rate threshold to define “high risk” may be arbitrary and insufficiently discriminatory, potentially introducing selection bias and limiting clinical applicability. Seventh, substantial heterogeneity and evidence of publication bias were observed for some outcomes, reducing overall certainty.
Eighth, defining “high risk of reintubation” based on observed event rates in study arms introduces conceptual limitations, as this approach relies on post-randomization outcomes rather than baseline patient characteristics. This may preferentially select studies with poorer outcomes rather than truly high-risk populations at the time of extubation. Ninth, considerable clinical heterogeneity exists across trials with respect to patient populations, indications for mechanical ventilation, underlying respiratory physiology, and extubation contexts. Subgroup analysis based solely on baseline PaCO2 is unlikely to capture this complexity, suggesting that treatment effects may be context-dependent rather than uniform.
ConclusionsCompared with NIV, HFNC was not associated with an increased risk of reintubation or mortality in post-extubation patients at high risk. However, important uncertainties remain regarding the relative effectiveness of HFNC and NIV in different physiological subgroups, particularly hypercapnic vs hypoxemic respiratory failure.
Given variability in definitions and study populations, these findings cannot be directly translated into bedside decision-making for individual patients. Future studies should focus on refining patient selection criteria and evaluating combined strategies, such as HFNC plus NIV, to optimize post-extubation respiratory support.
FundingNo funding was received for this work.
Conflict of interestsNone of the authors has a conflict of interest to declare.
The following are the supplementary data to this article:












