INTRODUCTION

Pulse oximeters are used in many clinical settings and provide a noninvasive means of monitoring oxygen levels. Pulse oximeters also allow for quick decisions at the bedside in critical situations—within ambulances, clinics, and hospitals—where patients and their clinicians depend on timely and accurate information.1 During the COVID-19 pandemic, pulse oximeters became essential tools for monitoring patients at home, informing clinical decision-making about oxygen supplementation, hospitalization, or intensive care.

Despite their utility, pulse oximeters may over- or underestimate a patient’s arterial oxygen saturation (SaO2). Pulse oximeter inaccuracy is especially concerning for patient safety when pulse oximeter readings of peripheral oxygen saturation (SpO2) indicate a normal blood oxygen level while a patient is actually in a hypoxemic state—a situation known as occult hypoxemia. Potential clinical impacts of occult or undetected hypoxemia include delayed or inadequate treatment, premature treatment de-escalation or discharge, and ultimately, greater morbidity and mortality.2,3,4

Inaccurate pulse oximeter readings in patients with darker skin pigmentation have been observed in clinical settings for several decades. The COVID-19 pandemic heightened concern that such inaccuracies may be widespread, routinely placing patients with darker skin pigmentation at risk of occult hypoxemia and adverse clinical outcomes. Recent studies in patients with COVID-19, for example, have shown that disparities between Black and White patients in the receipt of supplemental oxygen support and potentially life-saving treatments such as dexamethasone or remdesivir were associated with differential pulse oximeter performance.2,5 Other studies have found that Black patients with occult hypoxemia were more likely to experience organ failure and had higher odds of mortality compared with White patients with occult hypoxemia.3,6

A widely discussed retrospective study7 published in late 2020 analyzed nearly 50,000 paired SpO2–SaO2 measurements from adults in inpatient or intensive care settings and found that the prevalence of occult hypoxemia was over three times greater among Black patients compared with White patients (11.4% versus 3.4%). Since the publication of this analysis, additional studies have examined whether occult hypoxemia prevalence differs by patient race or ethnicity in large oximetry datasets from contemporary hospital and health system settings. Several studies have also investigated whether racial or ethnic minority patients at greater risk of occult hypoxemia, in turn, have poorer treatment and clinical outcomes than patients whose hypoxemic state was detected more quickly. The aim of this review was to synthesize this evidence to clarify the extent and magnitude of racial and ethnic disparities in the prevalence of occult hypoxemia and its clinical impacts.

METHODS

Reporting of this systematic review and meta-analysis follows 2020 PRISMA guidance.8 A preregistered protocol for the review can be found on the PROSPERO registry (CRD42023402152). This protocol corresponds to a parent review9 requested by health system and clinical leadership of the US Department of Veterans Affairs (VA) and conducted by the VA Evidence Synthesis Program Coordinating Center. The methods and results presented here focus on occult hypoxemia prevalence and related clinical outcomes, as defined in further detail below. Given the use of published aggregate data, ethics approval or oversight of this review were not required.

Data Sources and Searches

A research librarian searched Ovid MEDLINE, Embase, and CINAHL databases for relevant studies published through January 2024. Search strategies were reviewed by the lead author prior to searching. Complete search terms and reference counts for each database are provided in the supplemental materials.

Study Selection

Eligible studies must have been conducted among adults in inpatient or outpatient settings and report prevalence or risk of occult hypoxemia or hypoxemia-related clinical outcomes by patient race, ethnicity, or skin pigmentation level. Studies reporting occult hypoxemia prevalence were required to define occult hypoxemia as, at minimum, arterial oxygen saturation ≤ 88% despite a paired pulse oximeter reading > 88%. Stricter criteria (e.g., pulse oximeter reading > 92%) were permitted. Pulse oximeter readings must have been taken ± 10 min of arterial oxygen saturation to be considered paired. Studies of clinical outcomes associated with occult hypoxemia were eligible only when relationships between occult hypoxemia and eligible clinical outcomes were reported by both patient race or ethnicity and occult hypoxemia status. Studies were not required to test the significance of these associations across race or ethnicity groups (e.g., by including an interaction term in analytic models). Clinical outcomes of interest were (1) timing of treatment eligibility recognition, treatment discontinuation, or discharge; (2) treatment dosage or duration; or (3) in-hospital mortality. We did not include studies that investigated whether these clinical outcomes differed by patient race or ethnicity in general, and studies that induced hypoxemia in a controlled setting were also ineligible.

Titles, abstracts, and full-text articles were independently reviewed and disagreements were resolved by consensus. References of studies excluded during full-text review (with reasons for exclusion) are listed in the supplemental materials.

Data Extraction and Quality Assessment

Patient characteristics, study methodological details, and occult hypoxemia prevalence and clinical outcome data (when reported) were abstracted from all included studies. To facilitate synthesis, reported patient race or ethnicity was coded using one of the following terms: Asian, Black, Indigenous, Latinx, White, multiracial, or other race or ethnicity. When discussing results of individual studies, however, we used the race or ethnicity descriptors employed by the original study authors (e.g., Hispanic rather than Latinx).

Study internal validity (risk of bias) was assessed using the QUIPS tool,10 which is designed for studies of risk or prognostic factors. Potential biases in occult hypoxemia prevalence estimates were captured in the prognostic factor measurement domain of the tool. We supplemented criteria for this domain with considerations specific to prevalence assessment, including the representativeness of the sampling frame and the potential of selection biases arising from the case definition, data collection, or other aspects of the sampling methodology.11 Risk of bias assessment and data abstraction were first completed by one investigator then checked by another. Disagreements were resolved by consensus. Complete risk of bias ratings are provided in the supplemental materials.

Data Synthesis and Analysis

Studies varied in whether occult hypoxemia prevalence was reported in patients or in paired observations. Each type of prevalence estimate was synthesized separately. Prevalence estimates were transformed using the standard logit transformation for analysis, and back-transformed for interpretation and reporting. To facilitate comparison of occult hypoxemia prevalence between race or ethnicity groups, we also calculated and synthesized prevalence ratios.

Prevalence estimates were synthesized using meta-analytic generalized random-effects logistic models. These models were extended to multilevel models when multiple estimates from the same study were pooled. A correlation of 0.9 was assumed for nested estimates. Conventional random-effects models were used to synthesize prevalence ratios as well as adjusted odds ratios of occult hypoxemia that were reported in some studies. All meta-analyses incorporated the Knapp-Hartung method or comparable adjustment to standard errors.12,13 Cluster-robust confidence intervals and degrees of freedom calculated using the Satterwaithe approximation were used in multilevel meta-analysis models.

Heterogeneity in occult hypoxemia prevalence was estimated using (restricted) maximum-likelihood estimation and is presented as 95% prediction intervals. Prediction intervals approximate the range of true effects (e.g., true occult hypoxemia prevalence) across studies, providing an estimate of the magnitude and direction of effects that are likely in future studies similar to those included in a synthesis.14 A prediction interval encompassing values similar to the overall estimate suggests limited heterogeneity, whereas an interval that includes estimates in the same direction as the overall estimate but that vary widely in magnitude (low to high prevalence) suggests moderate heterogeneity. If a prediction interval encompasses estimates that range widely in both magnitude and direction, then substantial heterogeneity is likely present. Prediction intervals were evaluated alongside forest plots (presented in the supplemental materials) to gauge whether estimates included in a given analysis were consistent, moderately inconsistent, or highly inconsistent. Finally, because studies of occult hypoxemia prevalence were fairly homogeneous in methodological and setting characteristics and risk of bias, we did not conduct moderation analyses based on these factors. Sample characteristics (e.g., median patient age) were not examined as potential moderators because of the risk of aggregation bias.14 All meta-analyses were conducted using the metafor15 package for R (R Foundation for Statistical Computing, Vienna, Austria).

After synthesizing available evidence, we rated the certainty of evidence for each outcome based on the methodology and risk of bias of available studies, the consistency and precision of results, and the directness of outcomes (whether reported outcomes are relevant to patients and providers).16,17 We used the following general algorithm: high certainty evidence consisted of multiple, large studies with consistent findings at low risk of bias, and clinically relevant outcomes; moderate certainty evidence consisted of multiple studies with consistent findings at low to moderate risk of bias, and clinically relevant outcomes; low certainty evidence consisted of a single study, or multiple small studies, with moderate to high risk of bias, inconsistent findings, and/or outcomes with limited clinical relevance; and insufficient evidence consisted of a single study with moderate or high risk of bias, or no available studies. Conclusions using likely (e.g., “Occult hypoxemia is likely more common among Black patients compared with White patients”) are based on moderate certainty evidence, while those using may are based on low certainty evidence. When studies were judged to be too disparate in methodological or participant characteristics—or fewer than three comparable studies were available for a given outcome—we described evidence narratively.

Role of the Funding Source

All authors are scientific staff of the US Department of Veterans Affairs, which supported this review. The authors were fully independent and individuals with responsibility for funding decisions had no role in review design; data collection, analysis, or interpretation; or reporting of findings.

RESULTS

Publications included at each stage of screening are shown in Fig. 1. Database searches resulted in 243 potentially relevant articles after deduplication, and of these, 15 studies2,3,6,7,18,19,20,21,22,23,24,25,26,27,28 met eligibility criteria. In total, studies included 732,505 paired oximetry measurements from 207,464 patients and all studies reported prevalence of occult hypoxemia stratified by self-reported race or ethnicity. Four studies2,3,21,26 also examined associations between occult hypoxemia and clinical outcomes by patient race or ethnicity. Characteristics of included studies are summarized in Table 1. Most newly identified studies used data from patients receiving intensive or acute care in hospitals or health systems in the USA. One study19 was limited to patients undergoing anesthesia, two studies3,28 included surgical inpatients, and two studies2,21 were exclusively in patients with COVID-19. No studies were conducted in an outpatient setting.

Figure 1
figure 1

Literature flow diagram. References of studies excluded during full-text review with exclude reasons are provided in the supplemental materials. CINAHL, Cumulative Index to Nursing and Allied Health Literature.

Table 1 Characteristics of Included Studies

Pooled occult hypoxemia prevalence estimates by race or ethnicity groups are presented in Table 2, and corresponding prevalence ratios are shown in Fig. 2. The prevalence of occult hypoxemia among Black patients was 9.5% (95% CI 4.7 to 18.3; N = 40,751), or 67% greater than the prevalence of occult hypoxemia among White patients (pooled prevalence ratio = 1.67, 95% CI 1.47 to 1.90). A larger disparity between Black and White patients was apparent at the observation level (pooled prevalence ratio = 2.31, 95% CI 1.70 to 3.14). Among patients identifying as Asian, Latinx, Indigenous, multiracial, or other race or ethnicity, the prevalence of occult hypoxemia was 8.9% (95% CI 4.0 to 18.4; N = 39,336), corresponding to a 39% greater prevalence of occult hypoxemia compared with White patients (pooled prevalence ratio = 1.39, 95% CI 1.19 to 1.64). Two studies24,27 of occult hypoxemia prevalence at the patient level did not clearly report the duration between paired SpO2 and SaO2 measurements, and one of these studies was also a preprint.24 Results of sensitivity analyses excluding these studies were similar to results of the full analyses (Table 2).

Table 2 Occult Hypoxemia Prevalence by Patient Race or Ethnicity
Figure 2
figure 2

Occult hypoxemia prevalence ratios by patient race or ethnicity. Dashed line corresponds to no difference in prevalence of occult hypoxemia compared with White patients. Sample size (N) does not include White patients (reference group). Black patient-level prevalence ratio excludes data from one small study 22 for which a prevalence ratio could not be calculated (study did not report occult hypoxemia prevalence among White patients). Asian, Latinx, Indig., or Other = Asian, Latinx, Indigenous, multiracial, or other race or ethnicity.

Five studies3,19,20,21,27 of occult hypoxemia prevalence also reported odds of occult hypoxemia adjusted for potential confounders (in most cases, patient demographics, comorbidities, and treatment characteristics such as use of vasopressors) (Table 2). When pooled, Black patients had 84% greater odds of experiencing occult hypoxemia compared with White patients (pooled adjusted odds ratio = 1.84, 95% CI 1.23 to 2.75; N = 12,332). Reported odds ratios from all studies were similar in magnitude and consistent in direction. Four3,19,20,21 studies also included patients identifying as Asian, Latinx, Indigenous, multiracial, or other race or ethnicity, whose odds of occult hypoxemia were also significantly greater than those of White patients (pooled adjusted odds ratio = 1.30, 95% CI 1.02 to 1.66; N = 30,985).

All studies providing data on occult hypoxemia prevalence were rated at moderate risk of biased findings. The most common concerns were unclear detail about the pulse oximeter or CO-oximeter devices used and limitations arising from use of retrospective health record data, including unclear detail about patient characteristics and lack of control over oximetry data collection. One study27 that reported odds of occult hypoxemia adjusted only for patient sex and measured SpO2, in contrast to other studies that accounted for a more comprehensive set of potential confounders. Overall, prevalence estimates were judged to be moderately inconsistent, in that they differ in magnitude across studies but are generally consistent in direction (i.e., higher prevalences among racial or ethnic minority patient groups than among White patients). Considering both prevalence and associational findings together, evidence supporting the conclusion that occult hypoxemia is likely more common among Black patients than among White patients was rated moderate certainty. Evidence supporting the conclusion that patients identifying as Asian, Latinx, Indigenous, multiracial, or other race or ethnicity likely experience occult hypoxemia more frequently than White patients (but not as frequently as Black patients) was also considered moderate certainty.

Four observational studies2,3,6,21 were found that examined whether disparities in occult hypoxemia were associated with elevated risk of adverse clinical or healthcare utilization outcomes. An additional study,26 not described below, reported a causal inference analysis of clinical impacts of differential pulse oximeter measurement error (rather than occult hypoxemia per se) between Black and White patients.

At present, the largest available study of racial and ethnic disparities in clinical outcomes of occult hypoxemia is a US study6 that examined health record data from Black, Hispanic, Asian, and White acute care patients (N = 87,971) seen in 215 hospitals and 382 intensive care units. The prevalence of occult hypoxemia (defined as SaO2 < 88% despite a matched SpO2 > 88%) significantly differed across race or ethnicity groups (p < 0.001) and was highest among Black patients (6.9%; N = 26,032). After adjusting for patient age, sex, and Sequential Organ Failure Assessment (SOFA) score, Black patients with occult hypoxemia experienced significantly shorter average length of stay compared with Black patients without occult hypoxemia (− 3.0 days, p < 0.01). Length of stay for White patients (N = 57,623) with occult hypoxemia was also significantly shorter than for White patients without occult hypoxemia, but by only 0.5 days on average (p < 0.01). Length of stay among Asian (N = 1919) and Hispanic (N = 2397) patients with and without occult hypoxemia also differed by less than 1 day on average. In-hospital mortality was more common in patients experiencing occult hypoxemia regardless of race or ethnicity, with the largest difference among White patients (11.1% greater than White patients without occult hypoxemia, p < 0.001). The study did not test whether associations between occult hypoxemia and clinical outcomes significantly differed across race or ethnicity groups.

A second study3 used health record data from 26,603 patients in intensive care and surgery settings at three US academic medical centers, finding that Black patients had 65% greater odds of experiencing occult hypoxemia than White patients after adjusting for patient sex, treatment characteristics, comorbidities, and SpO2 (adjusted odds ratio = 1.65, 95% CI 1.28 to 2.14). In turn, patients with occult hypoxemia experienced significantly greater in-hospital mortality in both surgical settings (adjusted odds ratio = 2.96, 95% CI 1.20 to 7.28) and intensive care units (adjusted odds ratio = 1.36, 95% CI 1.03 to 1.80). Among surgery patients, occult hypoxemia was also significantly associated with fewer hospital-free days (− 2.5 days, 95% CI − 3.9 to − 1.2). Analyses of in-hospital mortality and hospital-free days were adjusted for patient age, sex, comorbidities, acuity, and setting. An interaction of occult hypoxemia status and patient race or ethnicity was nonsignificant (p-value not reported). Black, Asian, or American Indian patients together made up a relatively small proportion of the patient sample (2110 versus 24,493 White patients).

Another study21 examined clinical outcomes of unrecognized treatment need (defined as an admission SaO2 < 94% despite a matched SpO2 > 94%) among patients hospitalized for COVID-19 at 186 US acute care facilities (N = 8,635 in clinical outcomes analysis). After adjusting for demographic characteristics and potential clinical confounders, Black and Hispanic patients had significantly greater odds of unrecognized need for COVID-19 therapy compared with White patients (Black patients: adjusted odds ratio = 1.46, 95% CI 1.23 to 1.72; Hispanic patients: adjusted odds ratio = 1.18, 95% CI 1.01 to 1.39). Patients with unrecognized treatment need were somewhat less likely to receive COVID-19 therapy than patients with initially recognized treatment need (adjusted hazard ratio = 0.90, 95% CI 0.83 to 0.97) and had significantly greater odds of 30-day hospital readmission (adjusted odds ratio = 2.41, 95% CI 1.39 to 4.18). Average length of stay appeared to be shorter for patients with unrecognized treatment need compared with patients with recognized treatment need (− 1.4 days, 95% CI, − 3.1 to 0.2), but this difference was nonsignificant. In-hospital mortality was similar regardless of treatment recognition (adjusted odds ratio = 0.84, 95% CI 0.71 to 1.01). As in the previous study, an interaction with patient race or ethnicity was nonsignificant (p-values = 0.45 and 0.14, respectively), though median time to receipt of therapy was 2.1 to 4.5 h longer among Black patients with unrecognized treatment need compared with patients of any other race or ethnicity (regardless of whether they had unrecognized or recognized treatment need). Unrecognized treatment need was not significantly associated with in-hospital mortality or length of stay in this study.

Finally, a comparatively small study2 in patients evaluated in the emergency department or hospitalized for COVID-19 in a US health system (N = 1903 in clinical outcomes analysis) examined whether patients predicted to have an SaO2 of 94% or less prior to a measured SpO2 of 94% or less (i.e., an unrecognized hypoxemic state) experienced delayed recognition of treatment eligibility or delayed treatment initiation. Occult hypoxemia (defined as SaO2 < 88% despite a concurrent SpO2 between 92 and 96%) was most prevalent among Black, Hispanic, and Asian patients (28.5–30.2%) compared with White patients (17.2%) (p-values not reported). Failure to recognize eligibility or delayed recognition of eligibility was significantly more likely among both Black patients (adjusted hazard ratio = 0.71, 95% CI 0.63 to 0.80) and Hispanic patients (adjusted hazard ratio = 0.77, 95% CI 0.66 to 0.89) with unrecognized hypoxemia compared with White patients with unrecognized hypoxemia. Analyses were adjusted for demographics, comorbidities, acuity, and laboratory values (e.g., hemoglobin). In the subset of patients eventually recognized as treatment eligible, the median delay to eligibility recognition was about 2 h longer for Black patients and Asian patients compared with Hispanic and White patients (p = 0.01 for Black versus White patients).

Available evidence was considered insufficient to make firm conclusions about disparities in clinical impacts of occult hypoxemia, given that few studies were available and those that were identified differed in outcome definitions, analytic approaches, and setting and sample characteristics. Additionally, studies generally assessed clinical impacts of occult hypoxemia indirectly, by first examining occult hypoxemia prevalences then modeling associations between occult hypoxemia status and clinical outcomes separately. Nonetheless, these studies are fairly consistent in showing that patients with occult hypoxemia have worse outcomes than patients without occult hypoxemia, and they provide suggestive evidence that Black patients with undetected hypoxemia may experience poorer treatment delivery outcomes than White patients with undetected hypoxemia.

DISCUSSION

Occult hypoxemia is likely more common among Black patients compared with White patients. Patients identifying as Asian, Latinx, Indigenous, multiracial, or other race or ethnicity also likely experience occult hypoxemia more frequently than White patients, but less frequently than Black patients. Studies examining differences in clinical outcomes associated with occult hypoxemia are methodologically inconsistent but provide suggestive evidence that Black patients with undetected hypoxemia could experience poorer treatment delivery outcomes than White patients with undetected hypoxemia. Available evidence on disparities in occult hypoxemia prevalence and outcomes is based on oximetry data from pulse oximeters currently in wide use, so it is conceivable that differential pulse oximeter inaccuracy due to patient skin pigmentation level is commonplace in modern healthcare delivery. At the same time, even larger disparities in occult hypoxemia might be expected if pulse oximeters overestimate oxygen saturation among racial and ethnic minority patients by a consistently large magnitude. This suggests the presence of considerable variability in the amount of bias in pulse oximeter readings between (and potentially within) individual patients.

A recent systematic review and meta-analysis29 synthesized pulse oximeter accuracy data from 1990 to the early COVID-19 pandemic and found that the average amount of bias in pulse oximeter readings was somewhat higher for Black patients than for White patients (1.52 versus 0.55 from a total of 18,623 paired observations; see Table 3). Precision, or the variability in the amount of bias across patients, was similar between groups and ranged from 1.55 to 1.68. To compare these findings with the much larger amount of pulse oximeter accuracy data generated in contemporary hospital settings during and after the COVID-19 pandemic, we pooled mean bias and precision data from the 11 included studies that reported these data.6,18,19,20,22,23,24,25,26,27,28 These studies contribute 455,867 new paired oximetry observations from 214,715 patients. Details of our analytic approach to pooling accuracy data are provided in the supplemental materials.

Table 3 Pooled Estimates of Pulse Oximeter Mean Bias and Precision by Patient Race or Ethnicity

As shown in Table 3, although modern pulse oximeters appear to overestimate blood oxygen saturation in Black patients compared with White patients, the magnitude of bias is on average fairly small across race or ethnicity groups. This observation is consistent with findings from the earlier review of pre-COVID-19 evidence.29 However, in contrast with older studies that generally found modest variability is the amount of bias across patients (precision), evidence from more recent and much larger studies indicates that the accuracy of pulse oximeter readings could vary substantially from patient to patient regardless of their race or ethnicity. Moreover, findings from a recent analysis28 of 30,000 paired oximetry measurements from the US Veterans Health Administration suggest there may also be considerable variability in pulse oximeter bias within individual patients. Investigators found that even when Black and White patients were not experiencing occult hypoxemia at their first oximetry reading of a given day, Black patients were more likely to be experiencing occult hypoxemia at a second oximetry reading on the same day (i.e., Black patients’ probability of occult hypoxemia was more variable among same-day oximetry readings than that of White patients).

When pulse oximeters provide faulty data to clinicians, patients are at increased risk of harm—particularly when measurements are falsely reassuring as they are with occult hypoxemia. At present, the US Food and Drug Administration (FDA) recommends pre-market studies used for approval of new pulse oximeter devices include only two darkly pigmented patients (or 15% of the participant sample, whichever is larger).30,31 Moreover, as stated by the FDA, “these clinical studies are neither recommended to enroll demographic groups nor be statistically powered to detect differences in performance between such cohorts”.31 In response to ongoing concerns about disparities in pulse oximeter accuracy and regulatory requirements for pulse oximeter approval, the FDA has convened two public meetings of its Anesthesiology and Respiratory Therapy Devices Panel since late 2022.31,32,33 The Panel released a discussion paper in November 2023 that proposes several steps to improve pre-market pulse oximeter studies.31 These include enrollment of a larger number of participants; ensuring that a minimum number of participants are enrolled across all values of a validated skin tone scale, corroborated by an objective skin pigmentation measurement at the pulse oximeter sensor site; and use of a more sophisticated analytic approach to evaluate oximeter performance data.31 Other recent recommendations have included testing oximeters under real-world healthcare conditions and incorporating perfusion into validation requirements.20,34

In the nearer term, reassessing oxygen saturation in arterial blood more routinely—particularly for patients who show signs or symptoms of arterial hypoxemia—has been proposed as one step to reduce the risk of undetected hypoxemia.28 Another approach suggested to mitigate this risk is to raise the target oxygen saturation range for all patients from 92–96% to 94–98%, though this may increase risk of hyperoxemia and use of supplemental oxygen among patients who may not require it.20 Applying a fixed skin tone-based correction factor to readings from currently available pulse oximeters has also been discussed, but such adjustments have a controversial history35,36,37 and may have limited efficacy because the accuracy and reliability of pulse oximetry readings are influenced by multiple factors that cannot be accounted for in a single correction factor.36,38

Although changes to clinical practice may help to offset the impact of pulse oximeter inaccuracies on hypoxemia detection, it is clear that significant advancements in noninvasive oximeter technology are needed. Improving pulse oximeter technology is an active research area. A recent validation study,39 for example, tested an investigational noninvasive oximeter that uses green rather than the conventional red light, targets superficial skin layers to increase sensitivity to tissue hypoxia, and implements patient-specific skin tone calibration (rather than a fixed correction factor). The study enrolled equal proportions of patients with fair, brown, and dark skin tones based on the Von Luschan Chromatic Scale. Oxygen saturation readings from the novel oximeter were more highly correlated with blood-based oximetry (r = 0.76) than pulse oximeter readings (r = 0.47), and the device was also able to accurately assess oxygen levels in cases in which the pulse oximeter failed, including a patient with very dark skin tone.

As noted above, recent studies that contribute most available occult hypoxemia data report their results by patient-identified race or ethnicity (likely due to the use of patient health record data). A concern with the use of self-identified race or ethnicity—versus objective skin pigmentation level—is that it may introduce spurious variation across studies and lead to unexpected or clinically counterintuitive findings, given that individuals with a wide range of skin pigmentation levels could identify with the same race or ethnicity. The present review shares this limitation, in that categorizing individuals according to self-identified race or ethnicity may obscure variation in occult hypoxemia risk. This limitation is potentially exacerbated by aggregating results for patients identifying as Asian, Latinx, Indigenous, multiracial, or other race or ethnicity for syntheses, which was necessary because of the more limited representation of individuals with these races or ethnicities in available data. Additionally, limits to the extent of oximetry data captured in electronic health records meant that it was not feasible for us to examine whether pulse oximeter device type impacted occult hypoxemia risk, and it is also possible that time-stamps used to define paired observations may be inaccurately recorded in some cases. Despite these limitations, findings of this review underscore that clinicians should be aware of the risk of occult hypoxemia in patients with darker skin pigmentation and the potential of clinically important variability in pulse oximeter accuracy across and within patients.