How Pain-Related Facial Expressions Are Evaluated in Relation to Gender, Race, and Emotion

Dildine, Troy C.; Amir, Carolyn M.; Parsons, Julie; Atlas, Lauren Y.

doi:10.1007/s42761-023-00181-6

How Pain-Related Facial Expressions Are Evaluated in Relation to Gender, Race, and Emotion

RESEARCH ARTICLE
Open access
Published: 03 March 2023

Volume 4, pages 350–369, (2023)
Cite this article

Download PDF

You have full access to this open access article

Affective Science Aims and scope Submit manuscript

How Pain-Related Facial Expressions Are Evaluated in Relation to Gender, Race, and Emotion

Download PDF

Troy C. Dildine^1,2,
Carolyn M. Amir¹,
Julie Parsons¹ &
…
Lauren Y. Atlas^1,3,4

3448 Accesses
3 Citations
9 Altmetric
Explore all metrics

Abstract

Inequities in pain assessment are well-documented; however, the psychological mechanisms underlying such biases are poorly understood. We investigated potential perceptual biases in the judgments of faces displaying pain-related movements. Across five online studies, 956 adult participants viewed images of computer-generated faces (“targets”) that varied in features related to race (Black and White) and gender (women and men). Target identity was manipulated across participants, and each target had equivalent facial movements that displayed varying intensities of movement in facial action-units related to pain (Studies 1–4) or pain and emotion (Study 5). On each trial, participants provided categorical judgments as to whether a target was in pain (Studies 1–4) or which expression the target displayed (Study 5) and then rated the perceived intensity of the expression. Meta-analyses of Studies 1–4 revealed that movement intensity was positively associated with both categorizing a trial as painful and perceived pain intensity. Target race and gender did not consistently affect pain-related judgments, contrary to well-documented clinical inequities. In Study 5, in which pain was equally likely relative to other emotions, pain was the least frequently selected emotion (5%). Our results suggest that perceivers can utilize facial movements to evaluate pain in other individuals, but perceiving pain may depend on contextual factors. Furthermore, assessments of computer-generated, pain-related facial movements online do not replicate sociocultural biases observed in the clinic. These findings provide a foundation for future studies comparing CGI and real images of pain and emphasize the need for further work on the relationship between pain and emotion.

Pain and satisfaction: healthcare providers’ facial appearance matters

Article 07 April 2020

Facial Appearance as Core Expression Scale: Benchmarks and Properties

Article 01 October 2022

An Eye Tracking Investigation of Pain Decoding Based on Older and Younger Adults’ Facial Expressions

Article Open access 11 October 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

In the USA, medical providers underassess pain in socially marginalized patients (e.g., women and racial and ethnic minorities; Anderson et al., 2009; Green et al., 2003; LeResche, 2011; Todd et al., 1993). Clinicians and researchers alike typically assess pain using unidimensional self-report measures (e.g., the 0–10 Numerical Rating Scale), which should lead to standardized treatment (Vila et al., 2005). Yet, despite Black and women patients experiencing more pain in the lab and clinic, respectively (Kim et al., 2017; Mogil, 2012), they are conversely less likely to receive treatment and receive less potent treatments for pain (Cintron & Morrison, 2006; Mossey, 2011). This indicates that standardized pain measures do not lead to standardized pain treatments.

Researchers have begun to identify potential psychological mechanisms underlying non-standardized pain treatment and assessment (Drwecki et al., 2011; Hoffman et al., 2016). In particular, recent work indicates that sociocultural biases can influence the assessment of nonverbal pain behavior, i.e., pain-related facial movements (Hirsh et al., 2008; Mende-Siedlecki et al., 2019; Zhang et al., 2021) and that multiple sociodemographic factors (e.g., socioeconomic status and race; Summers et al., 2021) can impact the same pain assessment judgment. Although guidelines for nonverbal pain assessment exist for nonverbal patient populations (Payen et al., 2001), there is a lack of standard guidelines to assess nonverbal pain behaviors in verbal patient populations. This is particularly troubling as providers report looking at and integrating patient facial expressions during assessments (Johnson, 1977) and they believe facial expressions are more informative of their patients’ internal state compared to verbal self-reports (Poole & Craig, 1992). Thus, pain-related facial expression assessment is fundamental to pain evaluation and might provide insights on inequities in pain assessment.

Interestingly, although clinical pain assessments are biased against Black and women patients, findings from experimental assessments of facial expressions of pain have been mixed. Because most databases of pain-related facial expressions in actual patients are homogeneous (Dildine & Atlas, 2019), researchers use images of actors or computer-generated faces to manipulate identity (i.e., race and gender) and test whether sociodemographic factors impact pain assessment. While some studies observe biases that are consistent with disparities in pain assessment seen in clinical samples (Mende-Siedlecki et al., 2019; Zhang et al., 2021), others observe more pain attributed to faces depicting Black (Hirsh et al., 2008, 2009) or women patients ( Hirsh et al., 2008, 2009; Martel et al., 2011; Robinson & Wise, 2003) compared to White and men patients, respectively.

One important question is whether the expressions displayed in prior studies are ecologically valid and displayed by individuals experiencing actual pain. A recent systematic review of clinical and laboratory studies (Kunz et al., 2019) examined whether acute and chronic pain was associated with consistent movements in specific facial movements, or “action units” (AUs) based on the Facial Action Coding System (FACS; Ekman & Friesen, 1978). Pain was most often associated with brow lowering (AU4), nose wrinkling (AU9, 10), orbital tightening (AU6, 7), and jaw dropping (AU25, 26, 27). While a subset of these units overlap with those that were manipulated in previous studies (Hirsch et al., 2008), no studies have measured how coordinated movements of these regions are interpreted by perceivers and whether sociodemographic features impact assessment. We created computer-generated faces that display coordinated movements across these facial muscles and vary in apparent race and gender. Our goal was to test whether perceivers exhibit sociocultural biases in pain assessment based solely on facial expressions and cues related to sociodemographic features, without further contextual information (i.e., vignettes).

We evaluated whether the intensity of facial movement impacts pain assessments. Manipulating facial expression activation permits insights on the granularity of information with which perceivers can categorize and estimate pain and whether biases in nonverbal pain assessment are stable or dependent on the level of pain expression. In addition, we used a between-groups manipulation to avoid social desirability biases (i.e., response monitoring and demand characteristics). Previous studies have observed effects of social desirability on performance in within-subjects designs when including more than one controversial factor (e.g., Black and White faces; Muslim and Christian vignettes; Walzenbach, 2019) compared to between-subjects designs that only show one factor. Furthermore, decision-making tasks that involve faces from multiple races can engage conflict monitoring (i.e., hyper-awareness of errors) and elicit differential neural activity to Black compared to White faces with the same trial design (Amodio, 2014; Amodio et al., 2008). As previous studies used within-subjects designs to explore the impact of social factors on pain assessment (Hirsch et al., 2008, 2009; Mende-Siedlecki et al., 2019), our goal was to use a between-subjects design to minimize social desirability and conflict monitoring and test whether clinical inequities in pain assessment replicate in experimental contexts.

A second goal of the present study was to measure how pain evaluation relates to emotion recognition. Pain is defined as having both sensory and emotional components and overlaps with emotion in many ways (Gilam et al., 2020). Notably, action units associated with pain (Kunz et al., 2019) overlap with other emotions (see Table 1). For example, AU4 is associated with pain, anger, fear, and sadness, and AU26 is associated with disgust, happiness, pain, and surprise. Although studies have evaluated whether perceivers could distinguish between pain and other basic emotions (Kappesser & Williams 2002; Roy et al., 2015; Simon et al., 2008) or between pain and other negative affective states (e.g., pain and disgust; Antico et al., 2019; Dirupo et al., 2020), participants were presented with only one exemplar or with exemplars from the same race and/or gender; therefore, we do not know if perceivers bias their assessments by target race or the intersection between target race and gender. We aimed to assess not only the relationship between pain and emotion, but also whether the assessment of pain or emotion differs by target race, target gender, or their intersection. Furthermore, studies of emotion recognition indicate that basic emotions are recognized more accurately when assessed by perceivers that share an identity with the target (e.g., same race; Elfenbein & Ambady, 2002), often referred to as an in-group advantage, and perceivers tend to misattribute outgroup faces with stereotyped emotions (e.g., seeing anger in neutral Black faces; Hugenberg & Bodenhausen, 2003) and are less confident and slower to rate outgroup faces (Beaupre & Hess, 2006). Previous studies also indicate that patients who feel similar to their providers experience greater pain relief and lower expectations of pain in patients in simulated (Losin et al., 2017; Necka et al., 2021) and real clinical environments (Nazione et al., 2019; Street et al., 2008). However, we do not know how similarity impacts pain assessments from the perceiver’s position. Therefore, a third aim of our study was to measure how well perceivers can distinguish pain from other emotions on the basis of facial expressions and whether pain recognition and confidence in decisions varies as a function of our sociocultural factors: target sociodemographic features, group status, or similarity.

Table 1 Action unit table^a

Full size table

Our goals were to examine how participants assess facial expressions of pain and whether pain assessment is biased by sociocultural factors. We also explored how pain assessment relates to emotion recognition. In four studies, participants viewed faces that varied in the magnitude of pain-related facial movement based on empirical studies (Kunz et al., 2019) and in a fifth study participants viewed pain-related expressions as well as basic emotions. In each study, we manipulated the apparent sociodemographic background of the face in a between-subjects design. We controlled the movement of each face, such that movements were matched across sociodemographic subgroups and were restricted to the action units established in Kunz et al., (2019). We used only one combination set of these action units (see Table 1) and made collective changes in their activation (i.e., there was no variability of expression activation within the set of action units) to assess for the effect of facial expression activation on pain assessment outcomes. We hypothesized that increased pain-related facial movement activation would be associated with higher likelihoods of rating pain and higher estimated pain intensity. We also expected these effects to vary by facial identity, such that participants would attribute pain more often and with higher intensity in men and White targets relative to women and Black targets, respectively. We anticipated that these differences would be most pronounced in participants who did not share group membership or similarity with the face target. Finally, we explored the relationship between pain assessment and the assessment of other emotions.

Method

Participants

We conducted five online studies between January 2020 and February 2022. Nine hundred sixty-five participants in total (see Table 2 for sample demographics) provided consent as approved under the National Institutes of Health’s Office of Human Subjects Research Protection (18-NCCIH-0622). Enrollment was limited to include participants at least 18 years of age who were currently living in the USA, to ensure participants shared similar societal and cultural contexts. All participants were reached through CloudResearch (formerly TurkPrime; Litman et al., 2017). The task (see Fig. 1 for task schematic) was completed online to increase the diversity and generalizability of our sample (Huff & Tingley, 2015) and recent work has shown CloudResearch’s participant pools exhibit stable demographics, even during the COVID-19 pandemic (Moss et al., 2020). Participants were paid at a rate based on federal minimum wage (i.e., $7.25 per hour). We also implemented the following checks to decrease the potential for any non-human (i.e., bot) or poor responder: participants needed at least a 90% approval rating across a minimum of 100 previously completed online studies, participants could not be on CloudResearch’s “suspicious geolocation” or “bad participants” list (e.g., someone who changes their sociodemographic features across tasks is flagged as a bad participant), and participants could not repeat our task more than once.

Table 2 Sample demographics by study

Full size table

Stimuli

All participants completed the experiment via Pavlovia (Peirce & MacAskill, 2018) on either Chrome, Firefox, or Opera web browsers. Participants (“perceivers”) viewed computer generated photos of human faces (“targets”). Previous studies have created pain-related, computer-generated faces using FACEGen (Hirsh et al., 2008) or FACSGen (Antico et al., 2019) softwares. We created our images using FaceGen Modeller Pro v3.2 (Singular Inversions, Toronto, CA). Similar to Hirsh and colleagues (2008), we utilized Ekman’s Facial Action Coding System (FACS; Ekman & Friesen, 1978) to define our faces. However, we updated which facial movements to manipulate based on a recent systematic review (Kunz et al., 2019). The targets displayed different magnitudes of facial movement in regions commonly associated with painful stimulation based on FACS including: brow lowering (AU4), jaw dropping (AU25, 26, 27), nose wrinkling (AU9, 10), and orbital tightening (AU6, 7). To increase the ecological validity of the facial movements, we incorporated two additional facial regions, the inner and outer brow (AU1/2) and corner lip pull (AU12), that have been shown to naturally occur with the typically pain evoked facial regions of interest (Fabian Benitez-Quiroz et al., 2016; Zhao et al., 2016). Furthermore, because the intensity of an emotional facial expression can impact how well it is recognized (Hess et al., 1997) and because spontaneous facial expressions of emotions can be less prototypical and recognized less than posed emotions (Krumhuber et al., 2021; Tcherkassof et al., 2007), we varied the intensity of facial movements in our stimuli rather than presenting only maximal expressions. This allowed us to evaluate the association between variations in target facial movement and perceived pain in each analysis.

In Studies 1–5, expressions were mapped onto four composite faces: White man, White woman, Black man, and Black woman (see Fig. 2). Faces in each study showed equivalent movements across sociodemographic features and took up approximately 30% of the height and 20% of the width of each participant’s screen resolution. The studies were completed online with unfiltered images. This prevented us from computing the visual angle of the stimuli for each of our online participants and from parsing out potential influences of special frequency in our task. However, similar to most studies of pain expression assessment, we presented all stimuli upright and in a front-facing orientation (i.e., we did not apply any vertical or horizontal rotation to the stimuli).

In Studies 1–3, each set of action unit regions were increased collectively from zero (i.e., neutral position) to full activation (99%; i.e., full muscle activation) by 9% for each image, for a total of 12 images per face (i.e., 0%, 9%, 18%, 27%; see Fig. 2).

In Study 4, expressions were mapped onto three exemplars for each of our four sociodemographic groups (i.e., 12 total exemplars). Each exemplar was generated from a photo of a unique individual who participated in an in-person study of pain at NCCIH (ClinicalTrials.gov Identifier: NCT03258580), self-identified with the sociodemographic sub-group and consented to sharing their facial data for use in future studies and for sharing with participants outside of NIH. We used FaceGen Modeller Pro to generate computerized versions of each individual and manipulate the same AUs and intensities as Studies 1–3. Example images are displayed in Fig. 2.

Study 5 incorporated the same four composite faces as Studies 1–3, but in addition to pain-related movements, faces were manipulated to exhibit movements associated with each of the “basic” emotions (“anger,” “disgust,” “fear,” “happiness,” “sadness,” and “surprise”). Action units for the basic emotions were chosen based on the canonical presentations from Keltner et al. (2019). Each expression was presented at 20%, 50%, and 80% activations for a total of 21 images; we also included one neutral image which was presented 3 times to match the number of presentations associated with the other expressions.

General Procedures

In each study, participants provided consent and then received task instructions (see Supplementary Methods, Participant Instructions, for exact language). Each participant was randomly assigned to view only one of the sociodemographic face types (e.g., Black female face) in a between-groups design to decrease the likelihood that participants would think the task was about race or gender bias, which can alter behavior (Amodio et al., 2008, 2006; Devine et al., 2002). Participants performed practice trials with images of their assigned sociodemographic face type to confirm their understanding of the procedures before starting the task.

Specific Procedures

Studies 1–4: Following the practice, participants completed 36 trials of a pain assessment task in which twelve facial configurations (see Fig. 2) were each presented three times across the task. In Studies 1–3, participants saw a single identity 36 times, whereas in Study 4, participants saw three exemplars from the same sociodemographic group each 12 times (36 total trials). On each trial, the participant saw a target face and made a binary choice to report whether the target they viewed was in pain or not (see Fig. 1a). The target face was shown simultaneously with the options of “pain” and “no pain” and perceivers had 5 s to make their rating. If the perceiver rated the target face as displaying “pain,” they then had 5 s to rate the target’s pain intensity. Pain intensity was rated on a continuous scale anchored on the lower end with “No pain at all” and “Extreme pain” at the upper end (see Fig. 1a). For convenience, the scale was treated as a 0–7 continuous visual analogue scale in analyses, but there were no numbers displayed during the task (see Fig. 1a). The target face remained on the screen while the perceiver made their judgment. If a participant did not rate in the allotted time (5 s), the trial was not included in analyses. If the perceiver selected “no pain,” they then reported which emotion the target expressed: options included “anger,” “disgust,” “fear,” “happy,” “sad,” “surprise,” “other,” and “neutral” (see Fig. 1a). The emotion categories were presented in the same order each trial in Studies 1–3 and randomized in Study 4. Participants were allotted a few extra seconds to respond in Study 4 (8 s total), to account for the randomization.

Following 36 trials of the pain assessment task, the perceiver answered a free-response question probing what parts of the face they used to make their pain rating decisions. This question was used to probe for bots in Study 2 and Study 3; however, most participants left this question blank (Study 2 = 58% of participants; Study 3 = 63% of participants). Participants who left this question blank were still included in analyses. To improve our ability to identify suspicious responses, we used a multiple-choice option for Study 4, which asked which type of stimuli subjects were being asked to rate. All participants answered this question correctly.

We also collected ratings of perceived similarity. Perceivers rated how similar they believed the target face they viewed was to themselves on a continuous scale anchored from “Not at all similar” to “Extremely similar.” Participants were not primed to rate similarity based on any particular criteria. In Study 4, participants provided similarity ratings for each of the three facial identities that they viewed. Finally, participants self-reported their age, gender, race, and ethnicity.

Study 5

Study 5 had a similar structure to Studies 1–4 but included six emotions (anger, disgust, fear, happy, sad, and surprise) in addition to pain. Each expression type was presented at three activation levels (20%, 50%, and 80% activation) across the task. We also included three neutral expressions (0% activation). We again used a single target face for each sociodemographic group (i.e., the same exemplars as Studies 1–3).

Following the practice trials, participants completed 24 trials in which participants first categorized the target’s expression and then provided ratings of intensity and confidence (see Fig. 1c). The emotion categories were randomized each trial, and participants had 8 s to make each decision. As in Studies 1–4, intensity ratings were collected using a continuous visual analogue scale anchored with “Not at all intense” at the lower end and “Extremely intense” at the upper end, which was treated as a continuous 0–7 continuous scale in analyses, although there were no numbers displayed during the task. The target face remained on the screen, while the perceiver made their intensity judgment (5 s to make judgment). Following the intensity rating, participants were shown the emotion category they had chosen and rated confidence in their categorization on a continuous scale with “Not at all confident” at the lower end and “Extremely confident” at the upper end (5 s to make judgment). On trials in which participants selected “other” rather than a specific expression, they did not provide intensity or confidence ratings and instead had 10 s to provide a free text response to the prompt, “Please type 1–2 words of what you believe this face is expressing.” Trials in which participants failed to respond were not included in analyses. Similar to Study 4, we included a multiple-choice attention check, which asked which type of stimuli subjects were being asked to rate. All participants answered this question correctly.

Statistics and Analyses

Preregistration and Power Analyses

Sample size for study 1 was selected by running a power analysis in G*Power (Erdfelder et al., 1996) to estimate the number of participants needed to run a fully powered study to observe bias in pain categorization. We used the effect size reported for the experiments with computer-generated faces by Mende-Siedlecki and colleagues (2019; F = 0.8) with alpha set at 0.05 and power set at 80%. Studies 2 through 4 were preregistered at AsPredicted (Study 2, https://aspredicted.org/q22q7.pdf; Study 3, https://aspredicted.org/za5t5.pdf; Study 4: https://aspredicted.org/PF1_GQN), and powered based on the effects from the preceding studies with the same alpha and power parameters. We powered Study 5 based on a significant Target Gender (between) by Emotion (within) interaction, with a weighted effect size f(V) of 0.3246, from Studies 1–3 (Study 4 and Study 5 occurred simultaneously). We preregistered and included our power analysis for the study at AsPredicted (https://aspredicted.org/VPK_ZWK). In each of our studies, we did not remove outliers from our analyses. To take advantage of our multiple studies and to streamline our results, we deviated from our preregistration by using meta-analyses to assess for reliability and consistency of our effects (described below) compared to our initial plan to use Bayesian statistics within each study. Preregistered secondary analyses (e.g., reaction time, post-task qualitative questions) may be analyzed separately in future work.

Data Processing

We downloaded each perceiver’s data as CSV files from Pavlovia and imported the data to R v3.6.1 (R Core, Vienna, Austria). We evaluated catch questions to identify suspicious responses or bots. No participants were excluded from any study. Trials on which subjects failed to respond (M_{Study 1} = 0.13 trials; M _{Study 2} = 0.54 trials; M _{Study 3} = 0.86 trials; M _{Study 4} = 0.37; M _{Study 5} = 0.24) were excluded from analyses. Based on the correspondence between a perceiver’s self-identified race and gender and the sociodemographic category of the target they viewed, we computed two group membership measures. We created a group membership by gender variable, in which an individual was designated as either ingroup (perceiver shared the same gender as the target) or outgroup (perceiver did not share the same gender as the target). Similarly, we created a group membership by race variable with the same designations of ingroup or outgroup status.

Statistical Analyses: Studies 1–4

Studies 1–4 were analyzed using the same procedures. Because each study used the same basic design with subtle variations (e.g., stimuli, sample size, period of data collection), we used meta-analyses to make inferences across the studies. Complete results from individual studies are reported in Supplementary Tables (S1–S13).

Multilevel Models

We used multilevel models to assess the effect of facial expression activation on pain categorization and pain intensity estimates and whether target race, gender, group membership, or similarity had an impact on these effects. We evaluated maximal models (Barr et al., 2013) using both the lmer and glmer functions in the lme4 package in R (Bates et al., 2015). Although our similarity measure allowed for more nuance compared to group membership (i.e., a perceiver can have varying levels of similarity towards targets that may be equivalent by group membership), our factors for similarity and group membership had high multicollinearity; therefore, we evaluated each factor in separate models.

We used logistic multilevel models in each study to assess the odds a trial was rated as painful or not-painful based on expression intensity (i.e., do greater facial expression activations increase the odds of rating a trial as painful) and whether any of our sociocultural measures impacted this relation. If a participant did not have at least three trials in which they observed pain and three trials in which they observed no-pain, then they were excluded from the multilevel logistic models (final n_Study1 = 87, n_Study2 = 160, n_Study3 = 257, n_study4 = 226). We also used multilevel linear models to assess the effect of facial muscle movement (“ExpressionActivation”) on intensity ratings on trials rated as painful in each study and to evaluate whether sociocultural measures impacted this relationship. For each multilevel model, we verified homoscedasticity of our residuals. We also report the full model formulas in the Supplementary Methods and model betas and significances for each of our predictors in the Supplementary Tables.

Meta-analyses

We originally preregistered Bayesian analyses for our studies; however, as Studies 1–4 shared the same overall structure and asked the same question, we used meta-analyses to assess the reliability of our effects and focus on inferences that were consistent across samples and subtle task variations. Meta-analyses were implemented using the “metagen” function within the “meta” package for R (Balduzzi et al., 2019). Standardized betas and standard errors were taken from each multilevel model to compute meta-analyses across Studies 1–4 in R for each factor. Betas, 95% confidence intervals, and associated p values computed across Studies 1–4 are reported for each factor and represented in random forest plots with the forest function within the meta package in R.

Statistical Analyses: Study 5

We used repeated measures ANOVAs to estimate perceived intensity and reported confidence in our fifth study. ANOVAs were implemented with the aov_car function within the afex package (Singmann et al., 2015) in R. As we had limited number of trials, we ran separate models for our within-subjects factors (i.e., emotion type and facial activation). We ran a 2 (gender: Women, Men; between) × 2 (race: White, Black; between) × 8 (emotion: Anger, Disgust, Fear, Happiness, Neutral, Pain, Sadness, Surprise; within) repeated measures ANOVA and a 2 (gender: Women, Men; between) × 2 (race: White, Black; between) × 3 (facial activation: low [20%], medium [50%], high [80%] expression; within) repeated measures ANOVA. We used the emmeans package in R (Lenth et al., 2019) to run Tukey HSD tests on any factor that was significant in our model. All post hoc comparisons are included in Supplementary Tables (S14–S17). We analyzed separate models for our two dependent measures, perceived intensity and confidence.

We also computed a confusion matrix of emotion recognition as a function of category to assess how often individuals’ perceptions matched the putative emotion category of the expression. Finally, we computed hit rates using frameworks built by Wagner (1993) and Kappesser and Williams (2002). Simple hit rates (h₁) were the proportion of correctly identified emotion categories, sensitivity hit rates (h₂) were the number of correct responses compared to the number of times the category was chosen, and unbiased hit rate (h_u) was the product of these two hit rates.

Results

Perceivers Attribute Pain to Faces Displaying Pain-Related Movements

We first examined the proportion of trials that participants perceived the target in pain, relative to no pain. On average perceivers rated pain on approximately half of the trials (total trials = 36) in each study (M_Study1 = 51%, SD_Study1 = 17%; M_Study2 = 54%, SD_Study2 = 21%; M_Study3 = 54%, SD_Study3 = 20%; M_Study4 = 48.8%, SD_Study4 = 17%). Analyses of responses on no-pain trials are reported below (“The relationship between pain and other emotions depends on experimental context”).

We used multilevel logistic regressions (implemented in “glmer”) to analyze whether the magnitude of pain-related facial expression activation impacted pain recognition (i.e., pain vs. no pain judgments). We first examined the first-level (i.e., within-subjects) associations between target facial movement and the likelihood of labeling a target as being in pain. Meta-analysis of intercepts from Studies 1–4 revealed a significant positive intercept (SMD = 0.33, 95% CI [0.12, 0.54], p = 0.02; Fig. 3), such that perceivers rated the average pain-related facial expression activation as painful. Furthermore, across Studies 1–4, we observed a significant positive slope for the association between facial expression activation and pain categorization (SMD = 7.93, 95% CI [6.68, 9.17], p < 0.001; Fig. 3), such that perceivers were more likely to categorize an image as painful at more active facial expressions in all studies. Full model results for the meta-analyses are reported in Supplementary Table S1 and each individual study’s outcomes are reported in Supplementary Tables (Study 1, S2–S4; Study 2, S5–S7; Study 3, S8–S10; Study 4, S11–S13).

Pain-Related Facial Muscle Activation Is Positively Associated with Perceived Pain Intensity

We used multilevel linear models to assess the association between the magnitude of pain-related facial muscle activation and estimated pain intensity on trials rated as painful. Meta-analysis of intercepts across Studies 1–4 revealed a significant positive intercept (SMD = 4.39, 95% CI [4.02, 4.77], p < 0.001; Fig. 3), such that perceivers rated the average pain-related facial expression activation at 63% between anchors of “no pain” and “extreme pain” (i.e., 4.39 on our 0–7 scale for scoring). Furthermore, across Studies 1–4, we observed a significant positive association between facial expression activation and perceived pain intensity (SMD = 4.44, 95% CI [3.11, 5.77], p = 0.002), such that larger facial movements were rated as more intense.

Sociocultural Factors Do Not Reliably Affect Pain Categorization or Perceived Pain Intensity

Our key questions were whether target gender, race, group-membership, or perceived similarity impacted the effect of facial movement activation on pain categorization or perceived pain. To evaluate these questions, we examined interactions between first-level (within-subjects) and second-level (i.e., between-subjects) factors in both logistic and linear models. Figure 4 depicts results of meta-analyses of logistic models evaluating the association between sociocultural factors and the likelihood of categorizing a trial as painful. Meta-analysis of intercepts across Studies 1–4 revealed no effects of sociodemographic factors on the likelihood of rating pain at the average facial expression (all p’s > 0.05). Furthermore, across Studies 1–4, we did not observe any effects of sociodemographic features on the association between facial muscle movement and the pain categorization (all p’s > 0.05). Full model results are located in Supplementary Table S1.

Figure 5 depicts meta-analyses of the relationships between sociodemographic factors and the association between facial movements and perceived pain intensity on trials estimated as painful (i.e., linear models). Meta-analyses across Studies 1–4 revealed no consistent relationship between sociodemographic factors and the intercept (i.e., pain intensity at average facial expression; all p’s > 0.05) or the slope (association between facial activation and perceived pain; all p’s > 0.05).

The Relationship Between Pain and Other Emotions Depends on Experimental Context

The results reported above focus on Studies 1–4, in which targets displayed varying intensities of movement in facial muscles previously associated with pain (Kunz et al., 2019). Importantly, on trials that were rated as not painful, perceivers categorized which other emotion they observed. In Studies 1–4, the emotions that comprised the greatest percentage of trials were “Neutral” (M_Study1 = 14.5%, SD_Study1 = 9.8%; M_Study2 = 12.4%, SD_Study2 = 9.7%; M_Study3 = 14.4%, SD_Study3 = 10.1%; M_Study4 = 16.3%, SD_Study4 = 7.9%), “Surprise” (M_Study1 = 11.9%, SD_Study1 = 9.6%; M_Study2 = 9.7%, SD_Study2 = 9.4%; M_Study3 = 8.9%; SD_Study3 = 8.7%; M_Study4 = 6.1%; SD_Study4 = 5.4%), and “Anger” (M_Study1 = 10.1%, SD_Study1 = 10.3%; M_Study2 = 8.7%, SD_Study2 = 11.4%; M_Study3 = 7.8%, SD_Study3 = 11.2%; M_Study4 = 12.1%, SD_Study4 = 12.3%). For complete results, see Fig. 6.

Because Studies 1–4 included only pain-related expressions and may have primed participants to judge pain, we conducted a fifth study in which pain was equally likely to be expressed as other emotions. We used repeated-measures ANOVAs to analyze effects of our experimental manipulations on both perceived intensity and perceiver confidence. Results are reported in Table 3. In a model that included target emotion, target race, and target gender, we observed main effects of target emotion, F(4, 783) = 134.5, p < 0.001, n_g² = 0.23 and target race, F(1, 185) = 4.89, p = 0.03, n_g² = 0.02 on intensity ratings (see Fig.7a–c). Post hoc analyses with Tukey’s HSD tests indicated that participants attributed higher intensity ratings to stimuli that depicted pain (M = 5.45, SD = 1.62) compared to other emotion categories (M_range = [3.6–4.84], SD_range = [1.43–1.73]), as depicted in Fig. 7a. For a full consideration of post hoc comparisons in perceived intensity by target emotion, see Supplemental Table S14. Perceivers also reported greater intensity in White targets (M = 4.50, SD = 1.65) compared to Black targets (M = 4.23, SD = 1.65; see Fig. 7b). There was no effect of Target Gender on perceived intensity (p > 0.6), and there were no significant interactions between any factors (all p’s > 0.1). When we included target facial expression activation (target activation) as a factor rather than target emotion, we observed a main effect of target activation, F(2, 308) = 260.0, p < 0.001, n_g² = 0.33, such that perceivers attributed higher intensity ratings to more expressive faces (M_80% = 5.40; SD_80% = 1.35) compared to less expressive faces (M_20% = 3.50; SD_20% = 1.43; see Fig. 7c). In the rmANOVA with target activation, there was no longer an effect of target race (p = 0.06), and there were no other main effects or interactions present (all p’s > 0.05). For complete results, see Table 3 and post hoc comparisons in Supplemental Table S15.

Table 3 Study 5 rmANOVAs

Full size table

When we used similar repeated measures ANOVAs to examine effects of sociodemographic factors and target emotion category on confidence ratings (see Fig. 7d–f), we observed main effects of target emotion, F(6, 1108) = 46.6, p < 0.001, n_g² = 0.09 and target gender, F(1, 185) = 4.0, p = 0.05, n_g² = 0.01. There were no effects of target race (p > 0.8) nor any interactions between factors (all p’s > 0.07). Post hoc comparisons indicated that perceivers were most confident when rating pain (M = 5.78, SD = 1.35), neutral (M = 5.71, SD = 1.26), and happiness (M = 5.86, SD = 1.14), relative to all other emotion categories (M_range = [4.98–5.68], SD_range = [1.25–1.42]; see Fig. 7d). Perceivers also reported higher confidence in assessments of targets depicting men (M = 5.61, SD = 1.32) compared to targets depicting women (M = 5.39, SD = 1.3). For a full consideration of post hoc comparisons in perceived confidence by emotion category, see Supplemental Table S16. When we replaced target emotion category with target activation to measure the effects of facial expression intensity on confidence, we observed a main effect of target activation, F(2, 428) = 84.9, p < 0.001, n_g² = 0.11, such that confidence varied as a function of facial expression intensity (see Fig. 7f). The effect of target gender remained significant in this model (p = 0.03), and no other main effects or interactions were present (all p’s > 0.05). For full comparison of confidence by target activation see Table 3, and for post hoc comparisons see Supplemental Table S17.

Although pain expressions elicited high intensity ratings and reported confidence, a confusion matrix reveals that pain stimuli were one of the most confused emotion categories (see Fig. 8). Pain stimuli were only perceived as pain expressions 7.2% of the time and were confused for anger expression 70% of the time (compared to other emotions when restricted to their presentations: anger 52%, disgust 11%, fear 20%, happiness 87%, neutral 89%, sadness 68%, and surprise 77%). Although we did not evaluate the effect of facial expression activation on expression confusion statistically due to power limitations, we have included confusion matrices limited to each of the activation levels in “Supplementary information” (see Supplementary Figure S1). Visual inspection suggests pain-related movements are perceived as anger mainly at the 50% and 80% facial expression activations. Across all emotions, there were more misattributions at lower activation levels (see Supplementary Figure S1), but pain was never recognized as pain above 10% of trials regardless of intensity. Similarly, pain expression stimuli had the least accuracy (i.e., lowest unbiased hit rate) of all emotion categories (see Table 4).

Table 4 Hit rate estimates by emotion categories

Full size table

Discussion

We investigated how pain-related facial movements influence pain assessment and whether perceivers’ ratings depend on target or perceiver sociodemographic features. We observed that ecologically valid movements are recognized as pain, sociocultural factors did not consistently affect perceivers’ recognition and estimation of pain, and pain expressions were less recognizable than other emotion categories.

Pain-Related Facial Activations Are Positively Associated with Pain Recognition and Intensity Estimation

Stimuli from previous literature on facial expressions of pain recognition typically consist of cartoons (Wong et al., 2005), actors portraying pain expressions (Mende-Siedlecki et al., 2019), or facial images chosen from a large set by other perceivers (Chen et al., 2018). These faces are important for understanding what types of pain expressions people expect to see and portray; and although some studies have verified that these expectations matched real expressions (e.g., Chen et al., 2018), it is unknown whether socially marginalized patients exhibit such expressions in the clinic. We based our stimulus expressions on a recent systematic review (Kunz et al., 2019) of facial movements (brow lowering, nose wrinkling, jaw dropping, and orbital tightening) and observed that our perceivers were still capable of recognizing and estimating pain in these facial movements.

It is important to note that in our first four studies, perceivers recognized pain in our faces on a majority of trials, but that recognition depended on facial movement activation. Images with less than 50% facial expression activation had lower odds of being perceived as in pain. This is consistent with studies on basic emotions, which find emotions are more easily decoded at more intense facial expressions (Hess et al., 1997). This suggests that perceivers may have a hard time recognizing pain when a person exhibits subtle or suppressed pain-related facial movements. Once participants labeled a target as painful, we observed that facial activation was positively associated with estimated pain intensity. Furthermore, like other modalities and pain decisions that elicit higher confidence with more sensory information (Dildine et al., 2020), perceivers reported higher levels of confidence in their ratings on trials with more facial activation. Our results provide empirical evidence that individuals can rate pain-related nonverbal responses in a graded manner and can engage in metacognitive processes during these decisions.

Target Gender, Race, Group Status, and Perceived Similarity Had Inconsistent Effects on Pain Assessment but Influenced Intensity and Confidence in Emotion Judgments

Although we hypothesized differences in pain outcomes as a function of social factors, we did not observe consistent pain assessment biases as a function of sociocultural measures. Although these findings stand in contrast to the important clinical literature on pain disparities from which we based our hypotheses (i.e., that pain tends to be underassessed in Black patients in the clinic; Green et al., 2003), we note that some clinical studies have failed to observe sociocultural influences on pain. For instance, 43% of studies reviewed in one literature review of pain and ethnicity (Cintron & Morrison, 2006) did not report disparities in pain assessment and management. We also note that data collection for at least one sub-study (Study 2) was collected during a period of social unrest in the United States (Buchanan et al., 2020), when Black pain and suffering were broadcast widely (Dave et al., 2020). The timing and social context might have primed participants differently compared to studies conducted before the social unrest.

Interestingly, although we did not observe consistent influences of race or gender on pain assessment, we did observe an influence of target race on perceived intensity of emotion, and an influence of target gender on confidence in emotion judgements. These effects were present across emotion categories and were not specific to any particular emotion. Prior studies indicate that sociodemographic features typically do not influence emotion categorization, but do elicit biases in evaluations or behaviors associated with the categories (e.g., reaction time; Bijlstra et al., 2010). Sociodemographic features and how they relate to the perceiver can alter the interpretation of emotions and influence one’s responses to the emotion (Lazerus et al., 2016; Weisbuch & Ambady, 2008). However, most prior research focuses on differences in implicit metrics (e.g., reaction time). Our results suggest that sociodemographic features can also alter explicit features of emotion recognition, namely perceived intensity and confidence. We build on previous work comparing pain and emotion decisions (Blais et al., 2019; Dirupo et al., 2020; Kappesser & Williams 2002; Roy et al., 2015; Simon et al., 2008; Wang et al., 2021) by observing that intensity ratings were biased by target race and confidence ratings were influenced by target gender. Future studies of pain assessment should explore whether stereotypes have different impacts on these different types of judgments.

Perceivers Rarely Identify Pain and Confuse It for Other Emotions when Provided with Many Emotion Categories

Although pain expressions were associated with high levels of confidence and elicited intensity ratings that tracked facial movement, pain was consistently confused for other emotions in our final study. Each emotion, including pain, was presented on 12.5% of trials (3 trials total per emotion). On average, participants only labeled putatively pain-related movements as pain 7.2% of the time (M_trials = 1.72), whereas movements related to happiness were recognized as happiness on 87% of presentations, anger was recognized on 52% of presentations and surprise, a supposedly ambiguous emotion (Neta et al., 2020) was recognized on 77% of trials. Instead, pain related movements were most likely to be perceived as anger (70% of trials). Interestingly, the facial action units most frequently associated with anger (AUs: 4, 5, 17, 23, 24) overlap less with pain (AUs: 4, 6, 7, 9, 10, 25, 26, 27; Kunz et al, 2019) than other emotions (e.g., AUs associated with disgust: 7, 9, 25, 26). Indeed, in previous studies examining pain and emotion decisions, pain has been most often confused as disgust (Kappesser & Williams 2002) as one would predict based on AU overlap; however, the same study also observed anger expressions were the second most likely emotion (after pain) to be labelled as pain. Pain and anger are both high in negative valence and arousal and thus overlap in the affective circumplex (Russell, 1980). Furthermore, three of the five action units that commonly appear during anger are on the lower part of the face (AU17 chin raiser, AU20 lip stretcher, and AU23 lip tightener). Although these specific action units were not part of our pain-related faces, we did include the movement of multiple action units around the mouth (AU25/26/27). This is dissimilar to previous studies of pain-related facial movement which did not include these action units or any action units around the mouth (Prkachin, 1992). In the present study, we also may have observed lower levels of recognition because there were several negative emotion categories (similar to lower levels of recognition for negative affective states observed in previous studies of pain and emotion; Roy et al., 2015), whereas we only presented one positive emotion (happiness), and the recognition rate was highest for happiness. However, other negative emotions were associated with higher recognition rates than pain, and thus there may be something uniquely ambiguous about pain-related expressions, particularly when pain is presented alongside other emotions. These results suggest that pain-related facial movements without priming (i.e., “pain” or “no pain” used in Studies 1–4) are difficult to perceive as pain and suggest pain-related facial movements may be hard to decode when the stimulus is devoid of other cues (e.g., body movement) as observed in previous studies (Aviezer et al., 2012). This stands in stark contrast to Studies 1–4, in which most trials were perceived as painful. In these initial studies, perceivers were primed with an initial pain or no pain categorization question; however, participants were still able to select other emotions. Thus, future studies should continue work on the relationship between pain and other emotional expressions (Chen et al., 2018) as well as how context and priming may shape the assessment of pain and emotion.

Limitations

Our study adds to the literature by measuring the impact of facial movements that were associated with acute or chronic pain in previous literature (Kunz et al., 2019). Yet because most studies of facial responses to pain have been restricted to White and Western samples, and because our selection of facial movements was based on these studies, we do not know whether the facial movements that we presented generalize across sociocultural groups. Future research should measure pain assessment in images of diverse individuals experiencing actual pain, which would improve ecological validity and help us determine if current action units associated with pain are generalizable across diverse samples. However, studies of real individuals in pain must account for variations in pain experience and expression across individuals, which is why computer-generated stimuli offer some advantages. Our stimuli also lacked hair, clothing, and other potential social status indicators. Although this leads to a more controlled comparison of gender and has been incorporated in previous research (Oosterhof & Todorov, 2008), it may also decrease potential interactions between status or context and gender that may amplify bias (Freeman et al., 2011, Hirsch et al., 2008). Incorporating additional cues may be important in future work, as only a few studies have considered whether culture influences pain expressions (Chen et al., 2018). The interaction between culture, context, and pain-related expressions is important to understand, as expressiveness might vary across cultures and or as a function of patient-provider interactions, as individuals may try to counteract negative stereotypes (e.g., medication seeking) when seeking medical care. Future research should examine how sociocultural factors may impact a patient’s nonverbal pain behaviors and how these behaviors impact providers’ decisions. It is possible that biases in the clinic may be driven in part by the healthcare context and social interaction.

Our stimuli presented uniform movement intensities across action units, which prevented us from isolating potential facial regions and movements that may be under or overutilized in assessing pain. The eight action units used in our study, combined into four distinct regions by Kunz and colleagues, were the most common and only regions consistently active above baseline in the review by Kunz et el. (2019); however, pain-related expressions are accompanied by individual differences in action-unit activation (Kunz et al., 2019) and these regions of action-units are not always active during pain (Kunz et al., 2014). Future research should independently manipulate not only the four sub-regions, but all eight action units, to measure each action units’ unique contributions to pain assessment and test for potential interactions.

Finally, we ran our study online with a general sample population that acted as a proxy for medical providers. Future studies should investigate the role of nonverbal assessments and potential influences of sociocultural factors in medical providers, who have been shown to assess false beliefs about Black patients’ pain (Hoffman et al., 2016). Nonetheless, laypeople also endorse stereotypes and false beliefs, and thus, we believe our data can provide important insights. It is important to note that our sample was skewed, with a majority of participants identifying as White. Similar skews also exist in the make-up of medical professionals (Ly & Jena, 2021). This leads to skew in our ingroup/outgroup race variable, which overrepresents White targets as ingroup race and Black targets as outgroup race. Although we did not observe different results when restricting ingroup and outgroup analyses to Black perceivers, future studies should use specific approaches to obtain a more balanced sample. In addition, perceivers that did not identify as either Black or White were always categorized as outgroup race. Including more exemplars and identities beyond the typically used Black and White categories would increase the generalizability of our studies. Furthermore, future studies would benefit from delineating factors like similarity that may arise from multiple and individualized criteria (e.g., similarity scores may be driven by a belief in shared personality and/or a belief in shared appearances). Finally, future studies should include potential moderators, like perceiver’s racial bias endorsement, time spent in the USA, and community diversity to help identify why sociocultural biases in pain assessment may arise.

Conclusions

Our results suggest perceivers can recognize and assess nonverbal pain-related facial movements. Although perceivers do require a certain level of facial movement to consistently recognize pain, perceivers are able to utilize pain-related facial movements to recognize and estimate pain intensity. Furthermore, increased levels of facial expression activation increased recognition of emotions and confidence in perceiver judgments. However, without priming or cues perceivers rarely identify pain-related facial expressions and confuse it for other emotions. Finally, we observed inconsistent effects of target gender, race, group status, and perceived similarity on pain assessments similar to prior experimental paradigms investigating pain assessment biases. Overall, more work is needed to understand how assessment biases in the clinic form to move toward interventions and more equitable healthcare and future studies should further investigate the relationship between pain assessment, emotion recognition, and the psychosocial context surrounding pain and its treatment.

Data Availability

All data and scripts are available upon request. https://osf.io/xjvae/?view_only=bb82d8d5ffb74f6fb216cdaba6530612.

References

Adams, R. B., Nelson, A. J., Soto, J. A., Hess, U., & Kleck, R. E. (2012). Emotion in the neutral face: A mechanism for impression formation? Cognition and Emotion, 26(3), 431–441. https://doi.org/10.1080/02699931.2012.666502
Article PubMed Google Scholar
Alawadhi, S. A., Ohaeri, J. U., Alayarian, A., Anderson, S. R., Gianola, M., Perry, J. M., & Moore, D. (2019). A pilot study on perceived stress and PTSD symptomatology in relation to four dimensions of older women’s physical health. Quality of Life Research : An International Journal of Quality of Life Aspects of Treatment, Care and Rehabilitation, 20(1), 214–221. https://doi.org/10.1111/j.1365-2982.2010.01516.x
Article Google Scholar
Amodio, D. M., Devine, P. G., & Harmon-Jones, E. (2008). Individual differences in the regulation of intergroup bias: The role of conflict monitoring and neural signals for control. Journal of Personality and Social Psychology, 94(1), 60–74. https://doi.org/10.1037/0022-3514.94.1.60
Article PubMed Google Scholar
Amodio, D. M., Kubota, J. T., Harmon-Jones, E., & Devine, P. G. (2006). Alternative mechanisms for regulating racial responses according to internal vs external cues. Social Cognitive and Affective Neuroscience, 1(1), 26–36.
Article PubMed PubMed Central Google Scholar
Amodio, D. M. (2014). The neuroscience of prejudice and stereotyping. Nature Reviews Neuroscience, 15(10), 670–682.
Article PubMed Google Scholar
Anderson, K. O., Green, C. R., & Payne, R. (2009). Racial and ethnic disparities in pain: Causes and consequences of unequal care. Journal of Pain, 10(12), 1187–1204. https://doi.org/10.1016/j.jpain.2009.10.002
Article PubMed Google Scholar
Antico, L., Cataldo, E., & Corradi-Dell’Acqua, C. (2019). Does my pain affect your disgust? Cross-modal influence of first-hand aversive experiences in the appraisal of others’ facial expressions. European Journal of Pain, 23(7), 1283–1296.
PubMed Google Scholar
Aviezer, H., Trope, Y., & Todorov, A. (2012). Body cues, not facial expressions, discriminate between intense positive and negative emotions. Science, 338(6111), 1225–1229.
Article PubMed Google Scholar
Balduzzi, S., Rücker, G., & Schwarzer, G. (2019). How to perform a meta-analysis with R: A practical tutorial. Evidence-Based Mental Health, 22(4), 153–160.
Article PubMed Google Scholar
Barr, D. J., Levy, R., Scheepers, C., & Tily, H. J. (2013). Random effects structure for confirmatory hypothesis testing: Keep it maximal. Journal of Memory and Language, 68(3)https://doi.org/10.1016/j.jml.2012.11.001
Barrett, L. F., Adolphs, R., Marsella, S., Martinez, A. M., & Pollak, S. D. (2019). Emotional expressions reconsidered: Challenges to inferring emotion from human facial movements. Psychological Science in the Public Interest, 20(1), 1–68.
Article PubMed PubMed Central Google Scholar
Bates, D., Mächler, M., Bolker, B., & Walker, S. (2015). Fitting linear mixed-effects models using {lme4}. Journal of Statistical Software, 67(1), 1–48. https://doi.org/10.18637/jss.v067.i01
Article Google Scholar
Beaupré, M. G., & Hess, U. (2006). An ingroup advantage for confidence in emotion recognition judgments: The moderating effect of familiarity with the expressions of outgroup members. Personality and Social Psychology Bulletin, 32(1), 16–26.
Article PubMed Google Scholar
Berry, S. M., Carlin, B. P., Lee, J. J., & Muller, P. (2010). Bayesian adaptive methods for clinical trials. CRC Press.
Book Google Scholar
Bijlstra, G., Holland, R. W., & Wigboldus, D. H. (2010). The social face of emotion recognition: Evaluations versus stereotypes. Journal of Experimental Social Psychology, 46(4), 657–663.
Article Google Scholar
Blais, C., Fiset, D., Furumoto-Deshaies, H., Kunz, M., Seuss, D., & Cormier, S. (2019). Facial features underlying the decoding of pain expressions. The Journal of Pain, 20(6), 728–738.
Article PubMed Google Scholar
Buchanan, L., Bui, Q., & Patel, J. K. (2020). Black Lives Matter may be the largest movement in US history. The New York Times, 3.
Bürkner, P.-C. (2017). brms: An R package for Bayesian multilevel models using Stan. Journal of Statistical Software, 80(1), 1–28.
Article Google Scholar
Chen, C., Crivelli, C., Garrod, O. G. B., Schyns, P. G., Fernández-Dols, J.-M., & Jack, R. E. (2018). Distinct facial expressions represent pain and pleasure across cultures. Proceedings of the National Academy of Sciences, 115(43), E10013-LP−E10021. https://doi.org/10.1073/pnas.1807862115
Article Google Scholar
Cintron, A., & Morrison, R. S. (2006). Pain and ethnicity in the United States: A systematic review. Journal of Palliative Medicine, 9(6), 1454–1473. https://doi.org/10.1089/jpm.2006.9.1454
Article PubMed Google Scholar
Cohen, J. (1988). Statistical power analysis for the behavioral sciences (2nd ed.). Lawrence Earlbaum Associates.
Google Scholar
Cooper, L. A., Roter, D. L., Johnson, R. L., Ford, D. E., Steinwachs, D. M., & Powe, N. R. (2003). Patient-centered communication, ratings of care, and concordance of patient and physician race. Annals of Internal Medicine, 139(11), 907–915.
Article PubMed Google Scholar
Dave, D. M., Friedson, A. I., Matsuzawa, K., Sabia, J. J., & Safford, S. (2020). Black lives matter protests and risk avoidance: The case of civil unrest during a pandemic. National Bureau of Economic Research Working Paper Series.
Devine, P. G., Plant, E. A., Amodio, D. M., Harmon-Jones, E., & Vance, S. L. (2002). The regulation of explicit and implicit race bias: The role of motivations to respond without prejudice. Journal of Personality and Social Psychology, 82(5), 835.
Article PubMed Google Scholar
Dildine, T. C., & Atlas, L. Y. (2019). The need for diversity in research on facial expressions of pain. Pain, 160(8). https://doi.org/10.1097/j.pain.0000000000001593
Dildine, T. C., Necka, E. A., & Atlas, L. Y. (2020). Confidence in subjective pain is predicted by reaction time during decision making. Scientific Reports, 10(1), 1–14.
Article Google Scholar
Dirupo, G., Corradi-Dell’Acqua, C., Kashef, M., Debbané, M., & Badoud, D. (2020). The role of interoception in understanding others’ affect. Dissociation between superficial and detailed appraisal of facial expressions. Cortex, 130, 16–31.
Article PubMed Google Scholar
Drwecki, B. B., Moore, C. F., Ward, S. E., & Prkachin, K. M. (2011). Reducing racial disparities in pain treatment: The role of empathy and perspective-taking. Pain, 152(5), 1001–1006. https://doi.org/10.1016/j.pain.2010.12.005
Article PubMed Google Scholar
Ekman, P., & Friesen, W. V. (1978). Facial action coding system. Consulting Psychologists Press.
Elfenbein, H. A., & Ambady, N. (2002). Is there an in-group advantage in emotion recognition? Psychological Bulletin, 128(2), 243–249.
Article PubMed Google Scholar
Erdfelder, E., Faul, F., & Buchner, A. (1996). GPOWER: A general power analysis program. Behavior Research Methods, Instruments, & Computers, 28(1), 1–11. https://doi.org/10.3758/BF03203630
Article Google Scholar
Fabian Benitez-Quiroz, C., Srinivasan, R., & Martinez, A. M. (2016). Emotionet: An accurate, real-time algorithm for the automatic annotation of a million facial expressions in the wild. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 5562–5570).
Freeman, J. B., Penner, A. M., Saperstein, A., Scheutz, M., & Ambady, N. (2011). Looking the part: Social status cues shape race perception. PLoS ONE, 6(9), e25107.
Article PubMed PubMed Central Google Scholar
Gabry, J., & Goodrich, B. (2016). rstanarm: Bayesian applied regression modeling via Stan. R package version 2. 10. 0.
Gilam, G., Gross, J. J., Wager, T. D., Keefe, F. J., & Mackey, S. C. (2020). What is the relationship between pain and emotion? Bridging Constructs and Communities. Neuron, 107(1), 17–21.
PubMed Google Scholar
Green, C. R., Anderson, K. O., Baker, T. A., Campbell, L. C., Decker, S., Fillingim, R. B., & Vallerand, A. H. (2003). The unequal burden of pain: confronting racial and ethnic disparities in pain. Pain Med, 4(3), 277–294.
Article PubMed Google Scholar
Hess, U., Blairy, S., & Kleck, R. E. (1997). The intensity of emotional facial expressions and decoding accuracy. Journal of Nonverbal Behavior, 21(4), 241–257.
Article Google Scholar
Hirsh, A. T., George, S. Z., & Robinson, M. E. (2009). Pain assessment and treatment disparities: A virtual human technology investigation. Pain, 143(1–2), 106–113. https://doi.org/10.1016/j.pain.2009.02.005
Article PubMed PubMed Central Google Scholar
Hirsh, Adam T., Alqudah, A. F., Stutts, L. A., & Robinson, M. E. (2008). Virtual human technology: Capturing sex, race, and age influences in individual pain decision policies. PAIN, 140(1), 231–238. https://doi.org/10.1016/j.pain.2008.09.010
Article PubMed PubMed Central Google Scholar
Hirsh, Adam T., Hollingshead, N. A., Matthias, M. S., Bair, M. J., & Kroenke, K. (2014). The influence of patient sex, provider sex, and sexist attitudes on pain treatment decisions. The Journal of Pain, 15(5), 551–559. https://doi.org/10.1016/j.jpain.2014.02.003
Article PubMed Google Scholar
Hoffman, K. M., Trawalter, S., Axt, J. R., & Oliver, M. N. (2016). Racial bias in pain assessment and treatment recommendations, and false beliefs about biological differences between blacks and whites. Proceedings of the National Academy of Sciences, 113(16), 201516047. https://doi.org/10.1073/pnas.1516047113
Article Google Scholar
Huff, C., & Tingley, D. (2015). “Who are these people?” Evaluating the demographic characteristics and political preferences of MTurk survey respondents. Research & Politics, 2(3), 2053168015604648.
Article Google Scholar
Hugenberg, K., & Bodenhausen, G. V. (2003). Facing prejudice: Implicit prejudice and the perception of facial threat. Psychological Science, 14(6), 640–643. https://doi.org/10.1046/j.0956-7976.2003.psci_1478.x
Article PubMed Google Scholar
Johnson, M. (1977). Assessment of clinical pain. Pain: A Sourcebook for Nurses and Other Health Professionals, 451–476.
Kappesser, J., & Williams, A. C. D. C. (2002). Pain and negative emotions in the face: Judgements by health care professionals. Pain, 99(1–2), 197–206. https://doi.org/10.1016/s0304-3959(02)00101-x
Kassambara, A., & Kassambara, M. A. (2020). Package ‘ggpubr.’
Keltner, D., Sauter, D., Tracy, J., & Cowen, A. (2019). Emotional expression: Advances in basic emotion theory. Journal of Nonverbal Behavior, 43(2), 133–160.
Article PubMed PubMed Central Google Scholar
Kim, H. J., Yang, G. S., Greenspan, J. D., Downton, K. D., Griffith, K. A., Renn, C. L., & Dorsey, S. G. (2017). Racial and ethnic differences in experimental pain sensitivity: Systematic review and meta-analysis. Pain, 158(2), 194–211. https://doi.org/10.1097/j.pain.0000000000000731
Article PubMed Google Scholar
Kruschke, J. K. (2011). Bayesian assessment of null values via parameter estimation and model comparison. Perspectives on Psychological Science, 6(3), 299–312.
Article PubMed Google Scholar
Krumhuber, E. G., Küster, D., Namba, S., Shah, D., & Calvo, M. G. (2021). Emotion recognition from posed and spontaneous dynamic expressions: Human observers versus machine analysis. Emotion, 21(2), 447.
Article PubMed Google Scholar
Kunz, M., Meixner, D., & Lautenbacher, S. (2019). Facial muscle movements encoding pain—a systematic review. Pain, 160(3), 535–549.
Article PubMed Google Scholar
LaVeist, T. A., & Nuru-Jeter, A. (2002). Is doctor-patient race concordance associated with greater satisfaction with care? Journal of Health and Social Behavior, 296–306.
Lazerus, T., Ingbretsen, Z. A., Stolier, R. M., Freeman, J. B., & Cikara, M. (2016). Positivity bias in judging ingroup members’ emotional expressions. Emotion, 16(8), 1117.
Article PubMed Google Scholar
Lenth, R., Singmann, H., Love, J., Buerkner, P., & Herve, M. (2019). Package ‘emmeans’. In.
LeResche, L. (2011). Defining gender disparities in pain management. Clinical Orthopaedics and Related Research®, 469(7), 1871–1877.
Litman, L., Robinson, J., & Abberbock, T. (2017). TurkPrime.com: A versatile crowdsourcing data acquisition platform for the behavioral sciences. Behavior Research Methods, 49(2), 433–442. https://doi.org/10.3758/s13428-016-0727-z
Losin, E. A. R., Anderson, S. R., & Wager, T. D. (2017). Feelings of clinician-patient similarity and trust influence pain: Evidence from simulated clinical interactions. The Journal of Pain, 18(7), 787–799.
Article PubMed PubMed Central Google Scholar
Ly, D. P., & Jena, A. B. (2021). Trends in diversity and representativeness of health care workers in the United States, 2000 to 2019. JAMA Network Open, 4(7), e2117086–e2117086.
Article PubMed PubMed Central Google Scholar
Makowski, D., Ben-Shachar, M. S., Chen, S. H., & Lüdecke, D. (2019). Indices of effect existence and significance in the Bayesian framework. Frontiers in Psychology, 10, 2767.
Article PubMed PubMed Central Google Scholar
Martel, M. O., Thibault, P., & Sullivan, M. J. L. (2011). Judgments about pain intensity and pain genuineness: The role of pain behavior and judgmental heuristics. The Journal of Pain, 12(4), 468–475.
Article PubMed Google Scholar
Mende-Siedlecki, P., Lin, J., Ferron, S., Gibbons, C., Drain, A., & Goharzad, A. (2021). Seeing no pain: Assessing the generalizability of racial bias in pain perception. Emotion, 21(5), 932–950.
Article PubMed Google Scholar
Mende-Siedlecki, P., Qu-Lee, J., Backer, R., & Van Bavel, J. J. (2019). Perceptual contributions to racial bias in pain recognition. Journal of Experimental Psychology. General, 148(5), 863–889. https://doi.org/10.1037/xge0000600
Article PubMed Google Scholar
Mogil, J. S. (2012). Sex differences in pain and pain inhibition: Multiple explanations of a controversial phenomenon. Nature Reviews Neuroscience, 13(12), 859–866. https://doi.org/10.1038/nrn3360
Article PubMed Google Scholar
Moss, A. J., Rosenzweig, C., Robinson, J., & Litman, L. (2020). Demographic stability on Mechanical Turk despite COVID-19. Trends in Cognitive Sciences, 24(9), 678–680.
Article PubMed PubMed Central Google Scholar
Mossey, J. M. (2011). Defining racial and ethnic disparities in pain management. Clinical Orthopaedics and Related Research, 469(7), 1859–1870. https://doi.org/10.1007/s11999-011-1770-9
Article PubMed PubMed Central Google Scholar
Nazione, S., Perrault, E. K., & Keating, D. M. (2019). Finding common ground: Can provider-patient race concordance and self-disclosure bolster patient trust, perceptions, and intentions? Journal of Racial and Ethnic Health Disparities, 6(5), 962–972. https://doi.org/10.1007/s40615-019-00597-6
Article PubMed Google Scholar
Necka, E. A., Amir, C., Dildine, T. C., & Atlas, L. Y. (2021). Expectations about pain and analgesic treatment are shaped by medical providers’ facial appearances: Evidence from five online clinical simulation experiments. Social Science & Medicine, 114091.
Neta, M., Berkebile, M. M., & Freeman, J. B. (2020). The dynamic process of ambiguous emotion perception. Cognition and Emotion, 1–8.
Oosterhof, N. N., & Todorov, A. (2008). The functional basis of face evaluation. Proceedings of the National Academy of Sciences, 105(32), 11087–11092.
Article Google Scholar
Payen, J. F., Bru, O., Bosson, J. L., Lagrasta, A., Novel, E., Deschaux, I., & Jacquot, C. (2001). Assessing pain in critically ill sedated patients by using a behavioral pain scale. Critical Care Medicine, 29(12), 2258–2263. https://doi.org/10.1097/00003246-200112000-00004
Article PubMed Google Scholar
Peirce, J., & MacAskill, M. (2018). Building experiments in PsychoPy. Sage.
Google Scholar
Pitkin Derose, K., Hays, R. D., McCaffrey, D. F., & Baker, D. W. (2001). Does physician gender affect satisfaction of men and women visiting the emergency department? Journal of General Internal Medicine, 16(4), 218–226. https://doi.org/10.1046/j.1525-1497.2001.016004218.x
Article Google Scholar
Poole, G. D., & Craig, K. D. (1992). Judgments of genuine, suppressed, and faked facial expressions of pain. Journal of Personality and Social Psychology, 63(5), 797.
Article PubMed Google Scholar
Prkachin, K. M. (1992). The consistency of facial expressions of pain: A comparison across modalities. Pain, 51(3), 297–306.
Article PubMed Google Scholar
Robinson, M. E., & Wise, E. A. (2003). Gender bias in the observation of experimental pain. Pain, 104(1–2), 259–264.
Article PubMed Google Scholar
Roy, C., Blais, C., Fiset, D., Rainville, P., & Gosselin, F. (2015). Efficient information for recognizing pain in facial expressions. European Journal of Pain, 19(6), 852–860.
Article PubMed Google Scholar
Rukavina, S., Sachsenweger, F., Jerg-Bretzke, L., Daucher, A. E., Traue, H. C., Walter, S., & Hoffmann, H. (2018). Testosterone and its influence on emotion recognition in young, healthy males. Psychology, 9(07), 1814.
Article Google Scholar
Russell, J. A. (1980). A circumplex model of affect. Journal of Personality and Social Psychology, 39(6), 1161.
Samulowitz, A., Gremyr, I., Eriksson, E., & Hensing, G. (2018). “Brave men” and “emotional women”: A theory-guided literature review on gender bias in health care and gendered norms towards patients with chronic pain. Pain Research and Management, 2018, 6358624. https://doi.org/10.1155/2018/6358624
Article PubMed PubMed Central Google Scholar
Simon, D., Craig, K. D., Gosselin, F., Belin, P., & Rainville, P. (2008). Recognition and discrimination of prototypical dynamic expressions of pain and emotions. PAIN®, 135(1–2), 55–64.
Article PubMed Google Scholar
Singmann, H., Bolker, B., Westfall, J., Aust, F., & Ben-Shachar, M. S. (2015). afex: Analysis of factorial experiments. R Package Version 0.13–145.
Spiegelhalter, D. J., Freedman, L. S., & Parmar, M. K. B. (1994). Bayesian approaches to randomized trials. Journal of the Royal Statistical Society: Series A (statistics in Society), 157(3), 357–387.
Article Google Scholar
Street, R. L., Jr., O’Malley, K. J., Cooper, L. A., & Haidet, P. (2008). Understanding concordance in patient-physician relationships: Personal and ethnic dimensions of shared identity. Annals of Family Medicine, 6(3), 198–205. https://doi.org/10.1370/afm.821
Article PubMed PubMed Central Google Scholar
Summers, K. M., Deska, J. C., Almaraz, S. M., Hugenberg, K., & Lloyd, E. P. (2021). Poverty and pain: Low-ses people are believed to be insensitive to pain. Journal of Experimental Social Psychology, 95, 104116.
Article Google Scholar
Tcherkassof, A., Bollon, T., Dubois, M., Pansu, P., & Adam, J. M. (2007). Facial expressions of emotions: A methodological contribution to the study of spontaneous and dynamic emotional faces. European Journal of Social Psychology, 37(6), 1325–1345.
Article Google Scholar
Todd, K. H., Samaroo, N., Hoffman, J. R., Ng, B., Dimsdale, J. E., Rollnik, J. D., & Shapiro, H. (1993). Ethnicity as a risk factor for inadequate emergency department analgesia. Oncology Nursing Forum (0098–7484 (Print)).
Todorov, A., & Oosterhof, N. N. (2011). Modeling social perception of faces [social sciences]. IEEE Signal Processing Magazine, 28(2), 117–122.
Article Google Scholar
Vila, H. J., Smith, R. A., Augustyniak, M. J., Nagi, P. A., Soto, R. G., Ross, T. W., & Miguel, R. V. (2005). The efficacy and safety of pain management before and after implementation of hospital-wide pain management standards: Is patient safety compromised by treatment based solely on numerical pain ratings? Anesthesia & Analgesia, 101(2). Retrieved from https://journals.lww.com/anesthesia-analgesia/Fulltext/2005/08000/The_Efficacy_and_Safety_of_Pain_Management_Before.32.aspx
Wagner, H. L. (1993). On measuring performance in category judgment studies of nonverbal behavior. Journal of Nonverbal Behavior, 17(1), 3–28.
Article Google Scholar
Walzenbach, S. (2019). Hiding sensitive topics by design?: An experiment on the reduction of social desirability bias in factorial surveys. Survey Research Methods, 13(2019), 103–121.
Google Scholar
Wang, S., Eccleston, C., & Keogh, E. (2021). The time course of facial expression recognition using spatial frequency information: Comparing pain and core emotions. The Journal of Pain, 22(2), 196–208.
Article PubMed Google Scholar
Weisbuch, M., & Ambady, N. (2008). Affective divergence: Automatic responses to others’ emotions depend on group membership. Journal of Personality and Social Psychology, 95(5), 1063.
Article PubMed Google Scholar
Wong, D. L., Hockenberry, M. J., Wilson, D., & Winkelstein, M. L. (2005). Wong’s essentials of pediatric nursing (Vol. 1). Mosby.
Zhang, L., Losin, E. A. R., Ashar, Y. K., Koban, L., & Wager, T. D. (2021). Gender biases in estimation of others’ pain. The Journal of Pain. https://doi.org/10.1016/j.jpain.2021.03.001
Article PubMed PubMed Central Google Scholar
Zhao, K., Chu, W.-S., De la Torre, F., Cohn, J. F., & Zhang, H. (2016). Joint patch and multi-label learning for facial action unit and holistic expression recognition. IEEE Transactions on Image Processing, 25(8), 3931–3946.
Article Google Scholar

Download references

Acknowledgements

We thank Peter Mende-Siedlecki for helpful discussions on our task design.

Funding

This research was supported by the Intramural Research program of NIH’s National Center for Complementary and Integrative Health (ZIA-AT000030, awarded to LYA).

Author information

Authors and Affiliations

National Center for Complementary and Integrative Health, National Institutes of Health, 10, Center Drive, Bethesda, MD, 20892, USA
Troy C. Dildine, Carolyn M. Amir, Julie Parsons & Lauren Y. Atlas
Department of Clinical Neuroscience, Karolinska Institute, 171 77, Solna, Sweden
Troy C. Dildine
National Institute of Mental Health, National Institutes of Health, Bethesda, MD, 20892, USA
Lauren Y. Atlas
National Institute on Drug Abuse, National Institutes of Health, Baltimore, MD, 21224, USA
Lauren Y. Atlas

Authors

Troy C. Dildine
View author publications
You can also search for this author in PubMed Google Scholar
Carolyn M. Amir
View author publications
You can also search for this author in PubMed Google Scholar
Julie Parsons
View author publications
You can also search for this author in PubMed Google Scholar
Lauren Y. Atlas
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.C.D. and L.Y.A. conceptualized the studies. T.C.D. & C.M.A. collected the data. T.C.D. & J. P. processed the data. T.C.D., and L.Y.A. analyzed data, prepared figures, and wrote the manuscript. T.C.D., C. M. A., J. P., & L.Y.A. helped prepare and edit the manuscript.

Corresponding authors

Correspondence to Troy C. Dildine or Lauren Y. Atlas.

Ethics declarations

Ethical Approval

This protocol, which collected no identifiers and only involved surveys and benign behavioral measures, was determined to be IRB review exempt under the Common Rule by the National Institutes of Health’s Office of Human Subject Research Protection (18-NCCIH-0622).

Consent to Participate

Participants provided consent before completing the task, as approved by the National Institutes of the Health’s Office of Human Subject Research Protection (18-NCCIH-0622).

Conflict of Interest

The authors declare no competing interests.

Additional information

Handling Editor: Disa Sauter

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary file1 (DOCX 3447 KB)

Supplementary file2 (PNG 3775 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dildine, T.C., Amir, C.M., Parsons, J. et al. How Pain-Related Facial Expressions Are Evaluated in Relation to Gender, Race, and Emotion. Affec Sci 4, 350–369 (2023). https://doi.org/10.1007/s42761-023-00181-6

Download citation

Received: 13 July 2022
Accepted: 24 January 2023
Published: 03 March 2023
Issue Date: June 2023
DOI: https://doi.org/10.1007/s42761-023-00181-6

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

How Pain-Related Facial Expressions Are Evaluated in Relation to Gender, Race, and Emotion

Abstract

Similar content being viewed by others

Pain and satisfaction: healthcare providers’ facial appearance matters

Facial Appearance as Core Expression Scale: Benchmarks and Properties

An Eye Tracking Investigation of Pain Decoding Based on Older and Younger Adults’ Facial Expressions

Method

Participants

Stimuli

General Procedures

Specific Procedures

Study 5

Statistics and Analyses

Preregistration and Power Analyses

Data Processing

Statistical Analyses: Studies 1–4

Multilevel Models

Meta-analyses

Statistical Analyses: Study 5

Results

Perceivers Attribute Pain to Faces Displaying Pain-Related Movements

Pain-Related Facial Muscle Activation Is Positively Associated with Perceived Pain Intensity

Sociocultural Factors Do Not Reliably Affect Pain Categorization or Perceived Pain Intensity

The Relationship Between Pain and Other Emotions Depends on Experimental Context

Discussion

Pain-Related Facial Activations Are Positively Associated with Pain Recognition and Intensity Estimation

Target Gender, Race, Group Status, and Perceived Similarity Had Inconsistent Effects on Pain Assessment but Influenced Intensity and Confidence in Emotion Judgments

Perceivers Rarely Identify Pain and Confuse It for Other Emotions when Provided with Many Emotion Categories

Limitations

Conclusions

Data Availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethical Approval

Consent to Participate

Conflict of Interest

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (DOCX 3447 KB)

Supplementary file2 (PNG 3775 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation