Natural language processing of multi-hospital electronic health records for public health surveillance of suicidality

Bey, Romain; Cohen, Ariel; Trebossen, Vincent; Dura, Basile; Geoffroy, Pierre-Alexis; Jean, Charline; Landman, Benjamin; Petit-Jean, Thomas; Chatellier, Gilles; Sallah, Kankoe; Tannier, Xavier; Bourmaud, Aurelie; Delorme, Richard

doi:10.1038/s44184-023-00046-7

Natural language processing of multi-hospital electronic health records for public health surveillance of suicidality

Article
Open access
Published: 14 February 2024

Volume 3, article number 6, (2024)
Cite this article

Download PDF

You have full access to this open access article

npj Mental Health Research

Natural language processing of multi-hospital electronic health records for public health surveillance of suicidality

Download PDF

Romain Bey¹,
Ariel Cohen¹,
Vincent Trebossen²,
Basile Dura¹,
Pierre-Alexis Geoffroy^3,4,5,6,
Charline Jean^1,7,8,
Benjamin Landman²,
Thomas Petit-Jean¹,
Gilles Chatellier^1,9,
Kankoe Sallah¹⁰,
Xavier Tannier¹¹,
Aurelie Bourmaud^9,12,13 &
…
Richard Delorme^2,14

2953 Accesses
1 Citation
48 Altmetric
6 Mentions
Explore all metrics

Abstract

There is an urgent need to monitor the mental health of large populations, especially during crises such as the COVID-19 pandemic, to timely identify the most at-risk subgroups and to design targeted prevention campaigns. We therefore developed and validated surveillance indicators related to suicidality: the monthly number of hospitalisations caused by suicide attempts and the prevalence among them of five known risks factors. They were automatically computed analysing the electronic health records of fifteen university hospitals of the Paris area, France, using natural language processing algorithms based on artificial intelligence. We evaluated the relevance of these indicators conducting a retrospective cohort study. Considering 2,911,920 records contained in a common data warehouse, we tested for changes after the pandemic outbreak in the slope of the monthly number of suicide attempts by conducting an interrupted time-series analysis. We segmented the assessment time in two sub-periods: before (August 1, 2017, to February 29, 2020) and during (March 1, 2020, to June 31, 2022) the COVID-19 pandemic. We detected 14,023 hospitalisations caused by suicide attempts. Their monthly number accelerated after the COVID-19 outbreak with an estimated trend variation reaching 3.7 (95%CI 2.1–5.3), mainly driven by an increase among girls aged 8–17 (trend variation 1.8, 95%CI 1.2–2.5). After the pandemic outbreak, acts of domestic, physical and sexual violence were more often reported (prevalence ratios: 1.3, 95%CI 1.16–1.48; 1.3, 95%CI 1.10–1.64 and 1.7, 95%CI 1.48–1.98), fewer patients died (p = 0.007) and stays were shorter (p < 0.001). Our study demonstrates that textual clinical data collected in multiple hospitals can be jointly analysed to compute timely indicators describing mental health conditions of populations. Our findings also highlight the need to better take into account the violence imposed on women, especially at early ages and in the aftermath of the COVID-19 pandemic.

Identifying Suicide Ideation and Suicidal Attempts in a Psychiatric Clinical Research Database using Natural Language Processing

Article Open access 09 May 2018

Natural language processing applied to mental illness detection: a narrative review

Article Open access 08 April 2022

Use of natural language processing in electronic medical records to identify pregnant women with suicidal behavior: towards a solution to the complex classification problem

Article 10 December 2018

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Introduction

Since its outbreak, the COVID-19 pandemic raised concerns about the heavy toll it would take on mental health. Clinicians and policymakers strived to rapidly identify populations that were the most mentally at-risk in order to judiciously design prevention campaigns and optimally allocate scarce resources. In particular, they feared a raise of the incidence of suicide attempts (SA)¹. More than 3 years after the pandemic outbreak, strong evidence has been collected that confirmed the well-foundedness of these concerns², but the first studies were either inconclusive or contradictory^3,4,5. Moreover, only recently were youths, especially girls, unambiguously identified as being the most affected group^6,7,8,9,10. The underlying vulnerability mechanism inducing this sex- and age-dependent dynamics are still poorly understood although a possible impact of the pandemic context on some risk factors such as child abuse has been suggested¹¹. The time required for evidence collection impeded the adoption of timely and targeted measures to mitigate the impact of the pandemic on these populations. New tools are consequently required to better monitor the mental health of populations and facilitate the management of upcoming crises^12,13.

Prospects in that direction are opened by the collection in various databases of ever more naturalistic data that reflect the mental health conditions of populations, in particular textual data, and by the advent of natural language processing (NLP) algorithms to analyse them automatically and timely. Indeed, new NLP technologies increasingly enable the computation of structured variables using large text corpora that were primarily generated for another objective (e.g., for care, by users of social media, etc.). Algorithms based on artificial intelligence such as neural networks have achieved impressive performances on various tasks related to the analysis of human language, including in clinical applications^14,15.

Until now, most studies that pursued this line of enquiry have focused on social media data^16,17,18,19. Although they detected impacts of the COVID-19 outbreak on contents posted on these media, it remained unclear whether concrete guidelines for clinicians and policymakers could be deduced from these observations. Indeed, information collected using this media concerned only a few outcomes of interest whose reporting moreover depends on its perceived importance by the population. Moreover, contents posted on social media neither covered evenly the population of interest, over- or under-representing subgroups of different demographic and social conditions, nor allowed proper stratifications to identify populations at-risk. Applying NLP algorithms on data collected in hospitals’ electronic health records (EHR) instead of social media appears as a promising field of research to circumvent these limitations. Specific technical and methodological difficulties, however, have to be overcome^20,21,22.

First, although some projects have already demonstrated that NLP algorithms applied on clinical reports could extract variables of interest regarding mental health^{23,24,25,26,27}, many of these algorithms were developed using disease-, age- or hospital-specific cohorts and concerns have been raised regarding their generalisability²⁸. It is therefore necessary to train and validate NLP algorithms on datasets that are representative of various contexts of care to detect indications on mental health that are often scattered among millions of reports edited throughout care organisations. Second, platforms allowing for the analysis of EHR often encompass data collected in a single hospital that may not be sufficient to monitor mental health of populations at a regional or national level. Privacy and technical issues limit the sharing of data or algorithms among platforms, and developing any multi-hospital indicator may consequently require a costly and complex replication of developments. Even when data from multiple hospitals are available on a single platform, their aggregation in a joint analysis is rarely straightforward as local specificities shall still be accounted for to avoid biases.

We aim at demonstrating that mental health indicators can be computed timely at a population level analysing jointly millions of clinical reports collected in multiple hospitals. We therefore focused on a specific use case, the monitoring of suicidality during the COVID-19 crisis. First, we developed and validated new NLP algorithms that measured our main indicator, i.e., the monthly number of hospitalised SA in fifteen hospitals of the Paris area. Second, we conducted a retrospective study considering this indicator both before and after the COVID-19 outbreak and assessed whether we detected the now-established surge of SA among youths and especially girls. Third, we conducted an exploratory analysis to estimate whether further information on the underlying mechanisms inducing this surge could be obtained by detecting risk factors mentioned in clinical reports.

Results

Cohort of hospitalisations caused by SA

From August 1st, 2017 until June 31st, 2022, we included in our analysis 2,911,920 hospitalisations, gathering 14,023 hospitalisations linked to SA as classified by our main, hybrid NLP algorithm (0.5% of considered hospitalisations, related to 11,786 independent individuals). We observed less stays before than after the COVID-19 outbreak [5954 (42.5%) and 8069 (57.5%), respectively] (Table 1). The mean age at admission was 38.0 years (20.7 SD). When considering sex-ratio distribution along the observation period of our study, we observed that females counted approximately for two thirds of SA-caused stays (9015, 64.3%) and males for one third of them (5008, 35.7%).

Table 1 Number of hospitalisations caused by suicide attempts, seasonally-adjusted long-term means and trends, and post-pandemic trend variations.

Full size table

NLP algorithms

As shown in Table 2, the positive predictive value (PPV) of the main hybrid algorithm for SA detection (0.85) was superior to the PPV of an alternative rule-based algorithm (0.51). Interestingly, NLP algorithms featured constant PPV before and after the COVID-19 outbreak for the detection of both SA-caused stays and risk factors. The inter-annotator positive and negative agreements were [0.92;0.5] for SA detection, [1.0;1.0] for four of the algorithms detecting risk factors (i.e., history of SA, physical, sexual, and domestic violence) and [1.0;-] for the algorithm detecting social isolation (i.e., no false positive detection was observed).

Table 2 Positive predictive values of the natural language processing algorithms.

Full size table

Mental health indicators

Figure 1 shows the sex- and age-stratified time-series of the monthly number of hospitalised SA. We noticed a modification of SA dynamics after the COVID-19 outbreak with the slope variation indicating a statistically significant increase of SA affected the overall population (3.7, 95%CI 2.1–5.3). This effect was mainly driven by girls aged 8–17 (1.8, 95%CI 1.2–2.5) and young women aged 18–25 (1.1, 95%CI 0.7–1.5), and marginally by men (0.9, 95%CI 0.2–1.6). The residuals did not feature any noticeable time trend indicating that temporal variations were correctly accounted for by the linear, seasonally-adjusted model (Supplementary Figs. 1, 2).

Sensitivity analysis

As shown in Supplementary Figs. 3–13 and Supplementary Tables 1–3, four sensitivity analyses confirmed the robustness of the results we reported: they were either still present (analysis with an alternative rule-based NLP algorithm, adjusting for a potential completeness-induced bias or conducting per-hospital analysis) or not significant (analysis with an alternative claim-based algorithm). Considering single hospitals often led to results that were not statistically significant, highlighting the interest of considering jointly multiple hospitals.

Characteristics of SA-caused hospitalisations

The variety and proportion of methods used to attempt suicide were similar across time but featured a stronger amount of intentional drug overdose among women (Supplementary Fig. 14). The time-to-exit indicated shorter stays after the COVID-19 outbreak and the survival analysis showed that stays ended less often by patient’s death ($p\le 0.001$ and $p=0.007$, respectively). Kaplan-Meier curves indicated that short stays were on average shorter after COVID-19 outbreak while the duration of longer stays remained unaffected (see Supplementary Fig. 15).

Exploratory analysis of risk factors

Finally, we explored how known SA risk factors were reported by clinicians in discharge summaries of SA-caused stays. Whereas personal suicide attempt history and social isolation were equally mentioned for males and females, acts of domestic, physical and sexual violence were more often reported for females both before and after the outbreak. However, the prevalence of reported risk factors evolved during the study period we examined (Fig. 2). We observed a strong increase of any kind of violence after COVID-19 outbreak (prevalence ratios: 1.3, 95%CI 1.16–1.48; 1.3, 95%CI 1.10–1.64 and 1.7, 95%CI 1.48–1.98 for domestic, physical and sexual violence, respectively - see Table 3). Interestingly, we only observed a marginal effect of the COVID-19 outbreak on the prevalence of personal suicide attempt history and social isolation (prevalence ratio 1.1, 95%CI 1.05–1.14; 1.2, 95%CI 1.09–1.39, respectively).

**Fig. 2: Monthly prevalence of risk factors reported in discharge summaries of hospitalised suicide attempts.**

Table 3 Prevalence of risk factors in pre- and post-pandemic periods.

Full size table

Discussion

Retrospective validation of indicators related to suicidality

The aim of our study was to demonstrate that surveillance indicators describing mental health of populations could be computed analysing jointly clinical reports edited in multiple hospitals. We therefore developed and applied NLP algorithms on data collected retrospectively during an observation period of 5 years which encompassed approximately three million hospital stays. We assessed whether this methodology could detect known variations in the dynamics of SA in children and adults after the COVID-19 outbreak and provide information on the mechanisms underlying this phenomenon. We identified a large-scale sample of 14,023 SA-caused hospitalisations during the whole period covered by the analysis. We observed an increase in the monthly number of hospitalised SA after the COVID-19 outbreak, especially among females aged 8–25 years old in accordance with previous findings^2,6,7,9. Interestingly, we detected that violence -which was known as a risk factor of SA that strongly affects girls and young women-^29,30 was more frequently reported in the discharge summaries of SA-caused stays after the pandemic outbreak, stressing its role in the atypical dynamic of SA in this population.

Impact of the COVID-19 context and violence on mental health

The detected associations highlight retrospectively the major impact of the COVID-19 pandemic on women’s mental health. Although a direct link between lockdown and violence imposed on females was not established in our study, our findings emphasised that this period was critical for women’s safety and resulted in an exacerbation of SA in the immediate aftermath of the pandemic^31,32,33. According to the World Health Organization (WHO), violence affects one in three women worldwide, who are domestically, physically or sexually intimidated³⁴. The impact of violence is major on mental health of girls and women, but more generally alters their health by increasing their risk at long-term of diabetes, chronic pain, and cardiovascular disease among others^35,36. Actions to prevent SA among girls and young women must include measures to protect them from violence: challenging discriminatory gender norms and attitudes that condone violence against women, reforming discriminatory family laws, promoting women’s access to gainful employment and secondary education, reducing exposure to violence during childhood, and addressing substance abuse^37,38. If our methodology were applied during the COVID-19 crisis, the computed indicators would have raised an early alert helping thus clinicians and policymakers to mitigate the impact of the pandemic on the mental health of girls.

NLP to study mental health of populations

Our study validated retrospectively the feasibility of analysing automatically EHR contained in multi-hospital clinical data warehouses to provide epidemiological insights on mental health of populations and early identify at-risk groups during crises. Using NLP algorithms allowed us to summarise data collected in millions of reports without introducing additional constraints in clinical practices. We confirmed that NLP algorithms could efficiently detect SA and some of its known risk factors^24,25,26, showing that we could both detect rare events among hospitals’ EHRs and circumvent the limited completeness of claim data³⁹. Even better, the adoption of a hybrid architecture that contained artificial intelligence components improved the algorithms’ performances compared to purely rule-based approaches²⁵. The statistical analysis relying on data provided by NLP algorithms was robust to technical and methodological choices, indicating in particular that concerns about the reliability of indicators computed using multi-hospital EHR could be addressed efficiently. In the future, applying prospectively these algorithms could allow us to compute real-time mental health indicators that would complement already available surveillance epidemiological tools. The methodology of this study could moreover be extended to further process clinical notes using NLP algorithms, for instance to extract information related to the consumption of care (medications, previous visits, etc.) or to socioeconomic determinants (unemployment, dwelling type, etc.). NLP algorithms could moreover be implemented in a clinical setting to better target the prevention of SA⁴⁰.

Limitations

This study has several limitations. First, it was an observational study that was not designed to test causality. Second, we conducted a retrospective study that should be extended by a prospective study to fully demonstrate the usefulness of the developed mental health indicators to address upcoming crises. Third, we detected SA and risk factors analysing their reporting by clinicians, but this reporting may depend on varying clinical practices, experience of physicians, ease of use of the EHR, etc. Fourth, visits to emergency departments that were not followed by hospitalisations were discarded due to the current availability of emergency reports in the database, thus limiting our analysis to the most severe SA resulting in a hospitalisation stay and reducing the exploration of the whole severity spectrum of SA.

Conclusions

In conclusion, we demonstrated that naturalistic data collected in the EHR of multiple hospitals, both structured and unstructured, could be leveraged to compute indicators describing mental health conditions of populations. To achieve this, we analysed retrospectively millions of clinical reports using NLP algorithms to identify populations whose suicidality was the most affected during a critical period, the COVID-19 pandemic. We detected some risk factors associated with the variation of the number of severe SA being hospitalised. Our results also highlighted the need to better take into account violence imposed on women, especially at early ages, to prevent the occurrence of severe SA in this at-risk group.

Methods

This study followed the REporting of studies Conducted using Observational Routinely-collected health Data (RECORD) reporting guideline (checklist available in the Supplement)⁴¹. The methods were performed in accordance with relevant guidelines and regulations and approved by the institutional review board of the Greater Paris University Hospitals (IRB00011591, decision CSE 21-13). French regulation does not require the patient’s written consent for this kind of research but in accordance with the European General Data Protection Regulation the patients were informed and those who opposed the secondary use of their data for research were excluded from the study. Data was pseudonymised by replacing names and places of residence by aliases.

Choice of primary measure

We chose as primary measure the monthly number of hospitalisations caused by SA. Since the seminal work of Durkheim⁴², suicidality is indeed considered as a compelling marker of a population’s mental health. Many studies have been dedicated to the description of its variation with time and among subgroups, in particular when mentally at-risk populations needed to be identified (e.g., in the aftermath of economic crises or infectious epidemics)^43,44. Some of these studies focused on completed suicide attempts but this measure has limitations that makes it less suited for the monitoring of the mental health of populations as, (i) the reporting of SA on death certificates may be poorly reliable and is often delayed, (ii) and only a few events can be observed for some at-risk populations such as youths. Our choice to focus on suicide attempts increased the number of observed events and allowed us to leverage high-quality and timely data collected within EHRs.

Study design, setting, and participants

We conducted a multicentre observational retrospective cohort study. We considered hospitalisation stays caused by SA defined as a non-fatal self-directed potentially injurious behaviour with any intent to die as a result of the behaviour⁷. This definition discarded hospitalisations that only mentioned self-harm or suicide ideation. We considered data collected in the EHR of all patients hospitalised between August 1st, 2017 and June 31st, 2022 in the AP-HP (Assistance Publique-Hôpitaux de Paris) university hospitals. We selected only adult and paediatric hospitals for which the deployment of the EHR ensured a temporal stability of data collection during the study period (n = 15, see Supplementary Table 4 and Supplementary Fig. 16).

We examined administrative data (age, sex, dates of stay, death during stay), diagnoses issued from claim data (coded using the International Classification of Diseases 10th revision) and clinical reports. All reports were considered at the screening stage but only the last-edited discharge summary of each stay was used by the stay-classification algorithm.

To identify which stays were relative to a SA, we first detected patients whose clinical reports contained at least one keyword relative to SA (screening stage, see Supplementary Fig. 17 and Supplementary Table 5). Second, to discard the numerous false positive detections of the first stage we applied a stay-classification algorithm based on artificial intelligence. Third, we discarded patients who were aged under eight at admission. We chose this intermediate threshold as the intentionality of a SA is difficult to establish for very young children but SA are nevertheless reported by clinicians before the age of ten. Finally, if two SA-caused hospitalisations occurred within a period of 15 days for the same patient, we considered only the first occurrence to avoid spurious multiple counting.

Data sources

AP-HP comprises 38 hospitals spread across the Paris area (gathering more than 22,000 beds). Data collected in the EHR software and in the claim database are merged together in the AP-HP clinical data warehouse on a daily and monthly basis, respectively. The research database follows the Informatics for Integrating Biology & the Bedside standard⁴⁵. Data was extracted on July 4th, 2022.

Development and validation of natural language processing algorithms

At the screening stage, a dictionary of keywords relative to SA and grouped by modality was looked for in the clinical reports (e.g., “suicide attempts by jumping from height”, “intentional drug overdose” as well as orthographic variations of these terms, see Supplementary Table 5). At the stay-classification stage, keywords were extended as regular expressions (i.e., sequences of characters that define a search pattern) and looked for among the discharge summaries of the screened patients. Each detected mention was classified as valid (i.e., a stay really caused by a SA) or invalid using a NLP algorithm that followed the RoBERTa neural network architecture (see Supplementary Fig. 18)⁴⁶. It detected false detections of SA-caused by negative mentions, mentions not relative to the patient, relative to the patient’s history, expressed as reported speech or as an hypothesis. A stay was classified as SA-caused if at least one valid mention of SA was detected in its discharge summary. The NLP algorithm was developed using EDS-NLP v0.6.1⁴⁷.

The dataset was divided by hospital into training (n = 10) and validation (n = 5) sets (see Supplementary Table 4). The SA dictionary was initiated before accessing data by a group of four expert senior clinicians (V.T., P.-A.G., B.L., R.D.) and used to screen a first set of eligible stays. Then, 465 eligible stays were randomly selected among the training set and were annotated by authors. Each SA-keyword found in the documents was labelled as a true or false detection of a SA mention and each SA mention was qualified when relevant as negation, relative to a family member, relative to patient history, reported speech, hypothesis (1,571 mentions labelled, see the Annotation Guidelines in the Supplement). New SA-keywords detected during annotation were added to the dictionary. The annotations were then used to train the neural network that was initialised with a CamemBERT language model pre-trained on 21 million clinical reports written in French^48,49. The PPV of the algorithm used to detect SA-caused stays was assessed by a chart review by two expert clinicians (V.T., B.L.) of a sample of 162 discharge summaries randomly drawn from the validation set and divided in pre- and post-pandemic periods. Among them, 15 stays were blindly annotated by the two clinicians to measure inter-annotator agreement (positive and negative agreements)⁵⁰. The sensitivity of the algorithm could not be measured as SA-caused hospitalisations represented a tiny proportion of the hospitalisation stays and a massive dataset should consequently be annotated to estimate it.

Variables

We collected the following variables for each SA-caused stay: age at admission, sex, admission date, length of stay, death during the stay, known SA risk factors reported in discharge summaries [social isolation, domestic, sexual & physical violence and personal suicide attempt history (see Supplementary Table 6). Risk factors were detected in the last-edited discharge summaries of SA-caused stays using rule-based NLP algorithms (see Supplement)], claim diagnostic codes, hospital location.

Statistical analysis

Continuous variables were reported as means with standard deviations (SD). Qualitative variables were reported as numbers (%). To estimate the impact of the COVID-19 pandemic on the dynamics of SA-caused hospitalisations we conducted a single-group interrupted time-series analysis⁵¹. We considered the monthly numbers of SA-caused hospitalisations ${N}_{T}$ divided into a pre-pandemic period (after August 1st, 2017 and before February 29th, 2020) and a post-pandemic period (after March 1st, 2020 and before June 31st, 2022). We adjusted for the mean, the long-term trend and the seasonality⁵², and tested the presence of a post-pandemic variation of the trend using the following equation:

$${N}_{T}={\alpha }_{0}+{\alpha }_{1}T+{\alpha }_{2}(T-\tilde{T})\times 1\{T \ge \tilde{T}\}+\mathop{\sum }\limits_{m=Jan}^{Dec}\,{\beta }_{m}\times 1\{T=m\}+{e}_{T}$$

(1)

with $T$ a running counter of months since August, 2017; $\widetilde{T}$ the date of the pandemic outbreak (March 2020); ${\alpha }_{0}$ and ${\alpha }_{1}$ parameters characterising respectively the mean and the long-term trend; ${\alpha }_{2}$ the trend variation after the COVID-19 outbreak; ${\beta }_{m}$ parameters adjusting for seasonal variations with $m$ standing for months from January to December, and ${e}_{T}$ a random error. We estimated the model and confidence intervals (CI) via ordinary least-squares regression.

We conducted a subgroup analysis according to sex and age (8–17, 18–25, 26–65 and 66–). To compare the severity of SA-caused hospitalisations in the pre- and post-pandemic periods, accounting for the potential bias induced by the censoring of non-terminated stays, we considered length of stay as censored data and compared groups using log-rank tests. The same strategy was applied for the stays that ended with the patient’s death.

In an exploratory analysis we compared the pre- and post- pandemic prevalence ratios of the reported risk factors using Fisher’s exact tests. All tests were 2-sided and p-values were considered statistically significant when $\le 0.05$. All estimations were reported with their 95% CI. Statistical analysis was performed using the statsmodels v0.13.2 and lifelines v0.26.4^53,54.

Sensitivity analysis

We conducted four sensitivity analyses and assessed whether the statistically significant effects detected in the main analysis were still consistent in these sub-analyses (see Supplement for details): (i) we considered an alternative stay-classification algorithm that used only diagnosis claim codes; (ii) we evaluated an alternative stay-classification algorithm replacing the neural network by a rule-based NLP algorithm that did not rely on machine learning (we estimated the algorithm’s PPV annotating additional records); (iii) we adjusted for potential bias induced by missing data by dividing each monthly number of SA by the average completeness of discharge summaries⁵⁵; (iv) we examined each hospital separately.

Data availability

Access to the clinical data warehouse’s de-identified raw data can be granted following the process described on its website: www.eds.aphp.fr. A prior validation of the access by the local institutional review board is required. In the case of non-AP-HP researchers, the signature of a collaboration contract is moreover mandatory. The analysis code used to conduct this analysis is made freely available with publication.

Code availability

The underlying code for this study is available in https://doi.org/10.5281/zenodo.8283122 and can be accessed via this link https://github.com/aphp-datascience/study-nlp-suicidality-surveillance. The parameters of the machine learning model are not published due to privacy considerations, but they are available upon request for other projects conducted on the secure platform of the AP-HP’s clinical data warehouse on condition of a prior validation by the institutional review board.

References

Moutier, C. Suicide Prevention in the COVID-19 Era: Transforming Threat Into Opportunity. JAMA Psychiatry https://doi.org/10.1001/jamapsychiatry.2020.3746 (2020).
Dubé, J. P., Smith, M. M., Sherry, S. B., Hewitt, P. L. & Stewart, S. H. Suicide behaviors during the COVID-19 pandemic: a meta-analysis of 54 studies. Psychiatry Res. 301, 113998 (2021).
Article PubMed PubMed Central Google Scholar
Leske, S., Kõlves, K., Crompton, D., Arensman, E. & de Leo, D. Real-time suicide mortality data from police reports in Queensland, Australia, during the COVID-19 pandemic: an interrupted time-series analysis. Lancet Psychiatry 8, 58–63 (2021).
Article PubMed Google Scholar
Pirkis, J. et al. Suicide trends in the early months of the COVID-19 pandemic: an interrupted time-series analysis of preliminary data from 21 countries. Lancet Psychiatry 8, 579–588 (2021).
Article PubMed PubMed Central Google Scholar
Jollant, F. et al. Hospitalization for self-harm during the early months of the COVID-19 pandemic in France: A nationwide retrospective observational cohort study. Lancet Reg. Health Eur 6, 100102 (2021).
Article PubMed PubMed Central Google Scholar
Gracia, R. et al. Is the COVID-19 pandemic a risk factor for suicide attempts in adolescent girls? J. Affect. Disord. 292, 139–141 (2021).
Article CAS PubMed PubMed Central Google Scholar
Cousien, A., Acquaviva, E., Kernéis, S., Yazdanpanah, Y. & Delorme, R. Temporal trends in suicide attempts among children in the decade before and during the COVID-19 pandemic in Paris, France. JAMA Netw. Open 4, e2128611 (2021).
Article PubMed PubMed Central Google Scholar
Ridout, K. K. et al. Emergency department encounters among youth with suicidal thoughts or behaviors during the COVID-19 pandemic. JAMA Psychiatry 78, 1319–1328 (2021).
Article PubMed Google Scholar
Yard, E. et al. Emergency department visits for suspected suicide attempts among persons aged 12-25 years before and during the COVID-19 pandemic—United States, January 2019-May 2021. MMWR Morb. Mortal. Wkly. Rep. 70, 888–894 (2021).
Article CAS PubMed PubMed Central Google Scholar
Bersia, M. et al. Suicide spectrum among young people during the COVID-19 pandemic: a systematic review and meta-analysis. EClinicalMedicine 54, 101705 (2022).
Article PubMed PubMed Central Google Scholar
Swedo, E. Trends in U.S. emergency department visits related to suspected or confirmed child abuse and neglect among children and adolescents aged 18 years before and during the COVID-19 pandemic — United States, January 2019–September 2020. MMWR Morb. Mortal. Wkly. Rep. 69, 1841–1847 (2020).
John, A., Pirkis, J., Gunnell, D., Appleby, L. & Morrissey, J. Trends in suicide during the covid-19 pandemic. BMJ 371, m4352 (2020).
Article PubMed Google Scholar
Bitsko, R. H. et al. Mental health surveillance among children—United States, 2013-2019. MMWR Suppl. 71, 1–42 (2022).
Article PubMed PubMed Central Google Scholar
Spasic, I. & Nenadic, G. Clinical text data in machine learning: systematic review. JMIR Med. Inform. 8, e17984 (2020).
Article PubMed PubMed Central Google Scholar
Jiang, L. Y. et al. Health system-scale language models are all-purpose prediction engines. Nature 619, 357–362 (2023).
Article CAS PubMed PubMed Central Google Scholar
Shakeri Hossein Abad, Z. et al. Digital public health surveillance: a systematic scoping review. NPJ Digit. Med. 4, 41 (2021).
Article PubMed PubMed Central Google Scholar
Syrowatka, A. et al. Leveraging artificial intelligence for pandemic preparedness and response: a scoping review to identify key use cases. NPJ Digit. Med. 4, 96 (2021).
Article PubMed PubMed Central Google Scholar
Yin, Z., Sulieman, L. M. & Malin, B. A. A systematic literature review of machine learning in online personal health data. J. Am. Med. Inform. Assoc. JAMIA 26, 561–576 (2019).
Article PubMed Google Scholar
Edo-Osagie, O., De La Iglesia, B., Lake, I. & Edeghere, O. A scoping review of the use of Twitter for public health research. Comput. Biol. Med. 122, 103770 (2020).
Article PubMed PubMed Central Google Scholar
Kohane, I. S. et al. What every reader should know about studies using electronic health record data but may be afraid to ask. J. Med. Internet Res. 23, e22219 (2021).
Article PubMed PubMed Central Google Scholar
Wismüller, A. et al. Early-stage COVID-19 pandemic observations on pulmonary embolism using nationwide multi-institutional data harvesting. NPJ Digit. Med. 5, 120 (2022).
Article PubMed PubMed Central Google Scholar
Desai, R. J. et al. Broadening the reach of the FDA Sentinel system: a roadmap for integrating electronic health record data in a causal analysis framework. NPJ Digit. Med. 4, 170 (2021).
Article PubMed PubMed Central Google Scholar
Bittar, A., Velupillai, S., Downs, J., Sedgwick, R. & Dutta, R. Reviewing a decade of research into suicide and related behaviour using the South London and Maudsley NHS Foundation Trust Clinical Record Interactive Search (CRIS) System. Front. Psychiatry 11, 553463 (2020).
Article PubMed PubMed Central Google Scholar
Metzger, M.-H. et al. Use of emergency department electronic medical records for automated epidemiological surveillance of suicide attempts: a French pilot study. Int. J. Methods Psychiatr. Res. 26, e1522 (2017).
Article PubMed Google Scholar
Xie, F., Ling Grant, D. S., Chang, J., Amundsen, B. I. & Hechter, R. C. Identifying suicidal ideation and attempt from clinical notes within a large integrated health care system. Perm. J. 26, 85–93 (2022).
Article PubMed PubMed Central Google Scholar
Karystianis, G. et al. Automated analysis of domestic violence police reports to explore abuse types and victim injuries: text mining study. J. Med. Internet Res. 21, e13067 (2019).
Article PubMed PubMed Central Google Scholar
Downs, J. et al. Detection of suicidality in adolescents with autism spectrum disorders: developing a natural language processing approach for use in electronic health records. AMIA Annu. Symp. Proc. 2017, 641–649 (2017).
PubMed Google Scholar
Finlayson, S. G. et al. The clinician and dataset shift in artificial intelligence. N. Engl. J. Med. 385, 283–286 (2021).
Article PubMed PubMed Central Google Scholar
Angelakis, I., Gillespie, E. L. & Panagioti, M. Childhood maltreatment and adult suicidality: a comprehensive systematic review with meta-analysis. Psychol. Med. 49, 1057–1078 (2019).
Article PubMed PubMed Central Google Scholar
Serafini, G. et al. Life adversities and suicidal behavior in young individuals: a systematic review. Eur. Child Adolesc. Psychiatry 24, 1423–1446 (2015).
Article PubMed Google Scholar
Lawn, R. B. & Koenen, K. C. Violence against women and girls has long term health consequences. BMJ 375, e069311 (2021).
Article PubMed Google Scholar
Sardinha, L., Maheu-Giroux, M., Stöckl, H., Meyer, S. R. & García-Moreno, C. Global, regional, and national prevalence estimates of physical or sexual, or both, intimate partner violence against women in 2018. Lancet Lond. Engl. 399, 803–813 (2022).
Article Google Scholar
Di Fazio, N. et al. Mental health consequences of COVID-19 pandemic period in the European population: an institutional challenge. Int. J. Environ. Res. Public. Health 19, 9347 (2022).
Article PubMed PubMed Central Google Scholar
Violence Against Women Prevalence Estimates. World Health Organization https://www.who.int/publications-detail-redirect/9789240022256 (2021).
RESPECT women – Preventing violence against women. World Health Organization https://www.who.int/publications-detail-redirect/WHO-RHR-18.19 (2019).
Stubbs, A. & Szoeke, C. The effect of intimate partner violence on the physical health and health-related behaviors of women: a systematic review of the literature. Trauma Violence Abuse 23, 1157–1172 (2022).
Article PubMed Google Scholar
Jewkes, R. et al. Effective design and implementation elements in interventions to prevent violence against women and girls associated with success: reflections from the what works to prevent violence against women and girls? Global programme. Int. J. Environ. Res. Public Health 18, 12129 (2020).
Krug, E., Dahlberg, L., Mercy, J., Zwi, A. & Lozano, R. World report on violence and health. http://apps.who.int/iris/bitstream/handle/10665/42495/9241545615_eng.pdf?sequence=1 (2002).
Anderson, H. D. et al. Monitoring suicidal patients in primary care using electronic health records. J. Am. Board Fam. Med. 28, 65–71 (2015).
Article PubMed Google Scholar
Fradera, M. et al. Can Routine Primary Care Records Help in Detecting Suicide Risk? A Population-Based Case-Control Study in Barcelona. Arch. Suicide Res. 26, 1395–1409 (2022).
Article PubMed Google Scholar
Benchimol, E. I. et al. The REporting of studies Conducted using Observational Routinely-collected health Data (RECORD) statement. PLoS Med. 12, e1001885 (2015).
Article PubMed PubMed Central Google Scholar
Durkheim, E. Suicide: A Study of Sociology (Routledge, 1897).
Zortea, T. C. et al. The impact of infectious disease-related public health emergencies on suicide, suicidal behavior, and suicidal thoughts. Crisis 42, 474–487 (2021).
Article PubMed Google Scholar
Laliotis, I., Ioannidis, J. P. A. & Stavropoulou, C. Total and cause-specific mortality before and after the onset of the Greek economic crisis: an interrupted time-series analysis. Lancet Public Health 1, e56–e65 (2016).
Article PubMed Google Scholar
Murphy, S. N. et al. Serving the enterprise and beyond with informatics for integrating biology and the bedside (i2b2). J. Am. Med. Inform. Assoc. JAMIA 17, 124–130 (2010).
Article PubMed Google Scholar
Liu, Y. et al. RoBERTa: A Robustly Optimized BERT Pretraining Approach. http://arxiv.org/abs/1907.11692 (2019). https://doi.org/10.48550/arXiv.1907.11692.
Dura, B. et al. EDS-NLP: efficient information extraction from French clinical notes. https://doi.org/10.5281/zenodo.6818507 (2022).
Martin, L. et al. CamemBERT: a Tasty French Language Model. in Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics 7203–7219 https://doi.org/10.18653/v1/2020.acl-main.645 (2020).
Dura, B. et al. Learning structures of the French clinical language:development and validation of word embedding models using 21 million clinical reports from electronic health records. http://arxiv.org/abs/2207.12940 (2022). https://doi.org/10.48550/arXiv.2207.12940.
Cicchetti, D. V. & Feinstein, A. R. High agreement but low kappa: II. Resolving the paradoxes. J. Clin. Epidemiol. 43, 551–558 (1990).
Article CAS PubMed Google Scholar
Bernal, J. L., Cummins, S. & Gasparrini, A. Interrupted time series regression for the evaluation of public health interventions: a tutorial. Int. J. Epidemiol. 46, 348–355 (2017).
PubMed Google Scholar
Galvão, P. V. M., Silva, H. R. S. E. & Silva, C. M. F. P. d. Temporal distribution of suicide mortality: a systematic review. J. Affect. Disord 228, 132–142 (2018).
Article PubMed Google Scholar
Seabold, S. & Perktold, J. Statsmodels: econometric and statistical modeling with python. In Proc 9th Python in Science Conference, (2010).
Davidson-Pilon, C. lifelines: survival analysis in Python. J. Open Source Softw 4, 1317 (2019).
Article Google Scholar
Weiskopf, N. G., Hripcsak, G., Swaminathan, S. & Weng, C. Defining and measuring completeness of electronic health records for secondary use. J. Biomed. Inform. 46, 830–836 (2013).
Article PubMed Google Scholar

Download references

Acknowledgements

We thank the clinical data warehouse (Entrepôt de Données de Santé, EDS) of the Greater Paris University Hospitals for its support and the realisation of data management and data curation tasks. This study received funding from the AP-HP Foundation. The funder had no role in the analysis and interpretation of data. We thank Julie Tort, Remi Flicoteaux and Etienne Audureau for fruitful discussions and Nicolas Bain for proofreading.

Author information

Authors and Affiliations

Innovation and Data unit, IT Department, Assistance Publique-Hôpitaux de Paris, Paris, France
Romain Bey, Ariel Cohen, Basile Dura, Charline Jean, Thomas Petit-Jean & Gilles Chatellier
Child and Adolescent Psychiatry Department, Robert Debré University Hospital, Assistance Publique-Hôpitaux de Paris, Paris, France
Vincent Trebossen, Benjamin Landman & Richard Delorme
Département de psychiatrie et d’addictologie, GHU Paris Nord, DMU neurosciences, Bichat - Claude Bernard Hospital, Assistance Publique-Hôpitaux de Paris, 75018, Paris, France
Pierre-Alexis Geoffroy
GHU Paris – psychiatry & neurosciences, 1, rue Cabanis, 75014, Paris, France
Pierre-Alexis Geoffroy
NeuroDiderot, Inserm, FHU I2-D2, université Paris Cité, 75019, Paris, France
Pierre-Alexis Geoffroy
CNRS UPR 3212, Institute for cellular and integrative neurosciences, 67000, Strasbourg, France
Pierre-Alexis Geoffroy
Université Paris-Est Créteil, INSERM, IMRB U955, Créteil, France
Charline Jean
Service Santé Publique & URC, Hôpital Henri Mondor, Assistance Publique-Hôpitaux de Paris, Créteil, France
Charline Jean
Université Paris Cité, Paris, France
Gilles Chatellier & Aurelie Bourmaud
URC PNVS, CIC-EC 1425, INSERM, Bichat - Claude Bernard Hospital, Assistance Publique-Hôpitaux de Paris, Paris, France
Kankoe Sallah
Sorbonne Université, Inserm, Université Sorbonne Paris Nord, Laboratoire d’Informatique Médicale et d’Ingénierie des Connaissances pour la e-Santé (LIMICS), Paris, France
Xavier Tannier
Clinical Epidemiology Unit, Robert Debré University Hospital, Assistance Publique-Hôpitaux de Paris, Paris, France
Aurelie Bourmaud
CIC 1426, Inserm, Paris, France
Aurelie Bourmaud
Human Genetics and Cognitive Functions, Institut Pasteur, Paris, France
Richard Delorme

Authors

Romain Bey
View author publications
You can also search for this author in PubMed Google Scholar
Ariel Cohen
View author publications
You can also search for this author in PubMed Google Scholar
Vincent Trebossen
View author publications
You can also search for this author in PubMed Google Scholar
Basile Dura
View author publications
You can also search for this author in PubMed Google Scholar
Pierre-Alexis Geoffroy
View author publications
You can also search for this author in PubMed Google Scholar
Charline Jean
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin Landman
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Petit-Jean
View author publications
You can also search for this author in PubMed Google Scholar
Gilles Chatellier
View author publications
You can also search for this author in PubMed Google Scholar
Kankoe Sallah
View author publications
You can also search for this author in PubMed Google Scholar
Xavier Tannier
View author publications
You can also search for this author in PubMed Google Scholar
Aurelie Bourmaud
View author publications
You can also search for this author in PubMed Google Scholar
Richard Delorme
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.B., A.C., G.C., K.S., X.T., A.B., R.D. designed the study. R.B., A.C., V.T., B.L., B.D., C.J., T.P.J. annotated data with the methodological support of P.A.G. and R.D. R.B. drafted the manuscript. R.B. did the literature review. A.C. and R.D. identified relevant literature. A.C. developed the natural language processing algorithms and the computer code with the advice of R.B., B.D., C.J., T.P.J., X.T. All authors interpreted data and made critical intellectual revisions of the manuscript. R.D. supervised the project. R.B., A.C. had full access to the data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis.

Corresponding author

Correspondence to Ariel Cohen.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

RECORD_Checklist

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bey, R., Cohen, A., Trebossen, V. et al. Natural language processing of multi-hospital electronic health records for public health surveillance of suicidality. npj Mental Health Res 3, 6 (2024). https://doi.org/10.1038/s44184-023-00046-7

Download citation

Received: 01 February 2023
Accepted: 06 December 2023
Published: 14 February 2024
DOI: https://doi.org/10.1038/s44184-023-00046-7
Springer Nature Limited

Natural language processing of multi-hospital electronic health records for public health surveillance of suicidality

Abstract

Similar content being viewed by others

Identifying Suicide Ideation and Suicidal Attempts in a Psychiatric Clinical Research Database using Natural Language Processing

Natural language processing applied to mental illness detection: a narrative review

Use of natural language processing in electronic medical records to identify pregnant women with suicidal behavior: towards a solution to the complex classification problem

Explore related subjects

Introduction

Results

Cohort of hospitalisations caused by SA

NLP algorithms

Mental health indicators

Sensitivity analysis

Characteristics of SA-caused hospitalisations

Exploratory analysis of risk factors

Discussion

Retrospective validation of indicators related to suicidality

Impact of the COVID-19 context and violence on mental health

NLP to study mental health of populations

Limitations

Conclusions

Methods

Choice of primary measure

Study design, setting, and participants

Data sources

Development and validation of natural language processing algorithms

Variables

Statistical analysis

Sensitivity analysis

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

RECORD_Checklist

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation