Deep Learning–Based Localization and Detection of Malpositioned Nasogastric Tubes on Portable Supine Chest X-Rays in Intensive Care and Emergency Medicine: A Multi-center Retrospective Study

Wang, Chih-Hung; Hwang, Tianyu; Huang, Yu-Sen; Tay, Joyce; Wu, Cheng-Yi; Wu, Meng-Che; Roth, Holger R.; Yang, Dong; Zhao, Can; Wang, Weichung; Huang, Chien-Hua

doi:10.1007/s10278-024-01181-z

Deep Learning–Based Localization and Detection of Malpositioned Nasogastric Tubes on Portable Supine Chest X-Rays in Intensive Care and Emergency Medicine: A Multi-center Retrospective Study

Open access
Published: 09 July 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

Journal of Imaging Informatics in Medicine Aims and scope Submit manuscript

Deep Learning–Based Localization and Detection of Malpositioned Nasogastric Tubes on Portable Supine Chest X-Rays in Intensive Care and Emergency Medicine: A Multi-center Retrospective Study

Download PDF

Chih-Hung Wang^1,2,
Tianyu Hwang³,
Yu-Sen Huang⁴,
Joyce Tay²,
Cheng-Yi Wu²,
Meng-Che Wu²,
Holger R. Roth⁵,
Dong Yang⁵,
Can Zhao⁵,
Weichung Wang⁶^na1 &
…
Chien-Hua Huang ORCID: orcid.org/0000-0003-2981-4537^1,2^na1

537 Accesses
Explore all metrics

Abstract

Malposition of a nasogastric tube (NGT) can lead to severe complications. We aimed to develop a computer-aided detection (CAD) system to localize NGTs and detect NGT malposition on portable chest X-rays (CXRs). A total of 7378 portable CXRs were retrospectively retrieved from two hospitals between 2015 and 2020. All CXRs were annotated with pixel-level labels for NGT localization and image-level labels for NGT presence and malposition. In the CAD system, DeepLabv3 + with backbone ResNeSt50 and DenseNet121 served as the model architecture for segmentation and classification models, respectively. The CAD system was tested on images from chronologically different datasets (National Taiwan University Hospital (National Taiwan University Hospital)-20), geographically different datasets (National Taiwan University Hospital-Yunlin Branch (YB)), and the public CLiP dataset. For the segmentation model, the Dice coefficients indicated accurate delineation of the NGT course (National Taiwan University Hospital-20: 0.665, 95% confidence interval (CI) 0.630–0.696; National Taiwan University Hospital-Yunlin Branch: 0.646, 95% CI 0.614–0.678). The distance between the predicted and ground-truth NGT tips suggested accurate tip localization (National Taiwan University Hospital-20: 1.64 cm, 95% CI 0.99–2.41; National Taiwan University Hospital-Yunlin Branch: 2.83 cm, 95% CI 1.94–3.76). For the classification model, NGT presence was detected with high accuracy (area under the receiver operating characteristic curve (AUC): National Taiwan University Hospital-20: 0.998, 95% CI 0.995–1.000; National Taiwan University Hospital-Yunlin Branch: 0.998, 95% CI 0.995–1.000; CLiP dataset: 0.991, 95% CI 0.990–0.992). The CAD system also detected NGT malposition with high accuracy (AUC: National Taiwan University Hospital-20: 0.964, 95% CI 0.917–1.000; National Taiwan University Hospital-Yunlin Branch: 0.991, 95% CI 0.970–1.000) and detected abnormal nasoenteric tube positions with favorable performance (AUC: 0.839, 95% CI 0.807–0.869). The CAD system accurately localized NGTs and detected NGT malposition, demonstrating excellent potential for external generalizability.

Identification and Localization of Endotracheal Tube on Chest Radiographs Using a Cascaded Convolutional Neural Network Approach

Article Open access 23 May 2021

Assessment of Critical Feeding Tube Malpositions on Radiographs Using Deep Learning

Article Open access 09 May 2019

Deep Learning-based Diagnosis and Localization of Pneumothorax on Portable Supine Chest X-ray in Intensive and Emergency Medicine: A Retrospective Study

Article Open access 04 December 2023

Discover the latest articles, news and stories from top researchers in related subjects.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

A nasogastric tube (NGT) is a flexible rubber or plastic tube that is passed through the nostril, down the esophagus, and into the stomach. It may be indicated for various scenarios such as gastric decompression or administration of medications in intensive care units (ICUs) or emergency departments (EDs).

A malpositioned NGT in the main bronchus of the lung may lead to complications such as pneumonia, respiratory failure, and death [1]. The prevalence of NGT placement errors in adults has been estimated to vary from 1.3% [2] to 89.5% [3], depending on the error definition. In the UK, feeding through a misplaced NGT is recognized by the National Patient Safety Agency (NPSA) of the National Health Service [4] as a serious patient safety issue that is “wholly preventable if guidance or safety recommendations that provide strong systemic protective barriers are available at a national level [5].”

A chest X-ray (CXR) is considered the gold standard in verifying NGT position [6], and the importance of the radiologist’s role in verification has been emphasized by the NPSA [7, 8]. Between April 2021 and March 2022, a total of 31 incidents in which feeding through a misplaced NGT occurred were reported to the NPSA [9]; among these, CXR misinterpretation accounted for 14 (45%) and was ranked as the most frequently encountered mistake [9]. For patients in ICUs or EDs, CXRs are predominantly obtained using a portable X-ray machine. However, Torsy et al. [10] showed that in 16.9% of portable CXRs, the image quality was insufficient to conclusively determine the NGT position. While well-trained radiologists are crucial for confirming NGT placement, they may not always be readily available.

Few studies have employed deep learning to localize an NGT and detect its malposition. Most previous studies [11,12,13] focused on detecting NGT presence, using small datasets of 25–107 images. These models employed outdated image processing techniques [14, 15], which may fail when the NGT forms a loop or similar objects are present [13]. Consequently, they [11,12,13] achieved only moderate performance. Singh et al. [16] used 5475 radiographs to develop deep learning models (Inception V3, ResNet50, DenseNet121) for detecting bronchial insertion of an NGT. When tested on 100 images, Inception V3 showed the highest AUC (0.87) with a sensitivity of 0.88 and specificity of 0.76.

To the best of our knowledge, there is a lack of deep learning models capable of localizing an NGT and simultaneously detecting its malposition. Therefore, this study aimed to develop a deep learning–based computer-aided detection (CAD) system to assist in the localization of NGTs and identify their malposition on portable supine CXRs.

Materials and Methods

Study Design and Setting

This retrospective study received approval from the Research Ethics Committee of National Taiwan University Hospital (reference number: 202003106RINC), with a waiver of consent granted. Portable supine CXRs were obtained from the Picture Archiving and Communication System (PACS) database of National Taiwan University Hospital and its Yunlin Branch. The study findings are presented in accordance with the Checklist for Artificial Intelligence in Medical Imaging (CLAIM) [17]. The methods in building the datasets, annotating the images, and developing the CAD system have been detailed in previous studies [18,19,20] for different study purposes.

Image Acquisition and Dataset Construction

A radiology information system was employed to search the PACS databases for candidate CXRs (Fig. 1). Candidate positive images that showed NGT malposition were identified through a keywords search (Supplemental Table 1). The inclusion criteria for these images were as follows: (1) had text report of NGT malposition, (2) was a portable supine CXR, (3) obtained in EDs or ICUs, (4) examined between 1 January 2015 and 31 December 2019, and (5) patient age ≥ 20 years. The inclusion criteria of the candidate group negative for NGT malposition were the same as the candidate positive group, except that the text reports of the candidate negative group had to indicate the presence of at least one of the following devices: NGT, central venous catheter (CVC), or endotracheal tube (ETT). Because the number of images without NGT malposition was far greater than those with NGT malposition, we randomly selected 6000 images that met the inclusion criteria of the candidate negative group. The selected list of the candidate groups was further examined to avoid overlap of the positive and negative groups; i.e., for each patient, only one image was selected for analysis. After duplicate images were excluded, the positive and negative groups comprised the National Taiwan University Hospital-1519 training dataset for model development.

Table 1 Basic characteristics of the training and testing datasets

Full size table

In order to test the performance of the CAD system in a simulated real-world setting, a random sampling method was used to construct the testing datasets, National Taiwan University Hospital-20 and National Taiwan University Hospital-Yunlin Branch. As suggested by the guidelines [21], external validation can involve data collected by the same team, using identical predictors and outcome definitions, but typically sampled from a different timeframe (temporal validation) or setting (geographical validation). In our study, the National Taiwan University Hospital-20 dataset included CXRs taken in 2020 at National Taiwan University Hospital, while the National Taiwan University Hospital-Yunlin Branch dataset comprised CXRs from 2015 to 2020 at National Taiwan University Hospital-Yunlin Branch. Compared to NTUH-1519, NTUH-20 was a dataset from a different period (2015–2019 vs. 2020), and NTUH-YB was from a different location (NTUH vs. NTUH-YB). According to the guidelines [21], these temporally and geographically distinct datasets can be used to assess the external generalizability of the CAD system. All eligible images were exported in Digital Imaging and Communications in Medicine format for annotation.

The Catheter and Line Position (CLiP) dataset developed by Tang et al. [22] provided CXRs selected from the NIH ChestXray14 dataset [23]. The CLiP dataset includes CXRs with NGTs as well as nasoenteric tubes, such as nasojejunal and nasoduodenal tubes. These CXRs were obtained from individuals older than 10 years, and not all images were taken with portable CXR machines. Tang et al. [22] defined any nasoenteric tube within the airway system, the esophagus, or coiled anywhere above the gastroesophageal sphincter as being in an abnormal position. The dataset includes 30,083 CXRs from 3791 patients with a median age of 49; among these, 267 (0.9%) have abnormally positioned nasoenteric tubes. Few public datasets are available to examine NGT position on portable anteroposterior CXRs. Although the CXRs in the CLiP dataset did not meet the inclusion criteria for our study, they may still serve as a resource for external validation to some extent.

Image Annotation and Ground Truth

For segmentation tasks, a sequential annotation procedure was employed. Each image was first randomly assigned to nurse practitioners, who added pixel-level labels for the NGT, NGT tip, lung, and diaphragm. These annotated images were then randomly assigned to emergency medicine (EM) physicians for review and adjustment if necessary.

For classification tasks, each image was classified based on the presence and position of the NGT. According to clinicians’ discretion, images were annotated with image-level labels as either positive for malpositioned NGT, negative for correctly positioned NGT, or absence group with no NGT visible. NGT malposition was further categorized as bronchial or esophageal. Each image was randomly assigned to one EM senior resident and one EM attending physician for annotation, with both annotators blind to each other’s results.

A total of ten nurse practitioners, eight EM senior residents, and eight EM attending physicians participated in the annotation process, each with a minimum of 4 years of clinical experience. All annotated images for both segmentation and classification tasks were reviewed by a thoracic radiologist with 15 years of clinical experience for final approval and used as the ground truth.

Development of the Algorithm

Following annotation, the National Taiwan University Hospital-1519 dataset was randomly divided into five subgroups (folds) ensuring similar numbers of annotated images across all strata to develop the model. As shown in Fig. 2, the CAD system consisted of two models, where the first model was trained to segment out NGT, NGT tip, lung, and diaphragm (segmentation model); these segmentation masks then served as input to help detect the presence as well as malposition of the NGT (classification model).

Figure 3 shows the training pipeline. In preprocessing, images were first ensured to have a photometric interpretation of Monochrome2, then resized to 512 × 512 pixels, and finally transformed by contrast limited adaptive histogram equalization (CLAHE) [24]. DeepLabv3 + [25] with backbone ResNeSt50 [26] and DenseNet121 [27] were selected as the model architecture for segmentation and classification models, respectively. The training process used a batch size of 36, the AdamW [28] optimizer, and a learning rate of 3e⁻⁴, adjusted by Cosine Annealing with Warm Restarts [29]. Images with NGT malposition were oversampled to balance the image number across each annotated group.

The NGT tip was assumed to play a critical role in detecting NGT malposition. Therefore, we modified Dice loss [30] to be spatially weighted, assigning more weight to pixels around the NGT tip during model training. Two loss functions supervised the learning process: spatial-weighted Dice loss [30] for segmentation and focal loss [31] with label smoothing [32] for classification. The losses were dynamically weighted to optimize the segmentation model before the classification model. The training procedure was halted after 32 epochs.

The best parameters for the National Taiwan University Hospital-1519 dataset were obtained through fivefold validation and later used for model ensembling in testing datasets. Gradient-weighted class activation mapping (CAM) [33] was employed to inspect the image areas activated by the network and understand how the algorithm made inferences.

The model was trained on an Ubuntu 16.04.7 LTS operating system, using the PyTorch 1.12.1 deep learning framework [34] with CUDA 11.6. The training utilized four Intel(R) Xeon(R) CPU E5-2650 v4 @ 2.20 GHz processors, 256 GB of hard disk space, 16 GB of RAM, and an Nvidia Titan V graphics processing unit (Nvidia Corporation, Santa Clara, CA, USA).

Evaluation Metrics of the Algorithm

The segmentation model’s performance was measured using the Dice coefficient, which is calculated as twice the overlap area divided by the sum of the pixels in both the predicted and ground-truth masks. Additionally, the accuracy of NGT tip localization was evaluated by measuring the absolute distance between the predicted and ground-truth NGT tips (tip-tip distance).

The classification model’s performance was evaluated using the area under the receiver operating characteristics curve (AUC) and the area under the precision-recall curve. Other metrics included sensitivity, specificity, positive predictive value, and negative predictive value. The optimal threshold for these evaluation metrics was determined using Youden’s index [35] from the National Taiwan University Hospital-1519 dataset.

Statistical Analysis

Continuous variables are shown as mean and standard deviation, while categorical variables are displayed as counts and percentages. Comparisons of continuous variables were conducted using the analysis of variance (ANOVA) test, and categorical variables were compared using the chi-squared test. The kappa coefficient was calculated to evaluate inter-annotator agreement for classifying NGT malposition. All statistical measures are reported with point estimates and 95% confidence intervals (CIs), derived using the bootstrap method with 1000 repetitio ns. All statistical analyses were carried out with SciPy version 1.8.1 [36].

Results

As depicted in Fig. 1, a total of 7378 images were retrieved from the PACS database, with 5767 images designated for training and 1611 images for testing. Table 1 highlights significant differences between the training and testing datasets.

Figure 4 showcases three sets of representative images, with overlaid segmentation masks designed to aid in verifying NGT positions. According to Table 2, the Dice coefficient indicated that the segmentation model could accurately outline the NGT (National Taiwan University Hospital-20: 0.665, 95% CI 0.630–0.696; National Taiwan University Hospital-Yunlin Branch: 0.646, 95% CI 0.614–0.678) and lung, although its performance in identifying the diaphragm was less optimal. The tip-tip distance further demonstrated accurate localization of the NGT tip (National Taiwan University Hospital-20: 1.64 cm, 95% CI 0.99–2.41; National Taiwan University Hospital-Yunlin Branch: 2.83 cm, 95% CI 1.94–3.76).

Table 2 Performance of the segmentation model on the testing datasets

Full size table

Table 3 shows that the model could classify the presence of the NGT with high accuracy (AUC: National Taiwan University Hospital-20: 0.998, 95% CI 0.995–1.000; National Taiwan University Hospital-Yunlin Branch: 0.998, 95% CI 0.995–1.000; CLiP dataset: 0.991, 95% CI 0.990–0.992). Additionally, for images containing an NGT, the classification model demonstrated high accuracy in detecting NGT malposition (AUC: National Taiwan University Hospital-20: 0.964, 95% CI 0.917–1.000; National Taiwan University Hospital-Yunlin Branch: 0.991, 95% CI 0.970–1.000), effectively identifying both bronchial and esophageal insertions. In the CLiP dataset, the model also performed well in detecting abnormal positions of nasoenteric tubes (AUC 0.839, 95% CI 0.807–0.869). The CAM analysis indicated that the areas around the NGT tip were the primary focus of the classification model (Fig. 4). Table 4 also shows substantial agreement between annotators regarding NGT malposition [37].

Table 3 Performance of the classification model on the testing datasets

Full size table

Table 4 Interrater agreement between two annotators

Full size table

Discussion

Image Annotation

Compared to previous research [11,12,13, 16], our study incorporated the largest dataset of images annotated with pixel-level labels. This extensive annotation contributed to achieving strong performance with fewer images than anticipated. The quality of a portable supine CXR has been known to be highly variable because of differences in image exposure and scattered radiation, leading to poor NGT visibility [10, 38]. As the esophageal portions of NGTs are embedded within the mediastinum, the silhouette of surrounding structures such as the heart may further reduce NGT visibility with similar radiopacity. In addition, portable supine CXR may contain various overlaps or interconnections between NGTs and other tubes commonly used in critically ill patients, such as ETT or CVC. All these factors contribute to the substantial difficulty in accurate NGT detection, not only in clinical practice [10, 38] but also in the annotation. Hence, to facilitate annotation and model development, CLAHE [39] was employed to enhance the image contrast details and avoid noise amplification caused by histogram equalization. As shown in Table 4, the kappa value for NGT malposition was 0.821, indicating substantial inter-annotator agreement [37].

Dataset Construction

As NGTs may be easily overlooked on portable supine CXR [10, 38] and not mentioned in clinical radiology reports, using NGT as a keyword to search for candidate images may inadvertently select only those with an easily visible NGT into analysis, resulting in selection bias. In our candidate negative group, the keywords ETT, NGT, and CVC were all used to establish the candidate list. As NGT was frequently accompanied by the placement of ETT or CVC on portable supine CXR among critically ill patients, using a combination of these keywords for random sampling may have increased our chances of including images with various radiological appearances of NGTs on CXR, regardless of whether or not they were mentioned in the report. As shown in Fig. 1, the number of images with NGT malposition was more in the annotated than the candidate positive groups. This gap may be explained by the lack of keywords in the clinical reports. Hence, a multi-keyword-based search strategy may have a higher chance of obtaining datasets with less selection bias.

We randomly selected images from different time periods (National Taiwan University Hospital-20) and locations (National Taiwan University Hospital-YB). Additionally, for these two testing datasets, we included only images taken in the ED. Table 1 highlights significant differences among these datasets, indicating their suitability for evaluating the external generalizability of the CAD system.

Performance of Segmentation Model

Our CAD system could segment the NGT, NGT tip, lungs, and diaphragm. By visualizing the complete path of the NGT, including the tip, it would be easier for clinicians to verify the detection results. Tracing NGTs is not easy, as the NGT may loop on itself or take other aberrant courses. NGTs may also be confused with other similar linear structures, such as CVCs. Therefore, we used several approaches to improve the segmentation of the NGT. First, CLAHE intensified linear structures in images for a clear segmentation subject. Subsequently, the segmentation masks were morphologically dilated during training for a smoother loss, since thin-line structures such as an NGT yield an oversensitive loss. Lastly, tip attention was applied using spatial-weighted Dice loss, as the portion of NGT closer to the tip was more challenging to segment. The segmentation results of the Dice coefficient and NGT tip-tip distance were critical for the subsequent classification model to detect NGT malposition.

Besides delineating the NGT course, the CAD system could also segment out the lung and diaphragm. Based on the relative positions between the NGT tip, diaphragm, and lung, the classification model could determine whether the NGTs were malpositioned. It may be a concern that the low Dice coefficients for the diaphragm were not adequate for the classification tasks. However, suboptimal Dice coefficients for the diaphragm should not be surprising, as the portion where the diaphragm adjoins the lower heart border may be difficult to discern radiologically, rendering the ground truth difficult to be annotated and learned by the algorithm.

Performance of Classification Model in the National Taiwan University Hospital-20 and National Taiwan University Hospital-Yunlin Branch Testing Datasets

The detection task was split into two stages to evaluate its performance. In the first stage, the classification model would detect the presence of NGT, demonstrating excellent performance with AUCs above 0.99 (Table 3). In the second stage, the model identified NGT malposition in the images where an NGT was present. In this study, NGT malposition was defined as bronchial or esophageal insertion, as it is crucial to ensure that the distal side holes of the NGT are positioned in the stomach to prevent aspiration. The AUCs of our CAD system for detecting NGT malposition were 0.964 and 0.991 in National Taiwan University Hospital-20 and National Taiwan University Hospital-Yunlin Branch, respectively, demonstrating the CAD algorithm’s excellent performance and consistent external generalizability. In a previous study, the Singh et al. [16] model could detect bronchial insertion with an AUC of 0.87. Bronchial insertion may cause more harm to patients than esophageal insertion and necessitate immediate attention and adjustment. Our analysis demonstrated that the performance of the CAD system was similarly high in detecting bronchial or esophageal insertion, with AUCs above 0.96.

Our study had a lower number of images with NGT malposition, with one (1.2%) and six (5.1%) in National Taiwan University Hospital-20 and National Taiwan University Hospital-Yunlin Branch, respectively. Since there are many other methods that healthcare providers could use to check the NGT position before obtaining a CXR, the number of images with malposition is expected to be small. Previous studies [40, 41] have reported that daily CXRs reveal about 0.3–0.4% of NGT malposition in ICUs. Therefore, the low prevalence of NGT malposition in our testing datasets may just reflect real-world settings.

Performance in the CLiP Dataset

To the best of our knowledge, the CLiP dataset was the only public dataset with misplaced tubes annotated for external testing of our model. In the CLiP dataset, the classification model could detect the presence of nasoenteric tubes with similar performance as in NTUH-20 and NTUH-YB datasets for NGT. However, the model’s performance in detecting malpositioned nasoenteric tubes slightly decreased. The different tube types may cause the differences in performance. The CLiP dataset contained CXRs with nasoduodenal and nasojejunal tubes, which were longer than NGT. The proximal parts of the NGT and nasoduodenal and nasojejunal tubes may appear similar on CXRs but differ substantially distal to the gastroesophageal sphincter. The tube tip was important for the model to determine malposition. The different tube types may thus lead to a less favorable performance in detecting malpositioned nasoenteric tubes. Finally, in the CLiP dataset, each patient contributed eight CXRs to the dataset on average. These correlated images may lead to over- or underestimated classification performance, which could not be accounted for because the dataset did not offer the source information of these images. Given the limitations of the CLiP dataset, the external generalizability of the classification model may be considered good to excellent [42, 43]. Also, the classification model reached negative predictive value above 0.990 in all three testing datasets, which may be useful in confirming the tube position.

Future Applications

Yi et al. [44] suggested that a clinically effective tube assessment model should be capable of performing five tasks: (1) detecting the presence of the tube, (2) localizing its tip, (3) tracing the tube’s course, (4) identifying the tube, and (5) determining whether the tube is correctly positioned. Previous studies [11,12,13, 16] have addressed some of these tasks but have not provided all the necessary information. To advance this goal, we employed a sequential inference strategy that integrates deep learning–based segmentation and classification models, providing clinicians with the most comprehensive results. The CAD system can be utilized to (1) prioritize CXRs, highlighting those requiring immediate review by the radiologist, or (2) send notifications to treating clinicians. When clinicians review the classification results, the segmentation masks of the NGT, lung, and diaphragm can be displayed to aid in verifying the findings.

Study Limitations

First, the datasets were inherently imbalanced, as clinical protocols had already been established for checking NGT positioning, which may have reduced the number of malpositioned NGTs detected on CXRs. Second, the testing datasets only included images obtained in EDs. Most portable CXRs obtained in ICUs are used to follow pulmonary disease [40, 41], while those filmed in EDs are more likely to be used to check the position of a newly placed NGT. Random sampling of portable CXRs obtained in EDs may increase the probability of obtaining images with NGT and NGT malposition.

Conclusions

The developed deep learning–based CAD system effectively localizes NGTs and identifies any malposition on portable supine CXRs taken in the ED and ICU. The consistent performance observed across different time periods and locations indicates that the system has strong potential for external generalizability.

Data Availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

References

Odocha O, Lowery RC, Jr., Mezghebe HM, Siram SM, Warner OG: Tracheopleuropulmonary injuries following enteral tube insertion. J Natl Med Assoc 1989, 81(3):275-281.
CAS PubMed PubMed Central Google Scholar
McWey RE, Curry NS, Schabel SI, Reines HD: Complications of nasoenteric feeding tubes. American journal of surgery 1988, 155(2):253-257.
Article CAS PubMed Google Scholar
Niv Y, Abu-Avid S: On the positioning of a nasogastric tube. The American journal of medicine 1988, 84(3 Pt 1):563-564.
Article CAS PubMed Google Scholar
Coombes R: NHS safety agency issues guidance on nasogastric tubes. BMJ (Clinical research ed) 2005, 330(7489):438-438.
Article PubMed Google Scholar
Never Events policy and framework – revised January 2018 [https://www.england.nhs.uk/wp-content/uploads/2020/11/Revised-Never-Events-policy-and-framework-FINAL.pdf]
Amorosa JK, Bramwit MP, Mohammed TL, Reddy GP, Brown K, Dyer DS, Ginsburg ME, Heitkamp DE, Jeudy J, Kirsch J et al: ACR appropriateness criteria routine chest radiographs in intensive care unit patients. Journal of the American College of Radiology : JACR 2013, 10(3):170-174.
Article PubMed Google Scholar
Alert NPS: Reducing the harm caused by misplaced nasogastric feeding tubes. NHS National Patient Safety Agency 2011.
NPSA PSA: PSA002. Reducing the harm caused by misplaced nasogastric feeding tubes in adults, children and infants. Supporting Information March 2011.
Provisional publication of Never Events reported as occurring between 1 April 2021 and 31 March 2022 [https://www.england.nhs.uk/wp-content/uploads/2022/05/Provisional-publication-NE-1-April-31-March-2022.pdf]
Torsy T, Saman R, Boeykens K, Eriksson M, Verhaeghe S, Beeckman D: Factors associated with insufficient nasogastric tube visibility on X-ray: a retrospective analysis. European radiology 2021, 31(4):2444-2450. https://doi.org/10.1007/s00330-020-07302-w. Epub 2020 Oct 4. PMID: 33011836.
Ramakrishna B, Brown M, Goldin J, Cagnon C, Enzmann D: Catheter detection and classification on chest radiographs: an automated prototype computer-aided detection (CAD) system for radiologists. In: Medical Imaging 2011: Computer-Aided Diagnosis: 2011: SPIE; 2011: 892–897.
Ramakrishna B, Brown M, Goldin J, Cagnon C, Enzmann D: An improved automatic computer aided tube detection and labeling system on chest radiographs, vol. 8315: SPIE; 2012.
Sheng C, Li L, Pei W: Automatic detection of supporting device positioning in intensive care unit radiography. The International Journal of Medical Robotics and Computer Assisted Surgery 2009, 5(3):332-340.
Article PubMed Google Scholar
Brunelli R: Template matching techniques in computer vision: theory and practice: John Wiley & Sons; 2009.
Duda RO, Hart PE: Use of the Hough transformation to detect lines and curves in pictures. Communications of the ACM 1972, 15(1):11-15.
Article Google Scholar
Singh V, Danda V, Gorniak R, Flanders A, Lakhani P: Assessment of Critical Feeding Tube Malpositions on Radiographs Using Deep Learning. Journal of digital imaging 2019, 32(4):651-655.
Article PubMed PubMed Central Google Scholar
Mongan J, Moy L, Kahn CE, Jr.: Checklist for Artificial Intelligence in Medical Imaging (CLAIM): A Guide for Authors and Reviewers. Radiol Artif Intell 2020, 2(2):e200029.
Article PubMed PubMed Central Google Scholar
Wang CH, Lin T, Chen G, Lee MR, Tay J, Wu CY, Wu MC, Roth HR, Yang D, Zhao C et al: Deep Learning-based Diagnosis and Localization of Pneumothorax on Portable Supine Chest X-ray in Intensive and Emergency Medicine: A Retrospective Study. Journal of medical systems 2023, 48(1):1.
Article PubMed PubMed Central Google Scholar
Wang CH, Hwang T, Huang YS, Tay J, Wu CY, Wu MC, Roth HR, Yang D, Zhao C, Wang W et al: Deep Learning-Based Localization and Detection of Malpositioned Endotracheal Tube on Portable Supine Chest Radiographs in Intensive and Emergency Medicine: A Multicenter Retrospective Study. Critical care medicine 2024, 52(2):237-247.
Article CAS PubMed Google Scholar
Wang CH, Chang W, Lee MR, Tay J, Wu CY, Wu MC, Roth HR, Yang D, Zhao C, Wang W et al: Deep Learning-based Diagnosis of Pulmonary Tuberculosis on Chest X-ray in the Emergency Department: A Retrospective Study. J Imaging Inform Med 2024, 37(2):589-600.
Article PubMed PubMed Central Google Scholar
Collins GS, Reitsma JB, Altman DG, Moons KG: Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): the TRIPOD statement. Annals of internal medicine 2015, 162(1):55-63.
Article PubMed Google Scholar
Tang JSN, Seah JCY, Zia A, Gajera J, Schlegel RN, Wong AJN, Gai D, Su S, Bose T, Kok ML et al: CLiP, catheter and line position dataset. Sci Data 2021, 8(1):285.
Article PubMed PubMed Central Google Scholar
Wang X, Peng Y, Lu L, Lu Z, Bagheri M, Summers RM: Chestx-ray8: Hospital-scale chest x-ray database and benchmarks on weakly-supervised classification and localization of common thorax diseases. In: Proceedings of the IEEE conference on computer vision and pattern recognition: 2017; 2017: 2097–2106.
Zuiderveld KJ: Contrast Limited Adaptive Histogram Equalization. In: Graphics Gems: 1994; 1994.
Chen L-C, Zhu Y, Papandreou G, Schroff F, Adam H: Encoder-decoder with atrous separable convolution for semantic image segmentation. In: Proceedings of the European conference on computer vision (ECCV): 2018; 2018: 801–818.
Zhang H, Wu C, Zhang Z, Zhu Y, Lin H, Zhang Z, Sun Y, He T, Mueller J, Manmatha R: Resnest: Split-attention networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition: 2022; 2022: 2736–2746.
Huang G, Liu Z, Van Der Maaten L, Weinberger KQ: Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition: 2017; 2017: 4700–4708.
Loshchilov I, Hutter F: Fixing Weight Decay Regularization in Adam. ArXiv 2017, abs/1711.05101.
Loshchilov I, Hutter F: Sgdr: Stochastic gradient descent with warm restarts. arXiv preprint arXiv:160803983 2016.
Milletari F, Navab N, Ahmadi SA: V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation. In: 2016 Fourth International Conference on 3D Vision (3DV): 25–28 Oct. 2016 2016; 2016: 565–571.
Lin TY, Goyal P, Girshick R, He K, Dollár P: Focal Loss for Dense Object Detection. In: 2017 IEEE International Conference on Computer Vision (ICCV): 22–29 Oct. 2017 2017; 2017: 2999–3007.
Müller R, Kornblith S, Hinton GE: When does label smoothing help? Advances in neural information processing systems 2019, 32.
Zhou B, Khosla A, Lapedriza A, Oliva A, Torralba A: Learning deep features for discriminative localization. In: Proceedings of the IEEE conference on computer vision and pattern recognition: 2016; 2016: 2921–2929.
Paszke A, Gross S, Massa F, Lerer A, Bradbury J, Chanan G, Killeen T, Lin Z, Gimelshein N, Antiga L: Pytorch: An imperative style, high-performance deep learning library. Advances in neural information processing systems 2019, 32.
Youden WJ: Index for rating diagnostic tests. Cancer 1950, 3(1):32-35.
Article CAS PubMed Google Scholar
Virtanen P, Gommers R, Oliphant TE, Haberland M, Reddy T, Cournapeau D, Burovski E, Peterson P, Weckesser W, Bright J et al: SciPy 1.0: fundamental algorithms for scientific computing in Python. Nature methods 2020, 17(3):261–272.
McHugh ML: Interrater reliability: the kappa statistic. Biochem Med (Zagreb) 2012, 22(3):276-282.
Article PubMed Google Scholar
Torsy T, Saman R, Boeykens K, Duysburgh I, Van Damme N, Beeckman D: Comparison of two methods for estimating the tip position of a nasogastric feeding tube: a randomized controlled trial. Nutrition in Clinical Practice 2018, 33(6):843-850.
Article PubMed Google Scholar
Pisano ED, Zong S, Hemminger BM, DeLuca M, Johnston RE, Muller K, Braeuning MP, Pizer SM: Contrast limited adaptive histogram equalization image processing to improve the detection of simulated spiculations in dense mammograms. Journal of digital imaging 1998, 11(4):193-200.
Article CAS PubMed PubMed Central Google Scholar
STRAIN DS, KINASEWITZ GT, VEREEN LE, GEORGE RB: Value of routine daily chest x-rays in the medica intensive care unit. Critical care medicine 1985, 13(7):534-536.
Article CAS PubMed Google Scholar
Silverstein DS, Livingston DH, Elcavage J, Kovar L, Kelly KM: THE UTILITY OF ROUTINE DAILY CHEST RADIOGRAPHY IN THE SURGICAL INTENSIVE CARE UNIT. Journal of Trauma and Acute Care Surgery 1993, 35(4):643-646.
Article CAS Google Scholar
Lüdemann L, Grieger W, Wurm R, Wust P, Zimmer C: Glioma assessment using quantitative blood volume maps generated by T1-weighted dynamic contrast-enhanced magnetic resonance imaging: a receiver operating characteristic study. Acta Radiologica 2006, 47(3):303-310.
Article PubMed Google Scholar
Obuchowski NA: Receiver operating characteristic curves and their use in radiology. Radiology 2003, 229(1):3-8.
Article PubMed Google Scholar
Yi X, Adams SJ, Henderson RDE, Babyn P: Computer-aided Assessment of Catheters and Tubes on Radiographs: How Good Is Artificial Intelligence for Assessment? Radiol Artif Intell 2020, 2(1):e190082.
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank the staff of the 3rd Core Lab, Department of Medical Research, National Taiwan University Hospital, for the technical support. We thank the Integrated Medical Database, National Taiwan University Hospital, for assisting in acquiring images for analysis.

Funding

Author Chih-Hung Wang received a grant (112-UN0022) from the National Taiwan University Hospital. Authors Chih-Hung Wang, Weichung Wang, and Chien-Hua Huang received a grant (MOST 111–2634-F-002–015-, Capstone project) from the National Science and Technology Council, Taiwan.

Author information

Weichung Wang and Chien-Hua Huang contributed equally to the study.

Authors and Affiliations

Department of Emergency Medicine, College of Medicine, National Taiwan University, Taipei, Taiwan
Chih-Hung Wang & Chien-Hua Huang
Department of Emergency Medicine, Zhongzheng Dist, National Taiwan University Hospital, No. 7, Zhongshan S. Rd, Taipei City 100, Taiwan
Chih-Hung Wang, Joyce Tay, Cheng-Yi Wu, Meng-Che Wu & Chien-Hua Huang
Mathematics Division, National Center for Theoretical Sciences, National Taiwan University, Taipei, Taiwan
Tianyu Hwang
Department of Medical Imaging, National Taiwan University Hospital, Taipei, Taiwan
Yu-Sen Huang
NVIDIA Corporation, Bethesda, CA, USA
Holger R. Roth, Dong Yang & Can Zhao
Institute of Applied Mathematical Sciences, National Taiwan University, No. 1, Sec. 4, Roosevelt Rd, Taipei, 106, Taiwan
Weichung Wang

Authors

Chih-Hung Wang
View author publications
You can also search for this author in PubMed Google Scholar
Tianyu Hwang
View author publications
You can also search for this author in PubMed Google Scholar
Yu-Sen Huang
View author publications
You can also search for this author in PubMed Google Scholar
Joyce Tay
View author publications
You can also search for this author in PubMed Google Scholar
Cheng-Yi Wu
View author publications
You can also search for this author in PubMed Google Scholar
Meng-Che Wu
View author publications
You can also search for this author in PubMed Google Scholar
Holger R. Roth
View author publications
You can also search for this author in PubMed Google Scholar
Dong Yang
View author publications
You can also search for this author in PubMed Google Scholar
Can Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Weichung Wang
View author publications
You can also search for this author in PubMed Google Scholar
Chien-Hua Huang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

CHW: conceptualization, methodology, validation, resources, formal analysis, investigation, data curation, writing original draft, and project administration; TH: conceptualization, methodology, validation, resources, formal analysis, investigation, data curation, writing original draft, and project administration; YSH: resources, formal analysis, investigation, data curation, writing, review, and editing; JT: resources, formal analysis, investigation, data curation, writing, review, and editing; CYW: resources, formal analysis, investigation, data curation, writing, review, and editing; MCW: resources, formal analysis, investigation, data curation, writing, review, and editing; HRR: formal analysis, investigation, and data curation; DY: formal analysis, investigation, and data curation; CZ: formal analysis, investigation, and data curation; WW: conceptualization, methodology, validation, resources, formal analysis, writing, review, editing, and supervision; CHH: conceptualization, methodology, validation, resources, formal analysis, writing, review, editing, and supervision.

Corresponding authors

Correspondence to Weichung Wang or Chien-Hua Huang.

Ethics declarations

Ethics Approval

This study was approved by the Research Ethics Committee of the National Taiwan University Hospital (NTUH; reference number: 202003106RINC) and granted a consent waiver.

Disclaimer

National Taiwan University Hospital and National Science and Technology Council had no involvement in designing the study; collecting, analyzing, or interpreting the data; writing the manuscript; or deciding whether to submit the manuscript for publication.

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 16 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, CH., Hwang, T., Huang, YS. et al. Deep Learning–Based Localization and Detection of Malpositioned Nasogastric Tubes on Portable Supine Chest X-Rays in Intensive Care and Emergency Medicine: A Multi-center Retrospective Study. J Digit Imaging. Inform. med. (2024). https://doi.org/10.1007/s10278-024-01181-z

Download citation

Received: 22 April 2024
Revised: 31 May 2024
Accepted: 17 June 2024
Published: 09 July 2024
DOI: https://doi.org/10.1007/s10278-024-01181-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Deep Learning–Based Localization and Detection of Malpositioned Nasogastric Tubes on Portable Supine Chest X-Rays in Intensive Care and Emergency Medicine: A Multi-center Retrospective Study

Abstract

Similar content being viewed by others

Identification and Localization of Endotracheal Tube on Chest Radiographs Using a Cascaded Convolutional Neural Network Approach

Assessment of Critical Feeding Tube Malpositions on Radiographs Using Deep Learning

Deep Learning-based Diagnosis and Localization of Pneumothorax on Portable Supine Chest X-ray in Intensive and Emergency Medicine: A Retrospective Study

Explore related subjects

Introduction

Materials and Methods

Study Design and Setting

Image Acquisition and Dataset Construction

Image Annotation and Ground Truth

Development of the Algorithm

Evaluation Metrics of the Algorithm

Statistical Analysis

Results

Discussion

Image Annotation

Dataset Construction

Performance of Segmentation Model

Performance of Classification Model in the National Taiwan University Hospital-20 and National Taiwan University Hospital-Yunlin Branch Testing Datasets

Performance in the CLiP Dataset

Future Applications

Study Limitations

Conclusions

Data Availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics Approval

Disclaimer

Competing Interests

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (DOCX 16 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation