Background

Chronic obstructive pulmonary disease (COPD) is the third leading cause of mortality worldwide, with more than 3 million deaths per year [1, 2]. To reduce its global burden, the recently revised Global Initiative for Chronic Obstructive Lung Disease (GOLD) 2023 have proposed a new taxonomy (etiotype) to emphasize early recognition of various COPD etiologies. Among them, COPD due to infections (COPD-I) includes childhood infections, tuberculosis-associated COPD, and human immunodeficiency virus-associated COPD [3]. However, other studies have suggested that infections other than these diseases are also associated with obstructive lung disease [4, 5]. The GOLD committee has recommended increased attention to these other infections linked to COPD [6, 7].

Nontuberculous mycobacterial pulmonary disease (NTM-PD) is a chronic and progressing infectious respiratory disease caused by various NTM species, with an increasing incidence worldwide [8]. Airflow obstruction or physician-diagnosed COPD is common in patients with NTM-PD [9]. However, since COPD is a common risk factor for NTM-PD, [10] whether NTM-PD precedes COPD or vice versa remains unclear. Although previous studies have suggested that patients with NTM-PD experience a faster decline in forced expiratory volume in 1 s (FEV1) than that of the general population, [11,12,13,14] and coexisting NTM-PD is associated with the faster decline in FEV1 in patients with established COPD, [15, 16] only a few studies have explored whether NTM-PD can serve as an infectious etiotype for COPD [4].

Therefore, the present study aimed to investigate the incidence and risk of developing COPD in patients with NTM-PD with normal lung function. To achieve this, we conducted a retrospective single-center cohort involving a large number of patients with NTM-PD and matched controls and compared the incidences of spirometry-defined COPD between them. To validate the main outcomes, we conducted a nationwide population-based cohort study using the health insurance claim database.

Materials and methods

Data source

Two study designs were developed to investigate the incidence and risk of developing COPD. Study 1 aimed to compare the incidence of COPD between patients with NTM-PD and matched controls, who had undergone a comprehensive health screening examination at the Samsung Medical Center (1,960-bed university-affiliated, tertiary referral hospital). Data were extracted from the Clinical Data Warehouse (CDW) Darwin-C at the Samsung Medical Center [17].

Study 2 aimed to validate the findings of study 1 by evaluating the incidence and risk of newly diagnosed and treated COPD among individuals with and without NTM-PD. We used a nationwide representative dataset of approximately 1,000,000 individuals, the Korean National Health Insurance Service National Sample Cohort (NHIS-NSC), which contains information on demographics, healthcare utilization, health screening examinations, disease diagnosis, drug prescription, and death [18]. Supplementary Method details the methodology of study 2.

Study 1 was approved by the Institutional Review Board of the Samsung Medical Center (IRB no.: SMC 2023-08-084). Study 2 was approved by the Institutional Review Board of the Hanyang University Hospital (IRB no.: HYUH 2023-06-007). The requirement for obtaining informed consent was waived because the CDW and NHIS databases had been constructed after anonymization. These studies were conducted in accordance with the principles of the Declaration of Helsinki.

Study population

In study 1, the NTM-PD group initially included 8,184 patients with NTM-PD who had undergone a spirometry test between November 1994 and January 2022. Exclusion criteria were as follows: (1) an abnormal baseline spirometry result, defined as an FEV1/forced vital capacity (FVC) ratio < 0.7 or a predicted percentage value (%pred) of FEV1 < 80 (n = 5,009); (2) no follow-up spirometry test for at least 1 year after the baseline spirometry (n = 274); and (3) a previous diagnosis of COPD (n = 869). Additionally, exclusion criteria from the NTM-PD group included a history of lung resection surgery (n = 61) and missing data on smoking status (n = 256).

The control group initially included 323,862 participants who underwent a health screening examination with a baseline spirometry test during the same period. After applying the same exclusion criteria and excluding those with an established diagnosis of NTM-PD (n = 687), 90,088 participants were enrolled in this study. We performed propensity score-matching at a ratio of 1:4 based on age, sex, body mass index (BMI), and smoking status, resulting in 1,715 patients in the NTM-PD group and 6,860 participants in the matched control groups (Fig. 1). In study 1, both patients with NTM-PD and controls were followed up from the date of the baseline spirometry until COPD diagnosis, death, or December 31, 2022, whichever occurred first.

Fig. 1
figure 1

Flowchart of the study population. Abbreviations: NTM-PD, nontuberculous mycobacterial pulmonary disease; COPD, chronic obstructive pulmonary disease; FEV1, forced expiratory volume in 1 s; FVC, forced vital capacity; BMI, body mass index

In study 2, patients with NTM-PD who were free from COPD at baseline and within 1 year after enrollment (n = 386) and 1:3 age- and sex-matched controls without NTM-PD or COPD (n = 1,133) were included (Supplementary Method and Supplementary Fig. 1). To minimize the risk of reverse causality, we implemented a 12-month COPD-free washout period; consequently, patients were followed up from 12 months after the enrollment with the endpoint being death, a COPD diagnosis, or December 31, 2019, whichever occurred first.

Exposure

The exposure variable was NTM-PD. In study 1, we defined NTM-PD as the presence of both International Classification of Disease Codes-10th Revision (ICD-10) codes of A31.0, A31.8, or A31.9 and microbiological findings (≥ 2 positive culture results from separately expectorated sputum samples with the same species, or ≥ 1 positive culture result from bronchial washing, bronchoalveolar lavage, or lung biopsy with species identification) [17]. NTM species were identified using a nested multiplex polymerase chain reaction and reverse hybridization assay of the internal transcribed spacer region of rpoB [19]. In study 2, NTM-PD was defined as a case in which the diagnostic code A31.0, A31.8, or A31.9 was assigned for the main- or sub-diagnosis, and specific medical practice codes (e.g., microbial culture or identification codes) were entered together (Supplementary Method).

Study outcome

The primary outcome was newly diagnosed COPD during the follow-up period. In study 1, COPD was defined as a pre-bronchodilator FEV1/FVC ratio < 0.7. When multiple spirometry results were present, the first date of reporting pre-bronchodilator FEV1/FVC ratio < 0.7 was considered the newly diagnosed COPD date. In study 2, COPD was defined as the presence of ICD-10 codes J43–J44, except J43.0, and COPD medication prescribed at least twice within 1 year [20, 21].

Spirometry and annual lung function decline

The baseline spirometry was defined as the first spirometry taken between 6 months before and 6 months after the date of NTM-PD diagnosis for patients with NTM-PD and as the spirometry taken on the same day as the health screening exam for controls.

Spirometry was performed using the Vmax 22 system (SensorMedics, Yorba Linda, CA, USA) according to the American Thoracic Society/European Respiratory Society guidelines [22]. The absolute values of FEV1 and FVC were obtained, and %pred FEV1 and FVC were calculated using the equation obtained by analyzing the representative values of the Korean population [23]. Lung function decline was calculated as follows: [(last FEV1 or FVC) – (FEV1 or FVC at baseline)]/follow-up duration (years) [11, 24]. Rapid FEV1 decline was defined as a loss in FEV1 of > 60 mL/year [25].

Covariates

In study 1, covariate data were extracted from the CDW including age, sex, BMI, smoking status, and comorbidities (pulmonary tuberculosis, diabetes mellitus, hypertension, ischemic heart disease, and cerebrovascular disease), defined using ICD-10 codes [26,27,28,29,30]. Supplementary Method outlines the definitions of the covariates for study 2.

Statistical analysis

Baseline characteristics were expressed as mean ± standard deviation for continuous variables and number (percentage) for categorical variables. We compared the two groups using the χ2 test for categorical variables and t-tests for continuous variables.

In study 1, we performed a 1:4 propensity score-matching between the NTM-PD and control cohorts based on age, sex, BMI, and smoking status. In study 2, we performed a 1:3 matching between the two cohorts based on age and sex. Standardized mean difference (SMD) was used to examine the balance of covariate distributions between the groups, and an SMD > 0.1 was considered to indicate an imbalance [31].

The cumulative incidence of COPD was determined using the Kaplan–Meier curve. Cox proportional-hazards regression analyses were used to estimate the hazard ratios (HRs) and 95% confidence intervals (CIs) for the comparison of incident COPD between the NTM-PD and matched control cohorts, adjusted for age, sex, BMI, smoking status, and comorbidities (tuberculosis, hypertension, diabetes mellitus, ischemic heart disease, and cerebrovascular disease).

In study 2, the Kaplan–Meier curve and multivariable Cox proportional-hazards regression analyses were used to estimate the comparative risks of incident COPD in the NTM-PD and matched control cohorts. Detailed descriptions of statistical analyses are provided in Supplementary Method. Statistical significance was defined as a two-sided p-value < 0.05, and all analyses were conducted using SAS version 9.4 (SAS Institute Inc., Cary, NC, USA) and R software version 4.2.1 (Vienna, Austria).

Results

Baseline characteristics

Table 1 presents the baseline characteristics of the patients from study 1. Patients with NTM-PD were more likely to be older, women, or had a lower BMI than those without NTM-PD. Most (85.0%) of the patients had never smoked, with no significant difference in the smoking status between the two groups. All comorbidities, except cerebrovascular disease, were more common in the NTM-PD group than in the control group (all p < 0.001). Despite matching, variables including age and sex continued to exhibit differences between the two groups (SMD > 0.1). However, the SMDs markedly decreased (Supplementary Fig. 2), and the SMD for the smoking status was < 0.1.

Table 1 Baseline characteristics of propensity score-matched cohorts

Lung function decline

At baseline, FEV1, and FEV1/FVC ratio were significantly lower in patients with NTM-PD than in those without NTM-PD (NTM-PD vs. matched control; FEV1, 2.6 vs. 2.7 L, p = 0.017; FEV1/FVC, 0.79 vs. 0.81, p < 0.001), although all patients had normal range of spirometry results. During a mean spirometry follow-up of 3.3 years, the annual decline in FEV1 was greater in patients with NTM-PD than in those without NTM-PD (NTM-PD vs. matched control, − 77.6 vs. −47.3 mL, p < 0.001). Patients with NTM-PD exhibited a higher proportion of rapid FEV1 decline than that of matched controls (45.3% vs. 37.9%, p < 0.001; Table 2).

Table 2 Result of pulmonary function test in the study population

Incidence and risk of COPD

In study 1, during a mean follow-up of 3.3 years, 14.0% of patients in the NTM-PD group (241/1,1715) and 4.3% of patients in the matched control group (293/6,860) developed COPD. In the multivariable-adjusted model, the adjusted HR (aHR) was 2.57 (95% CI: 2.15–3.09) for the comparison of incident COPD between the NTM-PD and matched control groups. In the subgroup analysis, the risk of incident COPD was more evident in women (aHR = 5.44; 95% CI: 4.11–7.23) than in men, and in the age group < 60 years (aHR = 6.94; 95% CI: 4.84−9.94) compared to the other group (Table 3). Kaplan–Meier curves also demonstrated a significant difference in the cumulative incidence rate of COPD between the two groups (log-rank p < 0.001, Fig. 2 and Supplementary Fig. 3A).

Table 3 Association between NTM-PD and COPD development
Fig. 2
figure 2

Cumulative incidence of COPD according to the presence or absence of NTM-PD in study 1 (NTM-PD cohort versus matched control cohort). Abbreviations: NTM-PD, nontuberculous mycobacterial pulmonary disease; COPD, chronic obstructive pulmonary disease

Validation with NHIS-NSC data (study 2)

Supplementary Table 1 presents the baseline characteristics of the patients with NTM-PD and 1:3 age- and sex-matched controls of study 2. Although the smoking status did not differ significantly, patients with NTM-PD exhibited a lower BMI and higher Charlson comorbidity index than those of the matched controls. Furthermore, 6.2% (24/386) and 2.5% (28/1,133) of the patients with and without NTM-PD, respectively, developed COPD with respective incidence rates of 14.8 and 5.3 per 100,000 person-years (Table 4). The NTM-PD cohort exhibited a higher risk of COPD (aHR = 2.04; 95% CI: 1.21–3.42) than that of the matched control cohort. Similarly, the cumulative incidence rate of COPD differed significantly between the two groups (log-rank p < 0.001, Supplementary Fig. 3B). In the subgroup analysis, the risk of incident COPD was more evident in women (aHR = 3.55; 95% CI: 1.65−7.62) than in men.

Table 4 Overall and sex- and age-specific incidence rates and HR for COPD in the NTM-PD and matched control cohorts using a nationwide cohort dataset

Discussion

The present study provided longitudinal evidence for the association between NTM-PD and the development of COPD, utilizing a single-center database with a large number of patients with NTM-PD. Patients with NTM-PD exhibited a greater decline in lung function and approximately 2.5-fold higher risk of COPD incidence than those of the matched controls without NTM-PD. These findings were validated using a nationally representative cohort with similar hazard magnitudes. Notably, most (74–85%) patients in both cohorts were never-smokers, underscoring the potential role of NTM-PD as a major etiological factor in COPD-I.

Previous studies have reported lung function decline in patients with NTM-PD based on clinical outcomes or disease severity [11,12,13,14]. In a previous study where patients with NTM-PD (n = 358) were categorized into treatment-failure (n = 68), observation (n = 118), and treatment-success (n = 172) groups, FEV1 decline was faster in the treatment-failure group compared to other groups (52.2, 30.8, and 28.2 mL/year, respectively) [11]. Even after adjusting for multiple confounders, NTM-PD treatment failure was significantly associated with a greater decline in lung function. Additionally, in the treatment failure group, approximately 16% (5/31) of patients with normal spirometry results at baseline transitioned to obstructive spirometry. A recent study in South Korea categorized patients with NTM-PD (n = 354) into mild, moderate, and severe groups based on the BMI, age, cavity, erythrocyte sedimentation rate, and sex (BACES) severity score system [32] and reported that the annual declines in FEV1, FVC, and diffusing capacity for carbon monoxide increased with increasing disease severity of NTM-PD [13]. FEV1 decreased by 26.4, 31.3, and 35.7 mL/year in mild, moderate, and severe groups, respectively [13]. By demonstrating that treatment failure and severe NTM-PD were associated with a faster FEV1 decline, these studies supported the notion that NTM-PD is a risk factor for COPD development. However, these studies focused on lung function decline among patients with NTM-PD and did not compare them with healthy controls. A comparative study from Japan included 22 never-smokers with NTM-PD and 9 healthy never-smokers. Although the sample size was small, the FEV1 decline was significantly greater in the NTM-PD group than in the control group (− 70 vs. 20 mL/year) [12]. Our present study, which included a large number of patients with NTM-PD and matched controls, confirmed a greater lung function decline in patients with NTM-PD than in those without NTM-PD. Notably, we excluded patients with abnormal lung function or the diagnosis of COPD at baseline.

Regarding the association between NTM-PD and COPD development, a previous study using Taiwanese National Health Insurance data reported that the incidence of COPD was approximately 3.0-fold higher in the NTM cohort than in the non-NTM cohort [4]. However, this study did not include information on the smoking status and BMI. Since smoking exposure is a major risk factor for COPD, the lack of smoking status data was a notable limitation of this study. The present study overcomes these limitations and is the first to report a higher incidence of COPD in patients with NTM-PD than in smoking status-matched controls. An important strength of the present study was its longitudinal cohort design, which utilized a relatively large-scale and comprehensive dataset from a tertiary referral center and validation with a nationwide dataset. In study 1, the NTM-PD diagnosis was objectively defined using ICD-10 codes and microbiological findings. Using spirometry data, we objectively measured COPD development and excluded those with abnormal spirometry results at baseline. In study 2, we demonstrated the association between NTM-PD and COPD treatment, extending beyond spirometry-defined COPD in real-world clinical settings. These findings suggest that, akin to pulmonary tuberculosis, NTM-PD serves as a risk factor for COPD.

Considering the substantial incidence of radiographic airway involvement, including bronchiolitis and parenchymal destruction involving the alveoli [33], an association between NTM-PD and airflow obstruction is predictable. In patients with NTM-PD whose spirometry test is apparently “normal,” a mild obstructive pattern and gas trapping may be present [14]. Although the pathogenesis of NTM-PD has not been fully investigated, recurrent and persistent NTM infections are related to impaired mucociliary clearance and Th1 immune deficiency in the host [34]. Moreover, NTM species can damage the airway cilia, creating a vicious cycle that further impairs mucociliary clearance [35]. NTM infection induces airway inflammation and excessive neutrophilic response, which can compromise the airways, leading to pathological remodeling similar to that in tuberculosis (e.g., bronchiolitis, bronchiectasis, or cavity formation), [36] which is now recognized as a major etiotype of COPD-I. Therefore, persistent NTM infection and NTM-related inflammation may be responsible for COPD development.

This study has several clinical implications. First, it highlights the need for healthcare workers, patients, and local communities to recognize that NTM-PD can be associated with the development of COPD. Considering the increasing global prevalence of NTM-PD, COPD related to this disease might have been on the rise; however, without awareness of this association, its burden is likely to remain under-recognized. We suggest two strategies to ameliorate COPD progression in patients with NTM-PD. The first involves regular measurement of lung function in this population. Although the current guidelines for the NTM-PD diagnosis and treatment focus on identification and eradication of the causative microorganism, they do not mention lung function measurement or COPD development in patients with NTM-PD [7]. Close monitoring of symptoms, radiologic progression, and lung function could aid in the early detection of COPD. The second strategy entails the appropriate treatment of NTM-PD, as inappropriate treatment of NTM-PD is relatively common in real-world settings [37]. Because treatment failure could more rapidly decline lung function, which might lead to the COPD development, emphasizing appropriate NTM-PD treatment is crucial. Second, this study suggests the requirement for more basic research to elucidate the underlying mechanisms of COPD development in patients with NTM-PD. Unveiling specific molecular signaling pathways may help identify early biomarkers predicting COPD development and potential treatment-targeting molecules for this specific phenotype of COPD in patients with NTM-PD.

Despite its strengths, this study had several limitations. First, NTM-PD was diagnosed based on the presence of ICD-10 codes and microbiological findings. Therefore, patients with mild or asymptomatic NTM-PD or undiagnosed NTM-PD, such as cases where NTM species were isolated from only one sputum specimen, might not have been included in this study. Second, the incidence of COPD was higher in study 1 than in study 2. The potential reasons for overestimation in study 1 are as follows: (1) Study 1 was conducted at a tertiary referral hospital with a large number of patients with NTM-PD. The patients with NTM-PD in the present study were likely to experience more severe disease (and were more prone to developing COPD) compared to those with NTM-PD in general. (2) We defined newly diagnosed COPD as the first detected airflow obstruction using pre-bronchodilator spirometry results. Thus, a transient airflow obstruction could have been captured as COPD, and pre-bronchodilator airflow obstruction could have overestimated COPD. (3) Study 2 defined COPD as the ICD-10 codes for COPD and use of bronchodilators [20, 21, 38]. Since patients with ICD-10 codes for COPD who were not under treatment were not considered as having COPD, this could also lead to the discrepancy in the prevalence of COPD between the studies [39]. Fourth, as this study utilized data extracted from the CDW and NHIS-NSC, we lacked information on the presence and extent of coexisting bronchiectasis, pulmonary fibrosis, clinical phenotype of NTM-PD (nodular bronchiectasis or fibrocavitary), and severity of NTM-PD. In evaluating the association between NTM-PD and COPD risk, we were unable to consider some important variables, such as bronchiectasis and pulmonary fibrosis [40, 41] due to the following reasons: Our study relied on formal interpretations of chest CT findings by radiologists, who sometimes omit detailed descriptions of bronchiectasis and/or fibrosis when interpreting CT findings of patients with NTM-PD. Consequently, the presence of bronchiectasis and/or fibrosis may be underreported. Additionally, clinicians often record the ICD-10 code for NTM-PD while omitting codes for bronchiectasis or fibrosis. Given these constraints, bronchiectasis and pulmonary fibrosis were not included in our study. Further studies with more detailed information are required to explore the association between NTM-PD and COPD. Fifth, our study spans a considerable period during which both the prevalence and awareness of NTM-PD have increased substantially. This could have influenced the timing of diagnosis and initiation of antibiotic treatment for NTM-PD, potentially affecting the observed association between NTM-PD and the development of COPD. Future studies are warranted to address these remaining questions. Finally, as this study was conducted in South Korea, the generalizability of the results to other ethnic groups or countries may be limited.

Conclusion

In conclusion, patients with NTM-PD experienced a greater decline in lung function and a significantly higher incidence rate of COPD compared to those without NTM-PD. These findings emphasize the importance of monitoring lung function in patients with NTM-PD to improve their clinical prognosis.