Abstract
Background
Geographical and meteorological factors have been reported to influence the prevalence of echinococcosis, but there’s a lack of indicator system and model.
Objective
To provide further insight into the impact of geographical and meteorological factors on AE prevalence and establish a theoretical basis for prevention and control.
Methods
Principal component and regression analysis were used to screen and establish a three-level indicator system. Relative weights were examined to determine the impact of each indicator, and five mathematical models were compared to identify the best predictive model for AE epidemic levels.
Results
By analyzing the data downloaded from the China Meteorological Data Service Center and Geospatial Data Cloud, we established the KCBIS, including 50 basic indicators which could be directly obtained online, 15 characteristic indicators which were linear combination of the basic indicators and showed a linear relationship with AE epidemic, and 8 key indicators which were characteristic indicators with a clearer relationships and fewer mixed effects. The relative weight analysis revealed that monthly precipitation, monthly cold days, the difference between negative and positive temperature anomalies, basic air temperature conditions, altitude, the difference between positive and negative atmospheric pressure anomalies, monthy extremely hot days, and monthly fresh breeze days were correlated with the natural logarithm of AE prevalence, with sequential decreases in their relative weights. The multinomial logistic regression model was the best predictor at epidemic levels 1, 3, 5, and 6, whereas the CART model was the best predictor at epidemic levels 2, 4, and 5.
Similar content being viewed by others
Introduction
Echinococcosis, also known as hydatid disease, is caused by Echinococcus spp. and has been regarded as one of the most important parasitic zoonotic diseases worldwide [1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17]. Among the major species identified Echinococcus granulosus and Echinococcus multilocularis pose a substantial threat to public health, respectively leading to cystic echinococcosis (CE) and alveolar echinococcosis (AE). China bears the highest global burden of echinococcosis worldwide [18], with a higher prevalence of both CE and AE in the north and northwest regions compared to other areas [17]. According to the former research, there were approximately 18,235 (CIS 11,900–28,200) new cases of AE per annum globally with 16,629 (91%) occurring in China [19].
Previous studies have demonstrated that various geographical and meteorological factors can impact the transmission of echinococcosis by affecting host distribution and exposed egg activity. It is relatively easy for humans to control the transmission of Echinococcus in residential areas, but is very difficult to affect the transmission between host animals in the wild. Therefore, the impact of geographical and meteorological factors on the distribution and survival of animal hosts and eggs is greater in the wilderness, in turn leading to a greater impact on transmission. For example, field studies have reported poor survival of E. multilocularis eggs at temperatures >25 °C and following exposure to extreme cold conditions (≤−83 °C), whereas temperatures of –18 °C to 4 °C are well-tolerated [20]. In the same study, the authors reported that E. multilocularis eggs are also sensitive to different levels of moisture and humidity [20]. Other studies have reported a considerable correlation between mean annual air temperature and the prevalence of E. multilocularis in red foxes, as well as a correlation between prevalence and mean annual precipitation [21].
Further, when examined as independent variables, grassland area ratio and land surface temperature have been shown to exhibit positive and negative correlations with the prevalence of human CE, respectively [22]. By combining maps of environmental and biological covariates with information concerning known cases of human CE in the Xizang Autonomous Region of China, researchers have reported significant associations between CE prevalence and annual average precipitation, elevation, and water accessibility (P < 0.05) [23]. However, another previous study conducted in Xizang reported that minimum temperature, maximum temperature, relative humidity, precipitation, terrain, land use, and normalized difference vegetation index (NDVI) had little impact on the prevalence of AE, although the interactions between them enhanced their separate effects on AE prevalence [24]. Despite this finding, few studies overall have examined geographical and meteorological risk factors for AE in China.
To provide further insight into the impact of geographical and meteorological factors on AE prevalence and establish a theoretical basis for appropriate prevention and control measures, we conducted an exploratory analysis of these factors across all six affected provinces in China, based on an epidemiological survey conducted from 2012 to 2016: the Xizang Autonomous Region, Qinghai Province, Sichuan Province, Gansu Province, Xinjiang Uygur Autonomous Region, and Ningxia Hui Autonomous Region [17].
Materials and methods
Collection and processing of population infection data
The county-level prevalence data of AE from 2012 to 2016 were collected from recent scientific papers and reports [17, 25]. The same methodology was used for all the epidemiological surveys carried out in these counties, and the details of the diagnosed cases and human prevalence estimates are described in Wu and Zheng [17, 25].
Sources of geographical/meteorological data and processing
Echinococcosis is a chronic disease that is difficult to detect during its initial stage, therefore the mean value from 1981 to 2010 of the meteorological variables of county-level administrative regions were used for analyzing. Surface meteorological data from 1981 to 2010 (ID:1.2.156.416. CMA. D3. A002.001. OB. WB. CHN. MUL. MON. ZD. 1) were downloaded from the China Meteorological Data Service Center (http://data.cma.cn/), including 50 variables related to pressure, temperature, relative humidity, precipitation, and wind at all meteorological stations in the provinces with echinococcosis prevalence (Supplementary Table 1).
SRTMDEM and MODIS 500M monthly synthetic NDVI data for 2000 to 2015 were downloaded from the Geospatial Data Cloud (https://www.gscloud.cn/).
Statistical analysis and modeling
R-4.1.2 (R Foundation for Statistical Computing, Vienna, Austria) was used for statistical analysis. The base and dplyr packages were used for data cleaning, while the krige function of the gstat package was used for Kriging spatial interpolation. The base and dplyr packages were used to calculate the mean values of NDVI, altitude (DEM), and meteorological data. The principal function of the psych package was used to perform principal component analysis, and the lm function of the stats package was used to perform one-way linear regression analysis. The cv.glmnet and glmnet functions of the glmnet package were used to establish a least absolute shrinkage and selection operator (LASSO) regression model, while the lm function of the stats package was used to establish a multivariable linear regression model. The step function of the stats package was used to establish a stepwise regression model. The multinom function of the nnet package was used to establish a multinomial logistic regression model. The lda function of the MASS package was used to establish a Bayesian discriminant model, and the rpart function of the rpart package was used to establish a Classification and Regression Tree Model (CART). Relative weight analysis was also performed using the rwa package. Map created using the Free and Open Source QGIS-3.22.14 (Open Source Geospatial Foundation). The significance level is 0.05.
Epidemic levels for Echinococcus multilocularis
Using the k-means function in the statistics rwa package, AE epidemic level was classified based on the natural logarithm of the prevalence rate. Classification performance was evaluated using the goodness of classification. The minimum number of clusters with a goodness of classification >95% was selected as the optimal number of prevalence levels, as follows:
where k is the number of cluster centers, \({n}_{k}\) is the number of observations in cluster k, and n is the total number of observations.
Indicator library
NDVI, DEM, and all the meteorological variables were used as basic indicator library. To establish a composite indicator library, the principal components of atmospheric pressure, temperature, precipitation, and wind were extracted from 49 climatic variables, and the significance of each principal component was analyzed. The single variable indicators were NDVI, DEM, and relative humidity, which were labeled as composite indicators. Effective composite indicators that had a linear relationship with the natural logarithm of the AE prevalence rate were selected via single-factor linear regression of all composite indicators.
Key indicators were selected based on the λ with the smallest mean error via 10-fold cross-validation, and a multivariate LASSO regression model was established using the λ. Composite indicators with non-zero coefficients were selected as key indicators.
Establishment of a three-level indicator system
The Key-Characteristic-Basic Indicators System(KCBIS) was Established based on the results of principal components analysis and mathematical models. The key indicators were defined as effective composite indicators with a non-zero coefficients in the LASSO regression model. Define effective composite indicators selected in 2.5 part using single-factor linear regression models as characteristic indicators. Basic indicators were defined as those who linearly combined to create an effective composite indicator. On the other hand, characteristic indicators are linear combinations of basic indicators, and the combination method is based on the results of principal component analysis which described in 2.5 part.
Analysis of relative weights for key indicators
The relative weights of the key indicators were analyzed using the rwa package and RWA function in R-4.1.2.
Establishment and comparison of multiple models
Using the natural logarithm of AE prevalence rate as the dependent variable and the key indicators as independent variables, we established a multivariable linear regression model and stepwise regression model. Using the prevalence level as the dependent variable and the key indicators as independent variables, we also established a naive Bayesian classification model using the proportion of each prevalence category as the a priori weight. Using the prevalence level as the dependent variable and the key indicators as independent variables, we further established multinomial logistic regression and CART models.
All samples were used to predict the prevalence rate or level. Model performance was evaluated by calculating and comparing accuracy, precision, sensitivity, specificity, and the F1-Score.
Results
Composite indicators of atmospheric pressure
The principal component analysis of the basic indicators of atmospheric pressure revealed that two composite indicators could explain more than 90% of the variance in the original five basic indicators (Supplementary Table 2 and Fig. 1a). RC1 represented the basic atmospheric pressure condition, recorded as atoms_Val, while RC2 represented the difference between the positive and negative atmospheric pressure anomalies, recorded as atmos_an.
Composite indicators of temperature
The basic temperature indicators were divided into two categories. The first category included indicators of temperature measurements, with three composite indicators extracted via the principal component analysis(Supplementary Table 2 and Fig. 1b). In this analysis, RC1 represented the basic temperature condition, recorded as temp_Val, RC2 represented the difference between the negative and positive temperature anomalies, recorded as temp_anom, and RC3 represented the daily temperature range, recorded as temp_DRg. The second category included temperature duration indicators, with three composite indicators extracted via the principal component analysis(Supplementary Table 2 and Fig. 1c). In this analysis, RC1 represented the duration index of cold days monthly, recorded as CldD (i.e., the comprehensive index of days with temperatures not exceeding 2 °C, 0 °C, –2 °C, and –15 °C). Similarly, RC2 represented the duration index of extremely hot days monthly, recorded as ExtrHtD (i.e., the comprehensive index of days with temperatures not less than 30 °C, 35 °C, 37 °C, and 40 °C). Lastly, RC3 represented the duration index of extremely cold days monthly, recorded as ExtrCldD and reflecting the comprehensive index of days with temperatures not exceeding –30 °C and –40 °C.
Composite indicators of precipitation
The basic indicators of precipitation were divided into two categories. The first category included precipitation measurements, with two composite indicators extracted using principal component analysis (Supplementary Table 2 and Fig. 1d). In this analysis, RC1 represented the monthly precipitation index, recorded as prec_Val_M, while RC2 represented the daily precipitation index, recorded as prec_Val_D. The second category included indicators of precipitation duration, with three composite indicators extracted using principal component analysis(Supplementary Table 2 and Fig. 1e). In this analysis, RC1 represented the duration index of monthly rainy days, recorded as Rainy_Ds and reflecting the comprehensive index of rainy days. RC2 represented the monthly rainstorm days index, recorded as rainform_Ds and reflecting the comprehensive index of days with daily precipitation levels not less than 50 mm, 100 mm, and 150 mm. RC3 represented heavy rain days index monthly, recorded as Hv_Rny_Ds and reflecting the comprehensive index of days with daily precipitation of not less than 25 mm.
Composite wind indicators
In the principal component analysis, three composite indicators could be extracted from the basic indicators of wind factors (Supplementary Table 2 and Fig. 1f). In this analysis, RC1 represented the fresh breeze days index monthly, recorded as Fsh_brz_Ds and reflecting the comprehensive index of days with wind speeds of not less than 15 m/s and 17 m/s. RC2 represented the monthly gentle breeze days index, recorded as Gt_brz_Ds and reflecting the comprehensive index of days with wind speeds not less than 5 m/s, 10 m/s, and 12 m/s. RC3 represented the basis of wind speed, recorded as Wnd_spd_val and reflecting the comprehensive index of average and maximum wind speeds.
Effective composite indicators and characteristic indicators
All the composite indicators mentioned above are of particular realistic significance (Table 1). The single-factor linear regression analysis revealed a significant linear relationship between the natural logarithm of AE prevalence and the following, which were thus identified as effective composite indicators: atoms_Val, atmos_an, temp_Val, temp_anom, CldD, ExtrHtD, ExtrCldD, prec_Val_M, prec_Val_D, Rainy_Ds, rainstorm_Ds, Gt_brz_Ds, Fsh_brz_Ds, Wnd_spd_val, DEM respectively (Table 2). Among these, atoms_Val, atmos_an, temp_Val, and ExtrHtD were negatively correlated with the natural logarithm of AE prevalence. In contrast, temp_anom, ExtrCldD, prec_Val_M, prec_Val_D, Rainy_Ds, rainstorm_Ds, Gt_brz_Ds, Fsh_brz_Ds, Wnd_spd_Val, and DEM were positively correlated with the natural logarithm of AE prevalence (Table 2, Supplementary Fig. 1, Supplementary Fig. 2). These 15 effective composite indicators were used as characteristic indicators.
Key indicators
In the 10-fold cross-validation, the λ corresponding to the minimum of the mean error of the LASSO regression was 0.01484477 (Supplementary Fig. 3). In the LASSO regression model based on λ, the variables with non-zero coefficients were atmos_an, temp_Val, temp_anom, CldD, ExtrHtD, prec_Val_M, Fsh_brz_Ds, and DEM, all of which were used as key indicators (Table 3).
Establishment of a three-level indicator system
By calculating and screening the key indicators, characteristic indicators, and related basic indicators, we established a three-level indicator system. This Key-Characteristic-Basic Indicator System (KCBIS) included 50 basic indicators, 15 characteristic indicators, and 8 key indicators (Supplementary Fig. 4). Obviously, the 50 basic indicators could be directly observed at meteorological stations, the 15 characteristic indicators could be produced by linear combination of the basic indicators and showed a linear relationship with AE epidemic, and the 8 key indicators were characteristic indicators with clearer relationships and fewer mixed effects.
Analysis of relative weights for key indicators
In the analysis of relative weights for key indicators, we observed a positive correlation between the natural logarithm of AE prevalence and the following, with sequential decreases in the relative weight of each: prec_Val_M, CldD, temp_anom, DEM, Fsh_brz_Ds. Similarly, we observed negative correlations between the natural logarithm of AE prevalence and temp_Val, atoms_an, and ExtrHtD, again with sequential decreases in the relative weight of each (Fig. 2).
Classification of epidemic levels for Echinococcus multilocularis
When divided into 6 categories based on the natural logarithm of AE prevalence, prevalence (levels 1–6), the goodness of classification for R2 reached over 95% (Supplementary Fig. 5). The AE prevalence rates represented by levels 1~6 are ~0.01%, ~0.05%, ~0.15%, ~0.38%, ~1.73%, and 1.73%~, respectively (Supplementary Table 3).
Model establishment and comparison
The multivariable linear regression model and the stepwise regression model, the multinomial logistic regression model, the naive Bayesian classification model, and the CART model was established (the parameters showed in Supplementary Tables 4, 5, 6, and Supplementary Fig. 6 respectively). Among the models compared, the CART model had the highest accuracy, sensitivity, and specificity values, and the multinomial logistic regression model had the highest precision value and F1-Score. According to accuracy and F1-Score, the best predictor for levels 1, 3, 5, and 6 was the multinomial logistic regression model, while the best predictor for levels 2, 4, and 5 was the CART model (Table 4).
Discussion
In this study, we established a geographical–meteorological indicator system named Key-Characteristic-Basic Indicators System(KCBIS) to identify which factors exert a significant impact on the prevalence of AE in China, based on the results of regression and relative weight analyses. The KCBIS included 50 basic indicators, 15 characteristic indicators, and 8 key indicators.
The basic indicators included altitude (DEM) and 49 meteorological indicators, all of which can be directly measured at meteorological stations in China, making them easy to obtain. Different basic indicators can reflect a certain meteorological or geographical phenomenon to a certain extent, respectively. It should be emphasized here that echinococcosis is a chronic disease that is not easily detected in the early stages. Only when the lesion reaches a certain size can it be determined through imaging methods(B-ultrasound or computed X-ray tomography). Therefore, the basic indicators used in this study are not simple observation values of that year, but the average value of 30 years before the year of AE epidemic data. This study used geographic and meteorological data from 1981 to 2010 to analyze the AE epidemic from 2012 to 2016. Inevitably, there will be a very small number of cases infected before 1981 or after 2010, which will have a subtle impact on the results of this study. To solve this problem, it is necessary to conduct long-term monitoring of population infections and geographical or meteorological changes in epidemic areas, which is our further research direction.
The relationship between different geographical and meteorological data can seriously affect data analysis, such as the collinearity between factors (temperature decreases linearly with altitude to a certain extent) and within factors (certain linear relationship between average daily precipitation and maximum or minimum daily precipitation). If these single indicators are directly screened and then one or some indicators of a certain factor are roughly deleted, a large amount of information in that factor is often lost. Based on observations of different factors, most of them have several indicators, which could be classified to 2 or 3 dimensions such as basic conditions, variation, and duration. Then, principal component analysis is used to extract principal components from different dimensions, replacing the original indicators. On the one hand, this can reduce collinearity while retaining as much information as possible about the factor, and on the other hand, it can more clearly reflect the characteristics of the indicator from different dimensions. This is the basic idea of how we use basic indicators to generate a comprehensive indicator, from which we screened the characteristic indicators. So, from a mathematical perspective, the characteristic indicators were linear combinations of the basic indicators. The characteristic indicator could be regarded as a representation of a certain meteorological or geographical phenomenon, eliminating the collinearity that exists when observing the meteorological phenomenon at different scales. There is a significant correlation between the characteristic indicators and the epidemic of AE, showed the characteristic indicators had reference sense in indicating the strength of local AE prevalence. When evaluating the impact of individual geographical and meteorological phenomenon on the epidemic of AE, full consideration should be given to each characteristic indicators.
The key indicators were those characteristic indicators with significant influence, which were used in the main analyses and to develop the predictive models. The key indicators further reduced the collinearity between observations of different geographical and meteorological phenomena on the basis of characteristic indicators, and eliminated the mixed effect of different geographical and meteorological phenomena on the epidemic of AE to some extent, so that the key indicators has stronger indication and pertinence during the modeling process.
The relative weight analysis also revealed sequential decreases for the correlation between the natural logarithm of AE prevalence and prec_Val_M, CldD, temp_anom, temp_Val, DEM, atmos_an, ExtrHtD, and Fsh_brz_Ds.
Previous studies have found that different levels of water or humidity can affect [20] or interact with other environmental factors [23, 24] to affect the activity of Echinococcus eggs or disease transmission, reporting significant correlations between disease incidence and annual average precipitation [5]. Our study further identified a significant positive correlation between characteristic indicators of monthly precipitation (prec_Val_M) and the natural logarithm of AE prevalence.
Research has demonstrated that Echinococcus eggs can tolerate temperatures ranging from –18 °C to 4 °C [20], and a significant correlation between annual average temperature and rates of Echinococcus multilocularis infection has been observed in red foxes [5]. Another study have revealed that minimum temperature and maximum temperature have little effect on the incidence of AE [24]. However, in the current study, the characteristic indicator representing the basic temperature condition (temp_Val) and the characteristic indicator representing the number of extremely hot days each month (ExtrHtD) were significantly negatively correlated with the natural logarithm of AE prevalence. While the characteristic indicators representing the index of cold days (CldD) presented significant positive correlations with the natural logarithm of the AE prevalence rate, consistent with previous study on the temperature tolerance of eggs [20]. The difference between the negative and positive temperature anomalies (temp_anom) exhibited significant positive correlations with the natural logarithm of the AE prevalence rate, further supporting the notion that the minimum and highest temperatures can enhance the impact of other factors on AE prevalence rates [24]. Further, relative weights were higher for the characteristic indicators with positive effects than for those with negative effects.
Although high rates of AE prevalence in China have mainly been reported in the Qinghai-Tibet Plateau, the relationship between DEM and the incidence of AE is not a simple linear one. In the current study, the DEM exhibited a significant positive correlation with the natural logarithm of AE prevalence. It is well known that increases in altitude are associated with decreases in atmospheric pressure. However, in our study, the change of atmospheric pressure was a more significant indicator of AE prevalence compared with the atmospheric pressure itself. Specifically, the difference between positive and negative atmospheric pressure anomalies(atmos_an) was negatively correlated with the natural logarithm AE prevalence.
Our analysis identified a weak positive correlation between the characteristic indicator of days of fresh breeze(wind speed ≥15 m/s) and AE prevalence levels. There are currently very few studies reporting the relationship between wind and prevalence. However, the wind can affect egg dispersal and vapor evaporation, which in turn affects the environment risk and ultimately leads to changes in prevalence. Further studies are required to determine whether this indicator influences AE prevalence via humidity, egg dispersal, or other factors.
In this research, 5 models were developed using the key indicators and AE epidemic levels, and be compared using accuracy and the F1-Score. Compared with other research [26], what goes further is we assume that the impact of each variable varies at different epidemic levels and attempt to find well-performing models for each level. The results indicate that no model outperforms other models at all levels (Level 1~Level 6). Overall, the accuracy and F1-Score were highest for the multinomial logistic regression model and CART model. The multinomial logistic regression model was the best predictor of epidemic levels 1, 3, 5, and 6, whereas the CART model was the best predictor of epidemic levels 2, 4, and 5. This indicates that the influence of various factors is not constant at different epidemic levels and that a combination of multiple segmented models must be considered in future studies. It is also possible that some factors only begin to exert a certain effect when other factors are considered or when their weights reach a certain threshold which needs to be further researched. Although the accuracies and F1-Scores of the multinomial logistic regression model and CART model were same at epidemic level 5, the precision of the multinomial logistic regression model is higher while the sensitivity of CART model is higher. Complex indicators often lead to overfitting of the model. To avoid this phenomenon, we have taken a series of measures. Firstly, when selecting key indicators (independent variables), LASSO regression was used, which introduced regularization penalty terms, and a 10-fold cross-validation method was used to estimate regularization parameters. Secondly, using the large sample size, the prevalence data in this study was obtained from the largest epidemiological survey of echinococcosis in the world to date. Thirdly, choose models that have already been applied in published studies or other simple models that have not yet been applied in echinococcosis prediction. Echinococcosis is a chronic disease with insignificant early symptoms. Although the exact infection time of cases is unknown, it is generally within 30 years before the B-ultrasound examination. So the proportion of cases infected before 1981 or after 2010 must be very low. In addition, the AE prevalence data used in this study was not obtained from the overall population, but was credible due to the scientific sampling method. Therefore, although there may be some bias, it does not affect the validity of the results of this study.
At present, China’s control measures for livestock and dogs are very effective, and the next focus is on the prevention and control of wild animal hosts. However, the field investigation of the transmission cycle in the wild is very difficult. This study can, to a certain extent, conduct risk assessment on vast outdoor areas with unknown epidemic levels through geographic and meteorological models. Combined with sampling verification, high-risk areas for Echinococcus multilocularis in the wild can be identified, and precise prevention and control measures such as deworming wild canines can be carried out. High-risk areas could be marked to issue warnings to nomadic populations. Focus on people and dogs returning from nomadic grazing with high-risk, carry out targeted screening and dogs deworming. The future research should consider the combination of multiple segmented models, while focusing on evaluating the impact of geographic and meteorological factors in some special habitats, in order to further improve and optimize the model. The effectiveness of geo-meteorological models is reflected in the fact that geo-meteorological factors can significantly affect the infection of human echinococcosis by affecting the ecological distribution of animal hosts and the activity of pathogens. It has significant reference value for other countries or regions which are similar to the epidemic areas of AE in China, such as vast animal husbandry areas with grasslands, mountains, and cold weather, and is also applicable to the areas where human infections are mainly caused by transmission cycles composed of dogs, foxes, wolves, and rodents. This study also provides a feasible reference method for the establishment of other geo-meteorological prediction models for infectious diseases transmitted through animal hosts or vectors.
The Intelligent Prediction Large Model (IPLM) triggered by multiple points is one of the important development directions for future disease monitoring and early warning. As one of the important foundations of IPLM, the combination of geographic meteorological models with animal index system models, human behavior system models, and socio-economic system models can effectively improve the effectiveness of IPLM.
Conclusion
In this study, we established a Key-Characteristic-Basic Indicators System(KCBIS) of 50 basic, 15 characteristic, and 8 key geographical and meteorological indicators exerting a significant impact on AE epidemic levels. The 50 basic indicators could be directly observed at meteorological stations, the 15 characteristic indicators could be produced by linear combination of the basic indicators and showed a linear relationship with AE epidemic, and the 8 key indicators were characteristic indicators with a clearer relationships and fewer mixed effects. Comparison of five mathematical models revealed that the influence of various factors was not constant across different epidemic levels. The best predictor at epidemic levels 1, 3, 5, and 6 was the multinomial logistic regression model, whereas the best predictor at epidemic levels 2, 4, and 5 was the CART model. Thus, future studies should consider a combination of multiple segmented models.
Data availability
The data that support the findings of this study are available from the author of the study (Mr. Chuizhao Xue) but restrictions apply to the availability of these data, which are not publicly available. Data are however available from the authors upon reasonable request and with permission of the National Institute of Parasitic Diseases, Chinese Center for Disease Control and Prevention (Chinese Center for Tropical Diseases Research).
References
Abdulhameed MF, Habib I, Al-Azizz SA, Robertson I. A retrospective study of human cystic echinococcosis in Basrah province, Iraq. Acta Trop. 2018;178:130–3. https://doi.org/10.1016/j.actatropica.2017.11.011.
Al-Qaoud KM, Craig PS, Abdel-Hafez SK. Retrospective surgical incidence and case distribution of cystic echinococcosis in Jordan between 1994 and 2000. Acta Trop. 2003;87:207–14. https://doi.org/10.1016/S0001-706X(03)00022-6.
Altintas N. Past to present: echinococcosis in Turkey. Acta Trop. 2003;85:105–12. https://doi.org/10.1016/S0001-706X(02)00213-9.
Ammann RW, Eckert J. Cestodes: Echinococcus. Gastroenterol Clin North Am. 1996;25:655–89. https://doi.org/10.1016/s0889-8553(05)70268-5.
Antolová D, Miterpakova M, Radoňak J, Hudačkova D, Szilagyiova M, Žáček M. Alveolar echinococcosis in a highly endemic area of northern Slovakia between 2000 and 2013. Eurosurveillance. 2014;19:20882.
Azlaf R, Dakkak A. Epidemiological study of the cystic echinococcosis in Morocco. Vet Parasitol. 2006;137:83–93. https://doi.org/10.1016/j.vetpar.2006.01.003.
Bardonnet K, Benchikh-Elfegoun MC, Bart JM, Harraga S, Hannache N, Haddad S, et al. Cystic echinococcosis in Algeria: cattle act as reservoirs of a sheep strain and may contribute to human contamination. Vet Parasitol. 2003;116:35–44. https://doi.org/10.1016/s0304-4017(03)00255-3.
Battelli G, Mantovani A, Seimenis A. Cystic echinococcosis and the Mediterranean Region: a long-lasting association. Parassitologia. 2002;44:43–58.
Bold B. Burden, cost of disease and optimization of clinical practice of human cystic echinococcosis in Mongolia. Basel: University of Basel; 2018.
Bresson-Hadni S, Vuitton D-A, Bartholomot B, Heyd B, Godart D, Meyer J-P, et al. A twenty-year history of alveolar echinococcosis: analysis of a series of 117 patients from eastern France. Eur J Gastroenterol Hepatol. 2000;12:327–36.
Brundu D, Piseddu T, Stegel G, Masu G, Ledda S, Masala G. Retrospective study of human cystic echinococcosis in Italy based on the analysis of hospital discharge records between 2001 and 2012. Acta Trop. 2014;140:91–6.
Craig PS, McManus DP, Lightowlers MW, Chabalgoity JA, Garcia HH, Gavidia CM, et al. Prevention and control of cystic echinococcosis. Lancet Infect Dis. 2007;7:385–94. https://doi.org/10.1016/S1473-3099(07)70134-2.
Kern P, Bardonnet K, Renner E, Auer H, Pawlowski Z, Ammann RW, et al. European echinococcosis registry: human alveolar echinococcosis, Europe, 1982–2000. Emerg Infect Dis. 2003;9:343–9. https://doi.org/10.3201/eid0903.020341.
Larrieu E, Zanini F. Critical analysis of cystic echinococcosis control programs and praziquantel use in South America, 1974-2010. Rev Panam de Salud Pública. 2012;31:81–7.
Mahmoudi S, Mamishi S, Banar M, Pourakbari B, Keshavarz H. Epidemiology of echinococcosis in Iran: a systematic review and meta-analysis. BMC Infect Dis. 2019;19:929.
Manciulli T, Mariconti M, Vola A, Lissandrin R, Brunetti E. Cystic echinococcosis in the Mediterranean. Curr Trop Med Rep. 2017;4:235–44. https://doi.org/10.1007/s40475-017-0129-z.
Wu W, Wang H, Wang Q, Zhou X, Wang L, Zheng C, et al. A nationwide sampling survey on echinococcosis in China during 2012-2016. Chin J Parasitol Parasit Dis. 2018;36:1.
Zhang M, Wang L, Guan Y, Wu W. Meta-analysis on severity of hydatidosis. Chin J Parasitol Parasit Dis. 2018;36:156–60.
Torgerson PR, Keller K, Magnotta M, Ragland N. The global burden of alveolar echinococcosis. PLoS Negl Trop Dis. 2010;4:e722. https://doi.org/10.1371/journal.pntd.0000722.
Veit P, Bilger B, Schad V, Schafer J, Frank W, Lucius R. Influence of environmental factors on the infectivity of Echinococcus multilocularis eggs. Parasitology. 1995;110:79–86. https://doi.org/10.1017/s0031182000081075.
Miterpáková M, Dubinsky P, Reiterová K, Stanko M. Climate and environmental factors influencing Echinococcus multilocularis occurrence in the Slovak Republic. Ann Agric Environ Med. 2006;13:235–42.
Huang D, Li R, Qiu J, Sun X, Yuan R, Shi Y, et al. Geographical environment factors and risk mapping of human cystic echinococcosis in Western China. Int J Environ Res Public Health. 2018;15:1729. https://doi.org/10.3390/ijerph15081729.
Ma T, Jiang D, Gongsang Q, Xue C, Han S, Wu W, et al. Factors influencing the spatial distribution of cystic echinococcosis in Tibet, China. Sci Total Environ. 2021;754:142229. https://doi.org/10.1016/j.scitotenv.2020.142229.
Ma T, Jiang D, Hao M, Fan P, Zhang S, Gongsang Q, et al. Geographical detector-based influence factors analysis for echinococcosis prevalence in Tibet, China. PLoS Negl Trop Dis. 2021;15:e0009547. https://doi.org/10.1371/journal.pntd.0009547.
Canjun Z, Chuizhao X, Shuai H, Zhongjie L, Hu W, Liying W, et al. National alveolar echinococcosis distribution — China, 2012−2016. China CDC Wkly. 2019;2:1–7. https://doi.org/10.46234/ccdcw2020.001.
Fischer I, Graeter T, Kratzer W, Stark K, Schlingeloff P, Schmidberger J. Distribution of alveolar echinococcosis according to environmental and geographical factors in Germany, 1992-2018. Acta Trop. 2020;212:105654.
Acknowledgements
We gratefully acknowledge Shanghai Municipal Health Commission(20204Y0051), and National Key Research and Development Program of China (No. 2021YFC2300800, 2021YFC2300804). We would like to thank Editage (www.editage.cn) for English language editing.
Funding
This work was supported by Shanghai Municipal Health Commission(20204Y0051), and National Key Research and Development Program of China (No. 2021YFC2300800, 2021YFC2300804). The funding bodies did not have any role in the design of the study and collection, analysis, and interpretation of data or in writing the manuscript.
Author information
Authors and Affiliations
Contributions
CX participated in the data collection, cleaning, analysis, design, and the paper writing. BL, YK, WW, XZ, and NX participated in survey quality control. CZ, SH participated in supervision and design of this study, and quality control of the manuscript. All authors read and approved the final manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Consent for publication
The publication of this article has been agreed by authors, participants and others.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Xue, C., Liu, B., Kui, Y. et al. Developing a geographical–meteorological indicator system and evaluating prediction models for alveolar echinococcosis in China. J Expo Sci Environ Epidemiol (2024). https://doi.org/10.1038/s41370-024-00664-z
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41370-024-00664-z
- Springer Nature America, Inc.