Abstract
Non-alcoholic fatty liver disease (NAFLD) is the most common chronic liver disease, which can progress from simple steatosis to advanced cirrhosis and hepatocellular carcinoma. Clinical diagnosis of NAFLD is crucial in the early stages of the disease. The main aim of this study was to apply machine learning (ML) methods to identify significant classifiers of NAFLD using body composition and anthropometric variables. A cross-sectional study was carried out among 513 individuals aged 13 years old or above in Iran. Anthropometric and body composition measurements were performed manually using body composition analyzer InBody 270. Hepatic steatosis and fibrosis were determined using a Fibroscan. ML methods including k-Nearest Neighbor (kNN), Support Vector Machine (SVM), Radial Basis Function (RBF) SVM, Gaussian Process (GP), Random Forest (RF), Neural Network (NN), Adaboost and Naïve Bayes were examined for model performance and to identify anthropometric and body composition predictors of fatty liver disease. RF generated the most accurate model for fatty liver (presence of any stage), steatosis stages and fibrosis stages with 82%, 52% and 57% accuracy, respectively. Abdomen circumference, waist circumference, chest circumference, trunk fat and body mass index were among the most important variables contributing to fatty liver disease. ML-based prediction of NAFLD using anthropometric and body composition data can assist clinicians in decision making. ML-based systems provide opportunities for NAFLD screening and early diagnosis, especially in population-level and remote areas.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Introduction
Non-alcoholic fatty liver disease (NAFLD)‒ the hepatic manifestation of metabolic syndrome‒ is the most common chronic liver disease1,2. Worldwide prevalence of metabolic syndrome and NAFLD has increased in parallel with increased obesity prevalence3,4,5, which is about 20–30% in developed countries and one-third among American adults6,7,8.
Obesity is a common metabolic risk factor associated with NAFLD9,10,11. The prevalence of NAFLD is directly related to increased body mass index (BMI) and central obesity12,13,14. Most studies have shown that visceral fat is an independent factor in generating hepatic steatosis, independent of BMI15,16. The amount of adipose tissue and its distribution differs between men and women17. Women have higher overall fat tissue with relatively more subcutaneous adipose tissue in the hips and thighs. At the same time, men accumulate visceral and subcutaneous fat mainly in the trunk and abdomen with continuous changes before and after puberty17,18,19. The increased fat distribution around the waist (i.e. apple-shaped body) is linked to NAFLD in both genders20. In a pear-shaped body, the subcutaneous fat accumulates mainly in the thighs and buttocks21,22, which is typical among females but can increase metabolic syndrome in males, which is a risk factor for NAFLD independent of central obesity23. In support of the role of fat distribution and anthropometric measures in NAFLD, studies have found several contributing factors, including abdomen circumferences, waist, neck and fat accrual in trunk and arms24,25,26,27,28,29.
Most people with NAFLD, including both children or adults, do not have differential symptoms at the early stages of the disease30. Notably, after the development of cirrhosis, different symptoms such as caput medusa, spider angioma, palmar erythema, ascites, and jaundice appear31. Therefore, early diagnosis is critical to prevent severe complications.
Ultrasonography and laboratory tests are typical diagnostic methods for detecting fatty liver disease. Ultrasound technique has relatively high accuracy in detecting the moderate-to-severe steatosis level and lower accuracy in earlier stages of NAFLD32. Notably, hepatic fibrosis cannot be diagnosed by ultrasonography14,33. Although typically used to detect fatty liver disease, laboratory tests are not useful for all ages and gender groups due to low accuracy34,35. Therefore, a precise, cost-effective, and non-invasive method to analyze symptoms of various stages of the fatty liver for NAFLD diagnosis is desirable. Such an approach is important to help with early diagnosis of NAFLD, which could help prevent hepatic steatosis progression to fibrosis, advanced cirrhosis, and hepatocellular carcinoma.
In recent years, machine learning (ML) models have been used as a novel approach in predicting NAFLD36,37,38,39. However, all of these studies have focused mainly on laboratory outcomes and have not considered body composition and anthropometric factors. Therefore, the primary aim of this study is to identify essential ML classifiers of NAFLDs using body composition and anthropometric indices. The secondary aim is to identify feature contributions to the prediction of NAFLDs.
Materials and methods
Study design and participants
This cross-sectional study was conducted to explore NAFLD phenotypes based on body composition and anthropometric indices. Participants were recruited from the eastern (Khorasan Razavi) and southern (Hormozgan) provinces of Iran, through advertisement on the notice boards of the university clinics, as well as via phone or email contact to potential participants. A total of 593 individuals aged above 13 years old were initially recruited. Eighty individuals were excluded from the study and 513 participants were remained. Exclusion criteria were as follows: the presence of underlying liver disease, taking medications (anti-hypertensive and anti-arrhythmic, anti-glycaemic, corticosteroids, nervous system agents, chemotherapy, Methotrexate and Tamoxifen), alcoholic patients with more than twice-a-week consumption, previous history of any type of cancer during the last year, history of surgery during the last 6 months, pregnant women. The study was conducted following the Declaration of Helsinki, and ethical approval was granted by Mashhad University of Medical Sciences following (Code: IR.MUMS.fm.REC.1395.64). All participants provided written informed consent. Also, written informed consent was obtained from guardians of participants aged under 18 years.
Data collection
At each medical clinic, eligibility, demographics questionnaire, anthropometric, and body composition measurements were assessed by two trained nutritionists. Medical examination and disease diagnosis were performed by a general physician and an internal specialist, respectively. Demographic information, including sex, age, education, disease history and medications, were assessed by researcher using a questionnaire. Weight was measured using a digital weighing scale (Seca 704; Hamburg, Germany), height was measured using a wall height chart, and the body composition measures were assessed using InBody 270 (Inbody Co. Ltd, South Korea) body analyzer to measure per cent (%) body fat, total fat mass, muscle mass, as well as fat mass in the right/left leg, right/left arm and trunk with light clothing and without shoes. The circumferences of neck, chest, arm, wrist, waist, hips, abdomen, thighs, and length of ulna and leg were measured using a flexible tape measure with an accuracy of 0.1 cm. BMI was calculated by dividing weight (kilograms) by the height in meters squared26. Subcutaneous fat in the area below the scapula, arms biceps and triceps and the upper iliac crest was measured using a Saehan calliper (Saehan SH5020, Korea). Participants were also examined for acanthosis in the back of the neck and armpits and the presence of subcutaneous fat under the chin and at the back of the neck. A Fibroscan equipped with the M and XL probes (Echosens 504, Paris, France) was used to assess both controlled attenuation parameter (CAP) (dB/m) and liver stiffness measurement (LSM) (kPa) values simultaneously. A reliable LSM was defined as the median liver stiffness of the 10 measurements (a success rate of greater than 60%, and an IQR < 30% of the median LSM value)40. CAP values range from 100 to 400 dB/m and the following cut-off values were used for the diagnosis of steatosis stages: Stage 0, < 238 dB/m, Stage 1, ≥ 238 to 260 dB/m, Stage 2, ≥ 260 to 292 dB/m, and Stage 3, ≥ 292 dB/m41. LSM values range from 1.5 to 75 kPa, and the following cut-off values were used for the diagnosis of liver fibrosis stages: no significant fibrosis or F0 < 6.2 kPa, mild fibrosis or F1 ≥ 6.2 to 7.6 kPa, moderate fibrosis or F2 ≥ 7.6 to 8.8 kPa, severe fibrosis or F3 ≥ 8.8 to 11.8 kPa and cirrhosis or F4 ≥ 11.8 kPa42.
Statistical analysis
Descriptive and non-predictive data analysis was performed using SPSS version 21 software (SPSS, Inc., Chicago, IL). Data were expressed as mean ± standard deviation or frequencies. Between-group comparisons were performed using an independent sample t test and analysis of variance (ANOVA), followed by Tukey’s post hoc test. A P-value of less than 0.05 was considered statistically significant.
Machine learning models
Three label variables were considered: fatty liver (stage I, II and III vs. no steatosis), steatosis, and fibrosis stages. Eight ML techniques were applied to the dataset to identify the best modelling approach. To this end, k-Nearest Neighbor (kNN), Support Vector Machine (SVM), Radial Basis Function (RBF) SVM, Gaussian Process (GP), Random Forest (RF), Neural Network (NN), AdaBoost and Naïve Bayes were tested. An extant explanation of these classifiers can be found elsewhere43. Testing of these models was performed using the Scikit-learn library in Python programming language44.
To comprehensively compare different classifiers, we trained and evaluated dataset 50 times. This is because different classifiers sometimes predict slightly different outputs and initial points are different for a specific classifier in each run. Thus, a reliable output can be estimated by averaging each classifier several times. Model accuracy and area under the curve (AUC) are reported for each ML technique. Importance values are reported for individual feature variables.
Pre-processing involved data normalization and segmentation. The few missing values in the numerical results of the experiments were replaced using the Linear Interpolation method45. Principal component analysis (PCA) was used to extract the attribute of the data46,47. Data were divided into two parts, train and test. Processing involved feature selection and classification with the best feature. The model processing involved a variety of models. The model with the highest performance was selected.
Patient consent
All patients provided written consent for participation in this study. For participants aged under 18 years, written informed consent was obtained from their guardians.
Ethics approval
Ethical approval was received from the research ethics committee at Mashhad University of Medical Sciences (Code: IR.MUMS.fm.REC.1395.64).
Results
In total 513 participants (240 males and 273 females) took part in the study, of whom 169 (74.1%) male and 220 (80.6%) female cases had a degree of hepatic steatosis. The mean age, weight, and BMI were 37.04 ± 15.44 years, 77.26 ± 17.31 kg, and 28.15 ± 4.89 kg m2, respectively. Overall demographic characteristics and biochemical measures are presented in Table 1. Significant differences were found in most anthropometric variables between male and female participants (see Tables 2 and 3).
Machine learning results
Figures 1, 2, 3 present box plots for each classification method applied to three outcomes. Random Forest (RF) method generated the most accurate ML model for fatty liver (presence of any stage), steatosis stage and fibrosis stage. Average accuracy and AUC values resulted from RF were 0.82 and 0.84 for fatty liver, 0.52 and 0.69 for steatosis stages, 0.57 and 0.58 for fibrosis stages, respectively. Average accuracy and AUC are presented in the Supplemental file (Model Iterations) for all conditions. Moreover, sensitivity, specificity, true positive and true negative measures were presented for fatty liver disease.
Feature variables with the highest predictability for fatty liver were abdomen circumference (IV; average importance value = 0.061), waist circumference (IV = 0.061), chest circumference (IV = 0.054), trunk fat (IV = 0.056) and BMI (IV = 0.053); for steatosis, the stage was abdominal circumference (IV = 0.053), waist circumference (IV = 0.052), chest circumference (IV = 0.052), trunk fat (IV = 0.051) and BMI (IV = 0.050); and for fibrosis were abdominal circumference (IV = 0.049), waist circumference (IV = 0.049), chest circumference (IV = 0.043), BMI (IV = 0.045) and weight (IV = 0.045). See Figs. 4, 5, 6 and Tables 4, 5, 6.
Further assessment identified gender-specific features (see Supplemental Figs. 1–6; Supplemental Tables 1–6). Important predictor factors for fatty liver disease among females were waist circumference (IV = 0.057), abdomen circumference (IV = 0.056), trunk fat (IV = 0.055), fat mass (IV = 0.052), chest circumference (IV = 0.048), and BMI (IV = 0.048) were the most important features. Among males, waist circumference (IV = 0.053), chest circumference (IV = 0.052), trunk fat (IV = 0.051), BMI (IV = 0.052), abdomen circumference (IV = 0.049) and fat mass (IV = 0.048) had the highest predictive value for fatty liver. Important predictor factors for steatosis among females were abdomen circumference (IV = 0.048), waist circumference (IV = 0.047), weight (IV = 0.046), trunk fat (IV = 0.045), fat mass (IV = 0.044), and BMI (IV = 0.043) were the most important features. Among males, waist circumference (IV = 0.051), chest circumference (IV = 0.050), abdomen circumference (IV = 0.049), trunk fat (IV = 0.048), BMI (IV = 0.048), and fat mass (IV = 0.046) had the highest predictive value for steatosis. Important predictor factors for fibrosis among females were abdomen circumference (IV = 0.048), waist circumference (IV = 0.047), BMI (IV = 0.046), trunk fat (IV = 0.045), chest circumference (IV = 0.043), and muscle mass (IV = 0.043) were the most important features. Among males, abdomen circumference (IV = 0.045), waist circumference (IV = 0.043), weight (IV = 0.043), BMI (IV = 0.043), right arm fat (IV = 0.042) and fat mass (IV = 0.042) had the highest predictive value for fibrosis.
Discussion
This study applied ML techniques to determine the optimal body composition and anthropometric classifier of NAFLD and identify feature contribution to the prediction of the disease. RF generated the most accurate ML model to predict fatty liver presence, steatosis (stages) and fibrosis. To our knowledge, this is the first study applying ML on body composition and anthropometric data to predict NAFLD. High accuracy (82%) highlights the potential for applying ML techniques for the primary prevention and screening of NAFLD using anthropometric measurements.
Previous studies using ML techniques to predict fatty liver disease have mainly focused on biochemical measurements, with similar levels of accuracy (83.0%) using Bayesian Network38, (76.3%) Logistic Regression37, (86.4%) RF39 and (80%) Classification Tree techniques36. However, we tested the predictive value of body composition and anthropometric measurements rather than biochemical variables. Anthropometry as a lower-cost and more feasible approach can be considered a primary screening method for fatty liver disease.
Abdominal obesity is a significant risk factor leading to NAFLD27. Waist circumference and trunk fat have been shown to be significantly predicting the risk of NAFLD24. Although BMI is one of the risk factors of NAFLD31, it has been argued that BMI is limited compared to other anthropometric measures (e.g., waist circumference) in identifying lean NAFLD individuals25. In a similar vein, the findings of the present study clearly show the importance of these body composition and anthropometric measures and their relative contribution to the prediction of NAFLD.
Neck circumference reflects the amount of subcutaneous fat in the upper body, and is a reliable factor in determining central obesity48. A positive correlation has been shown between neck circumference and hepatic steatosis26,28. Neck circumference showed a positive association with other anthropometric components, such as BMI and waist and waist-to-hip circumference. In the present study, neck circumference contributed almost equally to hepatic steatosis and fibrosis.
A study by Subramanian revealed that the level of arm fat index in both males and females had a negative association with the degree and severity of NAFLD29. In our study, a strong and positive relationship between arm circumference and the severity of steatosis and fibrosis was detected, validated by the ML model. Rafiee et al. showed that the amount of fat in hips and legs and circumference of hip negatively associated with fatty liver and the severity of the disease. In contrast, the waist-to-hip ratio was closely associated with fatty liver. They also showed that the accuracy of this ratio in predicting NAFLD was greater than BMI and waist-to-height ratio49.
Most ML studies for the prediction of NAFLD have used the ultrasonography technique to diagnose fatty liver disease36,37,38,39. Ultrasound is a commonly used method for the diagnosis of hepatic steatosis50. Ultrasonography is a safe, well-tolerated, non-invasive and low-cost technique50; however, there are limitations associated with ultrasound use, including limited capability in detecting fatty infiltration (less than 20% steatosis), operator dependency and subjective assessment51,52, and ML is expected to minimise some of these. Application of ML techniques on body composition and anthropometric measures as a less time-consuming and easy to undertake method can help physicians in their clinical decision making.
The presence of liver fibrosis in patients with NAFLD is considered the strongest predictor of long-term outcome53. NAFLD Fibrosis Score (NFS) and Fibrosis-4 (FIB-4) have been recommended as appropriate methods for the initial assessment of fibrosis in NAFLD patients54. Both of these methods use a combination of variables including age, BMI and biochemical measures (i.e. aspartate aminotransferase (AST), alanine aminotransferase (ALT), platelets, etc.). Graupera et al. concluded that NFS and FIB-4 are not optimal for screening as they correlate poorly with liver stiffness55. In their study, waist circumference was found to be the ideal measure for fibrosis screening among high risk people from general population55. However, other studies found that NFS and FIB-4 have the potential to detect advanced fibrosis and the progression of fibrosis among people with NAFLD56. It seems that NFS and FIB-4 are more useful in the diagnosis of fibrosis in NAFLD but not for fibrosis screening among the general populations. The present study showed suboptimal accuracy (57%) in detecting fibrosis using less expensive and non-invasive factors i.e. anthropometric and body composition measures. Further studies might explore a combination of these methods including anthropometric, body composition and biochemical variables altogether.
The proposed algorithm identified in this research can be used by the health systems for several reasons. Screening of the presence or absence of NAFLD with the help of non-invasive anthropometric measurements can be achieved with simple and cheap equipment57. Moreover, performing the measurement task needs less specialty knowledge therefore can be implemented in several health centres (e.g., primary practice) and also remote areas. Once validated, the resulted assistive technology can serve the clinicians in the prevention of liver diseases. There are limitations of the present research that need to be addressed. A small sample size might have potentially limited the results of ML prediction. Although, the small sample size was accounted for by multiple cross-validations, which reduced potential errors. Future studies with larger sample sizes can allocate separate validation sets and evaluate the model. Moreover, even though the most common method for fatty liver diagnosis, the ultrasound technique is not the gold standard. Using liver biopsy outcomes would generate more valid results. Also, to increase the predictive accuracy of the proposed model for NAFLD prediction, future studies should include other body composition and anthropometric measures such as sagittal abdominal diameter (SAD) and peri-renal fat58.
Conclusion
Present findings show that applying a ML classification model on anthropometric and body composition variables predicted the presence of fatty liver disease. ML-based decision support systems offer potential to assist physicians with screening, diagnosis and prevention of NAFLD. ML-based decision support systems could be of particular value for providing services at a population level and remote health care where there is a lack of trained specialists.
Data availability
The datasets used and/or analyzed in the current study are available from the corresponding author upon reasonable request.
Abbreviations
- NAFLD:
-
Non-alcoholic fatty liver disease
- BMI:
-
Body mass index
- LSM:
-
Liver stiffness measurement
- ROCc:
-
Receiver operating characteristic curve
- PCA:
-
Principal component analysis
- CAP:
-
Controlled attenuation parameter
- CI:
-
Confidence interval
- ANOVA:
-
Analysis of variance
- DCL:
-
Dorso-cervical lipohypertrophy
References
Aggarwal, A., Puri, K., Thangada, S., Zein, N. & Alkhouri, N. Nonalcoholic fatty liver disease in children: Recent practice guidelines, where do they take us?. Curr. Pediatr. Rev. 10(2), 151–161 (2014).
Khashab, M. A., Liangpunsakul, S. & Chalasani, N. Nonalcoholic fatty liver disease as a component of the metabolic syndrome. Curr. Gastroenterol. Rep. 10(1), 73–80 (2008).
Wagenknecht, L. E. et al. Correlates and heritability of nonalcoholic fatty liver disease in a minority cohort. Obesity 17(6), 1240–1246 (2009).
Abdelmalek, M. F. & Diehl, A. M. Nonalcoholic fatty liver disease as a complication of insulin resistance. Med. Clin. North Am. 91(6), 1125–1149 (2007).
Milić, S. & Štimac, D. Nonalcoholic fatty liver disease/steatohepatitis: Epidemiology, pathogenesis, clinical presentation and treatment. Dig. Dis. 30(2), 158–162 (2012).
Clark, J. M., Brancati, F. L. & Diehl, A. M. The prevalence and etiology of elevated aminotransferase levels in the United States. Am. J. Gastroenterol. 98(5), 960–967 (2003).
Kim, W. R., Brown, R. S. Jr., Terrault, N. A. & El-Serag, H. Burden of liver disease in the United States: Summary of a workshop. Hepatology 36(1), 227–242 (2002).
McCullough, A. J. Pathophysiology of nonalcoholic steatohepatitis. J. Clin. Gastroenterol. 40, S17–S29 (2006).
Chalasani, N. et al. The diagnosis and management of non-alcoholic fatty liver disease: Practice Guideline by the American Association for the Study of Liver Diseases, American College of Gastroenterology, and the American Gastroenterological Association. Hepatology 55(6), 2005–2023 (2012).
Ertle, J. et al. Non-alcoholic fatty liver disease progresses to hepatocellular carcinoma in the absence of apparent cirrhosis. Int. J. Cancer 128(10), 2436–2443 (2011).
Bellentani, S. & Marino, M. Epidemiology and natural history of non-alcoholic liver disease (NAFLD). Ann. Hepatol. 8(S1), 4–8 (2009).
Patton, H. M. et al. Pediatric nonalcoholic fatty liver disease: A critical appraisal of current data and implications for future research. J. Pediatr. Gastroenterol. Nutr. 43(4), 413–427 (2006).
Shiotani, A., Motoyama, M., Matsuda, T. & Miyanishi, T. Brachial-ankle pulse wave velocity in Japanese university students. Intern. Med. 44(7), 696–701 (2005).
Razmpour, F., Abbasi, B. & Ganji, A. Evaluating the accuracy and sensitivity of anthropometric and laboratory variables in diagnosing the liver steatosis and fibrosis in adolescents with non-alcoholic fatty liver disease. J. Liver Res. Disord. Ther. 4(3), 121–125 (2018).
Bellentani, S. et al. Prevalence of and risk factors for hepatic steatosis in Northern Italy. Ann. Intern. Med. 132(2), 112–119 (2000).
Omagari, K. et al. Fatty liver in non-alcoholic non-overweight Japanese adults: Incidence and clinical characteristics. J. Gastroenterol. Hepatol. 17(10), 1098–1105 (2002).
Shaw, N. J., Crabtree, N. J., Kibirige, M. S. & Fordham, J. N. Ethnic and gender differences in body fat in British schoolchildren as measured by DXA. Arch. Dis. Child. 92(10), 872–875 (2007).
Chumlea, W. C., Siervogel, R., Roche, A. F., Webb, P. & Rogers, E. Increments across age in body composition for children 10 to 18 years of age. Hum. Biol. 55, 845–852 (1983).
Van der Sluis, I., De Ridder, M., Boot, A., Krenning, E. & de Muinck, K.-S. Reference data for bone density and body composition measured with dual energy x ray absorptiometry in white children and young adults. Arch. Dis. Child. 87(4), 341–347 (2002).
Alferink, L. J. M. et al. Nonalcoholic fatty liver disease in the Rotterdam study: About muscle mass, sarcopenia, fat mass, and fat distribution. J. Bone Miner. Res. 34(7), 1254–1263 (2019).
He, Q. et al. Sex and race differences in fat distribution among Asian, African-American, and Caucasian prepubertal children. J. Clin. Endocrinol. Metab. 87(5), 2164–2170 (2002).
Płudowski, P., Matusik, H., Olszaniecka, M., Lebiedowski, M. & Lorenc, R. S. Reference values for the indicators of skeletal and muscular status of healthy Polish children. J. Clin. Densitom. 8(2), 164–177 (2005).
Yang, K. C. et al. Association of non-alcoholic fatty liver disease with metabolic syndrome independently of central obesity and insulin resistance. Sci. Rep. 6(1), 1–10 (2016).
Balakrishnan, M. et al. Obesity and risk of nonalcoholic fatty liver disease: A comparison of bioelectrical impedance analysis and conventionally-derived anthropometric measures. Clin. Gastroenterol. Hepatol. 15(12), 1965–1967 (2017).
Brambilla, P., Bedogni, G., Heo, M. & Pietrobelli, A. Waist circumference-to-height ratio predicts adiposity better than body mass index in children and adolescents. Int. J. Obes. 37(7), 943–946 (2013).
Huang, B.-A. et al. Neck circumference, along with other anthropometric indices, has an independent and additional contribution in predicting fatty liver disease. PLoS One 10(2), e0118071 (2015).
Sookoian, S. & Pirola, C. J. Systematic review with meta-analysis: Risk factors for non-alcoholic fatty liver disease suggest a shared altered metabolic and cardiovascular profile between lean and obese patients. Aliment. Pharmacol. Ther. 46(2), 85–95 (2017).
Stabe, C. et al. Neck circumference as a simple tool for identifying the metabolic syndrome and insulin resistance: Results from the Brazilian Metabolic Syndrome Study. Clin. Endocrinol. 78(6), 874–881 (2013).
Subramanian, V., Johnston, R., Kaye, P. & Aithal, G. Regional anthropometric measures associated with the severity of liver injury in patients with non-alcoholic fatty liver disease. Aliment. Pharmacol. Ther. 37(4), 455–463 (2013).
Borruel, S. et al. Surrogate markers of visceral adiposity in young adults: Waist circumference and body mass index are more accurate than waist hip ratio, model of adipose distribution and visceral adiposity index. PLoS One 9(12), e114112 (2014).
Rankinen, T., Kim, S., Perusse, L., Despres, J. & Bouchard, C. The prediction of abdominal visceral fat level from body composition and anthropometry: ROC analysis. Int. J. Obes. 23(8), 801 (1999).
Lee, S. S. & Park, S. H. Radiologic evaluation of nonalcoholic fatty liver disease. World J. Gastroenterol. WJG 20(23), 7392 (2014).
EskandarNejad, M. Correlation of perceived body image and physical activity in women and men according to the different levels of Body Mass Index (BMI). J. Health Promot. Manag. 2, 59–40 (2013).
Belghaisi-Naseri, M. et al. Plasma levels of vascular endothelial growth factor and its soluble receptor in non-alcoholic fatty liver. J. Fast. Health (2018).
Dehnavi, Z. et al. Fatty Liver Index (FLI) in predicting non-alcoholic fatty liver disease (NAFLD). Hepat. Mon. 18(2) (2018).
Birjandi, M., Ayatollahi, S. M. T., Pourahmad, S. & Safarpour, A. R. Prediction and diagnosis of non-alcoholic fatty liver disease (NAFLD) and identification of its associated factors using the classification tree method. Iran. Red Crescent Med. J. 18(11) (2016).
Islam, M., Wu, C.-C., Poly, T. N., Yang, H.-C. & Li, Y.-C.J. Applications of machine learning in fatty live disease prediction. Building Continents of Knowledge in Oceans of Data: The Future of Co-Created eHealth 166–170 (IOS Press, 2018).
Ma, H., Xu, C.-F., Shen, Z., Yu, C.-H. & Li, Y.-M. Application of machine learning techniques for clinical predictive modeling: A cross-sectional study on nonalcoholic fatty liver disease in China. BioMed Res. Int. 2018 (2018).
Wu, C.-C. et al. Prediction of fatty liver disease using machine learning algorithms. Comput. Methods Programs Biomed. 170, 23–29 (2019).
Gaia, S. et al. Reliability of transient elastography for the detection of fibrosis in non-alcoholic fatty liver disease and chronic viral hepatitis. J. Hepatol. 54(1), 64–71 (2011).
Sasso, M. et al. Controlled attenuation parameter (CAP): A novel VCTE™ guided ultrasonic attenuation measurement for the evaluation of hepatic steatosis: Preliminary study and validation in a cohort of patients with chronic liver disease from various causes. Ultrasound Med. Biol. 36(11), 1825–1835 (2010).
Hsu, C. et al. Magnetic resonance vs transient elastography analysis of patients with nonalcoholic fatty liver disease: A systematic review and pooled analysis of individual participants. Clin. Gastroenterol. Hepatol. 17(4), 630–637 (2019).
Shamsi, A. et al. An uncertainty-aware transfer learning-based framework for COVID-19 diagnosis. IEEE Trans. Neural Netw. Learn. Syst. 32(4), 1408–1417 (2021).
Pedregosa, F. et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
Noor, N. M. et al. (eds) (Trans Tech Publ, 2015).
Norazian, M. N. Comparison of linear interpolation method and mean method to replace the missing values in environmental data set (2007).
Cunningham, J. P. & Ghahramani, Z. Linear dimensionality reduction: Survey, insights, and generalizations. J. Mach. Learn. Res. 16(1), 2859–2900 (2015).
Onat, A. et al. Neck circumference as a measure of central obesity: Associations with metabolic syndrome and obstructive sleep apnea syndrome beyond waist circumference. Clin. Nutr. 28(1), 46–51 (2009).
Rafiei, R., Fouladi, L. & Torabi, Z. Which component of metabolic syndrome is the most important one in development of colorectal adenoma?
Albhaisi, S. Noninvasive imaging modalities in nonalcoholic fatty liver disease: Where do we stand?. EMJ 4(3), 57–62 (2019).
Ferraioli, G. & Monteiro, L. B. S. Ultrasound-based techniques for the diagnosis of liver steatosis. World J. Gastroenterol. 25(40), 6053 (2019).
Khov, N., Sharma, A. & Riley, T. R. Bedside ultrasound in the diagnosis of nonalcoholic fatty liver disease. World J. Gastroenterol. WJG 20(22), 6821 (2014).
Angulo, P. et al. Liver fibrosis, but no other histologic features, is associated with long-term outcomes of patients with nonalcoholic fatty liver disease. Gastroenterology 149(2), 389-397.e10 (2015).
Lee, J. et al. Prognostic accuracy of FIB-4, NAFLD fibrosis score and APRI for NAFLD-related events: A systematic review. Liver Int. 41(2), 261–270 (2021).
Graupera, I. et al. Low accuracy of FIB-4 and NAFLD fibrosis scores for screening for liver fibrosis in the population. Clin. Gastroenterol. Hepatol. 20(11), 2567–76.e6 (2022).
Siddiqui, M. S. et al. Diagnostic accuracy of noninvasive fibrosis models to detect change in fibrosis stage. Clin. Gastroenterol. Hepatol. 17(9), 1877–85.e5 (2019).
Eaton-Evans, J. Nutritional assessment | Anthropometry (2005).
Vitturi, N. et al. Ultrasound, anthropometry and bioimpedance: A comparison in predicting fat deposition in non-alcoholic fatty liver disease. Eat. Weight Disord. Stud. Anorex. Bulimia Obes. 20(2), 241–247 (2015).
Acknowledgements
We are grateful to all participants and assistants of this study.
Funding
The study was supported by Mashhad University of Medical Sciences, Mashhad, Iran (Code: IR.MUMS.fm.REC.1395.64). The funder was not involved in the study design, data analysis and interpretation, or manuscript writing.
Author information
Authors and Affiliations
Contributions
Study concept and design: F.R. and M.R. Analysis and interpretation of data: H.A., A.S., R.D., G.S. and D.S. Drafting the manuscript: R.D., M.N., S.M.S.I. Critical revision of the manuscript for important intellectual content: F.R., R.M., S.M.S.I., R.D. Study supervision: F.R. All authors provided final approval of the version to be published and agreed to be accountable for all aspects of the work.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Razmpour, F., Daryabeygi-Khotbehsara, R., Soleimani, D. et al. Application of machine learning in predicting non-alcoholic fatty liver disease using anthropometric and body composition indices. Sci Rep 13, 4942 (2023). https://doi.org/10.1038/s41598-023-32129-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-023-32129-y
- Springer Nature Limited