Machine learning allows robust classification of visceral fat in women with obesity using common laboratory metrics

Palmieri, Flavio; Akhtar, Nidà Farooq; Pané, Adriana; Jiménez, Amanda; Olbeyra, Romina Paula; Viaplana, Judith; Vidal, Josep; de Hollanda, Ana; Gama-Perez, Pau; Jiménez-Chillarón, Josep C.; Garcia-Roves, Pablo M.

doi:10.1038/s41598-024-68269-y

Machine learning allows robust classification of visceral fat in women with obesity using common laboratory metrics

Article
Open access
Published: 27 July 2024

Volume 14, article number 17263, (2024)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Machine learning allows robust classification of visceral fat in women with obesity using common laboratory metrics

Download PDF

Flavio Palmieri^1,2,
Nidà Farooq Akhtar³,
Adriana Pané^4,5,
Amanda Jiménez^4,5,6,
Romina Paula Olbeyra⁶,
Judith Viaplana⁶,
Josep Vidal^4,6,7,
Ana de Hollanda^4,5,6,
Pau Gama-Perez¹,
Josep C. Jiménez-Chillarón^1,8^na1 &
…
Pablo M. Garcia-Roves^1,2,5^na1

552 Accesses
4 Altmetric
Explore all metrics

Abstract

The excessive accumulation and malfunctioning of visceral adipose tissue (VAT) is a major determinant of increased risk of obesity-related comorbidities. Thus, risk stratification of people living with obesity according to their amount of VAT is of clinical interest. Currently, the most common VAT measurement methods include mathematical formulae based on anthropometric dimensions, often biased by human measurement errors, bio-impedance, and image techniques such as X-ray absorptiometry (DXA) analysis, which requires specialized equipment. However, previous studies showed the possibility of classifying people living with obesity according to their VAT through blood chemical concentrations by applying machine learning techniques. In addition, most of the efforts were spent on men living with obesity while little was done for women. Therefore, this study aims to compare the performance of the multilinear regression model (MLR) in estimating VAT and six different supervised machine learning classifiers, including logistic regression (LR), support vector machine and decision tree-based models, to categorize 149 women living with obesity. For clustering, the study population was categorized into classes 0, 1, and 2 according to their VAT and the accuracy of each MLR and classification model was evaluated using DXA-data (DXAdata), blood chemical concentrations (BLDdata), and both DXAdata and BLDdata together (ALLdata). Estimation error and \(\hbox {R}^{2}\) were computed for MLR, while receiver operating characteristic (ROC) and precision-recall curves (PR) area under the curve (AUC) were used to assess the performance of every classification model. MLR models showed a poor ability to estimate VAT with mean absolute error \(\ge 401.40\) and \(\hbox {R}^{2} \le 0.62\) in all the datasets. The highest accuracy was found for LR with values of 0.57, 0.63, and 0.53 for ALLdata, DXAdata, and BLDdata, respectively. The ROC AUC showed a poor ability of both ALLdata and DXAdata to distinguish class 1 from classes 0 and 2 (AUC = 0.31, 0.71, and 0.85, respectively) as also confirmed by PR (AUC = 0.24, 0.57, and 0.73, respectively). However, improved performances were obtained when applying LR model to BLDdata (ROC AUC \(\ge \) 0.61 and PR AUC \(\ge \) 0.42), especially for class 1. These results seem to suggest that, while a direct and reliable estimation of VAT was not possible in our cohort, blood sample-derived information can robustly classify women living with obesity by machine learning-based classifiers, a fact that could benefit the clinical practice, especially in those health centres where medical imaging devices are not available. Nonetheless, these promising findings should be further validated over a larger population.

Machine learning prediction of susceptibility to visceral fat associated diseases

Article Open access 30 July 2020

DXA-measured visceral adipose tissue predicts impaired glucose tolerance and metabolic syndrome in obese Caucasian and African-American women

Article 22 October 2014

Equations for predicting DXA-measured visceral adipose tissue mass based on BMI or weight in adults

Article Open access 16 May 2022

Discover the latest articles, news and stories from top researchers in related subjects.

Introduction

Obesity is a silent global pandemic estimated to affect about 4 billion people by 2035, compared with over 2.6 billion in 2020¹. Particularly, the excess of body fat mass increases the risk of chronic disorders including type 2 diabetes mellitus (T2D), fatty liver disease, hypertension, and cardiovascular disease^2,3,4. This is mainly due to obesity being associated with hyperglycemia^5,6, dyslipidemia⁷ and liver disease⁸. Although the risk of developing these obesity-related complications is proportional to the degree of adiposity, and more specifically to abdominal (or android) fat accumulation^9,10, the prevalence of these comorbidities is not uniform across people living with obesity^11,12,13,14. Previous studies have underscored the distribution of fat depots to correctly assess the risk of obesity-related health issues^15,16,17, giving special importance to the visceral adipose tissue (VAT)^18,19,20,21. Therefore, the correct assessment of VAT may play a relevant role in identifying and stratifying those individuals at higher risk of suffering obesity-associated comorbidities. Although obesity is defined as an excessive accumulation of body fat, the most commonly used measure to classify obesity is the body mass index (BMI), which is calculated by dividing a person’s weight in kilograms by the square of their height in meters. Multiple cut-points for BMI and other anthropometric measures have been employed in clinical practice for their non-invasive nature and simplicity of use²². Nevertheless, they do not allow us to distinguish the two main compartments of abdominal fat: VAT and subcutaneous adipose tissue^23,24,25. Image techniques, like dual-energy X-ray absorptiometry (DXA) have gained popularity since they provide an accurate measurement of body composition^26,27,28. However, due to its costs and the need for trained personnel, they are not always available. Conversely, blood analyses are easy to obtain and could provide an alternative worth investigating since multiple parameters are known to be altered in the context of obesity.

Over the past decades, considerable efforts have been made in constructing classification and prediction models especially for men living with obesity due to their predisposition to accumulate VAT^29,30. Consequently, women have been largely underrepresented in these models, which draws attention to the need for a better characterisation of this population. In this sense, our study aims to develop supervised classification techniques in the female population based on blood chemical concentrations, and a few clinical parameters, and compare the results to the same classification models obtained from DXA-based data. The goal is to determine the ability of risk stratification of both blood-derived and DXA-derived data.

Materials

Study population

This study included a total of 149 bariatric surgery candidate women living with obesity. This cohort was already presented in Osorio-Conles et al.³¹ and in Pané et al.³². Inclusion criteria were female patients, aged between 18 and 70 years, BMI \(\ge 40\) kg/\(\hbox {m}^{2}\) or \(\ge 35\) kg/\(\hbox {m}^{2}\) in the presence of obesity-related comorbidities. The Institutional Ethics Committee approved the study protocol in both cohorts (ID. Reg. HCB/2017/0984 and Reg. HCB/2019/0137) and written consent was obtained from all participants. For each patient, DXA-estimated data (DXAdata) and chemical concentrations from blood samples (BLDdata) were collected. Total body fat and lean mass were measured by DXA using a Lunar iDXA scan (GE Healthcare, Madison, WI, USA). The estimated VAT content was computed by the validated CoreScan software (EnCore version 17.0). All the procedures and all the methods were performed following the Helsinki Declaration³³.

Variables included in the study

DXAdata included values for: fat mass arms (Arms_FM), fat mass trunk (Trunk_FM), fat mass android (Android_FM), total fat mass (Total_FM), muscular mass trunk (Trunk_MM), muscular mass android (Android_MM), tissue mass trunk (Trunk_TM), tissue mass android (Android_TM), total tissue mass (Total_TM), free-fat mass trunk (Trunk_FFM), free-fat mass android (Androrid_FFM), total mass trunk (Trunk_TotalM), total mass android (Android_TotalM), Overall mass (OverallMass). BLDdata included: ultrasensitive reactive C protein (usCRP), fasting plasma glucose (FPG), glycosylated hemoglobin (HbA1c), colesterol (COLT), triglycerides (TG), low-density lipoproteins (LDL), aspartate aminotransferase (AST), alanine aminotransferase (ALT), gamma glutamyltransferase (GGT). A summary list of the variables included in this study, with the corresponding unit of measurement and abbreviation adopted in the text, can be found in Table 2.

Methods

Data preprocessing and analysis, model construction, and performance evaluation were performed by using customized algorithms based on the Scikit-learn python library³⁴.

Preprocessing

Patients were grouped according to their VAT weight into three classes: class 0 (VAT < 1854.60 g); class 1 (1854.60 g \(\le \) VAT \(\le \) 2495.88 g) and class 2 (VAT > 2495.88 g), corresponding to the first, second and third tertile, respectively, of the VAT distribution in our population as shown in Fig. 1. In each class, missing values were replaced by using the k-nearest neighbors method^34,35,36 with default values for (1) the number of neighboring samples to use for imputation (2) the weight function used in prediction, and (3) the distance metric for searching neighbors, and all of them can be found in the help page³⁷. Next, the Kruskal-Wallis test among each variable in both DXAdata and BLDdata was performed to select only those showing statistically significant variations across the three classes. Finally, a panel of trained doctors validated the clinical significance of the selected DXAdata and BLDdata variables.

Regression models

Multilinear regression analysis (MLR) was initially performed to directly estimate the VAT weight from each ALLdata, DXAdata and BLDdata by using the built-in function already provided in the Scikit-learn python library³⁴. For that, we first divide each dataset into training (90%) and test (10%) sets with the same random state for each dataset. This ensures that the models were trained and then tested over the same subgroup of subjects, thus allowing a proper comparison of the results.

The performance of each regression model was assessed by computing the widely used evaluation metrics: R-squared (\(\hbox {R}^{2}\)) and adjusted \(\hbox {R}^{2}\), Mean Absolute Error (MAE), Mean Square Error (MSE) and Root Mean Square Error (RMSE)³⁴.

Classification models

Logistic regression (LR), support vector machine (SVM), decision tree, random forest, k-nearest neighbors, and XG Boost classifiers were evaluated over DXAdata, BLDdata alone, and after joining them (ALLdata) and including routine clinical information: age, T2D, weight, BMI, waist and hip circumferences. VAT weight values were not included in any DXAdata, BLDdata, or ALLdata. Each dataset was normalized with the MinMaxScaler function as it preserves the relative relationships and distributions of the variables^34,35,36, and grid search analysis was performed to find the best hyper-parameter (HP) configuration for each model with a 10-fold cross-validation strategy, being this latter already used in several previous medical-related studies³⁸. During every cross-validation, each dataset was partitioned into training and test (80% and 20% of the data, respectively) and the model was trained and evaluated with each set of partitions. The HP (Table 1) were either already employed in previous studies on related topics^39,40,41 or selected according to their suitability to the analyzed dataset^34,35,36.

Table 1 List of HP systematically evaluated during the 10-fold cross-validation grid search analysis to find the best-performing configuration for each model.

Full size table

For every ALLdata, DXAdata, and BLDdata, the model (with its optimal HP configuration) showing the highest test accuracy from the 10-fold cross-validation grind search analysis, was selected. The HPs control the learning process of a classification model in order to optimally map the input variables (i.e. those in the ALLdata, DXAdata, and BLDdata) to the output labels (i.e. class 0, class 1, and class 2)^36,42. The test accuracy, on the other hand, provides us with information, in terms of percentage, on how well each model classifies each patient by comparing the model-predicted class with the true class in the test group^36,42. Then, we assessed the performance of each selected model over their corresponding dataset by evaluating the confusion matrix. For that purpose, we used 80% of each dataset for training the model and the remaining 20% for testing.

Next, the one-vs-rest analysis was executed^36,42. This test involves splitting the multi-class dataset (i.e. class 0, class 1, and class 2) into multiple binary classification problems and then training the selected model on each problem: (1) class 0 vs [1 and 2]; (2) class 1 vs [0 and 2] (3) class 2 vs [0 and 1]. Thus, the one-vs-rest analysis provides information on the ability of a classifier to distinguish one class from the other two classes considered together.

Finally, for every ALLdata, DXAdata, and BLDdata, a confusion matrix was obtained for the selected model. Confusion matrix is a simple performance analysis tool which is used to represent the test result of a classification model: each column represents the instances in a predicted class, while each row represents the instances in an actual class.

Classification models interpretability and performance assessment

SHapley Additive exPlanations (SHAP)⁴³ analysis was conducted to understand the contribution of each variable in ALLdata, DXAdata, and BLDdata to the classification made by the selected model. SHAP was calculated by comparing the classification made with and without including a given variable in the selected model and considering the difference between these two classifications as the contribution of that variable^43,44. Moreover, SHAP considered all possible combinations of variables when calculating their contribution to the classification⁴⁴.

To assess the performance from the one-vs-rest analysis, both receiver operating characteristic (ROC) and precision-recall (PR) graphs were computed. ROC graphs are useful tools for investigating a classification model according to their performance with respect to the false positive rate and true positive rate⁴⁵. The diagonal of a ROC graph can be interpreted as random guessing, and classification models that fall below the diagonal are considered worse than random guessing. A perfect classifier would fall into the top-left corner of the graph with a true positive rate of 1 and a false positive rate of 0. Based on the ROC curve, the so-called ROC area under the curve (AUC) can be computed to characterize the performance of a classification model. A PR graph reports precision values, defined as the proportion of correct positive classifications (true positive) divided by the total number of predicted positive classifications that were made (true positive + false positive), and the recall values, that is the proportion of correct positive classifications (true positive) divided by the total number of the truly positive classifications (true positive + false negative) and can be considered equivalent to the false discovery rate curve^46,47.

Results

Analysed variables

Out of 166 variables initially collected in Ref.³¹, only 29, including 14 from DXA and 9 from blood sample analysis, fulfilled the Kruskal-Wallis test and thus included in the study. These variables are shown in Table 2 grouped according to their dataset (i.e. DXAdata and BLDdata) and classes 0, 1, and 2, and the values are expressed as median [25-th–75-th percentile]. Although a slight overlap of the percentiles, an increment of the medians across classes of the DXAdata parameters can be observed. Analogous considerations can be made for most of the variables in the BLDdata, including FPG, TG, COLT, and GGT.

Table 2 Distributions of age, number of patients with T2D, weight, BMI, waist and hip dimensions, DXA-estimated data, and blood chemicals concentrations for each class. Data are presented as median [25-th–75-th percentile] or number of patients.

Full size table

Table 3 shows the P-value obtained from the Wilcoxon signed-rank test, a non-parametric statistical test used to check statistical differences in the median values of two populations. Here the test was applied to each variable and for every couple of classes: class 0 vs 1, class 0 vs 2, and class 1 vs 2. In addition, Bonferroni’s correction for multiple testing was also performed and results were reported in the same table. When testing class 0 vs 1, only 10 of the proposed parameters showed significant statistical differences but all of them failed Bonferroni’s correction. A similar observation can be made when comparing class 1 vs 2. However, when comparing class 0 vs 2, all but two parameters were found to be statistically different, and 13 of them did pass Bonferroni’s correction. Nevertheless, only FPG in the BLDdata, when comparing class 0 vs 2, passed the multiclass Bonferroni’s correction.

Table 3 P-value for the Wilcoxon signed-rank test for each variable and couples of classes, along with multiple testing corrections adjusted P-value by Bonferroni’s correction. ns denotes no statistically significant values.

Full size table

Regression models

MLR model coefficients obtained during the training for each variable and dataset are reported in Supplementary Table 1. Notably, some of the DXA-derived variables (e.g. Trunk_FM, Trunk_MM, and Trunk_TM) showed a relevant effect on the MLR model in both ALLdata and DXAdata, since the absolute value of their coefficients is considerably greater than the other variables, with few exceptions for usCRP and HbA1c in the case of ALLdata and T2D for DXAdata. Analogous consideration can be made for usCRP and HbA1c in the BLDdata.

Supplementary Table 2 shows the actual and estimated VAT weights for the same group of patients. In most of the cases, DXAdata-based MLR resulted in VAT weight values closer to the actual ones, the opposite of the BLDdata-derived MLR.

Table 4 presents the evaluation metrics computed for each MLR model. DXAdata showed the best results in terms of errors (MEA = 401.40, MSE = 305505.12 and RMSE = 552.73) respect to ALLdata and BLDdata as well as comparable \(\hbox {R}^{2}\) (0.58) and \(\hbox {R}^{2}\) adjusted (0.52) to ALLdata (\(\hbox {R}^{2}\) = 0.62 and \(\hbox {R}^{2}\) adjusted 0.53), which match with what observed in Supplementary Table 2.

Table 4 MLR models evaluation metrics for each ALLdata, DXAdata and BLDdata.

Full size table

Classification models

Optimal HP configuration for each classification model obtained from the 10-cross-validation grid search analysis, along with the corresponding test accuracy values, are reported in Table 5. LR and SVM models trained over the DXAdata showed the highest test accuracy (0.63 and 0.60, respectively), followed by their equivalent evaluated on the ALLdata (0.57 and 0.53) and BLDdata (0.53 and 0.47), being this value always \(\le 0.47\) for every other combination of model and dataset. In addition, similar HPs were found for LR (i.e. solver = saga and penalty = l1) and SVM (i.e. kernel =linear) when comparing ALLdata and DXAdata but not for BLDdata where solver = liblinear and penalty = l2 for LR and kernel = rbf for SVM.

Table 5 Optimal HP configuration and corresponding test accuracy for each classification model and dataset obtained from 10-fold cross-validation grid search analysis. \(^*\) denotes the accuracy of the selected classification model for each dataset.

Full size table

Confusion matrices, presenting the relationship between actual and predicted classes, are in Fig. 2 for each ALLdata DXAdata and BLDdata. Similar results are for both Fig. 2a,b showing that the majority of class 1 patients were misclassified as class 2. On the contrary, 6 out of 10 class 1 patients from BLDdata (Fig. 2c) were accounted as class 0.

Classification models interpretability and performance assessment

The SHAP values, which quantify the magnitude of each variable’s contribution towards the correct patients’ classification, for each ALLdata, DXAdata, and BLDdata are presented in the diagrams in Fig. 3a–c, respectively. Three legends can be found in the figure, namely, class 0 (marked with the colour blue); class 1 (marked with the colour orange); and class 2 (marked with the colour green). Mean absolute SHAP values are displayed as proportional to the length of every colour-coded bar and variables are ranked according to their importance: in the upper part of the plot are the most influential to the classification model. According to these diagrams, Trunk_FM, OverallMass, Trunk_TM, Total_TM, and Trunk_TotalM were the top five variables in the ALLdata (Fig. 3a). Analogous distribution was found for variables in the DXAdata (Fig. 3b) being Trunk_FM, OverallMass, Trunk_TM, Total_TM, Trunk_TotalM the most relevant ones. As for the BLDdata (Fig. 3c), the first three most relevant variables were Weight, Age, and Hip followed by T2D and AST.

The ROC curves for each dataset and class are in Fig. 4a–c and show the trade-off between specificity and sensitivity for one-vs-rest analysis. PR curves are in Fig. 4d,e and show the precision values for corresponding sensitivity (recall). The same three-colour legend as in the SHAP diagrams was here adopted. Similar performances were obtained when using the LR model in the one-vs-rest analysis for both ALLdata and DXAdata with class 1 being the most difficult to distinguish (ROC AUC = 0.31 and PR AUC = 0.24), indicating that both models could distinguish the ensemble of class 0 + class 2 as not class 1 but they were specific for class 1 only. However, better results can be observed for the BLDdata having ROC AUC \(\ge \) 0.61 and PR AUC \(\ge \) 0.42.

Discussion

The main aim of this study was to investigate the possibility of combining the results from routine blood tests and basic clinical information, for the classification of women living with obesity based on their VAT weight by MLR models and machine-learning-based classification techniques. Both MLR and classification hold significance from a clinical standpoint for risk stratification of those subjects, given the well-established associations of VAT accumulation with metabolic^48,49 and clinical outcomes¹⁸.

Analysed variables

On the one hand, all but one of the variables in the DXAdata were estimated from the core region of the body which does not seem too far removed from previously reported studies in different populations, as in Refs.^50,51,52 to mention few. On the other hand, the variables included in the BLDdata were blood chemicals whose concentration can be altered by obesity and/or its comorbidities. Interestingly, the adoption of these blood biochemical concentrations for the classification of overweight and obesity, considering both males and females together, was already proposed in previous investigations^53,54, thus corroborating their usage in the context of this research.

Although the Wilcoxon signed-rank showed that most of the variables were statistically different across the three groups, only a few variables did pass Bonferroni’s correction for multiple testing and the majority of them were for class 0 vs 2 comparison (see Table 3). In other words, single variables, either from DXA or blood samples, taken on their own might not be sufficient to robustly classify people living with obesity probably due to their physiological interconnection. Nevertheless, machine learning-based classification models can capture these complex relationships between variables more effectively than other statistical models, as confirmed by previous investigations including Ferenci et al.⁵¹ and Mitu et al.⁵⁵, thus justifying their applications in this type of analysis.

Regression models

In this study, we chose the MLR since (1) it can model the complex relationships existing among all the DXA-based and blood chemicals, (2) it can determine individual variable effects on the estimated VAT weight values, and (3) its ability to control for confounding variables. Indeed, by simultaneously testing the effects of multiple variables, MLR enhances predictive accuracy and strengthens the robustness of the model, while also accommodating non-linear relationships³⁴.

Due to the high estimation error and the poor \(\hbox {R}^{2}\), none of the MLR models yielded estimated VAT weight values sufficiently accurate to be reliably used in the medical decision-making process. Specifically, the estimated VAT weight values can differ by up to twice the actual VAT weight, as demonstrated by the case of patient 12 in Supplementary Table 2. This discrepancy may be attributed to the limited sample size (N = 149), which potentially compromised the accuracy of the MLR models, making them unreliable for VAT prediction and risk stratification within our study population.

Selection of the number of classes

As a preliminary study, four different threshold settings were investigated to define the most suitable number of clusters. The thresholds were based on (1) the mean, (2) the tertiles, (3) quartiles, and (4) pentiles values to obtain equally sized classes, hence avoiding any potential bias due to unbalanced data. The statistical power of the resulting groups was tested by using the “PWR” package in R⁵⁶ obtaining 23% for mean-based, 22% for tertiles-based, 18% for quartiles-based, and 13% for pentiles-based grouping. Thus a more granular subdivision of the population would diminish the statistical power of the results.

On the one hand, a two-class approach generates an increment of the intra-class heterogeneity hence increasing the complexity of correctly classifying patients due to the remarkable overlap of their VAT weight distribution (see Supplementary Fig. 1a). This fact could have potentially lowered the clinical significance of the outcomes.

On the other hand, a three-class approach allows better discrimination of the two classes of major interest—class 0, including subjects with lower VAT weight and so lower risk, and class 2, containing subjects with higher VAT and so higher risk— as the amount of overlap is drastically reduced (see Supplementary Fig. 1b). Moreover, it adds an intermediate class that can contribute to a more refined classification (e.g. low, medium, and high-risk level) without losing statistical power.

Classification models

In general, LR works well on a wide variety of datasets, being its performance better than that of decision trees (and its derived random forest and XGBoost) and k-nearest neighbors⁵⁷, as also noticeable from results in Table 5. This can be explained by the fact that LR regression is a classification technique where the target variable (the classes in this study) is assumed to be categorical (i.e. class 0, class 1, and class 2). Specifically, LR is best suited in the context of a binomial problem, as in the case of this study where the one-vs-rest analysis was employed^34,42. The performance of k-nearest neighbors is generally worse on high-dimensional data especially in the presence of outliers, as it could be in this study, since they may negatively influence the calculation of the distance function⁴². SVM, on the other hand, has shown performance comparable to LR in the different studies on medical data sets^58,59 being particularly useful in those scenarios where the number of variables was much greater than the number of samples. Nevertheless, LR remains the most familiar to clinicians given the relatively straightforward relationship between the inputs and output⁶⁰.

We found remarkable similarities in the resulting HPs configuration for LR between ALLdata and DXAdata as well as in the confusion matrices, ROC, and PR graphs. This is probably because DXA-based variables could display a better correlation with VAT weight than blood chemicals thus reducing their contribution to the final classification (see Fig. 3a). Nevertheless, the LR model derived from BLDdata showed comparable test accuracy and similar classification performance (see Fig. 2) to ALLdata and DXAdata, at least for classes 0 and 2. In any case, class 1 appears to be the most difficult to assess correctly. This could be a consequence of the VAT distribution within class 1 where, as can be noticed in Fig. 1, most of the values lay near the boundary with either class 0 or class 2.

Classification models interpretability and performance assessment

SHAP was employed to better understand the underlying mechanisms of the LR classification and analyze the influence of each variable on the classification, sorting them from the most to the least relevant. Moreover, the length of each coloured bar relative to the others provides information on the importance of a given variable to that particular class. By visual inspection of SHAP diagrams, it is clear that both LR classification models derived from ALLdata (Fig. 3a) and DXAdata (Fig. 3b) are more biased toward class 0 and 2 as blue and green bars overshadow the orange one across all the dataset. On the contrary, classes in BLDdata (Fig. 3c) appear to be more balanced across variables, although those not related to blood (i.e. Weight, Age and Hip) still lean toward classes 0 and 2.

The one-vs-rest analysis allows us to extend any binary classifier to multi-class problems and has been extensively employed in multi-class classification problems^61,62,63 and to evaluate the ability of the selected LR model to correctly distinguish patients in a specific class from all the others. Similar results between ALLdata and DXAdata can be observed when comparing AUCs for ROC and PR curves, see Fig. 4, being the classification by the BLDdata, on average, more accurate (i.e. higher AUCs). Interestingly, BLDdata-derived LR showed better performance in classifying class 1 than the same model obtained from any of the other two databases.

One could argue that when considering the ALLdata, blood chemicals appear to be less relevant than DXA-based variables (see Fig. 3a). However, despite the small reduction in the AUC for class 2, both the ROC and PR graphs point to better general outcomes for BLDdata-derived LR especially when it comes to class 1 patients which were almost completely classified thanks to the contribution to the model of blood chemicals such as AST, COLT, usCRP, LDL, and FPG, (see Fig. 3c) thus confirming their crucial role in the classification task in the BLDdata.

Clinical significance

This study demonstrated the usefulness of machine learning algorithms in building risk stratification models based not only on variables directly connected to VAT weight (i.e. DXAdata) but also on those derived from non-imagining techniques, such as blood chemical concentration, that can be altered by an excess of VAT. Indeed, thanks to machine learning, healthcare professionals can take advantage of subtle blood variations, which may remain unnoticed at first glance, and stratify women living with obesity according to their VAT weight using only common laboratory metrics and routine clinical information. In clinical practice, those non-imaging-based models could be employed in primary health care centers for early-stage risk stratification, enabling timely interventions to prevent severe clinical outcomes.

Furthermore, our results may represent an advancement in assessing obesity-related comorbidities, particularly cardiovascular disease because of its clear association with VAT. This gains significant relevance in the context of women, as data from the Framingham Heart Study⁶⁴ unveils a heightened likelihood of developing cardiovascular disease due to obesity in women (64%) compared to men (46%). Obesity, alongside other contributing factors, markedly influences the prevalence and mortality of cardiovascular diseases in women, establishing it as a crucial focal point for health interventions^65,66.

Limitations

First, it is important to note that usCRP may not consistently be included in routine blood sample tests, particularly in primary health care centres. Second, the relatively small number of subjects included might have hampered our ability to find significant changes in some of the parameters proposed in the original study and thus, have limited the total amount of possible variables to include in the final model. Third, the study population included people living with a severe degree of obesity making it impossible to test the model performance over subjects with lower degrees of obesity. Fourth, the threshold values used to define the three proposed categories may not be suitable for other cohorts, limiting the extrapolation of our results to other populations. Additionally, only women were considered to avoid the confounding effect of gender. Finally, the reduced size of the study population prevents us from applying more sophisticated techniques such as those based on deep learning analysis as proposed in similar studies like Agrawa et al.⁶⁷ or in Klarqvist et al.⁶⁸.

Conclusions and future research

This study aimed to assess the feasibility of data-driven machine learning models to directly estimate VAT weight as well as classify women with obesity based on their VAT content, using the concentration of specific blood parameters. Additionally, we sought to compare the performance of these models with variables derived from DXA.

MLR models failed to robustly predict VAT weight in our cohort independently of the dataset thus preventing their usage in clinical practice. Out of the six classification models LR was found to be the most accurate. Furthermore, the classification obtained through blood chemicals appeared more robust than DXA variables displaying higher accuracy, recall, and precision. Accordingly, the usage of machine learning together with non-imaging techniques can enhance early risk stratification of women living with obesity. This represents a significant advancement in the context of preventive and personalized medicine, offering an easier and more effective approach to managing a life-threatening condition like excess VAT content.

Future investigations, incorporating a larger participant pool and a control group, would strengthen the statistical power of our findings. Moreover, this expansion would facilitate the exploration of more advanced classification and, possibly, direct VAT estimation techniques, including those based on neural networks.

References

World Obesity Federation. World obesity atlas 2023 (2023). https://data.worldobesity.org/publications/?cat=19, Accessed: 2024-01-20.
Blüher, M. Obesity: global epidemiology and pathogenesis. Nat. Rev. Endocrinol. 15, 288–298. https://doi.org/10.1038/s41574-019-0176-8 (2019). Number: 5 Publisher: Nature Publishing Group.
Hecker, J., Freijer, K., Hiligsmann, M. & Evers, S. M. A. A. Burden of disease study of overweight and obesity; the societal impact in terms of cost-of-illness and health-related quality of life. BMC Public Health 22, 46. https://doi.org/10.1186/s12889-021-12449-2 (2022).
Article CAS PubMed PubMed Central Google Scholar
GBD 2019 Universal Health Coverage Collaborators. Measuring universal health coverage based on an index of effective coverage of health services in 204 countries and territories, 1990-2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet (London, England) 396, 1250–1284, https://doi.org/10.1016/S0140-6736(20)30750-9 (2020).
Kahn, S. E., Hull, R. L. & Utzschneider, K. M. Mechanisms linking obesity to insulin resistance and type 2 diabetes. Nature 444, 840–846. https://doi.org/10.1038/nature05482 (2006).
Article ADS CAS PubMed Google Scholar
Després, J.-P. & Lemieux, I. Abdominal obesity and metabolic syndrome. Nature 444, 881–887. https://doi.org/10.1038/nature05488 (2006).
Article ADS CAS PubMed Google Scholar
Klop, B., Elte, J. W. F. & Castro Cabezas, M. Dyslipidemia in obesity: Mechanisms and potential targets. Nutrients 5, 1218–1240. https://doi.org/10.3390/nu5041218 (2013).
Article CAS PubMed PubMed Central Google Scholar
Jalili, V. et al. The association between obesity with serum levels of liver enzymes, alanine aminotransferase, aspartate aminotransferase, alkaline phosphatase and gamma-glutamyl transferase in adult women. Endocrinol. Diabetes Metab. 5, e367. https://doi.org/10.1002/edm2.367 (2022).
Article CAS PubMed PubMed Central Google Scholar
Fox, C. S. et al. Abdominal visceral and subcutaneous adipose tissue compartments: Association with metabolic risk factors in the Framingham Heart Study. Circulation 116, 39–48. https://doi.org/10.1161/CIRCULATIONAHA.106.675355 (2007).
Article PubMed Google Scholar
Primeau, V. et al. Characterizing the profile of obese patients who are metabolically healthy. Int. J. Obesity 35, 971–981. https://doi.org/10.1038/ijo.2010.216 (2011).
Article CAS Google Scholar
Ferrannini, E. et al. Insulin resistance and hypersecretion in obesity. European Group for the Study of Insulin Resistance (EGIR). J. Clin. Investig. 100, 1166–1173. https://doi.org/10.1172/JCI119628 (1997).
Article CAS PubMed PubMed Central Google Scholar
Bonora, E. et al. Prevalence of insulin resistance in metabolic disorders: The Bruneck Study. Diabetes 47, 1643–1649. https://doi.org/10.2337/diabetes.47.10.1643 (1998).
Article CAS PubMed Google Scholar
Karelis, A. D. Metabolically healthy but obese individuals. Lancet (London, England) 372, 1281–1283. https://doi.org/10.1016/S0140-6736(08)61531-7 (2008).
Article PubMed Google Scholar
Blüher, M. Metabolically healthy obesity. Endocr. Rev. 41, bnaa004. https://doi.org/10.1210/endrev/bnaa004 (2020).
Article PubMed PubMed Central Google Scholar
Jensen, M. D. Role of body fat distribution and the metabolic complications of obesity. J. Clin. Endocrinol. Metab. 93, S57–S63. https://doi.org/10.1210/jc.2008-1585 (2008).
Article CAS PubMed PubMed Central Google Scholar
Després, J.-P. Body fat distribution and risk of cardiovascular disease. Circulation 126, 1301–1313. https://doi.org/10.1161/CIRCULATIONAHA.111.067264 (2012).
Article PubMed Google Scholar
Frank, A. P., de Souza Santos, R., Palmer, B. F. & Clegg, D. J. Determinants of body fat distribution in humans may provide insight about obesity-related health risks. J. Lipid Res. 60, 1710–1719. https://doi.org/10.1194/jlr.R086975 (2019).
Article CAS PubMed Google Scholar
Shuster, A., Patlas, M., Pinthus, J. H. & Mourtzakis, M. The clinical importance of visceral adiposity: A critical review of methods for visceral adipose tissue analysis. Br. J. Radiol. 85, 1–10. https://doi.org/10.1259/bjr/38447238 (2012).
Article CAS PubMed PubMed Central Google Scholar
Dhawan, D. & Sharma, S. Abdominal obesity, adipokines and non-communicable diseases. J. Steroid Biochem. Mol. Biol. 203, 105737. https://doi.org/10.1016/j.jsbmb.2020.105737 (2020).
Article CAS PubMed PubMed Central Google Scholar
Moreira, V. C., Silva, C. M. S., Welker, A. F. & da Silva, I. C. R. Visceral Adipose Tissue Influence on Health Problem Development and Its Relationship with Serum Biochemical Parameters in Middle-Aged and Older Adults: A Literature Review. J. Aging Res. 2022, e8350527. https://doi.org/10.1155/2022/8350527 (2022). Publisher: Hindawi.
Chen, Q. et al. Effect of visceral adipose tissue mass on coronary artery disease and heart failure: A Mendelian randomization study. Int. J. Obesity 46, 2102–2106. https://doi.org/10.1038/s41366-022-01216-x (2022).
Article CAS Google Scholar
Sommer, I. et al. The performance of anthropometric tools to determine obesity: A systematic review and meta-analysis. Sci. Rep. 10, 12699. https://doi.org/10.1038/s41598-020-69498-7 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Frankenfield, D. C., Rowe, W. A., Cooney, R. N., Smith, J. S. & Becker, D. Limits of body mass index to detect obesity and predict body composition. Nutrition 17, 26–30. https://doi.org/10.1016/S0899-9007(00)00471-8 (2001).
Article CAS PubMed Google Scholar
Okorodudu, D. O. et al. Diagnostic performance of body mass index to identify obesity as defined by body adiposity: A systematic review and meta-analysis. Int. J. Obesity 34, 791–799. https://doi.org/10.1038/ijo.2010.5 (2010). Number: 5 Publisher: Nature Publishing Group.
Nuttall, F. Q. Body mass index: Obesity, BMI, and health: A critical review. Nutr. Today 50, 117–128. https://doi.org/10.1097/NT.0000000000000092 (2015).
Article PubMed PubMed Central Google Scholar
Rothney, M. P., Brychta, R. J., Schaefer, E. V., Chen, K. Y. & Skarulis, M. C. Body composition measured by dual-energy X-ray absorptiometry half-body scans in obese adults. Obesity (Silver Spring, Md.) 17, 1281–1286. https://doi.org/10.1038/oby.2009.14 (2009).
Article PubMed Google Scholar
Ponti, F. et al. DXA-assessed changes in body composition in obese women following two different weight loss programs. Nutrition (Burbank, Los Angeles County, Calif.) 46, 13–19. https://doi.org/10.1016/j.nut.2017.07.016 (2018).
Article PubMed Google Scholar
Messina, C. et al. Body composition with dual energy X-ray absorptiometry: From basics to new tools. Quant. Imaging Med. Surg. 10, 1687698. https://doi.org/10.21037/qims.2020.03.02 (2020).
Article Google Scholar
Palmer, B. F. & Clegg, D. J. The sexual dimorphism of obesity. Mol. Cell. Endocrinol. 402, 113–119. https://doi.org/10.1016/j.mce.2014.11.029 (2015).
Article CAS PubMed Google Scholar
Chang, E., Varghese, M. & Singer, K. Gender and sex differences in adipose tissue. Curr. Diabetes Rep. 18, 69. https://doi.org/10.1007/s11892-018-1031-3 (2018).
Article CAS Google Scholar
Osorio-Conles, O. et al. Positive effects of a mediterranean diet supplemented with almonds on female adipose tissue biology in severe obesity. Nutrients 14, 2617. https://doi.org/10.3390/nu14132617 (2022).
Article CAS PubMed PubMed Central Google Scholar
Pané, A. et al. Effects of bariatric surgery on blood and vascular large extracellular vesicles according to type 2 diabetes status. J. Clin. Endocrinol. Metab. 109, e107–e118. https://doi.org/10.1210/clinem/dgad473 (2023).
Article PubMed Google Scholar
World Medical Association Declaration of Helsinki - Ethical Principles for Medical Research Involving Human Subjects (2022). https://www.wma.net/policies-post/wma-declaration-of-helsinki-ethical-principles-for-medical-research-involving-human-subjects/. Accessed: 2024-03-08.
Jolly, K. Machine Learning with scikit-learn Quick Start Guide: Classification, regression, and clustering techniques in Python (Packt Publishing Ltd, 2018).
Google Scholar
Geron, A. Hands-on Machine Learning with Scikit-Learn, Keras & TensorFlow (O’Reilly Media, Inc., 2019).
Google Scholar
Kapoor, A., Gulli, A. & Pal, S. Deep Learning with TensorFlow and Keras: Build and Deploy Supervised, Unsupervised, Deep, and Reinforcement Learning Models 3rd edn. (Packt Publishing Ltd, 2022).
Google Scholar
sklearn.impute.knnimputer - user manual. https://scikit-learn.org/stable/modules/generated/sklearn.impute.KNNImputer.html, Accessed: 2024-05-02.
Al-Zaiti, S. S. et al. A clinician’s guide to understanding and critically appraising machine learning studies: A checklist for Ruling Out Bias Using Standard Tools in Machine Learning (ROBUST-ML). Eur. Heart J. Digital Health 3, 125–140. https://doi.org/10.1093/ehjdh/ztac016 (2022).
Article Google Scholar
Ferdowsy, F., Rahi, K. S. A., Jabiullah, M. I. & Habib, M. T. A machine learning approach for obesity risk prediction. Curr. Res. Behav. Sci. 2, 100053. https://doi.org/10.1016/J.CRBEHA.2021.100053 (2021).
Article Google Scholar
Jeon, J., Lee, S. & Oh, C. Age-specific risk factors for the prediction of obesity using a machine learning approach. Front. Public Health.https://doi.org/10.3389/FPUBH.2022.998782/PDF (2023).
Article PubMed PubMed Central Google Scholar
Bag, H. G. G. et al. Estimation of obesity levels through the proposed predictive approach based on physical activity and nutritional habits. Diagnostics.https://doi.org/10.3390/DIAGNOSTICS13182949 (2023).
Article Google Scholar
Mitchell, T. Machine Learning (McGraw-Hill, New York, 1997).
Google Scholar
Lundberg, S. M. & Lee, S.-I. A Unified Approach to Interpreting Model Predictions. In Proceedings of the 31st International Conference on Neural Information Processing Systems, 4768—4777. NIPS 2017 (Long Beach, CA, USA, 2017).
Li, X. et al. Efficient Shapley Explanation For Features Importance Estimation Under Uncertainty. In Proceedings of the Medical image computing and computer-assisted intervention, 792–801, https://doi.org/10.1007/978-3-030-59710-8_77 (2020).
Bradley, A. P. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognit. 30, 1145–1159. https://doi.org/10.1016/S0031-3203(96)00142-2 (1997).
Article ADS Google Scholar
He, H. & Garcia, E. A. Learning from imbalanced data. IEEE Trans. Knowl. Data Eng. 21, 1263–1284 (2009).
Article Google Scholar
Bleakley, K., Biau, G. & Vert, J.-P. Supervised reconstruction of biological networks with local models. Bioinformatics 23, i57–i65. https://doi.org/10.1093/bioinformatics/btm204 (2007).
Article CAS PubMed Google Scholar
De Lorenzo, A. et al. New obesity classification criteria as a tool for bariatric surgery indication. World J. Gastroenterol. 22, 681–703. https://doi.org/10.3748/wjg.v22.i2.681 (2016).
Article CAS PubMed PubMed Central Google Scholar
Longo, M. et al. Adipose tissue dysfunction as determinant of obesity-associated metabolic complications. Int. J. Mol. Sci. 20, 2358. https://doi.org/10.3390/ijms20092358 (2019).
Article CAS PubMed PubMed Central Google Scholar
Wan, C. S. et al. Bioelectrical impedance analysis to estimate body composition, and change in adiposity, in overweight and obese adolescents: comparison with dual-energy x-ray absorptiometry. BMC Pediatr. 14, 249. https://doi.org/10.1186/1471-2431-14-249 (2014).
Article PubMed PubMed Central Google Scholar
Ferenci, T. & Kovács, L. Predicting body fat percentage from anthropometric and laboratory measurements using artificial neural networks. Appl. Soft Comput. 67, 834–839. https://doi.org/10.1016/j.asoc.2017.05.063 (2018).
Article Google Scholar
Minetto, M. A., Busso, C., Lalli, P., Gamerro, G. & Massazza, G. DXA-derived adiposity and lean indices for management of cardiometabolic and musculoskeletal frailty: Data interpretation tricks and reporting tips. Front. Rehabilit. Sci. 2, 712977. https://doi.org/10.3389/fresc.2021.712977 (2021).
Article Google Scholar
Chen, H. et al. Using blood indexes to predict overweight statuses: An extreme learning machine-based approach. PLoS ONE 10, e0143003. https://doi.org/10.1371/journal.pone.0143003 (2015).
Article CAS PubMed PubMed Central Google Scholar
Schrover, I. M., van der Graaf, Y., Spiering, W. & Visseren, F. L. The relation between body fat distribution, plasma concentrations of adipokines and the metabolic syndrome in patients with clinically manifest vascular disease. Eur. J. Prevent. Cardiol. 25, 1548–1557. https://doi.org/10.1177/2047487318790722 (2018).
Article Google Scholar
Mitu, I. et al. Artificial Neural Network Models for Accurate Predictions of Fat-Free and Fat Masses. Using Easy-to-Measure Anthropometric Parameters. Biomedicines 11, 489. https://doi.org/10.3390/biomedicines11020489 (2023). Number: 2 Publisher: Multidisciplinary Digital Publishing Institute.
pwr: Basic Functions for Power Analysis. https://CRAN.R-project.org/package=pwr. Accessed: 2024-05-03.
Dreiseitl, S. & Ohno-Machado, L. Logistic regression and artificial neural network classification models: A methodology review. J. Biomed. Inform. 35, 352–359. https://doi.org/10.1016/S1532-0464(03)00034-0 (2002).
Article PubMed Google Scholar
Dreiseitl, S. et al. A comparison of machine learning methods for the diagnosis of pigmented skin lesions. J. Biomed. Inform. 34, 28–36. https://doi.org/10.1006/jbin.2001.1004 (2001).
Article CAS PubMed Google Scholar
Chang, R.-F., Wu, W.-J., Moon, W. K., Chou, Y.-H. & Chen, D.-R. Support vector machines for diagnosis of breast tumors on US images. Acad. Radiol. 10, 189–197. https://doi.org/10.1016/S1076-6332(03)80044-2 (2003).
Article PubMed Google Scholar
Battineni, G., Chintalapudi, N. & Amenta, F. Machine learning in medicine: Performance calculation of dementia prediction by support vector machines (SVM). Inform. Med. Unlocked 16, 100200. https://doi.org/10.1016/j.imu.2019.100200 (2019).
Article Google Scholar
Varpa, K., Joutsijoki, H., Iltanen, K. & Juhola, M. Applying one-vs-one and one-vs-all classifiers in k-nearest neighbour method and support vector machines to an otoneurological multi-class problem. Stud. Health Technol. Inform. 169, 579–583 (2011).
PubMed Google Scholar
Liu, Z., Bensmail, H. & Tan, M. Efficient feature selection and multiclass classification with integrated instance and model based learning. Evolut. Bioinform. Online 8, 197–205. https://doi.org/10.4137/EBO.S9407 (2012).
Article Google Scholar
Xue, Y. & Hauskrecht, M. Active learning of multi-class classification models from ordered class sets. Proc. AAAI Conf. Artif. Intelligence. 33, 5589–5596 (2019).
Article Google Scholar
Wilson, P. W. F., D’Agostino, R. B., Sullivan, L., Parise, H. & Kannel, W. B. Overweight and obesity as determinants of cardiovascular risk: The Framingham experience. Arch. Internal Med. 162, 1867–1872. https://doi.org/10.1001/archinte.162.16.1867 (2002).
Article Google Scholar
Bhupathiraju, S. N. & Hu, F. B. Epidemiology of Obesity and Diabetes and Their Cardiovascular Complications. Circulation Research 118, 1723–1735. https://doi.org/10.1161/CIRCRESAHA.115.306825 (2016). Publisher: American Heart Association.
Vogel, B. et al. The Lancet women and cardiovascular disease Commission: Reducing the global burden by 2030. Lancet 397, 2385–2438. https://doi.org/10.1016/S0140-6736(21)00684-X (2021).
Article PubMed Google Scholar
Agrawal, S. et al. Bmi-adjusted adipose tissue volumes exhibit depot-specific and divergent associations with cardiometabolic diseases. Nat. Commun. 14, 1–10. https://doi.org/10.1038/s41467-022-35704-5 (2023).
Article CAS Google Scholar
Klarqvist, M. D. et al. Silhouette images enable estimation of body fat distribution and associated cardiometabolic risk. NPJ Digital Med. 5, 1–9. https://doi.org/10.1038/s41746-022-00654-1 (2022).
Article Google Scholar

Download references

Acknowledgements

This study received support from the Spanish Ministry of Science and Innovation (MCIN)/FEDER projects: PID2022-138537OB-I00 granted to P.M.G.-R. and PID2021-126441NB-I00 awarded to J. C.J.-C., co-funded by the European Regional Development Fund (ERDF). Funding was also provided by the Instituto de Salud Carlos III (ISCIII) through the projects PI20/00424 and PI17/00279 to A.J., and PI22/00394 to A.d.H., co-funded by the European Union. Additional support for this study was obtained from the “Pla Estratègic de Recerca i Innovació en Salut” (PERIS) (SLT008/18/00127 to A.J.), the “Ajut a la Recerca Josep Font” (2018, Hospital Clínic de Barcelona to A.P.), the CIBEROBN initiative of ISCIII (Spain), the Government of Catalonia (2017SGR845 to J.C.J.-C.), and Pfizer Global Grants (Pfizer Tracking No. 76587213 to J.C.J.-C.). F. P. expresses gratitude for the support received through the Juan de la Cierva fellowship ID JDC2022-049039-I, co-funded by MCIN/AEI/10.13039/501100011033 and the European Union-NextGenerationEU/PRTR.

Author information

These authors jointly supervised this work: Josep C. Jiménez‑Chillarón and Pablo M. Garcia‑Roves.

Authors and Affiliations

Biophysics unit, Department of Physiological Sciences, Faculty of Medicine and Health, Universitat de Barcelona, Bellvitge campus, 08907, Barcelona, Spain
Flavio Palmieri, Pau Gama-Perez, Josep C. Jiménez-Chillarón & Pablo M. Garcia-Roves
Nutrition, Metabolism and Gene Therapy Group; Diabetes and Metabolism Program; Bellvitge Biomedical Research Institute (IDIBELL), 08908, Barcelona, Spain
Flavio Palmieri & Pablo M. Garcia-Roves
Escola d’Enginyeria de Barcelona Est (EEBE) Universitat Politècnica De Catalunya. Barcelona Tech-UPC, 08019, Barcelona, Spain
Nidà Farooq Akhtar
Obesity Unit, Endocrinology and Nutrition Department, Hospital Clínic de Barcelona, 08036, Barcelona, Spain
Adriana Pané, Amanda Jiménez, Josep Vidal & Ana de Hollanda
Centro de Investigación Biomédica en Red de la Fisiopatología de la Obesidad y Nutrición (CIBEROBN), Instituto de Salud Carlos III (ISCIII), 28029, Madrid, Spain
Adriana Pané, Amanda Jiménez, Ana de Hollanda & Pablo M. Garcia-Roves
Fundació Clínic per a la Recerca Biomèdica (FCRB)-Institut d’Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), 08036, Barcelona, Spain
Amanda Jiménez, Romina Paula Olbeyra, Judith Viaplana, Josep Vidal & Ana de Hollanda
Centro de Investigación Biomédica en Red de Diabetes y Enfermedades Metabólicas Asociadas (CIBERDEM), Instituto de Salud Carlos III (ISCIII), 28029, Madrid, Spain
Josep Vidal
Metabolic diseases of pediatric origin unit, Institut de Recerca Sant Joan de Déu - Barcelona Children’s Hospital, 08950, Esplugues del Llobregat, Spain
Josep C. Jiménez-Chillarón

Authors

Flavio Palmieri
View author publications
You can also search for this author in PubMed Google Scholar
Nidà Farooq Akhtar
View author publications
You can also search for this author in PubMed Google Scholar
Adriana Pané
View author publications
You can also search for this author in PubMed Google Scholar
Amanda Jiménez
View author publications
You can also search for this author in PubMed Google Scholar
Romina Paula Olbeyra
View author publications
You can also search for this author in PubMed Google Scholar
Judith Viaplana
View author publications
You can also search for this author in PubMed Google Scholar
Josep Vidal
View author publications
You can also search for this author in PubMed Google Scholar
Ana de Hollanda
View author publications
You can also search for this author in PubMed Google Scholar
Pau Gama-Perez
View author publications
You can also search for this author in PubMed Google Scholar
Josep C. Jiménez-Chillarón
View author publications
You can also search for this author in PubMed Google Scholar
Pablo M. Garcia-Roves
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Each of the authors gave substantial contributions to the study design, and data interpretation and discussion. In particular: F. P. and N. F. A. developed the algorithms and analysed the data; F. P., A. P., A. J., R. P. O., J. V., J. V., A. d. H., P. G. P., J. C. J.-C. and P. M. G.-R. conceived and conducted the study, and discussed and interpreted the results; F. P. structured and drafted the paper. All authors critically reviewed the article and gave final approval to the submitted version.

Corresponding authors

Correspondence to Flavio Palmieri or Pablo M. Garcia-Roves.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Palmieri, F., Akhtar, N.F., Pané, A. et al. Machine learning allows robust classification of visceral fat in women with obesity using common laboratory metrics. Sci Rep 14, 17263 (2024). https://doi.org/10.1038/s41598-024-68269-y

Download citation

Received: 28 February 2024
Accepted: 22 July 2024
Published: 27 July 2024
DOI: https://doi.org/10.1038/s41598-024-68269-y
Springer Nature Limited

Machine learning allows robust classification of visceral fat in women with obesity using common laboratory metrics

Abstract

Similar content being viewed by others

Machine learning prediction of susceptibility to visceral fat associated diseases

DXA-measured visceral adipose tissue predicts impaired glucose tolerance and metabolic syndrome in obese Caucasian and African-American women

Equations for predicting DXA-measured visceral adipose tissue mass based on BMI or weight in adults

Explore related subjects

Introduction

Materials

Study population

Variables included in the study

Methods

Preprocessing

Regression models

Classification models

Classification models interpretability and performance assessment

Results

Analysed variables

Regression models

Classification models

Classification models interpretability and performance assessment

Discussion

Analysed variables

Regression models

Selection of the number of classes

Classification models

Classification models interpretability and performance assessment

Clinical significance

Limitations

Conclusions and future research

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation