Introduction

The proportion of the Brazilian adults and elders population has increased due to an ongoing epidemiological transition process in Brazil and abroad [1, 2]. The Brazilian Unified Health System (SUS) has been providing universal coverage for the population since 2004 by the Family health strategy (FHS) [3, 4]. Despite little improvements in oral health and the declining prevalence of edentulism in the population, after the Brazilian Smiling Program, elders had not changed the DMF-T index. In 2003, it was 27,8 (92% of tooth loss), maintaining stable at 27,5 (91,9% of tooth loss) in 2010. Indeed, prevalence of edentulous changed from 53,2% in 2003 to 47,9% in 2010, corroborating that edentulism in the Brazilian population is a structural issue [5].

Despite advancements in Brazilian public policy, it is challenging to set priorities on the dental agenda for the whole population, especially concerning adults and elders, due to working time hours or mobility barriers. Primary Health Care (PHC) units need to equilibrate those who have already concluded dental treatment and need to be monitored through time on recalls intervals to prevent oral diseases [6,7,8,9]. Furthermore, it is necessary to take into account the social determinants of oral health [10, 11] that modulate access to oral health services in terms of structural, psychological and literacy barriers to a dental appointment [12, 13]. Educational-related inequalities in access to oral health services were found in several countries in elders [14]. Even in high-income countries, adults have neglected oral health, with 21% not attending any year in the past six years in Scotland [15]. A machine learning approach would help predict those adults and elders at higher risk for tooth loss and helps schedule a dental visit and establish masticatory function and quality of life. This approach has been tested with US data only for adults [16] and elders in Japan [17]. It would be great to organise dental agenda and set priorities. Tooth loss is a paramount public health concern for global health [18, 19] that impacts peoples’ quality of life and is influenced by socioeconomic position [20, 21].

The aims of the present study were to investigate last dental visit as a mediator in the relationship between socioeconomic status and lack of functional dentition/severe tooth loss and use a machine learning approach to predict those adults and elders at higher risk of tooth loss in Brazil.

Methods

The data analysed in this study came from the 2019 National Health Survey (Pesquisa Nacional de Saúde - PNS), designed to have a nationally representative sample of the Brazilian population. The 2019 PNS is a cross-sectional household survey with a sampling process carried out in three stages. More detailed information can be found elsewhere [22]. The PNS data is freely available on the Brazilian Institute of Geography and Statistics (IBGE) website: https://www.ibge.gov.br/estatisticas/downloads-estatisticas.html.

Tooth loss assessment

Tooth loss was measured by asking: “Have you lost any of your permanent upper teeth?“ Response options were (1) No; (2) Yes, I have lost (number) teeth; (3) Yes, I have lost all my upper teeth. The same question was asked for the lower permanent teeth. Upper and lower self-reported tooth loss count was analysed classified into two levels of severity, regardless of the tooth position: (1) Functional dentition, defined as a loss of up to 12 permanent teeth; and (2) Severe tooth loss considered the loss of 23-32 [18].

Covariates

Gender (men or women); age; schooling was classified into formative years (0 to 8) and formal education (8 or more years) for both adults and elders. Income was dichotomized according to the official Brazilian minimum wage (MW) in 2019, into equal or higher than one per capita MW and less than one. Socioeconomic status was a latent variable measured by 2 indicators: schooling and income. The last dental visit was categorized into two groups: up to two years and more than two years. Officially named in Brazil as colour/race, the self-described race was assessed using five options according to the categories proposed by the Brazilian Institute of Geography and Statistics (IBGE): (1) white; (2) black; (3) yellow; (4) brown, and (5) indigenous. Self-reported smoking status was measured as those who never smoked and individuals who smoked in the past and/or are current smokers and use of dental floss (yes/no) only for adults due to the percentage of edentulous in the elders (almost half of the sample that does not have chance to use dental floss due to the loss of all teeth). Dental floss use was considered as an oral health behaviour and was stratified as (yes/no).

Statistical analysis

Age-standardised estimates for functional dentition and severe tooth loss (including edentulous) were reported for each covariate. Data analysis for the Structural Equation Modelling (SEM) was performed using the Stata software version 14.2 (StataCorp LP, College Station, United States) using the survey module that considers the effects of stratification and conglomeration in estimating indicators and their precision measures. This method allows the simultaneous testing of complex interrelationships between variables in public health. The variables were tested using the social determinants of oral health [10] and Sisson’s model as theoretical explanations [11]. Figure 1 exemplifies the finals SEM models for adults and elders. Standardised coefficients (SCs) were interpreted as being a small association (SC < 0.10), medium association (SCs between 0.30 and 0.50), and a strong association (SC > 0.50) [23].

Fig. 1
figure 1

The social determinants of oral health adapted to individual-level predictors

Machine learning approach

The Extreme gradient boost algorithm (Xgboost), based on sequential models of decision trees, was used to predict the outcome for adults and elders. Previous research has shown this algorithm has the highest area under the receiver operating characteristic curve (AUC) than others [16]. Appendix 1 shows the strategy for tuning all hyperparameters and the grids created. All of the results presented here are from the test set. Finally, to assess the predictive performance of the trained algorithm, the AUC, accuracy, sensitivity and specificity were calculated.

Furthermore, we computed the importance of each covariate in predicting our study outcomes, one for the adults and the elders. For the first sensitivity analysis, we stratify our dataset into the five main regions in Brazil (a proxy for human development levels). Southeast and South regions have better development levels than Northeast and North and different concentrations of dental surgeons workforce in both public and private sectors [24]. We used R (R Foundation for Statistical Computing, Vienna, Austria) software for our machine learning approach. We followed the STROBE guidelines for human observational studies and the checklist for the artificial intelligence approach [25].

Ethical aspects

The Brazilian Committee (Health National Council Committee) approved the Ethics in Human Research (protocol number 3.529.376).

Results

Out of the 90,846 individual who have answered the questionnaire, the analytical sample comprised 65,803 adults, and 22,728 elders. Table 1 shows the sample characteristics. The age-standardised estimates of having a functional dentition were higher among white participants, men, earning above one minimum wage, higher schooling, non-smokers, for both adults and elders.

Table 1 Descriptive characteristics and proportions of Brazilians adults and elders and age-standardised prevalence of levels of tooth loss in Brazilian National Health Survey 2019 (n = 88,531)

Table 2 showed the direct, indirect and total effects of SES, last dental visit, race and gender on the tooth loss outcomes, adjusted for smoke and use of dental floss (only for the adult’s population). Socioeconomic statuses SC = 0.20 (CI 95% 0.16; 0.23) was directly associated to the presence of functional dentition in adults and with last dental visit up to 2 years SC = 0.62 (CI 95% 0.56; 0.68). The last dental visit mediated the effect of race on tooth loss, favouring the Whites. Concerning the elders, better SES SC = − 0.71 (CI 95% -0.79; -0.62) and last dental visit up to 2 years SC = − 0.17 (CI 95% -0.20; -0.14) were associated with less severe tooth loss. The Last dental visit up to 2 years mediated the effect of race SC = − 0.15 (CI 95% -0.16; -0.14) and SES SC = − 0.17 (CI 95% -0.20; -0.14) on severe tooth loss, favouring the Whites and the higher socioeconomic position. The fit indices of the models were considered appropriate and are presented in Table 2.

Table 2 Structural Equation models with association of SES, race, gender and LDV on tooth loss, Brazilian National health survey, 2019

Table 3 shows the machine learning approach. For the adults, our trained Xgboost algorithm showed that the accuracy was 90%, predicting those with a lack of functional dentition, which means that in every ten adults, we correct classified nine in the Brazilian context. The AUC was 90%, sensitivity 38% and specificity 97%, and age was the main contributor. For the elders, our trained algorithm showed an accuracy of 70%. It means that we corrected classified seven in ten elders. The AUC was 76%, sensitivity 68% and specificity of 73%, and interestingly, the last dental visit was the main contributor. Sensitivity analysis by regions (a proxy for the human development levels) showed increased AUC by 91% and accuracy of 93% in the Southeast region in adults. Elders showed the same patterns. Indeed, regions changed the importance of individual variables in the machine learning approach (Appendix 1). In the second sensitivity analysis, with under sample the majority class in the training set, no differences were found in the testing set in terms of collected metrics.

Table 3 Machine learning approach and metrics for adults and elders in the whole country and divided by geographic regions

Discussion

This present research showed two critical findings. First, the last dental visit was a mediator between SES and severe tooth loss in elders. Secondly, in the machine learning approach, the trained Xgboost algorithm had good metrics predicting the lack of functional dentition in adults, with age, schooling and dental floss as the main contributors, indicating that it should be used for FHS in Brazil.

It is of fundamental importance to identify adults without functional dentition to schedule an appointment in primary health care. This can help with the principle of equity, giving more attention to those who need it most, considering that the absence of functional dentition affects the quality of life of adults and elders [26,27,28,29,30]. The algorithm trained for the Brazilian context is easy to use and with only a few input variables, generally collected in one visit by community health workers(CHW), an accuracy of 90% was achieved, that is, getting 90 out of 100 adults right. Moreover, for elders, identifying those with a higher risk for severe tooth loss in the adscript area of FHS could improve the planning and management for Dental Specialties Centers in Brazil. They are responsible for secondary care (prosthesis, for example) and give agility and resolution for elders to reestablish masticatory function and quality of life. For example, even without a dental consultation by a dentist in the PHC, the Family Health Strategy could use the trained algorithm, and target all individuals at higher risk for tooth loss. If correctly implemented, the work process has potential to be changed in the country. And the consultations could be target on those who need it more to prevent tooth loss. The algorithm can function as a support for care coordination.

Some structural barriers and contextual factors could be related to tooth loss, as recent evidence on the commercial determinants of health and its association with oral health [31]. The last dental visit was a mediator of SES for severe loss in elders and a borderline in adults for functional dentition (indirect effect p = 0.10). However, it is unrealistic for the health sector to be able to change people’s socioeconomic condition. Identifying those with more than 2 years of last dental visit, carrying out an active search in FHS and scheduling appointments, maybe with help of machine learning, could help adults and elders to prevent tooth loss. Future research needs to explore the role of contextual characteristics at different levels. An important finding of the machine learning approach for adults and elders was that schooling is one of the three most crucial predictors of lack of functional dentition/severe tooth loss in the Brazilian context. In a multicountry study, there were significant education-related inequalities in using oral health services by elders [14], corroborating our findings. Monitoring these inequalities is critical to planning and delivering dental services, organizing dental agenda, and setting priorities.

Limitations

This study has some strengths and limitations that should be acknowledged. The strengths were using a nationally representative sample of Brazilian adults and elders, the use of two definitions of tooth loss internationally to compare populations [18, 33,34,35] and promote machine learning as a potential path to set priorities on the dental agenda in the Unified health system in Brazil and care coordination.

One limitation was the self-reported nature of tooth loss assessment that may lead to information bias, but previous research has shown a good concordance between self-reported tooth loss and clinical evaluation in national surveys [32]. Another limitation of our study is its cross-sectional design, which may not establish a temporal relationship between exposures and outcomes.

Policymakers should use implementation science frameworks [36, 37] to train community health workers and measure organisational readiness [38, 39] for using the machine learning approaches. The algorithm could be retrained at local levels and improve current performance in each context of primary care in Brazil. Future works using tooth loss and machine learning approaches need to be further investigated.

In conclusion, more than two years of last dental visit appears to be associated with a severe loss in elders and lack of functional dentition in adults. The machine learning approach had a good performance to predict those individuals. The Family Health Strategy should use the trained algorithm to target those individuals in the Brazilian context and set priorities on dental agendas.