Background

According to a report by Mark Welch et al. [1], there are over 700 prokaryotic species that inhabit the human oral cavity. The oral microbiome plays a crucial role in maintaining not only oral health, but also overall bodily health [1, 2]. The human body is known to harbor numerous microorganisms that are associated with both health and disease [1, 3]. Several studies have suggested that oral microbiomes may be linked to various systemic diseases [3,4,5].

Research has shown that bacteria reside in specific niches within the human mouth [1, 6]. Moreover, differences in local sites within the oral cavity can lead to distinct distribution patterns as revealed by 16s rRNA sequencing [7,8,9,10]. Current studies are focused on bacterial colonization of various essential areas of the oral cavity, including dental surfaces, buccal mucosa, saliva, and the tongue [1, 2, 6]. However, unifying sampling methods in large-scale studies poses challenges, as there is a lack of standardized sampling protocols and regions for healthy individuals. Therefore, our upcoming research aims to address this gap [11, 12]. Investigating the microbiome that represents oral health in large populations is time-consuming and labor-intensive, highlighting the urgent need for a standardized reference. When it comes to the health of an individual, major bacterial sequencing is identical, supporting the opinion that a core microbiome exists in healthy individuals [13, 14].

The Human Microbiome Project Consortium revealed that the unique feature of an individual’s microbiome is stable and characteristic of healthy people [7, 15]. Accumulating evidence has shown that bacteria can choose the habitat that suit them best [1, 6, 13, 16]. At the micron scale, the distribution between Gram-positive and Gram-negative bacteria differs [17]. A study based on five individuals proved that Streptococcus mitis and Gemella hemolysans are the predominant species of the buccal epithelium [18]. Meanwhile, different niches are meaningful for studying the representatives of the oral microbiome [1, 19]. Selecting different sampling areas and techniques in the oral cavity is crucial for research and clinical applications. It was once believed that saliva represents the microbiome on the oral surface. However, Segata and his colleagues’ study implied that saliva has more bias on the tongue and palate [20]. The oral cavity’s diverse microenvironments, such as teeth and buccal mucosa, each host distinct microbial compositions. Using swabs or curette sampling techniques appropriately ensures comprehensive and reliable data, as recommended by the World Health Organization [21]. Efficient sampling is vital for epidemiological surveys, especially under time constraints. Optimized sampling strategies reduce survey time and resources, crucial for timely data collection during outbreaks or monitoring infectious diseases. Swab sampling is more convenient and faster than curette sampling, making it ideal for large-scale surveys.

Therefore, our hypothesis is that in individuals with good oral health, swab sampling will show no significant variation in the distribution of oral core microbiota across different regions of the oral cavity. This explores the feasibility of using swab sampling instead of curette sampling in large populations of healthy individuals.

Methods

Study design and participants

Five persons with no oral diseases were enrolled at the Stomatology Hospital of Tianjin Medical University, and informed consent was obtained from all subjects. The inclusion criteria is as follows: (1) People who had no oral diseases particularly caries, periodontitis, and oral mucosal diseases. (2) Teeth without previous restorations. Exclusion criteria is as follows: (1) People who are pregnant. (2) People who suffer from severe systematic diseases, such as diabetes, immune system diseases, and genetic diseases. (3) People who have taken antibiotics within two weeks before sampling.(4) People who have had or currently receiving radiotherapy or chemotherapy. (5) People with complete or incomplete absence of dentition. (6) Individuals who smoke or have smoked within the past six months.

Sample collection and clinical grouping

Ultimately, five teeth were included. One trained clinician examined the cases and collected the samples under aseptic conditions. We prevented subjects from eating and drinking for at least 2 h before sampling. Two sampling methods successively sampled each patient. The buccal plaque of the left and right lateral mandibular first molars was taken with curettes and swabs, respectively, and the buccal mucosa of the left and right lateral teeth was taken with swabs, respectively. Finally, the colonies of the left buccal swab, left tooth curette, left tooth swab, right buccal swab, right tooth curette, and right tooth swab were counted. All samples were transported to the laboratory on ice within 2 h of collection. Moreover, they were stored at -80℃ in a refrigerator before 16 S rDNA sequencing.

16 S rDNA sequencing

PCR (TransGenAP221-02: Transstart Fastpfu DNA Polymerase) was used to amplify the extracted genomic DNA.DNA concentration and purification were monitored on 1% agarose gels. The whole gene sequencing was conducted on the Miseq platform. Synthesize specific primers with barcodes according to the designated sequencing region, or synthesize fusion primers with mispaired bases. In our study, we used universal primers targeting the 16S rRNA gene for bacterial identification and diversity analysis. Specifically, the primers used were: Forward Primer: 338F(5’-ACTCCTACGGGAGGCAGCAG-3’); Reverse Primer: 806R(5’-GGACTACHVGGGTWTCTAAT-3’). These primers are well-established in microbiome research for their broad range and ability to amplify bacterial DNA across a wide variety of species. This choice was made to ensure a comprehensive analysis of the microbial communities present in the oral cavity.

Primers performed PCR with barcodes, which targeted the V3-V4 region of the 16 S rDNA region. PCR was performed under the following conditions: 94℃ for 5 min, followed by 28 cycles of 94℃ for 30 s, 55℃ for 30 s, and 72℃ for 1 min, and then a final elongation step at 72℃ for 7 min and 4℃ in the end. Using fast length adjustment of short reads (FLASH) software merged primitive DNA sequence. Using the database SRA (Sequence Read Archive, http://www.ncbi.nlm.nih.gov/Traces/sra) established by the NCBI data center to store shared raw sequencing data. The Quantitative Insights Into Microbial Ecology (QIIME) software were applied to analyze the results of sequencing. The data obtained from the MiSeq sequencing is in the form of paired-end sequences.

Data management and statistical analysis

To enhance data processing efficiency, we assigned numbers to samples collected in the same manner. The left tooth curette sample was labeled as AZ, the left buccal swab as AS, the left tooth swab as AX, the right tooth curette as BS, the right buccal swab as BZ, and the right tooth swab as BX. AZ comprised of samples A1, A4, A7, A11, and A13. AS comprised of samples A2, A5, A8, A10, and A14. AX comprised of samples A3, A6, A9, A12, and A15. BS comprised of samples A17, A20, A23, A25, and A29. BZ comprised of samples A16, A19, A22, A26, and A28. BX comprised of samples A18, A21, A24, A27, and A30 (Table S1).

Alpha diversity was used to estimate the richness and diversity of microbial species. We use QIIME (Version 1.8), Uclust (Version 1.2.22), and Usearch (Version 10.0.240) to analyze the results and generate rarefaction curves, and the R package to visualize the difference of each sample. Beta diversity was analyzed to evaluate differences among samples in species complexity. We used Qiime to analyze the results and rarefaction curves, and the R package to exhibit the differences of each sample. Weighted UniFrac distance was calculated using Qiime. Anonymous analysis was used to compare differences among groups with differences within groups. Linear discriminant (LDA) effect size analysis was performed, setting the LDA score to 3 by default (Fig S1). P-values less than 0.05 were considered statistically significant.

Results

The overall diversity of bacterial communities across samples

Initially, the obtained FASTQ data is filtered by removing the bases with quality scores below 20 from the end of the reads. A 50 bp window is set, and if the average quality score within the window is below 20, the bases at the end of the window are trimmed. Reads with less than 50 bp after quality control are filtered out. Then, the paired sequences are merged into a single sequence based on their PE sequencing overlap relationship. The raw data (Raw PE) is processed by removing the barcodes and primers and merging them to obtain raw tags. The specific result statistics are shown in Table S2. In our study, we collected 30 specimens for each of the 5 individuals who had a healthy oral status. The baseline information of the subjects is presented in Table 1, while the distribution statistics of high-quality sequences obtained after data preprocessing are shown in Table S3.

Table 1 Demographic and clinical parameters found no statistically significant differences in age

We utilized various indexes, namely Shannon, Specaccum, and Chao1, to estimate the α-diversity. The rarefaction curves of observed OTUs per sample indicated that additional sequencing would be necessary to capture all phylotypes (Fig. 1A). The Shannon-Wiener curve had reached a plateau, suggesting that the sample size was sufficient to capture species diversity (Fig. 1B). The species accumulation curves reflected the biodiversity within the sample. In this experiment, the total number of OTUs detected increased steadily with the increase of the sample size, and the trend of increase was flat. This indicated that the sample size in this experiment was sufficient to support the conclusion. The results showed that regardless of the sampling tools used, the microbiome diversity between the tooth surface and buccal mucosa was different. The group that took the right buccal mucosal swab sample had a faster decrease in the Rank-Abundance curve than the other groups, indicating that collecting samples in this way in clinic may result in uneven bacterial diversity. The most abundant sample content was obtained when the left side teeth were scraped and sampled. As revealed by the Shannon diversity index, there were significant differences in diversity between BZ and AZ or BS. The AZ-BS, AS-BZ, and Shannon values were statistically different from the AS-BZ group (P = 0.004, P = 0.003; Fig. 1C). This indicates that the biodiversity exhibited by the right buccal swab sampling was not abundant, and the biodiversity between the buccal mucosa and tooth surface was uneven. Overall, the sampling results of the AZ group were significantly better than the other groups in terms of microbial diversity and the authenticity of the measured data. The OTUs detected by collecting bacteria on teeth were higher than those collected on the buccal mucosa.

Fig. 1
figure 1

Description of oral microbiota structures in the samples. (a) Rarefaction analysis of bacterial 16 S rDNA gene sequences was applied. Suggested by a plateau, it told us whether additional taxa were found. Different colors represent different samples. (b) Shannon index curves were performed to assess whether the samples were sufficient, indicated by a plateau. (c) Box plots depict differences in bacterial diversity among AZ-AX, AS-BZ, BS-BX. BS represents the right tooth curette; AZ represents the left tooth curette; BZ represents the right buccal swab; AS represents the left buccal swab; AX represents the left tooth swab; BX represents the right tooth swab

Taxonomic analysis in each group

Firmicutes, Proteobacteria, Bacteroidetes, Actinobacteria, and Fusobacteria were the top five most abundant phyla, accounting for 98.26% of all sequences. Firmicutes was the most abundant phylum, comprising 41.25% of sequences (Fig. 2A). Neisseria, Streptococcus, Haemophilus, Veillonella, and Rothia were the five most abundant genera at the genus level (Fig. 2B).

Fig. 2
figure 2

Composition of bacterial communities across samples at the phylum and genus levels. (a) Relative abundance of bacterial phyla among each group. (b) Relative abundance of bacterial genus among each group. BS represents the right tooth curette; AZ represents the left tooth curette; BZ represents the right buccal swab; AS represents the left buccal swab; AX represents the left tooth swab; BX represents the right tooth swab

Association between groups of samples and indication of specific pathogens detection

Compared with right tooth sampling with a curette, swab sampling is more efficient in detecting Fusobacteria (Fig. 3A). There was a difference in abundance, including Bacteroidetes, Firmicutes, and Fusobacteria between AS-BZ/AZ-BS (Fig. 3B). Bacteroidetes were more abundant in the left tooth compared to the right. Heatmap analysis was used to exhibit the pathogen that was considered to lead to the disease. As the outcome revealed, Bacteroidetes, Fusobacteria, Spirochaetes, Cyanobacteria, Firmicutes, and Actinobacteria were part of the top 20 phyla (Fig. 4A). The Firmicutes and Proteobacteria were measured to have a similar distribution in each group. Prevotella intermedia was more abundant in the AX and AZ groups. Enterococcus faecalis and Porphyromonas endodontalis, two pathogens known to cause pulp diseases, showed significant abundance in the samples taken from the tooth surface (Fig. 4B). There was little difference in the sampling method for testing these two bacteria alone.

Fig. 3
figure 3

The taxonomic analysis of microbiome relations. (a) The Wilcoxon test of AX-BX and AS-BZ of the group on the phylum level. (b) The Wilcoxon test of AX-BX and AZ-BS of the group on the phylum level. BS represents the right tooth curette; AZ represents the left tooth curette; BZ represents the right buccal swab; AS represents the left buccal swab; AX represents the left tooth swab; BX represents the right tooth swab

Fig. 4
figure 4

The taxonomic analysis of colony composition. (a) The heatmap of the top 20 species of each group. (b) The heatmap of bacterial genera. BS represents the right tooth curette; AZ represents the left tooth curette; BZ represents the right buccal swab; AS represents the left buccal swab; AX represents the left tooth swab; BX represents the right tooth swab

Metastats analysis for inter-group significance indicated that the differences between groups were greater than the differences within groups, suggesting that the grouping was meaningful (Table 2). Buccal mucosa and tooth surface microorganisms share a great degree of similarity in composition and structure (Fig. 5A and B). The colonies obtained by the two methods of sampling at the teeth were more similar (Fig. 5C).

Table 2 Microbial community dissimilarity comparison among three groups using Anosim statistical method
Fig. 5
figure 5

Interpretation of Intra-Group Taxa. (a) Unifrac distance analysis of each group. (b) NMDS of oral microbiota in groups. (c) PCA analysis in different groups. BS: right tooth curette; AZ: left tooth curette; BZ: right buccal swab; AS: left buccal swab; AX: left tooth swab; BX: right tooth swab

After the LDA test, no significant difference was detected between the AX and BX groups (Fig. 6A). LDA analysis showed a significant difference in the Mycobacteria among the AS, AZ, BS, and BZ groups. In the AS group, Mycobacteriaceae had a high LDA score. In the samples taken from AZ, there were significant differences in the detection of Cardiobacterium, Corynebacterium, and Capnocytophaga. Among the samples obtained from the BS group, Fusobacteria and Porphyromonadaceae had a high LDA score. In the BZ group, the significant difference between Lactobacillus and Streptococcus was higher than in other groups. In the BS-BX group, the class level of Betaproteobacteria and Fusobacteria, the genus level of Neisseria, Pseudopropionibacterium, Leptotrichia, Corynebacterium, and Porphyromonas, the order level of Neisseriales, Fusobacteria, and Corynebacteriales, the family level of Neisseriaceae, Leptotrichiaceae, Corynebacteriaceae, and Porphyromonadaceae, and the species level of Pseudopropionibacterium propionicum, Corynebacterium matruchotii, Neisseria elongata, and Leptotrichia 225 were more abundant than in the tooth group (Fig. 6B). As for swab-sampling on teeth, the genus level of Capnocytophaga had a high LDA score. As for buccal mucosa, the species level of Mycobacterium neoaurum, Haemophilus 036, and Streptococcus 071, the family level of Streptococcaceae, the genus level of Haemophilus and Streptococcus, the order level of Lactobacillales, the phylum of Firmicutes, and the class level of Bacilli had a high LDA score. Bacteroidetes and Fusobacteria on the phylum level, and Porphyromonadaceae on the family level, were more abundant. The LDA score is high for Firmicutes on the phylum level, Bacilli on the class level, and Streptococci on the genus level. (Fig. 6C)

Fig. 6
figure 6

LDA Analysis of the Samples. (a) LDA scores predict gene function enriched in different groups. BS: right tooth curette; AZ: left tooth curette; BZ: right buccal swab; AS: left buccal swab; AX: left tooth swab; BX: right tooth swab. (b) Analysis of each group. (c) Difference between AS-BZ and AZ-BS groups. BS: right tooth curette; AZ: left tooth curette; BZ: right buccal swab; AS: left buccal swab; AX: left tooth swab; BX: right tooth swab

Discussion

The study aimed to analyze the composition and distribution of microbial communities in the buccal mucosa and tooth surfaces within the healthy oral cavity. The findings from this investigation provide valuable insights into the microbial diversity and its implications for oral health research.

The results indicated that there is a considerable similarity in the microbial composition between the buccal mucosa and tooth surfaces, with a total of 223 OTUs found in both regions. This suggests that these two sites share a core microbiome, despite variations in microbial abundance. Firmicutes were more prevalent in swab samples, while Fusobacteria and Bacteroidetes were more abundant in curette samples from the tooth surface. This distribution aligns with previous studies highlighting the dominance of these phyla in different oral niches [1, 22, 23].

The presence of significant microbial taxa such as Prevotella intermedia and Porphyromonas endodontalis on the tooth surface indicates the potential for using specific sampling methods to detect oral pathogens effectively. This is crucial for diagnosing and monitoring oral diseases. The study’s detection of these pathogens is consistent with findings from other research that identifies specific niches within the oral cavity for different bacterial species [2, 7, 18, 24].

Swab samples were found to be easier and less invasive to collect compared to curette samples, making them more suitable for large-scale epidemiological studies. However, the detection of disease-associated pathogens such as Enterococcus faecalis and Porphyromonas gingivalis was more efficient with curette sampling. This is consistent with previous studies, which suggest that the method of sampling can significantly impact the detection and analysis of microbial communities [4].

The similarities in microbial composition between the buccal mucosa and tooth surfaces support the use of buccal mucosa sampling as a representative method for studying the oral microbiome. This approach is particularly advantageous for large-scale studies due to its non-invasive nature and ease of collection. However, the differences in microbial abundance and the efficiency of pathogen detection between the two sampling methods highlight the need for careful consideration of the sampling technique based on the research objective. For instance, while swab samples are ideal for general microbiome studies, curette samples may be preferred for pathogen-specific investigations.

Research by the Human Microbiome Project Consortium has revealed that the unique feature of an individual’s microbiome is stable and characteristic of healthy people, supporting the idea of a core microbiome in healthy individuals [3, 21]. This consistency further validates the use of buccal mucosa sampling as a reliable method for microbiome studies.

While this study provides a comprehensive comparison of microbial communities in different oral regions, it has several limitations. The sample size was relatively small, which may affect the generalizability of the findings. Future studies should include a larger and more diverse population to validate these results. Additionally, incorporating other oral sites such as the tongue and saliva could provide a more complete understanding of the oral microbiome.

In conclusion, our study underscores the importance of selecting appropriate sampling sites and methods to accurately characterize the oral microbiome. These findings contribute to the foundation for future research and clinical applications aimed at understanding the role of the oral microbiome in health and disease. Effective sampling strategies are essential for obtaining reliable data, particularly in large-scale epidemiological surveys where time and resources are limited. Swab sampling’s convenience and efficiency make it an ideal choice for such studies, although curette sampling may be necessary for more detailed pathogen analysis.

Conclusions

Our study provides preliminary evidence that there is no significant difference between oral buccal mucosa and hard surfaces for bacteria composition. This may be used in oral epidemiological surveys of large healthy populations. The detection of the distribution of pathogenic bacteria in oral cavity must utilize the appropriate sampling method depending on the bacteria under study.