Performance of convolutional neural network (CNN) and performance influencing factors for wood species classification of Lepidobalanus growing in Korea

Kim, Jong-Ho; Park, Wan-Geun; Kim, Nam-Hun

doi:10.1038/s41598-024-69281-y

Performance of convolutional neural network (CNN) and performance influencing factors for wood species classification of Lepidobalanus growing in Korea

Article
Open access
Published: 05 August 2024

Volume 14, article number 18141, (2024)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Performance of convolutional neural network (CNN) and performance influencing factors for wood species classification of Lepidobalanus growing in Korea

Download PDF

Jong-Ho Kim¹,
Wan-Geun Park² &
Nam-Hun Kim³

299 Accesses
1 Altmetric
Explore all metrics

Abstract

This study aimed to investigate the performance and factors affecting the species classification of convolutional neural network (CNN) architecture using whole-part and earlywood-part cross-sectional datasets of six Korean Quercus species. The accuracy of species classification for each condition was analyzed using the datasets, data augmentation, and optimizers—stochastic gradient descent (SGD), adaptive moment estimation (Adam), and root mean square propagation (RMSProp)—based on a CNN architecture with three to four convolutional layers. The model trained with the augmented dataset yielded significantly superior results in terms of classification accuracy compared to the model trained with the non-augmented dataset. The augmented dataset was the only factor affecting classification accuracy in the final five epochs. In contrast, four factors in the entire epoch, such as the Adam and SGD optimizers and the earlywood-part and whole-part datasets, affected species classification. The arrangement of earlywood vessels, broad rays, and axial parenchyma was identified as a major influential factor in the CNN species classification using gradient-weighted class activation mapping (Grad-CAM) analysis. The augmented whole-part dataset with the Adam optimizer achieved the highest classification accuracy of 85.7% during the final five epochs of the test phase.

Wood identification based on longitudinal section images by using deep learning

Article 16 February 2021

Using machine learning to predict the density profiles of surface-densified wood based on cross-sectional images

Article Open access 04 June 2022

BACNN: Multi-scale feature fusion-based bilinear attention convolutional neural network for wood NIR classification

Article Open access 06 December 2023

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Introduction

The importance of identifying wood species is being emphasized globally to prevent the distribution of illegally harvested timber^1,2. Conventional wood identification based on light microscopy is difficult for the public to access because of the requirements of experimental techniques and a high level of wood anatomical knowledge^3,4,5,6,7. Therefore, various species identification protocols, such as DNA barcoding, principal component analysis, and comparison of extractive components, are being discussed worldwide to replace traditional species identification methods^8,9,10. Recently, deep learning-based species classification has become the focus among these alternatives. This method minimizes the difficulty of species identification by optimizing the requirements for specialized knowledge, experimental techniques, and time-consuming species analysis required for the conventional species identification process¹¹.

The deep learning-based species identification method processes visual traits, such as anatomical features on the wood surface, that can be visualized into feature maps, recognizes repetitive patterns as unique features of the species, and utilizes them as classification indices for species^12,13,14. The diversity of the constituent cells observed on the wood surface enables convolutional layers to recognize and learn more diverse patterns, structures, and features while extracting features. This enables generalizing a wide range of features, a more precise classification of species, and increasing classification performance. Various feature maps of anatomical traits of hardwood, which exhibits more diverse and detailed anatomical features than softwood^15,16, can be generated during feature extraction. This implies that hardwoods can be classified more precisely than softwoods.

Six oak species—Quercus serrata, Quercus dentata, Quercus mongolica, Quercus variabilis, Quercus aliena, and Quercus acutissima—are distributed over an area of 975,181 ha in Korea, accounting for approximately 22.3% of the total forest area¹⁷. Oakwood accounts for approximately 10.8% of the total domestic timber distribution volume¹⁸ and plays a crucial role across various sectors of the Korean wood industry, from low-value applications such as firewood, pulp, and fiberboard resources to high-value uses such as charcoal, engineered wood, and wood slabs.

These six oak woods have similar anatomical characteristics, making it difficult to classify them without specialized knowledge. In conventional species identification, oak species have been classified based on anatomical characteristics such as the arrangement of growth rings, development of large vessels in the earlywood, tyloses, and broad rays (more than 10 seriates in width and over 1 mm in height)^{19,20,21,22,23,24,25}. However, the conventional procedure based on wood anatomy can generally identify up to the genus level, owing to the similarity of anatomical characteristics²⁶.

In the mid-2000s, studies on the application of computer vision and artificial neural networks to identify wood species began to be reported. In 2004, Clark classified wood species based on a multi-layer perceptron (MLP) and suggested using it as an auxiliary method to conventional wood species identification²⁷. In 2007, Tou et al. compared the wood species classification performance using a small dataset of five wood species from Malaysia based on a gray-level co-occurrence matrix (GLCM) and a multilayer perceptron (MLP)²⁸. In 2009, Esteban et al. used an artificial neural network to classify two Juniperus species with similar anatomical characteristics with a high accuracy of over 90%²⁹. Entering the 2010s, artificial neural networks and computer vision-based methods for wood species classification had developed into more sophisticated and diverse directions. In 2012, Ma et al. reported excellent classification performance of over 90% using the near-infrared spectroscopy (NIR) dataset of wood species based on backpropagation artificial neural network and regression neural network model³⁰. In 2013, Yadav et al. classified 25 hardwood species with excellent accuracy of over 90% using a gray-level co-occurrence matrix (GLCM) and a multilayer perceptron (MLP) and compared the performance for each condition by dividing the dataset into various ratios during the wood species classification process³¹. In 2017, Esteban et al. again used an artificial neural network to report that Pinus sylvestris and Pinus nigra, which were difficult to classify due to similar anatomical characteristics, could be classified with an accuracy of over 80%³². In the late 2010s, wood species identification studies based on convolutional neural networks began to be reported, and active research is being conducted to this day^7,33,34.

Recently, several studies have been conducted on deep learning technology to identify wood species in Korea. Kwon et al. reported the classification performance of softwood species using convolutional neural network architectures, including LeNet and miniVGGNet^35,36. We also conducted a study on the classification of commercial softwood species using convolutional neural networks to explore the potential of deep learning for species identification³⁷. Subsequently, we reported the performance of species classification using the bark of domestic oak species as a follow-up study³⁸.

However, no studies have been conducted on classifying oak species using convolutional neural networks (CNNs) with wood datasets. Therefore, the aim of this study was to verify the potential of CNN-based species classification using a wood dataset of six Korean oak species to provide a convenient identification method for their efficient utilization. In this context, the factors influencing classification performance were statistically analyzed to investigate the optimal performance conditions, and important classification factors were analyzed through visualization.

Materials and methods

Materials

Six Korean oak species were harvested from the research forest of Kangwon national university located across Chuncheon and Hongcheon in Gangwon State. After confirming the species by an expert in dendrology, three stems of each species were obtained, and sample discs were collected from the breast height of each stem. Comprehensive information about the samples is presented in Table 1.

Table 1 Sample Information.

Full size table

Image and dataset preparation

Wood samples were collected from the normal parts of the mature wood to avoid the influence of juveniles and abnormal parts during collection. Five to eight blocks (R × T × L = 15 − 20 × 10 × 10 mm) for each species were randomly separated from each disc using a cutter knife and hand hammer. More than 20 wooden blocks were obtained for each species.

Sample preparation for optical microscopy was performed by following the guidelines of conventional methods^39,40. The wood blocks were softened in a boiling mixture of glycerin and water (a 1:4 ratio) and sliced into approximately 30 μm thick slices using a sliding microtome (Lab Model, WSL, Birmensdorf, Switzerland). The sections were stained with a 1% safranin solution, dehydrated using an ethanol series, cleared with xylene, and mounted on permanent slides using Canada balsam.

The cross-section was observed using a digital camera (IMTCam 6.3MP camera, IMT, British Columbia, Canada) connected to an optical microscope (ECLIPSE E600, NIKON, Tokyo, Japan) with 4× (Plan Fluor model; NA 0.13; WD 17.2, NIKON, Tokyo, Japan) and 10× (Plan Fluor model; NA 0.30; WD 16.0, NIKON) objective lenses to collect the dataset. Micrographs were obtained and analyzed using IMT i-Solution Lite image analyzer (Version 26.1, IMT Inc., Burnaby, British Columbia, http://www.imt-digital.com/).

Two types of datasets (whole-part and earlywood) were prepared to analyze the effect of micrograph capture location on the performance of the artificial neural networks. The whole-part dataset comprised earlywood and latewood, captured using the 4× objective lens. The earlywood dataset was captured using the 10× objective lens. The micrographs with defects such as cracks and contamination were excluded. Finally, the dataset was constructed with 150 micrographs by species and collected part.

In this study, all methods were performed in accordance with the relevant guidelines and regulations.

Dataset pretreatment

Before training, the RGB coefficients were reduced to a ratio of 1/255 to decrease the range from zero to one for normalization. Dataset augmentation is used to directly increase the size of a dataset for training by applying random variables to the images that constitute the dataset to improve the generalization performance of neural networks⁴¹. This study examined and analyzed the impact of dataset augmentation on classification accuracy. Recently, various modules and libraries such as tf.image, albumentations, torchvision, Augmentor, and imgaug have been widely used for dataset augmentation. However, we augmented the dataset using the ImageDataGenerator function to maintain consistency with the Keras library employed in the neural network model in this study. Datasets of 80% for training and 20% for testing, based on the size of the non-augmented dataset with 150 images, were prepared. Data augmentation was only applied to the training dataset. The dataset augmentation using the ImageDataGenerator function was set to a tenfold increase. The options applied included a rotation range of 10°, a width shift range of 10%, a height shift range of 10%, a zoom range of 20%, horizontal flip, vertical flip, and fill mode set to 'nearest'. These settings expanded the dataset from 120 images to over 1,300 images per species before and after augmentation, respectively.

The effects of other verification conditions, such as the composition of the dataset and type of optimization function, on the classification accuracy were also investigated by augmenting the training dataset and comparing it with the non-augmented dataset.

Micrographs of the dataset were constructed with 3072 × 2084 pixels (6,402,048 pixels) and resized to 224 × 224 pixels (50,176 pixels) to conserve system resources used during training. The actual size of each pixel increased from approximately 1.3 µm²/pixel to approximately 160 µm²/pixel (15.5 × 10.3 µm/pixel) as the resolution of individual datasets decreased.

Verification factors influencing classification performance

The classification performance and factors influencing the performance were analyzed using the CNN architecture. The convolutional layer extracts the features from an image and transforms them into feature maps. Therefore, as the number of convolutional layers in a convolutional neural network increases, the features of the image can be extracted more deeply, leading to improved accuracy. However, there was also a negative relationships with an increase in the number of convolutional layers, showing an increase in loss⁴². Therefore, three or four convolutional layers constituting the convolutional neural network were applied depending on the variation of classification accuracy and loss by the verification condition. The number of convolutional layers by the verification condition is shown in Table 2.

Table 2 Applied conditions for verification in this study.

Full size table

Dropout and batch normalization techniques were partially applied to the convolutional and fully connected layers to normalize the verification data. Finally, the Softmax activation function was applied.

The verification variables were set to optimizer, dataset augmentation, and collected part of the dataset to analyze the factors affecting the classification performance of the convolutional neural network for hardwood species. Three types of optimizers—stochastic gradient descent (SGD), adaptive moment estimation (Adam), and root mean square propagation (RMSProp)—were used to compare the classification performance according to the optimizer type. The classification accuracy was compared based on the augmentation of the training dataset. In addition, low-magnification (4× objective lens) images for the entire cross-section, and high-magnification (10× objective lens) images for earlywood-centered images were constructed for each dataset to analyze the difference in classification performance according to the collected part of the microscopic images that comprise the dataset, and the classification accuracy of the trained model was compared.

The gradient-weighted class activation mapping (Grad-CAM) technique was used to verify the factors affecting species classification^43,44. The Grad-CAM technique extracts the output of the feature map in the convolutional layer from the input image and works on the principle of multiplying the gradient average of the class for all channels, creating a spatial map of how much the input image activates the class⁴⁵. Therefore, the Grad-CAM technique is widely used to analyze influential factors in the process of classifying labels based on machine learning and deep learning^46,47,48. An arbitrary image per species was selected, and the Grad-CAM technique was applied to each verification condition to analyze the area recognized as a common classification indicator and the classification indicators by species.

The Softmax function was used as the activation function to classify the six oak species using the results from the artificial neural networks. A categorical cross-entropy function was applied as a multiclassification loss function for compilation.

Statistical analysis

The Pearson correlation coefficients among the variables were analyzed using a bivariate correlation analysis in IBM SPSS Statistics for Windows (Version 26.0, IBM, Armonk, New York, USA, https://www.ibm.com/spss/). Nominal variables, including the type of optimizer (SGD, Adam, and RMSProp), dataset collection region (whole and earlywood), and augmentation for analysis, were applied, whereas accuracy and loss were applied as scale variables. Furthermore, homogeneous subsets among the results were obtained using Duncan's post hoc analysis with one-way ANOVA.

Results and discussion

Comparison of classification performance by the applied condition

Classification performance of oak species using CNN

Figure 1 shows the results of comparing the classification accuracy and loss in the test phase of the CNN architecture using the whole-part and earlywood-part datasets of the six oak woods. As the number of epochs increased in both dataset conditions, the loss decreased and the accuracy increased. This trend appeared in the process of updating weights and biases repeatedly with increasing epochs of the CNN⁴⁹, indicating proper performance in learning and classification through the CNN.

In the verification results of the whole-part dataset, the classification accuracy and loss under the conditions of the Adam and RMSProp optimizers rapidly stabilized within the range of 10–20 epochs, whereas the conditions of SGD were relatively gently stabilized. The learning speed according to the type of optimizer generally depends on the difference in the operational principle^50,51. The conditions trained by the augmented dataset tended to stabilize in the range of 80–100 epochs regardless of the optimizer. In particular, the SGD non-augmented dataset condition showed the gentlest stabilization, with a slope close to linear. In the testing phase, the augmented dataset tended to stabilize relatively quickly in the range of 20–40 epochs compared to the non-augmented dataset for all optimizer conditions, whereas the Adam and RMSProp conditions using the non-augmented dataset showed a pattern of overfitting after 20–40 epochs. In particular, the classification accuracy at the final stage in the validation condition using the augmented dataset was nearly 20% higher than that using the non-augmented dataset, which was due to the improvement in the model’s generalization performance owing to the increase in the diversity of the dataset⁵².

Whereas, in the verification results of the earlywood-part dataset, the fluctuation trend of classification accuracy and loss was more clearly observed than that in the whole-part dataset. The classification accuracy and loss were observed to stabilize at approximately 40 epochs, regardless of the optimizer and dataset augmentation. However, when the augmented dataset was applied, the loss was lower, and the classification accuracy was higher than when the non-augmented dataset was applied. Meanwhile, the difference in classification accuracy and loss between the augmented and non-augmented dataset conditions decreased significantly compared with the validation condition using the whole-part dataset. This implies that even when using the non-augmented dataset, similar levels of performance to the augmented dataset can be achieved in the earlywood dataset for learning, and it can be interpreted that the convolutional layer extracts various features in earlywood, achieving excellent generalization performance even with a small dataset⁵².

Anatomical factors affecting wood species classification performance

Grad-CAM analysis of whole-part dataset

Table 3 shows the weights of the parts recognized as classification indicators using the Grad-CAM technique, which was applied to classify the six oak species based on earlywood and latewood cross-sectional images. As a result of the analysis of the factors affecting species classification in the Grad-CAM technique using the cross-sections of the oak species, the earlywood vessels and well-developed broad rays over 10 seriates were identified as common classification indicators in most species. The arrangement of earlywood vessels in the earlywood of Q. acutissima acted as a factor influencing the classification, whereas the area composed of only fibers without broad-ray tissue and axial parenchyma cells was involved in the classification of Q. aliena. The arrangement of earlywood vessels and distribution of axial parenchyma cells around the latewood affected the classification of Q. dentata. The fiber area without axial parenchyma cells and broad rays in the cross-section was identified as a classification indicator for Q. mongolica. The arrangement of earlywood vessels, axial parenchyma cells, and broad rays were identified as classification factors for Q. serrata. Axial parenchymal cells adjacent to vessels around the broad rays did not affect the classification of Q. serrata. Most traits such as the arrangement of vessels, axial parenchyma cells, and fibers were confirmed as classification indicators in Q. variabilis. However, the parenchymal cells distributed around the vessels did not affect the classification.

Table 3 Analysis of the classification factors of six oak species using the whole-part micrographs.

Full size table

In the whole part dataset, species classification based on convolutional neural network was affected by the arrangement of pores, broad rays, and axial parenchyma cells.

Table 4 lists the weights of the parts recognized as classification indicators in the Grad-CAM technique using the earlywood dataset. Compared with the whole-part images, the arrangement of earlywood vessels, which is a major characteristic, was more clearly observed, and classification indicators, such as wood fiber and axial parenchyma, were found around the earlywood vessels. Oak species undergo rapid growth from spring to summer owing to seasonal factors⁵³, leading to the significant development of earlywood in the xylem, which contributes prominently to the classification of the species; thus, it is regarded as a determining factor for classification among the oak species in this study. Tyloses in the earlywood vessels and the axial parenchyma cells around the earlywood vessels were also classified as indicators of Q. acutissima. The classification accuracy of Q. aliena was affected by earlywood vessels, tyloses in earlywood vessels, and axial parenchyma cells around latewood vessels. Tyloses in earlywood vessels and axial parenchyma cells around the earlywood vessels were also classification indicators of Q. dentata. Q. mongolica is affected by its overall structural components, such as the arrangement of earlywood vessels, fibers, and axial parenchyma cells. Q. serrata was characterized by a lower occurrence rate of tyloses in earlywood vessels than in other species and did not affect the classification of axial parenchyma cells. Although the arrangement of earlywood vessels was confirmed as a classification indicator in Q. variabilis, the axial parenchymal cells around the earlywood vessels were excluded from the classification indicators.

Table 4 Analysis of the classification factors of six oak species using the earlywood micrographs.

Full size table

The results suggested that species classification based on the convolutional neural network using an earlywood dataset was affected by the arrangement of the pores, broad rays, and axial parenchyma cells.

Statistical analysis

Correlation among the factors

Table 5 presents the correlations between the variables applied to the test process of the CNN architectures. The loss tended to decrease with an increasing number of epochs in whole epochs verification, whereas the accuracy tended to increase in proportion to the number of epochs.

Table 5 Correlation of the factors influencing classification performance.

Full size table

Among the optimizers in whole epochs condition, Adam (0.127**) and SGD (− 0.160**) had the highest and lowest impact on classification accuracy, respectively, whereas RMSProp did not show a significant difference in classification accuracy. Dataset augmentation showed a relatively higher impact (0.351**) on classification accuracy than the other factors. In contrast, the accuracy tended to decrease with the application of the whole part or non-augmented dataset, whereas it increased with the application of the earlywood part or augmented dataset. The loss tended to be opposite to that of accuracy.

The factors affecting classification accuracy, such as epochs, optimizer, and dataset composition (whole-part, earlywood), disappeared, and the impact of dataset augmentation increased more than twice from 0.351** to 0.747**. The increase in impact is expected to be attributed to the variation in classification accuracy and loss minimization after reaching the convergence point. Dataset augmentation could be a major factor affecting classification performance.

Homogeneous subset

Table 6 presents the results for the homogeneous subsets among the conditions of the dataset based on the verification results shown in Fig. 1.

Table 6 Comparison of average loss and accuracy among optimizers.

Full size table

In the results of whole-epochs verification, losses were classified into multiple subsets. The first identified subset included conditions such as whole-part-SGD, earlywood-Adam, and earlywood-RMSProp, which utilized an augmented dataset. The second subset included four conditions: two conditions for the augmented dataset, whole-part-RMSProp and earlywood-SGD, and two conditions for the non-augmented dataset, earlywood-SGD and earlywood-RMSProp. In the third subset, three conditions were classified: earlywood-SGD and earlywood-Adam for the non-augmented dataset and earlywood-SGD for the augmented dataset. The fourth subset was identified according to three conditions: whole-part-Adam and earlywood-Adam for the non-augmented dataset and earlywood-SGD for the augmented dataset. The fifth subset identified three conditions: whole-part-SGD, whole-part-Adam, and earlywood-Adam, which utilized a non-augmented dataset.

Classification accuracy was divided into two major subsets. The first homogeneous subset consisted of five of the six earlywood dataset conditions, excluding the ADAM-augmented condition. The second homogeneous subset also consisted of five conditions: whole-part-SGD, earlywood-SGD, earlywood-Adam, and earlywood-RMSProp conditions that utilized an augmented dataset, and the earlywood-RMSProp condition that utilized a non-augmented dataset. Based on these results, it was concluded that applying the earlywood dataset produced similar results without a significant impact on the conditions.

In the results of last five epochs verification, most conditions during the test phase had a classification accuracy of around 70%. Only some conditions, such as whole-part-SGD, whole-part-Adam, whole-part-RMSProp, and earlywood-SGD that applied an augmented dataset, had a classification accuracy of over 80%. The condition with the highest classification accuracy among the conditions was whole-part-Adam that applied an augmented dataset, showing an accuracy of approximately 85.7%. However, there was no significant difference in classification accuracy between the conditions that produced a classification accuracy of over 80% mentioned earlier and the whole-part-Adam condition.

Table 7 presents the results of the homogeneous subset analysis between the indicators using the average accuracy in the final five epochs of the test phase, as shown in Table 6. Data augmentation directly affected the classification accuracy, and the classification accuracy before and after augmentation was verified as a significantly independent subset. There was no significant difference in classification accuracy between the whole-part and earlywood datasets. The classification accuracy according to the optimizers SGD, Adam, and RMSProp was classified as a homogeneous subset with no difference.

Table 7 Comparison of average accuracy in the final five epochs per factor in the test phase.

Full size table

Conclusions

The results of the CNN classification trained with the datasets of the six Korean oak species are as follows:

The classification accuracy ranged from 61.5 to 85.7% for the whole-part dataset and from 71.6 to 83.3% for the earlywood-part dataset based on the final five epochs. The whole-part dataset had a larger deviation for each condition than the earlywood-part dataset. However, the whole-part dataset exhibited excellent classification accuracy in the augmented dataset. In particular, the classification accuracy in the augmented condition increased significantly compared with that in the non-augmented condition.

The arrangement of pores, broad rays, and axial parenchyma was verified as a species classification factor from Grad-CAM analysis results.

The factors affecting the classification accuracy included the epoch, optimizer type, and dataset augmentation and composition. Epochs showed the highest influence (0.533**), followed by dataset augmentation (0.351**), the Adam optimizer (0.127**), the earlywood-part dataset (0.070**), the whole-part dataset (− 0.070**), and the SGD optimizer (− 0.160**).

Based on the final five epochs, dataset augmentation was proven to have a significant effect on classification accuracy, with a value of 0.747**, indicating a strong correlation.

Four validation conditions were used in the augmented dataset: three for the whole-part dataset, Adam (85.7%), RMSProp (84.9%), and SGD (81.9%), and one for the earlywood dataset, SGD (83.3%), which showed classification accuracies of over 80%.

It was concluded that a whole-part dataset with augmented conditions should be used for training, and Adam or RMSProp optimizers can be used to obtain the best classification accuracy for the six Korean oak wood species.

Data availability

The datasets and codes used and/or analyzed during the current study are available from the corresponding author on reasonable request.

References

Low, M. C. et al. Tracing the world’s timber: The status of scientific verification technologies for species and origin identification. IAWA J. 44(1), 63–84. https://doi.org/10.1163/22941932-bja10097 (2023).
Article Google Scholar
United Nations Office on Drugs and Crime (UNODC) Best practice guide for forensic timber identification. (United Nations Office on Drugs and Crime, 2016).
Kim, J. H. et al. Comparison of anatomical features in the three Syzygium species. BioResources 16(2), 3631–3642. https://doi.org/10.15376/biores.16.2.3631-3642 (2021).
Article CAS Google Scholar
Purusatama, B. D., Febrianto, F., Lee, S. H. & Kim, N. H. Hardness and fracture morphology of reaction wood from Pinus merkusii and Agathis loranthifolia. Wood Sci. Technol. 56(5), 1331–1351. https://doi.org/10.1007/s00226-022-01413-x (2022).
Article CAS Google Scholar
Savero, A. M. et al. Characterization of anatomical and non-anatomical properties for the identification of six commercial wood species from Vietnamese plantation forests. Forests 14(3), 496. https://doi.org/10.3390/f14030496 (2023).
Article Google Scholar
Kobayashi, K., Hwang, S. W., Okochi, T., Lee, W. H. & Sugiyama, J. Non-destructive method for wood identification using conventional X-ray computed tomography data. J. Cult. Herit. 38, 88–93. https://doi.org/10.1016/j.culher.2019.02.001 (2019).
Article Google Scholar
Fabijańska, A., Danek, M. & Barniak, J. Wood species automatic identification from wood core images with a residual convolutional neural network. Comput. Electron. Agric. 181, 105941. https://doi.org/10.1016/j.compag.2020.105941 (2021).
Article Google Scholar
Tang, X., Zhao, G. & Ping, L. Wood identification with PCR targeting noncoding chloroplast DNA. Plant Mol. Biol. 77(6), 609–617. https://doi.org/10.1007/s11103-011-9837-2 (2011).
Article PubMed CAS Google Scholar
Yang, Z., Liu, Y., Pang, X. & Li, K. Preliminary investigation into the identification of wood species from different locations by near infrared spectroscopy. BioResources 10(4), 8505–8517. https://doi.org/10.15376/biores.10.4.8505-8517 (2015).
Article CAS Google Scholar
Yin, X., Huang, A., Zhang, S., Liu, R. & Ma, F. Identification of three Dalbergia species based on differences in extractive components. Molecules 23(9), 2163. https://doi.org/10.3390/molecules23092163 (2018).
Article PubMed PubMed Central CAS Google Scholar
Hwang, S. W. & Sugiyama, J. Computer vision-based wood identification and its expansion and contribution potentials in wood science: A review. Plant Methods 17(1), 47. https://doi.org/10.1186/s13007-021-00746-1 (2021).
Article PubMed PubMed Central Google Scholar
Buduma, N. Fundamentals of Deep Learning 2nd edn, 108–114 (Hanbit Publishing Network, 2018).
Google Scholar
Fujita, K. & Takahara, A. Deep Learning Boot Camp with Keras 1st edn, 62–68 (Gilbut Publishing, 2017).
Google Scholar
Moroney, L. AI and Machine Learning for Coders 1st edn, 63–67 (Hanbit Publishing Network, 2022).
Google Scholar
IAWA Committee. IAWA list of microscopic features for hardwood identification. IAWA J. 10(3), 219–332 (1989).
Google Scholar
IAWA Committee. IAWA list of microscopic features for softwood identification. IAWA J. 25(1), 1–70. https://doi.org/10.1163/22941932-90000349 (2004).
Article Google Scholar
Korea Forestry Promotion Institute (KOFPI). Forest resources of Korea, xi (Korea Forest Promotion Institute, 2017).
Korea Forest Service. Market survey of timber product in 2021, 26. (Korea Forest Service, 2022).
Chong, S. H. & Park, B. S. Wood Properties of the Useful Tree Species Grown in Korea 108–125 (National Institute of Forest Science, 2008) (in Korean).
Google Scholar
Eom, Y. G. Wood Anatomy of Korean Species 132–143 (Mediawood, 2015) (in Korean).
Google Scholar
Jeon, W. S., Lee, H. M. & Park, J. H. Comparison of anatomical characteristics for wood damaged by oak wilt and sound wood from Quercus mongolica. J. Korean Wood Sci. Technol. 48(6), 807–819. https://doi.org/10.5658/WOOD.2020.48.6.807 (2020).
Article Google Scholar
Luo, J., Li, D. & Cui, Y. Anatomical Database and Atlas of Chinese Woods. 1048–1053, 1086–1087, 1112–1113 (Kaiseisha Press, 2022).
Savero, A. M. et al. Macroscopic and microscopic anatomical characteristics of six Korean oak species. Forests 14(12), 2449. https://doi.org/10.3390/f14122449 (2023).
Article Google Scholar
Shim, K. The variation of wood qualities in the stems of Lepidobalanus, Ph.D. Dissertation. (Chonnam National University, 1991).
Savero, A. M. et al. Radial variation of wood anatomical characteristics and maturation ages of six Korean oak species. Forests 15(3), 433. https://doi.org/10.3390/f15030433 (2024).
Article Google Scholar
Gasson, P. How precise can wood identification be? Wood anatomy’s role in support of the legal timber trade especially CITES. IAWA J. 32(2), 137–154. https://doi.org/10.1163/22941932-90000049 (2011).
Article Google Scholar
Clark, J. Y. Identification of botanical specimens using artificial neural networks. In Proceeding of 2004 Symposium on Computational Intelligence in Bioinformatics and Computational Biology, California, United States of America https://doi.org/10.1109/CIBCB.2004.1393938 (2004).
Tou, J. Y., Lau, P. Y., & Tay, Y. H. Computer vision-based wood recognition system. In Proceeding of international workshop on advanced image technology 2007 (IWAIT 2007); 2007. Bangkok, Thailand, 1–6 (2007) https://www.researchgate.net/publication/264886592_Computer_Vision-based_Wood_Recognition_System (Accessed 11 July 2024).
Esteban, L. G., Fernández, F. G., de Palacios, P. D. P., Romero, R. M. & Cano, N. N. Artificial neural networks in wood identification: The case of two Juniperus species from the Canary Islands. IAWA J. 30(1), 87–94. https://doi.org/10.1163/22941932-90000206 (2009).
Article Google Scholar
Ma, M. Y. et al. Study on artificial neural network combined with near infrared spectroscopy for wood species identification. Spectrosc. Spectral Anal. 32(9), 2377–2381. https://doi.org/10.3964/j.issn.1000-0593(2012)09-2377-05 (2012).
Article CAS Google Scholar
Yadav, A. R., Dewal, M. L., Anand, R. S. & Gupta, S. Classification of hardwood species using ANN classifier. In Proceeding of 2013 Fourth National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG); 2013. Jodhpur, India, 1–5 https://doi.org/10.1109/NCVPRIPG.2013.6776231 (2013).
Esteban, L. G. et al. Application of artificial neural networks as a predictive method to differentiate the wood of Pinus sylvestris L. and Pinus nigra Arn subsp. salzmannii (Dunal) Franco. Wood Sci. Technol. 51, 1249–1258. https://doi.org/10.1007/s00226-017-0932-7 (2017).
Article CAS Google Scholar
He, T., Mu, S., Zhou, H. & Hu, J. Wood species identification based on an ensemble of deep convolution neural networks. Wood Res. 66(1), 1–14. https://doi.org/10.37763/66.1.0114 (2021).
Article Google Scholar
Okataria, A. S. et al. Wood species identification using convolutional neural network (CNN) architectures on macroscopic images. J. Inf. Technol. Comput. Sci. 4(3), 274–283. https://doi.org/10.25126/jitecs.201943155 (2019).
Article Google Scholar
Kwon, O. K. et al. Automatic wood species identification of Korean softwood based on convolutional neural networks. J. Korean Wood Sci. Technol. 45(6), 797–808. https://doi.org/10.5658/WOOD.2017.45.6.797 (2017).
Article Google Scholar
Kwon, O. K. et al. Performance enhancement of automatic wood classification of Korean softwood by ensembles of convolutional neural networks. J. Korean Wood Sci. Technol. 47(3), 265–276. https://doi.org/10.5658/WOOD.2019.47.3.265 (2019).
Article Google Scholar
Kim, J. H. et al. Performance influencing factors of convolutional neural network models for classifying certain softwood species. Forests 14(6), 1249. https://doi.org/10.3390/f14061249 (2023).
Article Google Scholar
Kim, J. H. et al. Convolutional neural network performance and the factors affecting performance for classification of seven Quercus species using sclereid characteristics in the bark. BioResources 19(1), 510–524. https://doi.org/10.15376/biores.19.1.510-524 (2023).
Article MathSciNet CAS Google Scholar
Gartner, H. & Schweingruber, F. H. Microscopic Preparation Techniques for Plant Stem Analysis 18–76 (Verlag Dr. Kessel, 2013).
Google Scholar
von Arx, G. V., Crivellaro, A., Prendin, A. L., Čufar, K. & Carrer, M. Quantitative wood anatomy—practical guidelines. Front. Plant Sci. 7, 781. https://doi.org/10.3389/fpls.2016.00781 (2016).
Article Google Scholar
Yoon, S. J. Do it! Deep Learning Textbook 188–190 (Easys Publishing, 2021).
Google Scholar
Simonyan, K. & Zisserman, A. Very deep convolutional networks for large-scale image recognition. In Proceeding of International Conference on Learning Representations (ICLR) 2015, San Diego, United States of America https://doi.org/10.48550/arXiv.1409.1556 (2015).
Selvaraju, R. R. et al. Grad-CAM: visual explanations from deep networks via gradient-based localization. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Vancouver, BC, Canada, 618–626 https://doi.org/10.1109/ICCV.2017.74 (2001).
Grad-CAM class activation visualization. https://keras.io/examples/vision/grad_cam/ (Accessed 31 Dec 2023).
Chollet, F. Deep Learning with Python 1st edn, 235–241 (Gilbut Publishing, 2018).
Google Scholar
El-Madafri, I., Peña, M. & Olmedo-Torre, N. The wildfire dataset: Enhancing deep learning-based forest fire detection with a diverse evolving open-source dataset focused on data representativeness and a novel multi-task learning approach. Forests 14(9), 1697. https://doi.org/10.3390/f14091697 (2023).
Article Google Scholar
Cui, Z., Li, X., Li, T. & Li, M. Improvement and assessment of convolutional neural network for tree species identification based on bark characteristics. Forests 14(7), 1292. https://doi.org/10.3390/f14071292 (2023).
Article Google Scholar
Zhu, X. et al. Identification of oil tea (Camellia oleifera C. Abel) cultivars using EfficientNet-B4 CNN model with attention mechanism. Forests 13(1), 1. https://doi.org/10.3390/f13010001 (2022).
Article Google Scholar
Loy, J. Neural Network Projects With Python 32 (Gilbut Publishing, 2020).
Google Scholar
Ruder, S. An overview of gradient descent optimization algorithms. https://doi.org/10.48550/arXiv.1609.04747 (2016).
Yang, L. & Shami, A. On hyperparameter optimization of machine learning algorithms theory and practice. Neurocomputing 415, 295–316. https://doi.org/10.1016/j.neucom.2020.07.061 (2022).
Article Google Scholar
Shorten, C. & Khoshgoftaar, T. M. A survey on image data augmentation for deep learning. J. Big Data 6(1), 1–48. https://doi.org/10.1186/s40537-019-0197-0 (2019).
Article Google Scholar
Cochard, H. & Tyree, M. T. Xylem dysfunction in Quercus: Vessel sizes, tyloses, cavitation and seasonal changes in embolism. Tree Physiol. 6(4), 393–407. https://doi.org/10.1093/treephys/6.4.393 (1990).
Article PubMed CAS Google Scholar

Download references

Acknowledgements

This research was supported by the Science and Technology Support Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science and ICT (MSIT) (No. 2022R1A2C1006470), the Basic Science Research Program through the NRF funded by the Ministry of Education (No. 2018R1A6A1A03025582), and the R&D Program for Forest Science Technology (Project Nos. 2021350C10-2323-AC03 and 2021311A00-2122-AA03) provided by the Korea Forest Service (Korea Forestry Promotion Institute).

Author information

Authors and Affiliations

Institute of Forest Science, Kangwon National University, Chuncheon, 24341, Republic of Korea
Jong-Ho Kim
Department of Forestry, Kangwon National University, Chuncheon, 24341, Republic of Korea
Wan-Geun Park
Department of Forest Biomaterials Engineering, Kangwon National University, Chuncheon, 24341, Republic of Korea
Nam-Hun Kim

Authors

Jong-Ho Kim
View author publications
You can also search for this author in PubMed Google Scholar
Wan-Geun Park
View author publications
You can also search for this author in PubMed Google Scholar
Nam-Hun Kim
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: J.H.K. and N.H.K.; methodology: J.H.K. and N.H.K.; species classification in harvesting process: W.G.P.; formal analysis and investigation: J.H.K.; writing—original draft preparation: J.H.K. and N.H.K.; writing—review and editing: J.H.K. and N.H.K.; funding acquisition: J.H.K. and N.H.K.; resources: J.H.K.; supervision: N.H.K.

Corresponding author

Correspondence to Nam-Hun Kim.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Kim, JH., Park, WG. & Kim, NH. Performance of convolutional neural network (CNN) and performance influencing factors for wood species classification of Lepidobalanus growing in Korea. Sci Rep 14, 18141 (2024). https://doi.org/10.1038/s41598-024-69281-y

Download citation

Received: 02 April 2024
Accepted: 02 August 2024
Published: 05 August 2024
DOI: https://doi.org/10.1038/s41598-024-69281-y
Springer Nature Limited

Performance of convolutional neural network (CNN) and performance influencing factors for wood species classification of Lepidobalanus growing in Korea

Abstract

Similar content being viewed by others

Wood identification based on longitudinal section images by using deep learning

Using machine learning to predict the density profiles of surface-densified wood based on cross-sectional images

BACNN: Multi-scale feature fusion-based bilinear attention convolutional neural network for wood NIR classification

Introduction

Materials and methods

Materials

Image and dataset preparation

Dataset pretreatment

Verification factors influencing classification performance

Statistical analysis

Results and discussion

Comparison of classification performance by the applied condition

Classification performance of oak species using CNN

Anatomical factors affecting wood species classification performance

Grad-CAM analysis of whole-part dataset

Statistical analysis

Correlation among the factors

Homogeneous subset

Conclusions

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Performance of convolutional neural network (CNN) and performance influencing factors for wood species classification of Lepidobalanus growing in Korea

Abstract

Similar content being viewed by others

Wood identification based on longitudinal section images by using deep learning

Using machine learning to predict the density profiles of surface-densified wood based on cross-sectional images

BACNN: Multi-scale feature fusion-based bilinear attention convolutional neural network for wood NIR classification

Explore related subjects

Introduction

Materials and methods

Materials

Image and dataset preparation

Dataset pretreatment

Verification factors influencing classification performance

Statistical analysis

Results and discussion

Comparison of classification performance by the applied condition

Classification performance of oak species using CNN

Anatomical factors affecting wood species classification performance

Grad-CAM analysis of whole-part dataset

Statistical analysis

Correlation among the factors

Homogeneous subset

Conclusions

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation