Abstract
Functional traits are the result of evolution and adaptation, providing important ecological insights into how organisms interact with their environment. Benthic macroinvertebrates, in particular, have garnered attention as biomonitoring indicators for freshwater ecosystems. This study presents a functional trait dataset for benthic macroinvertebrates, comprising 447 taxa (393 at genus level, 53 at family level and one at class level) from five phyla (Annelida, Arthropoda, Mollusca, Nematomorpha, and Platyhelmenthes), categorized into nine traits related to life history, morphology, and habit. To account for variation in available trait information, we assigned confidence levels to each taxon and functional trait based on the level of evidence using fuzzy coding. Our dataset provides an important resource for understanding the ecology of benthic macroinvertebrates in South Korea, serving as a valuable baseline dataset for studying their biodiversity, conservation, and biomonitoring in freshwater ecosystems.
Similar content being viewed by others
Background & Summary
‘Functional trait’ are any characteristics of an organism, such as morphological, physiological, biochemical, behavioural, and phenological traits, that influence its fitness or survival1. It aids in understanding a species’ ecological adaptation to its environment and the community’s response to eco-environmental change2,3. It is considered a currency of functional ecology to assess the functional properties of ecological communities4,5. It is used to measure functional diversity, which helps to understand how an ecosystem functions6.
Functional traits bridge the gap between ecology and evolution, providing insight into various scientific questions related to biogeography, ecosystem health, and conservation7,8,9,10. Furthermore, the functional trait-based approach to understand ecology enables global comparisons of ecological responses, despite taxonomic differences in species assemblages8,11. Given immense importance of the functional traits, there is a growing demand for trait datasets to progress the field of functional ecology. However, collecting trait data requires significant cost and time investment, resulting in a limited number of trait datasets covering only a few taxa and biogeographic regions.
The diversity of benthic macroinvertebrates and their functional traits make them an ideal model group for biomonitoring freshwater ecosystems12, as they have an intermediate lifespan and a diverse array of functional traits that help measure changes in ecosystems13,14. Despite the immense importance of trait data for freshwater benthic macroinvertebrates, only a few datasets covering a small biogeographic portion of the globe exist, such as CESTES (Mediterranean rivers, Catalonia, Spain; Segura River basin, Spain; Ebro river, Mediterranee, Spain; Ponds, agricultural areas, Brie, Seine-et-Marne, France; Wu Stream, central Taiwan; and Ponds, 200-ha section of the Yale-Myers Research Station in Union, Connecticut, USA)15; European aquatic macroinvertebrates dispersal related trait dataset16, European freshwater organisms trait dataset17; stream macroinvertebrates of Han river basin, China18, lotic insects of North America19,20 and freshwater macroinvertebrates of New Zealand21. This limited number of datasets for a small part of the world underscores the need for a worldwide aquatic macroinvertebrate data collection program to develop a global dataset. Such a dataset would help fill a significant gap in functional ecology and enable a better understanding of the consequences of environmental change due to different drivers, such as climate change and anthropogenic activities, on benthic macroinvertebrates worldwide.
In this study, we developed a functional trait dataset for benthic macroinvertebrates in South Korean streams. The dataset consists of functional traits of 447 taxa. The dataset was constructed using occurrence data of macroinvertebrates collected from 3032 locations throughout South Korea as part of the National Aquatic Ecological Monitoring Program (NAEMP) from 2008 to 2021. We considered nine traits across three categories, namely life history, morphology, and habit, and obtained trait data from various literature sources. Besides fulfilling the gap in macroinvertebrate trait data, the dataset can be utilized for various scientific studies to understand the autecology of benthic macroinvertebrates in Asian streams, including Korea, along with its further comparison to global counterparts, biomonitoring and conservation planning.
Methods
Taxonomic and geographical coverage
The dataset covered almost all streams of South Korea (Fig. 1) and was compiled from biomonitoring data available on the National Institute of Environmental Research (NIER) website (https://water.nier.go.kr/web/bioMeasure?pMENU_NO=586). This data was collected collaboratively according to NIER guidelines under the NAEMP from 2008 to 2021, covering 3032 sampling locations22. Additionally, eight additional genera were included from another published article23.
Taxonomy and systematics
The compiled data includes 908 macroinvertebrate taxa. However, due to the unavailability of species-level trait data for many species, we established the taxonomic resolution of our dataset at the genus level, resulting in 455 genera. In some instances, the specimens were identified only up to the subfamily (e.g., Acentropinae), family (e.g., Saldidae), or class level (e.g. Collembola) in the original dataset. We used “genera” to refer to the lowest identifiable level in our dataset. These genera were classified according to the GBIF backbone taxonomy into four taxonomic hierarchies: Family, Order, Class, and Phylum. We updated some genus names to match those used in GBIF and corrected seven inconsistent genera, resulting in a final dataset with the data for 393 taxa at genus level, 53 taxa at family level and one taxon at class level. We removed two genera due to their synonymy with existing genera, four genera for spelling errors, and one genus that was not a macroinvertebrate.
Functional traits
Based on available data, we selected nine functional traits and sorted them into three categories: life history, morphology and habit (Table 1). These traits were selected based on existing literature and data availability. While some traits such as fecundity, environmental tolerance, synchronization of emergence, resistance form, and the propensity of drift have been excluded due to data scarcity, we intend to expand our dataset in the future as more data becomes available.
Life history contains three traits, i.e., voltinism, life span and aquatic stages. Voltinism indicates the number of generations per year24, which positively impacts intraspecific size structure variation and negatively affects intraspecific competition & carnivory25,26. Life span is the average life cycle duration linked to a species’ reproductive potential27. Generally, species with shorter life spans are more tolerant to disturbance28. Aquatic stages indicate dispersal capability, and non-aquatic adults with flying ability typically have higher dispersal capability29,30.
Morphology encompasses four traits: maximum size, respiratory organ, shape and armouring. Maximum size is positively related to fecundity31, trophic level32,33, and mobility34 in aquatic macroinvertebrates. The respiratory organ denotes how an organism adapts to various environmental conditions and its oxygen tolerance35. Shape constrains mobility and reflects an organism’s adaptation to differing water flow levels36,37, while armouring conveys its capacity to withstand mechanical and environmental stresses38,39.
Habit contains two traits: locomotion and functional feeding habit. Locomotion mode and substrate relation affect microhabitat selection40 and ecosystem resilience by connecting habitats41. In contrast, functional feeding groups provide insights into trophic dynamics42 and response to perturbations43.
Trait information collection
Initially, we searched macroinvertebrate datasets15,16,17,18,19,20,21 to gather trait information for various genera. Despite our efforts, trait information for numerous novel genera remained incomplete. We turned to Korean books44,45 and web resources46,47 to fill these gaps, and then we scoured journal articles and books. Since Korea, Japan, and China share similar species composition, we preferred trait information sourced from species in these regions. Additionally, we consulted numerous websites, as listed in the attached dataset’s reference sheet. Unfortunately, for many genera, we were unable to locate trait information. In such cases, we used trait information for higher taxonomic categories marked with a fuzzy code, with some exceptions outlined in the next section.
Fuzzy coding of the modalities
We utilized a fuzzy coding framework to express the confidence level in trait modalities within our dataset, a method commonly employed in similar datasets15,16,21. We used three levels of fuzzy coding in this dataset where 0, 1, 2 and 3 indicate absence, low level, moderate level and high level of confidence, respectively. We established rules for the fuzzy coding process as follows:
-
1.
If no reference supports the presence of a particular trait for a genus, it is denoted with 0.
-
2.
If only one reference indicates a particular trait modality and there is no evidence about other trait modalities of a trait, then it is denoted as 2.
-
3.
If multiple references indicate a particular modality without evidence for other modalities, it is coded as 3.
-
4.
If the majority of evidence supports one modality while a single reference indicates the presence of another, the former is coded as 3, and the latter is coded as 1.
-
5.
If the evidence for two different modalities is equal, both modalities are coded as 2, unless all references indicate the presence of both modalities, in which case they are coded as 3.
-
6.
If one modality has the most evidence, while another has less, and a third has the least, they are coded as 3, 2, and 1, respectively. There can be a case where there is no evidence for the third. It can be coded as 3,2 and 0 respectively.
-
7.
If a modality is inferred from a higher taxonomic level, such as a family, order, class, or phylum, it is coded with less confidence, unless it applies to all members of that group, in which case it is coded as 3 (e.g., hair in mammals).
-
8.
In some cases, trait modalities were inferred from other databases, some of which used fuzzy coding. In this case, fuzzy codes across all modalities are summed up and then individual references are added as a single score against each modality. Then the fuzzy codes are inferred as per the above rules (Table 2).
By applying these rules, our fuzzy coding framework provides a flexible and consistent approach to representing the confidence in trait modalities within our dataset.
Data Records
Dataset
The dataset48 is available in the latest Excel Workbook (*.xlsx) format and includes five sheets: Trait dataset, Datakey, Reference, Source reference and Korean endemics. The first sheet contains taxon names, lowest taxonomic ranks, and classifications in the first eight columns, while the remaining columns have trait modalities and references supporting the fuzzy coding of each modality (Table 3). Trait modalities are represented by abbreviations, with explanations available in the second sheet (Datakey). References in the Trait Dataset are identified by reference numbers, with corresponding details available in the third sheet (Reference). The fourth sheet contains source references in the large databases cited in the ‘Reference’ sheet. It has four columns. The first column indicates taxon name, second column indicates trait name, third column indicates the references to the database cited in ‘Reference’ sheet and the last column indicates the actual source reference. The last sheet represents a list of Korean endemic species those are included in this work.
Data summary
The dataset includes 447 taxa (393 at genus level, 53 at family level and one at class level) from five phyla. Arthropoda has the largest representation with 367 genera, followed by Mollusca (49 genera), Annelida (29 genera), Platyhelmenthes (3 genera), and Nematomorpha (2 genera). Of the 6,616 non-zero records, 24.14% are classified as having very low confidence (1), 49.18% have a moderate level of confidence (2), and 26.68% have a high level of confidence (3). See Fig. 2 for a summary of the different traits.
Technical Validation
The biomonitoring data were collected through the NAEMP following the NIER guidelines22. Taxonomic experts identified all the specimens, and trait information was collected from a total of 154 sources, including journal articles, datasets, books, and web resources. To ensure accuracy, the resulting dataset underwent cross-checking for any mistakes. About 77% of the data in the dataset were sourced from the references, while the remaining 23% were inferred from higher taxonomic-level characteristics (Fig. 3). This indicates the dataset needs periodic updates to include trait data from more recent research.
Usage Notes
The dataset we have compiled contains a wealth of information on new genera that have not yet been included in other existing trait datasets. As a result, it can help to fill some critical gaps towards developing an integrated global trait dataset. Our biomonitoring data consists of 51 endemic species belonging to 34 macroinvertebrate genera (see 'Korean endemic' sheet of the dataset48). While only one of these genera is endemic to Korea (Koreanomelania), the others share some species from other countries, particularly Japan and China. This broadens the applicability of the dataset and enhances its usefulness in different contexts.
This dataset provides a unique opportunity to better understand functional diversity, as well as the responses of different functional groups to environmental perturbations. It also enables researchers to compare similar functional groups at a global level, providing valuable insights into their effects on different stressors such as pollution and climate change.
The database uses fuzzy coding system to indicate probability of different traits. In this case, use of traits with higher confidence (2 & 3) are advised for application. The data is provided in an Excel workbook format (*.xlsx).
Lastly, this database is the pioneering effort to develop a functional trait dataset for streams & rivers of South Korea. It is still not comprehensive and many traits information are inferred from higher taxonomic levels due to lack of enough information. So, this dataset demands improvement via periodic updates to include more detailed information about the existing traits, to include additional traits, to increase the taxonomic resolution and to include the additional genera those are not yet included.
Data availability
The dataset is accessible from Figshare48.
Code availability
No custom code has been used.
References
Nock, C. A., Vogt, R. J. & Beisner, B. E. Functional Traits. in eLS 1–8, https://doi.org/10.1002/9780470015902.a0026282 (Wiley, 2016).
Díaz, S. et al. Functional traits, the phylogeny of function, and ecosystem service vulnerability. Ecol. Evol. 3, 2958–2975 (2013).
Violle, C. et al. Let the concept of trait be functional! Oikos 116, 882–892 (2007).
Mammola, S., Carmona, C. P., Guillerme, T. & Cardoso, P. Concepts and applications in functional diversity. Funct. Ecol. 35, 1869–1885 (2021).
de Bello, F. et al. Handbook of Trait-Based Ecology: From Theory to R Tools. https://doi.org/10.1017/9781108628426 (Cambridge University Press, 2021).
Lee, D.-Y., Lee, D.-S. & Park, Y.-S. Taxonomic and Functional Diversity of Benthic Macroinvertebrate Assemblages in Reservoirs of South Korea. Int. J. Environ. Res. Public Health 20, 673 (2022).
Edwards, K. F. et al. Evolutionarily stable communities: a framework for understanding the role of trait evolution in the maintenance of diversity. Ecol. Lett. 21, 1853–1868 (2018).
Soriano-Redondo, A., Gutiérrez, J. S., Hodgson, D. & Bearhop, S. Migrant birds and mammals live faster than residents. Nat. Commun. 11, 5719 (2020).
Kosman, E., Burgio, K. R., Presley, S. J., Willig, M. R. & Scheiner, S. M. Conservation prioritization based on trait‐based metrics illustrated with global parrot distributions. Divers. Distrib. 25, 1156–1165 (2019).
Violle, C., Reich, P. B., Pacala, S. W., Enquist, B. J. & Kattge, J. The emergence and promise of functional biogeography. Proc. Natl. Acad. Sci. 111, 13690–13696 (2014).
Kenis, M., Rabitsch, W., Auger-Rozenberg, M.-A. & Roques, A. How can alien species inventories and interception data help us prevent insect invasions? Bull. Entomol. Res. 97, 489–502 (2007).
Buss, D. F. et al. Stream biomonitoring using macroinvertebrates around the globe: a comparison of large-scale programs. Environ. Monit. Assess. 187, 4132 (2015).
Morse, J. C. et al. Freshwater biomonitoring with macroinvertebrates in East Asia. Front. Ecol. Environ. 5, 33–42 (2007).
Freshwater biomonitoring and benthic macroinvertebrates. (eds. Rosenberg, D. M. & Resh, V. H.) (Springer New York, 1993).
Jeliazkov, A. et al. A global database for metacommunity ecology, integrating species, traits, environment and space. Sci. Data 7, 6 (2020).
Sarremejane, R. et al. DISPERSE, a trait database to assess the dispersal potential of European aquatic macroinvertebrates. Sci. Data 7, 386 (2020).
Schmidt-Kloiber, A. & Hering, D. – An online tool that unifies, standardises and codifies more than 20,000 European freshwater organisms and their ecological preferences. Ecol. Indic. 53, 271–282, www.freshwaterecology.info (2015).
Li, Z. et al. The drivers of multiple dimensions of stream macroinvertebrate beta diversity across a large montane landscape. Limnol. Oceanogr. 66, 226–236 (2021).
Poff, N. L. et al. Functional trait niches of North American lotic insects: traits-based ecological applications in light of phylogenetic relationships. J. North Am. Benthol. Soc. 25, 730–755 (2006).
Vieira, N. K. M. et al. A database of lotic invertebrate traits for North America. US Geol. Surv. Data Ser. 187, 1–15 (2006).
Phillips, N. & Smith, B. New Zealand freshwater macroinvertebrate trait database. https://niwa.co.nz/freshwater/management-tools/aquatic-invertebrate-traits-database (2018).
National Institute of Environmental Research. 수생태계 현황 조사 및 건강성 평가 방법 등에 관한 지침: 하천편 (Guidelines for aquatic ecosystem survey and health assessment methods: stream/river) [In Korean language]. https://dl.nanet.go.kr/file/fileDownload.do?linkSystemId=NADL&controlNo=MONO1202054287 (2019).
Kim, P. J., Lee, J. H., Huh, I. A. & Kong, D. Development of benthic macroinvertebrates sediment index (BSI) for bioassessment of freshwater sediment. Int. J. Sediment Res. 34, 368–378 (2019).
Encyclopedia of Insects. (eds. Resh, V. H. & Cardé, R. T.). https://doi.org/10.1016/B978-0-12-374144-8.X0001-X (Elsevier, 2009).
Wissinger, S. A. Life History and Size Structure of Larval Dragonfly Populations. J. North Am. Benthol. Soc. 7, 13–28 (1988).
Purse, B. V. & Thompson, D. J. Voltinism and larval growth pattern in Coenagrion mercuriale (Odonata: Coenagrionidae) at its northern range margin. Eur. J. Entomol. 99, 11–18 (2002).
Öckinger, E. et al. Life-history traits predict species responses to habitat area and isolation: a cross-continental synthesis. Ecol. Lett. no-no, https://doi.org/10.1111/j.1461-0248.2010.01487.x (2010).
Rijnsdorp, A. D. et al. Estimating sensitivity of seabed habitats to disturbance by bottom trawling based on the longevity of benthic fauna. Ecol. Appl. 28, 1302–1312 (2018).
Miller, M. P., Blinn, D. W. & Keim, P. Correlations between observed dispersal capabilities and patterns of genetic differentiation in populations of four aquatic insect species from the Arizona White Mountains, USA. Freshw. Biol. 47, 1660–1673 (2002).
Kelly, L. C., Bilton, D. T. & Rundle, S. D. Population structure and dispersal in the Canary Island caddisfly Mesophylax aspersus (Trichoptera, Limnephilidae). Heredity (Edinb). 86, 370–377 (2001).
Gotthard, K., Berger, D. & Walters, R. What Keeps Insects Small? Time Limitation during Oviposition Reduces the Fecundity Benefit of Female Size in a Butterfly. Am. Nat. 169, 768–779 (2007).
Akin, S. & Winemiller, K. O. Body size and trophic position in a temperate estuarine food web. Acta Oecologica 33, 144–153 (2008).
Keppeler, F. W., Montaña, C. G. & Winemiller, K. O. The relationship between trophic level and body size in fishes depends on functional traits. Ecol. Monogr. 90 (2020).
McPeek, M. A., Schrot, A. K. & Brown, J. M. Adaptation to Predators in a New Community: Swimming Performance and Predator Avoidance in Damselflies. Ecology 77, 617–629 (1996).
Graham, J. B. Ecological, Evolutionary, and Physical Factors Influencing Aquatic Animal Respiration. Am. Zool. 30, 137–146 (1990).
Statzner, B. & Bêche, L. A. Can biological invertebrate traits resolve effects of multiple stressors on running water ecosystems? Freshw. Biol. 55, 80–119 (2010).
Feio, M. J. & Dolédec, S. Integration of invertebrate traits into predictive models for indirect assessment of stream functional integrity: A case study in Portugal. Ecol. Indic. 15, 236–247 (2012).
Céréghino, R. et al. Desiccation resistance traits predict freshwater invertebrate survival and community response to drought scenarios in a Neotropical ecosystem. Ecol. Indic. 119, 106839 (2020).
Rico, A. & Van den Brink, P. J. Evaluating aquatic invertebrate vulnerability to insecticides based on intrinsic sensitivity, biological traits, and toxic mode of action. Environ. Toxicol. Chem. 34, 1907–1917 (2015).
Forcellini, M. et al. Microhabitat selection by macroinvertebrates: generality among rivers and functional interpretation. J. Ecohydraulics 7, 28–41 (2022).
Belmar, O. et al. Functional responses of aquatic macroinvertebrates to flow regulation are shaped by natural flow intermittence in Mediterranean streams. Freshw. Biol. 64, 1064–1077 (2019).
Tomanova, S., Goitia, E. & Helešic, J. Trophic Levels and Functional Feeding Groups of Macroinvertebrates in Neotropical Streams. Hydrobiologia 556, 251–264 (2006).
Rawer-Jost, C., Böhmer, J., Blank, J. & Rahmann, H. Macroinvertebrate functional feeding group methods in ecological assessment. Hydrobiologia 422, 225–232 (2000).
Kwon, S.-J., Jeon, Y.-C. & Kim, M.-C. 물속 생물 도감: 저서성 대형무척추동물 (Underwater creature encyclopedia: Benthic macroinvertebrates) [In Korean Language]. (자연화생태 (Nature and Ecology), 2013).
Kwon, S.-J., Jeon, Y.-C. & Kim, M.-C. 화살표 물속생물 도감 (Encyclopedia of underwater creatures) [In Korean Language]. (자연화생태 (Nature and Ecology), 2017).
National Institute of Biological Resources. 국립생물자원관 한반도의 생물다양성 (National Museum of Biological Resources: Biodiversity of the Korean Peninsula). https://species.nibr.go.kr/ (2011).
National Biodiversity Center. 국가 생물다양성 정보공유체계 (National Biodiversity Information Sharing System) [In Korean Language]. https://www.kbr.go.kr (2018).
Adhurya, S., Lee, D-Y., Lee, D-S. & Park, Y-S. Macroinvertebrate functional trait database of South Korean stream. Version 1.2, figshare, https://doi.org/10.6084/m9.figshare.22010822 (2023).
Acknowledgements
We express our gratitude to Hun-Jeong Song, Jin A Yun, Yewon Kim, Yu-Jin Kim and Daeun Park for their invaluable assistance in data collection. We also extend our appreciation to Kyung Hee University in South Korea for providing the essential infrastructure and support for this research. Funding for this work was provided by the National Research Foundation of Korea (NRF) through the Korean government (MSIP) (grant number NRF-2019R1A2C1087099) and the Korea Environment Industry & Technology Institute (KEITI) through the Aquatic Ecosystem Conservation Research Program, which is supported by the Korean Ministry of Environment (MOE) (2020003050003).
Author information
Authors and Affiliations
Contributions
Sagar Adhurya: Conceptualization, Methodology, Data curation, Writing, Analysis, Visualisation; Da-Yeong Lee: Methodology, Validation, Visualisation, Data curation; Dae-Seong Lee: Visualisation, Writing, Data curation; Young-Seuk Park: Conceptualization, Supervision, Project administration, Funding acquisition.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Adhurya, S., Lee, DY., Lee, DS. et al. Functional trait dataset of benthic macroinvertebrates in South Korean streams. Sci Data 10, 838 (2023). https://doi.org/10.1038/s41597-023-02678-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-023-02678-y
- Springer Nature Limited