Abstract
Load forecasting is crucial for the economic and secure operation of power systems. Extreme weather events, such as extreme heat and typhoons, can lead to more significant fluctuations in power consumption, making load forecasting more difficult. At present, due to the lack of relevant public data, the research on load forecasting under extreme weather events is still blank, so it is necessary to release a large-scale load dataset containing extreme weather events. The dataset includes electricity consumption data of industrial and commercial users under extreme weather events such as typhoons and extreme heat, which are collected at 15-minute intervals. The data is collected over six years from smart meters installed at the power entry points of users in southern China. The dataset consists of electricity consumption data from 386 industrial and commercial users in 17 industries, with more than 50 million records. During the recording period, extreme weather events such as typhoons and extreme heat are marked to form a total of 5,741 event records.
Similar content being viewed by others
Background & Summary
With the increasing penetration of distributed energy resources (DER), the need for load forecasting at all levels of the distribution system has increased dramatically1. Underestimating electricity demand can lead to low-quality service (even blackouts) provided by utility companies and pose a severe threat to the safe and stable operation of the power grid. Therefore, accurate short-term load forecasting is the basis of power system operation and planning, supporting many services such as:
-
unit commitment and scheduling of maintenance, i.e., electricity utility companies make unit commitment decisions and device maintenance plans based on the result of load forecasting2,3;
-
demand-side management, i.e., electricity market participants can adjust their bidding strategies based on demand-side management models supported by load forecasting4;
-
cogeneration, i.e., planning and optimizing electricity and heat production in combined heat and power plants based on the forecasting results to increase economic efficiency5;
-
system stability and security, accurate forecast results can help the grid balance generation and demand, thus ensuring system stability and security6,7,8.
Currently, there are some publicly available load datasets that provide such data at different scales and details for the above services. Most of these datasets contain appliance power consumption data of residential users. For example, the REFIT9 electrical load measurement dataset consists of eight-second interval electrical measurements of the total load of the entire house, collected continuously from 20 houses over two years. In BLUED10, voltage and current measurements of a US residence are sampled at 12 kHz for a week. Load data is recorded in UK-DALE11 for five residential users with a sampling rate of 16 kHz, one of which is recorded for 655 days. There are few non-residential load datasets, and they are mainly medium voltage or higher voltage load data12. ISO provides utility-scale load data for a total of 46 months from March 1, 2003, to December 31, 2006 in New Zealand13. In particular, since core information such as sales and operating conditions of a company can be analyzed from its load data, such kind of data is regarded as a commercial secret and is rarely disclosed14. To the best of the authors’ knowledge, public industrial load datasets are limited. A one-year dataset of electricity load curves with a temporal resolution of 15 minutes for 50 small and mid-size enterprises in Germany is released15. The electricity data of 10 manufacturing companies is collected at one-minute intervals for seven months from 1 March to 30 September 201914. The electricity consumption data of food and paper industries is presented16 and machine-level load data of a paper manufacturing factory in Brazil is investigated17.
To make matters worse, load forecasting is more difficult because extreme weather events can greatly affect inherent electricity usage patterns, such as extremely hot weather, which may lead to increased electricity consumption. Moreover, there are almost no relevant studies and public data on load forecasting under extreme weather events, which further increases the difficulty of load forecasting under extreme weather conditions. To solve the above problems, load data including large-scale industrial and commercial users under extreme weather events is collected and formed a dataset named Extreme Weather Events Load Dataset (EWELD) in this paper. EWELD is a public load dataset consisting of electricity consumption data from different industrial and commercial users with a sampling interval of 15 minutes. The electricity consumption data of 386 companies in 17 industries over a period of six years is recorded in this dataset, extreme weather events such as typhoons and extreme heat in the corresponding period are also recorded. The main contributions of this paper are as follows:
-
The load data of 386 enterprises in 17 industries for more than six years is collected and released. Extreme weather events during the load data collection time are collected and analyzed.
-
Electricity consumption patterns under extreme weather events and those under non-extreme weather events are compared and analyzed.
-
All data is provided in a unified and structured format with no missing or abnormal data and is easy to use.
Methods
In this section, methods used to create the Extreme Weather Events Load Dataset (EWELD) are introduced. First, the communication platform for collecting meteorological data and users’ electricity consumption data is described. Second, a measurement campaign conducted in large-scale industrial and commercial users in South China is presented. Finally, the processing method of the collected data and the generation method of the dataset are introduced in detail. The data in this dataset comes from an industrial partner, a power retailer with many industrial and commercial customers operating mainly in South China. This dataset focuses solely on commercial and industrial users and does not include residential data. The dataset contains data from industrial and commercial users in 17 different industries. Before data collection, each business and industry user in the dataset was informed of the data collection procedures and gave their consent. Furthermore, to protect the privacy of these users, we have implemented strict anonymization measures to ensure the identification of users cannot be identified from the data. These anonymization measures include data masking, redaction, and no direct identifiers. For data masking, the identifiable data, such as a user’s name, address, or other direct identifiers, are replaced by encoded references. These coded references do not correlate with the original data and cannot be reverse-engineered. For redaction, specific data values, especially outliers that might identify a specific user, are redacted or modified for consistency and anonymity. For no direct identifiers, direct identifiers such as phone numbers, addresses, or specific geographical locations, are omitted or transformed when forming the dataset. By employing these anonymization procedures, we aim to strike a balance between data utility and privacy, ensuring that the data is useful for research without compromising the identity and confidentiality of the users involved.
The communication platform
In the proposed communication platform, there are four main components: users, weather stations, the cloud system and the cloud database. The overall communication platform is illustrated in Fig. 1. For users, power grids provide them with electricity to supply the devices they belong to. The electricity consumption of the user is measured by the smart meter installed at the power entry point of the user. From December 2015 to April 2018, smart meters were gradually installed in multiple industrial and commercial users to collect electricity consumption. Smart meter using RS485 as its communication protocol collects data about electricity usage at a sampling interval of 15 minutes. Then, the collected data is transmitted via the RS485 protocol. To connect the smart meters to routers, an RS485 to RJ45 converter is used, converting the differential signals of RS485 to the voltage levels used by Ethernet. The router then receives the collected data and forwards it to the cloud system over a standard internet connection. For weather stations, m sensors are installed in three locations respectively and meteorological data is collected from 2015. Each sensor is responsible for collecting different meteorological characteristics such as temperature, humidity, and rainfall. Similarly, these sensors using RS485 as their communication protocol collect data about weather conditions at a sampling interval of 15 minutes. Then, the collected data is transmitted via the RS485 protocol. To connect the smart meters to routers, an RS485 to RJ45 converter is also used, converting the differential signals of RS485 to the voltage levels used by Ethernet. The router then transmits the meteorological data collected by these sensors to the cloud system through the Internet. After the cloud system receives the collected data, it transmits the data to the cloud database through the Internet for storage. Figure 1 illustrates the proposed communication platform.
Measurement of selected users
386 users from three cities in southern China are chosen for measurement, and smart meters are used to collect 15-minute sampling electricity consumption data. These users cover 17 sections and 45 divisions, according to the International Standard Industrial Classification of All Economic Activities (ISIC)18. These users are all industrial or commercial electricity customers, such as factories, retailers, enterprises, and public organizations. To maintain information security, the user names and locations are not disclosed in this paper. The detailed categories and corresponding numbers of users are listed in Table 1 (Production-related sections) and Table 2 (Services-related sections).
Data processing and generation
After data is collected from smart meters and sensors, data preprocessing is required to deal with missing data, outliers, and frequency inconsistencies. Next, methods for data processing and data generation are introduced.
Data preprocessing
After smart meter data is recorded and collected, differential operations are first conducted to obtain the electricity consumption data within the time period. Then data cleaning is performed on missing data, duplicated data and outliers. Missing values are filled with the last valid observation forward. Duplicated data is removed except for the first occurrence. Outliers are first detected based on Z-score19 which is shown in Eq. (1). x is the data, μ is the mean value of x and σ is the standard deviation of x. Z is set to three. The detected outliers are replaced with the mean value of the previous two observations. For data points not following a standard normal distribution, outliers are then detected using Interquartile Range (IQR) method20 and are replaced with the mean value of the previous 96 observations. Finally, the high-quality electricity consumption data of 386 users are obtained at a sampling interval of 15 minutes.
For the meteorological data collected by weather stations, data cleaning is also carried out in a similar manner as above. Duplicated meteorological data is first removed, and missing data is filled with the last valid observation. Based on the IQR method, outliers candidate sets are determined. If the value in the candidate set is not within the two times Z-score intervals of the previous two observations, the value will be replaced with the mean value of the previous two observations. The original meteorological data has a sampling interval of 30 minutes. To keep it consistent with the frequency of the load data for further analysis, linear interpolation is used to upsample it to obtain meteorological data with a sampling interval of 15 minutes21.
Selected extreme weather events
Studying the impact of extreme weather on electricity consumption patterns is very important for power supply security. There are 20 types of extreme weather events including high temperatures, high humidity, tropical storm, typhoon, and heavy rain are selected in this paper. The three cities where users in the dataset are all located in southern China and have a subtropical humid climate. Winters are short, mild, and relatively dry, while summers are long, hot, and very wet. The criteria of extreme weather events are defined based on statistical data, national standards, etc., combined with the characteristics of the three cities. The detailed descriptions of extreme weather events and their criteria are shown in Table 3. For EW1 to EW4, four extreme weather events are defined based on statistical values of temperature and humidity. For EW5 to EW7, three thunderstorm weather events are defined based on the wind gust data, according to the wind threat definition by National Oceanic and Atmospheric Administration (NOAA)22. For EW8 to EW12, some tropical storm and typhoon weather events are defined based on wind speed observations according to the Beaufort wind force scale. Moreover, typhoon data that occurred within a radius of 500 kilometers of these three cities from 2015 to 2022 is additionally collected as a supplement, including the typhoon name, time, and intensity. For EW13 to EW20, some other extreme weather events are identified based on the weather condition data.
Following these steps, a large-scale industrial and commercial load dataset in extreme weather events is constructed, named Extreme Weather Events Load Dataset (EWELD). It includes electricity consumption data of industrial and commercial users under extreme weather events such as typhoons and extreme heat, which are collected at 15-minute intervals. The data is collected over six years from smart meters installed at the power entry points of users in southern China. The dataset consists of electricity consumption data from 386 industrial and commercial users in 17 industries, with more than 50 million records. During the recording period, extreme weather events such as typhoons and extreme heat are marked to form a total of 5,741 event records.
Data Records
EWELD is available to the public through Figshare23 in .csv or .xlsx formats. Table 4 provides a summary of the folder structures, description of data files, output variables in each file, and the format of these files. The whole dataset includes four folders, among which the electricity consumption and weather data folders contain the basic data, extreme weather and user location folders contain the necessary supplementary information, including extreme weather filtering criteria and other useful indexes.
For the electricity consumption data of 386 users, some features are summarized in Table 5. The electricity consumption data covers from June 2016 to August 2022, with multiple variable records for different users. The statistics of the electricity consumption data for different types of users are shown in Table 6, including mean values, standard deviation, skew, kurtosis, and percentiles. Electricity consumption of manufacturing users (Section C) has a large standard deviation since there are various types and sizes of manufacturing factories. Users of Section C and G tend to operate with higher power consumption, and users of Section S (Other service activities) consume lower electricity. There are 274,752 weather data records with a 15-minute sampling interval for each city, covering the period from 01/01/2015 00:00 to 01/11/2022 23:30. The weather data statistics for different numerical meteorological indicators in three cities are shown in Table 7.
Technical Validation
The sensors and data acquisition devices are manufactured by well-known companies that produce meters and sensors for industrial and residential installations around the world. Meter calibration is done by the meter manufacturers before shipping to the factory. The calibration process is proprietary, and we are not privy to the process. Additionally, we checked if the measured values are within the range given by the manufacturers of the meters. Measurements outside the range or at its edge suggest a wrongly designed measurement infrastructure. All measurements are found to be well within the given range though. This section presents the data visualization to show the quality and technical validity of the dataset, including data integrity, reliability, and effectiveness. The data integrity is presented by data availability plots. The data reliability is verified by determining the discrepancy of annual total electricity consumption of these users of two metering systems. In addition, annual, seasonal, monthly, weekly, and daily pattern plots are represented to provide characteristic insights into electricity consumption and operating states of different industries. The data effectiveness is then verified by electricity consumption diagrams around extreme weather events to demonstrate the impact of extreme weather on electricity consumption.
Data integrity
EWELD mainly includes two types of data, electricity consumption data of 386 users and weather data from 3 cities where these users are located. After data cleaning, the weather data realizes approximately 100% data availability from January 2015 to October 2022. The electricity consumption data comprises 217,055 15-minute steps and shows various availability for the data collection period (02/06/2016 ~ 10/08/2022) due to some force majeure factors. These factors include but are not limited to the different installation time points of smart meters, closures of some factories, communication network interruptions, and data collection failures. Among 386 users, there are 30 users with data completeness greater than 99% and 63 users greater than 90%. For a better overview, data availability for each industrial section is visualized in Fig. 2. The available data plot for the entire data collection period (where the missing data is indicated using white lines) is shown on the left side of the figure. Most industrial sections have large data availability with three sections less than 50%, D (Electricity, gas, steam and air conditioning supply), J (information and communication), M (Professional, scientific and technical activities). Only one user is collected for each of these three industrial sections (U256, U280, U353). These users only have data available for some time for various reasons. For example, we started to collect smart meter data of U256 from customers in July 2016 but stopped collecting due to the user changed their business venue in November 2018. Note that data missing is mainly caused by data gaps larger than one day. The completeness criterion states that a complete day has at least 95% of the expected records24. Statistical results indicate that more than 99% of the days have more than 91 records, i.e. the vast majority of days meet the completeness criterion. Moreover, missing data can be further recovered through missing data imputation methods to guarantee data quality, such as regression-based and deep-learning-based methods25,26.
Data reliability
The annual total electricity consumption according to the dataset is compared with the value reported by the electric company. Very close values are obtained with errors less than 2%. For further validation, the annual, monthly, weekly, and daily pattern plots of some users are presented to illustrate the quality of records properly. The energy consumption discretized in 15-minute intervals for 18 representative users during the data collection period is presented in Fig. 3, covering 17 industrial sections. It shows an overview of the electricity consumption of various users these years. Different types of users show various electricity consumption characteristics. For instance, U165 (Steel pipe factory) uses more electricity in the mid-year and shows a typical single-peak electricity consumption curve throughout the year, while U258 (sewer systems) presents no significant seasonal variations in electricity consumption all year round. Next, monthly, weekly, and daily statistics of users are drawn to show the significant consumption patterns, aligning with the corresponding electricity consumption behavior of different industries and verifying the reliability of this dataset. Fig. 4 illustrates the monthly electricity consumption profiles of 9 users from different industrial sections, obtained by averaging the electricity consumption during the data collection period by day of the month. All users show a regular periodical change. Fig. 5 shows the weekly electricity consumption patterns, obtained by averaging the electricity consumption during the data collection period by day of the week. Daily electricity consumption patterns are shown in Fig. 6, with weekday and weekend differences for distinct users. U10 (timber production company), U364 (police station), and U381 (hospital) show little difference in electricity consumption on Monday and Sunday. At the same time, the other six users present lower electricity consumption on Sunday than Monday, especially for U165 (steel pipe factory) and U380 (primary school).
Data effectiveness
To show the effectiveness of the dataset in capturing dynamic changes and reflecting the impact of significant events on electricity consumption, U380 is chosen as an example to analyze the weather impact on its electricity consumption. U380 is a primary school located in City C2. Fig. 7 presents the relationship between electricity consumption and two important meteorological indicators, temperature and humidity, in 2018. The diagonal subplots show the distribution of values for different variables. For example, temperature data presents a Poisson Distribution in Fig. 7a. Non-diagonal subplots show the scatter plots between two variables and it can be found that the electricity consumption of U380 has a strong correlation with temperature and humidity, aligning with the law of facts. Specifically, higher temperatures and humidity tend to result in higher electricity consumption, which is caused by greater dependency on air conditioners and electric fans. Taking Fig. 7g as an example, temperature and electricity consumption are positively related. The chaotic points in Fig. 7d indicate that there is no evident relationship between temperature and humidity.
Next, the impact of different types of extreme weather on its electricity consumption is shown in Fig. 8, indicating the consistency with the reality that various extreme events lead to different impacts on electricity consumption. It presents the electricity consumption of the day extreme weather happened (D-0) and compares it with that of the previous day (D-0) and the same day of the last week (D-7). Low temperature has little influence on electricity consumption since some of the central air conditioners in these three cities do not have the heating capability, and the lowest temperatures in these cities in 2018 are still above 41 Fahrenheit degrees without the need to turn on the air conditioners for heating. On the other hand, high temperatures and high humidity will lead to higher electricity consumption due to air conditioning cooling and dehumidification. In addition, tropical storms and typhoons tend to result in lower electricity consumption since the school generally chooses to be closed to ensure safety after receiving the warning weather information.
Usage Notes
For the privacy protection of the users, the raw data collected is not publicly available. Although the EWELD dataset published here is the data preprocessed through some simple data cleaning procedures, significant information remained unchanged, such as the electricity consumption patterns in different cycles. The codes published are licensed under the MIT license. We recommend users follow the guidance on Github, including the details of installation, package usage, dataset navigation, and code navigation. The raw data for this project was obtained through a collaboration between the authors of this work and an industry partner. This partner is a well-respected power retailer and was integral in supplying the information we needed for our analyses. Unfortunately, while we are unable to share the exact source due to confidentiality agreements, researchers who wish to repeat our work or perform similar studies can approach such companies. This could be either through direct contact or by establishing collaborations as we did for this project. Please note that obtaining such data might require appropriate agreements regarding data privacy and usage to be put in place. We recommend researchers consider such ethical aspects when planning their projects.
The dataset can significantly contribute to multiple potential applications. Firstly, in user behavior analysis, the dataset can reveal valuable insights into patterns and habits of energy consumption. This can aid in the development of targeted energy conservation strategies, thereby optimizing the power system for user-specific demands. Secondly, the dataset can contribute greatly to load forecasting. Accurate load forecasting is essential in maintaining the balance between energy supply and demand, hence avoiding possible outages and inefficiencies. The rich data available can enhance the precision of short-term and medium-term load predictions, assisting utility companies in making more informed decisions about resource allocation and grid management. Lastly, the dataset can facilitate the analysis of extreme event impacts. Understanding how extreme weather events affect power loads is crucial in this era of growing climate uncertainties. This dataset allows scholars and policymakers to quantify and predict these impacts, leading to the development of more resilient power systems that can withstand such events. Each of these applications holds promise for improving the efficiency, resilience, and sustainability of our power systems.
Code availability
The code implementation was done using Python. Source codes that were used to develop and analyze the data are publicly available in the GitHub repository (https://github.com/Judy0718/EWELD).
References
Nti, I. K., Teimeh, M., Nyarko-Boateng, O. & Adekoya, A. F. Electricity load forecasting: A systematic review. Journal of Electrical Systems and Information Technology 7, 1–19, https://doi.org/10.1186/s43067-020-00021-8 (2020).
Lebotsa, M. E., Sigauke, C., Bere, A., Fildes, R. & Boylan, J. E. Short term electricity demand forecasting using partially linear additive quantile regression with an application to the unit commitment problem. Applied Energy 222, 104–118, https://doi.org/10.1016/j.apenergy.2018.03.155 (2018).
Padhy, N. P. Unit commitment-a bibliographical survey. IEEE Transactions on power systems 19, 1196–1205, https://doi.org/10.1109/TPWRS.2003.821611 (2004).
Ghasemi, A., Shayeghi, H., Moradzadeh, M. & Nooshyar, M. A novel hybrid algorithm for electricity price and load forecasting in smart grids with demand-side management. Applied energy 177, 40–59, https://doi.org/10.1016/j.apenergy.2016.05.083 (2016).
Żymełka, P. & Szega, M. Issues of an improving the accuracy of energy carriers production forecasting in a computer-aided system for monitoring the operation of a gas-fired cogeneration plant. Energy 209, 118431, https://doi.org/10.1016/j.energy.2020.118431 (2020).
Mohan, N., Soman, K. & Kumar, S. S. A data-driven strategy for short-term electric load forecasting using dynamic mode decomposition model. Applied energy 232, 229–244, https://doi.org/10.1016/j.apenergy.2018.09.190 (2018).
Gross, G. & Galiana, F. D. Short-term load forecasting. Proceedings of the IEEE 75, 1558–1573, https://doi.org/10.1109/PROC.1987.13927 (1987).
Hafeez, G. et al. A novel hybrid load forecasting framework with intelligent feature engineering and optimization algorithm in smart grid. Applied Energy 299, 117178, https://doi.org/10.1016/j.apenergy.2021.117178 (2021).
Murray, D., Stankovic, L. & Stankovic, V. An electrical load measurements dataset of united kingdom households from a two-year longitudinal study. Scientific data 4, 1–12, https://doi.org/10.1038/sdata.2016.122 (2017).
Filip, A. et al. Blued: A fully labeled public dataset for event-based nonintrusive load monitoring research. In 2nd workshop on data mining applications in sustainability (SustKDD), vol. 2012, https://api.semanticscholar.org/CorpusID:25397318 (2011).
Kelly, J. & Knottenbelt, W. The uk-dale dataset, domestic appliance-level electricity demand and whole-house demand from five uk homes. Scientific data 2, 1–14, https://doi.org/10.1038/sdata.2015.7 (2015).
Ren, Y., Suganthan, P. N., Srikanth, N. & Amaratunga, G. Random vector functional link network for short-term electricity load demand forecasting. Information Sciences 367, 1078–1093, https://doi.org/10.1016/j.ins.2015.11.039 (2016).
Zhang, Y.-F. & Chiang, H.-D. Enhanced elite-load: A novel cmpsoatt methodology constructing short-term load forecasting model for industrial applications. IEEE Transactions on Industrial Informatics 16, 2325–2334, https://doi.org/10.1109/TII.2019.2930064 (2019).
Lee, E., Baek, K. & Kim, J. Datasets on south korean manufacturing factories’ electricity consumption and demand response participation. Scientific Data 9, 1–11, https://doi.org/10.1038/s41597-022-01357-8 (2022).
Braeuer, F. Load profile data of 50 industrial plants in germany for one year. Zenodo, https://doi.org/10.5281/zenodo.3899018 (2020).
Valdes, J. & Camargo, L. R. Synthetic hourly electricity load data for the paper and food industries. Data in brief 35, 106903, https://doi.org/10.1016/j.dib.2021.106903 (2021).
Martins, P., Nascimento, V. B., Freitas, A., Silva, P. & Pinto, R. G. D. Industrial machines dataset for electrical load disaggregation. IEEE Dataport https://doi.org/10.21227/cg5v-dk02 (2018).
United Nations. International standard industrial classification of all economic activities (isic), rev.4. https://unstats.un.org/unsd/classifications/Econ/Download/In%20Text/ISIC_Rev_4_publication_English.pdf (2008).
Liu, G. et al. Real-time corporate carbon footprint estimation methodology based on appliance identification. IEEE Transactions on Industrial Informatics 19, 1401–1412, https://doi.org/10.1109/TII.2022.3154467 (2022).
Whaley, D. L. III The interquartile range: Theory and estimation. Ph.D. thesis, East Tennessee State University (2005).
Liu, G., Gu, J., Zhao, J., Wen, F. & Liang, G. Super resolution perception for smart meter data. Information Sciences 526, 263–273, https://doi.org/10.1016/j.ins.2020.03.088 (2020).
National Oceanic and Atmospheric Administration (NOAA). Wind threat defined. https://www.weather.gov/mlb/wind_threat (2020).
Liu, G. et al. Eweld: A large-scale industrial and commercial load dataset in extreme weather events, Figshare, https://doi.org/10.6084/m9.figshare.21893808.v3 (2023).
Chavat, J., Nesmachnow, S., Graneri, J. & Alvez, G. Ecd-uy, detailed household electricity consumption dataset of uruguay. Scientific Data 9, 1–16, https://doi.org/10.6084/m9.figshare.16965400 (2022).
Liang, G. et al. Super resolution perception for improving data completeness in smart grid state estimation. Engineering 6, 789–800, https://doi.org/10.1016/j.eng.2020.06.006 (2020).
Liu, G. et al. Super resolution perception for wind power forecasting by enhancing historical data. Frontiers in Energy Research 1403, https://doi.org/10.3389/fenrg.2022.959333 (2022).
Acknowledgements
This work was supported in part by the National Natural Science Foundation of China (Key Program 72331009, 72171206 and 42105145), the Shenzhen Key Lab of Crowd Intelligence Empowered Low-Carbon Energy Network (No. ZDSYS20220606100601002) and the Shenzhen Institute of Artificial Intelligence and Robotics for Society (AIRS).
Author information
Authors and Affiliations
Contributions
G. Liu and J. Liu designed the general approach of developing the dataset and conducted the data collection, data processing, validation, and data analysis. Y. Bai performed data visualization. C. Wang and H. Sheng conducted the data processing. H. Zhao and J. Qiu supported data analysis and reviewed the manuscript. G. Liang and J. Zhao advised and managed the projects that funded this research and revised the manuscript. G. Liu and J. Liu contributed equally to this work. All authors participated in discussions and reviews during the development of this manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Liu, G., Liu, J., Bai, Y. et al. EWELD: A Large-Scale Industrial and Commercial Load Dataset in Extreme Weather Events. Sci Data 10, 615 (2023). https://doi.org/10.1038/s41597-023-02503-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-023-02503-6
- Springer Nature Limited