Abstract
To extract meaningful and reproducible models of brain function from stroke images, for both clinical and research proposes, is a daunting task severely hindered by the great variability of lesion frequency and patterns. Large datasets are therefore imperative, as well as fully automated image post-processing tools to analyze them. The development of such tools, particularly with artificial intelligence, is highly dependent on the availability of large datasets to model training and testing. We present a public dataset of 2,888 multimodal clinical MRIs of patients with acute and early subacute stroke, with manual lesion segmentation, and metadata. The dataset provides high quality, large scale, human-supervised knowledge to feed artificial intelligence models and enable further development of tools to automate several tasks that currently rely on human labor, such as lesion segmentation, labeling, calculation of disease-relevant scores, and lesion-based studies relating function to frequency lesion maps.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Background & Summary
Stroke is the 5th more frequent cause of death and a leading cause of long-term disability in the United States1. Extracting meaningful and reproducible models of brain function from stroke images is a daunting task severely hindered by the great variability of lesion frequency and patterns. A corollary to this problem is that large datasets are imperative to encompass the possible lesion-function relationships. While biomedicine has seen a shift from “anecdotal” experiences to objective, data-supported evidence based on large amounts of data, many lesion-based studies failed to weather this transition, as evidenced by a plethora of underpowered designs leading to inexact extrapolations, or to conclusions that cannot be validated on external populations2,3,4,5,6. In addition, technical developments with artificial intelligence (AI) depend on the availability of high quality, large scale, human-supervised dataset to generate and test meaningful and reproducible models7,8,9. Although unsupervised and self-supervised techniques can extract valuable insights from unannotated data, their success still currently depends on the specific task, as well as the quality and quantity of available data. For instance, domain adaptation or transfer learning from unsupervised models, or from models trained with unrelated data (e.g., Large Language Model Meta AI models, LLAMA210) might be highly efficient only if fine-tunned with expert labeled data. Therefore, combining these methods with expert-annotated data can further improve the accuracy and reliability of AI models in medical imaging applications.
A public dataset of acute stroke MRIs, associated with lesion delineation and organized non-image information will potentially enable clinical researchers to advance in clinical modeling and prediction. It will also enable the bioengineering community to develop and test AI algorithms of technical and clinical relevance, e.g., for lesion segmentation, brain mapping, and automatic generation of labels and scores. AI applications in various diseases, such as in chest X-ray, dermatology and histopathology images, and detection of breast cancer in mammography, have drastically increased, due to the availability of large image datasets11,12,13,14,15,16. Brain MRIs, particularly in acute conditions, offer extra challenges to the organization of large datasets, such as the lack of data (MRI scan is costly, therefore less common), the large variability among scanners and protocols, and the volumetric nature of the data which hinders annotation and expert labeling. As of today, the most successful examples of open-source collections of annotated MRIs are probably the brain tumor dataset of 750 patients included in the Medical Segmentation Decathlon (MSD)17, used in the Brain Tumor Image Segmentation (BraTS) challenge, and the FastMRI+18, a collection of about 7 thousand brain MRIs, with diverse pathologies, some of them with bound-box 2D annotation. In acute stroke, the lack of such large, annotated, high quality dataset, rather than mathematical or computational resources, is the current bottleneck for AI development.
The first efforts to create stroke repositories started with population-based epidemiological studies19,20 and did not include images. Starting in the 2,000’s, both the medical and the bioengineering communities acknowledged the need for a central repository for acute stroke images, in addition to metadata. Initiatives such as the “Acute Stroke Imaging Research Roadmap”21 initiated such effort, with the goal of standardizing imaging techniques, accessing the accuracy and clinical utility of imaging markers, and validating imaging biomarkers relevant to clinical outcomes. Since then, various consortiums and trials22,23,24,25,26,27 were able to accumulate large amounts of data, often available “after competition” and/or “upon request”. These conditions, however, do not guarantee that the data are shared under’FAIR’ principles28,29. FAIR stands for Findable, Accessible, Interoperable, and Reusable. Findable data are assigned persistent identifiers and well-described metadata, ensuring their easy discovery. Accessible data are openly available with clear access protocols, promoting transparency and inclusivity. Interoperable data are structured in a way that facilitates integration across diverse platforms and tools, enabling seamless collaboration and analysis. Lastly, reusable data are properly documented, allowing researchers to effectively understand, reproduce, and build upon previous work. Embracing the FAIR principles not only accelerates scientific discovery but also fosters a culture of responsible and efficient data sharing within the research community. A search in generalist repositories (e.g., Dataverse, Mendeley Data, Dryad, Open Science Framework, Vivli) or using tools suited to find “open data” (e.g., Google Dataset Search, Data Citation Index, Data.gov) mostly reveals end-analysis data that do not serve purposes such as technical development. In addition, datasets from published studies usually involve a modest number of subjects and are research-focused, acquired with homogeneous and particular protocols that do not reflect the noise and variability of clinical data, hindering the translational potential.
We share the first annotated large dataset of clinical acute stroke MRIs, associated to demographic and clinical metadata. Recently, a dataset of (mostly) chronic stroke lesions annotated in high resolution
T1-WIs (ATLAS30, followed by ATLASv231) under the ENIGMA Stroke Recovery initiative32 was well received by the neuroscience and bioengineering communities. The ATLAS has been used to improve lesion segmentation of chronic lesions in high resolution images, to create new tools for processing chronic stroke MRIs, as well as for education proposes33,34. Acute stroke MRIs, however, require specialized processing because of the particular lesion intensity characteristics, the images low-resolution, heterogeneity, and noise. The organizers of the Ischemic Stroke Lesion Segmentation Challenge 2022 (ISLES22) recently released 250 MRIs with acute stroke masks35. An analogous large, independent, multi-modality and clinical-representative dataset of acute strokes is highly anticipated.
The resource we present consists of 2,888 clinical MRIs of patients admitted with acute or early subacute stroke. It includes diverse protocols and MRI modalities, with typical clinical resolution. The large sample, as well as the technical and population heterogeneity, improve the potential generalization of models developed with these data. It includes diverse metadata, comprised of demographic information, basic clinical profile (including National Institutes of Health Stroke Scale (NIHSS) scores, 90 days follow up modified Rankin Score (mRS), hospitalization duration, biometric screening at hospital admission and discharge, and associated health conditions), and expert description of the acute lesion. The stroke lesion is manually defined in the diffusion weighted images (DWI); the images are provided in native subject space and in standard space (Montreal Neurological Institute, MNI). The data format and organization follows the Brain Imaging Data Structure, BIDS36 guidelines, facilitating navigation and sharing. To the best of our knowledge, this is the first large clinical MRI dataset shared under FAIR principles, and is available at the Inter-university Consortium for Political and Social Research, ICPSR (https://www.icpsr.umich.edu/web/ICPSR/studies/38464)37.
Methods
Cohort
Clinical data and MRIs were obtained retrospectively from patients admitted from 2009–2019 to the Johns Hopkins Comprehensive Stroke Center. The dataset creation, under waiver of informed consent, and its sharing model followed the recommendations the Johns Hopkins Internal Review Board and were approved by this board (IRB00228775). About 500 stroke patients are admitted annually, and an estimated 70% of them have MRI at admission, the majority between 6–24 hours after symptoms. To create the dataset presented here, we included patients admitted with the clinical diagnosis of acute stroke that had MRIs with DWI. A neuroradiologist (AVF) excluded those whose scans had artifacts considered impeditive of the visual analysis, as well as post-operative or strokes secondary to etiologies other than vascular, e.g., secondary to brain tumors (15% of cases). The final dataset includes 2,888 patients.
An expert neuroradiologist (AVF) and a stroke neurologist (RL), both with more than 20 years of experience, reviewed the lesions to provide qualitative descriptions of the type of lesion and location. According to their radiological appearance at MRI, the lesions were categorized as: (1) ischemic, which are lesions primarily hyperintense in DWI and hypo/isointense in the apparent diffusion coefficient (ADC); (2) hemorrhage, when any signal of bleeding, intra or extra-parenchymal was detected, or (3) “not visible” when the stroke lesion was not visually detected. Note that hemorrhage includes hemorrhagic transformation of ischemic strokes, as well as primary intraparenchymal, subarachnoid, subdural and intraventricular hemorrhages. The category “not visible” includes mostly transient ischemic attacks (TIA) or strokes with volume bellow the image resolution. Note that the radiological classification of “lesion type” aims to facilitate image-based organization and search, and does not necessarily corresponds to the clinical diagnosis of “stroke type” (Ischemic Stroke, Embolic Stroke, TIA, Intracranial Hemorrhage, Subarachnoid Hemorrhage). The “stroke type” was recorded at patient’s admission and is also provided with the dataset.
The demographic and clinical information recorded at admission and discharge is provided for each patient, following the BIDS36 recommendation. The itemized description of the information available is in the “Dictionary” (Supplementary Material). The population, image and lesions profiles are listed in Table 1. The mean age of the patients was 62.16 years-old (±14.68); ages ranged from 18 to 99 years-old. There was a slightly predominance of male (52.87%) over female. African American/Black was the predominant racial group (43.52%), followed by Caucasians (31.79%). The mean NIHSS score at admission was 5.80 ± 6.48. The “hemorrhage” group had the highest mean scores, followed by the “ischemic”; the “not-visible” had the lowest. The length of hospitalization, which indirectly reflects the severity of the stroke, was 6.63 days in average. The “hemorrhage” group had the longest length, followed by the “ischemic”; the “not-visible” had the shortest. The results of the following laboratorial tests were recorded at admission: cholesterol profile, hemoglobin a1c, serum creatinine, prothrombin international normalized ratio, fasting glucose; the means are listed in Table 1. We also report blood pressure at admission, ambulation status (prior, at admission and at discharge), body mass index (BMI), and modified Rankin scores (mRS) collected by phone interview 90 days after the stroke event. The previous medical condition most often reported was hypertension (60.80%), followed by dyslipidemia (31.68%) and diabetes (25.66%). We report time from symptoms to scan in patients who were (or whose caregiver was) highly confident about symptoms onset. In most of cases, the MRI was performed 6 or more hours after the initial symptoms. The MRI scan was performed after acute treatment (intravenous tissue plasminogen activator, ivtPA, in the majority, followed by thrombolysis) in 43.32% of patients.
MRIs
The MRIs were collected in 11 MRI scanners, over 10 years. This resulted in a large data variability, due to the various image protocols used over the years in different machines, scanners changes and updates, as well as modifications in acute stroke guidelines over this period. Shortly, the MRIs were performed in 1.5 T (61%) and 3 T (39%), whole-body Siemens (91.66%), Toshiba (0.21%), Phillips (7.20%), and GE (0.93%) scanners. Summaries of the acquisition parameters for all the MRI modalities in the Supplementary Table 1 and Fig. 1. The DWIs had high in plane (axial) resolution (1x1mm, or less), and typical clinical high slice thickness (2–7 mm).
Almost all patients (98.8%) had at least one MRI modality other than DWI that met the visual quality control standards and is provided with the dataset. Additional MRI sequences, and the percentage of scans that had these sequences, are: T1-weighted images (T1-WI, before or after exogenous contrast injection, n = 2,373, 82%), high resolution T1-WI MPRAGE (n = 1,298, 45%), T2-WI (n = 2,581, 90%), FLAIR (n = 2,746, 95%), Susceptibility-WI (SWI, n = 2,106, 73%), and Perfusion-WI (PWI, originally 34.2%; but only n = 531, 18.4% had “readable” quality PWIs, over passing the quality-control check).
The images were fully de-identified by removing all HIPAA (Health Insurance Portability and Accountability)-protected health information direct and indirect identifiers. The original DICOM files were converted to Neuroimaging Informatics Technology Initiative, Nifti format (nii.gz/json) using dcm2niix (https://github.com/rordenlab/dcm2niix) with the anonymization option according BIDS guide-lines. Note that the “.json” preserves the technical information from the image header. All the high resolution T1-WI MPRAGE, and the low resolution T1-WIs and FLAIR with full head coverage were defaced using FSL (https://surfer.nmr.mgh.harvard.edu/fswiki/mrideface). Another round of visual quality control was preformed to secure complete anonymization, including 3D reconstruction of each image to guarantee impossibility of face recognition. The overall structure of the archive is represented in Fig. 1 and detailed in the sections below, as well as in the “Data Availability”.
Lesion masks
Although there is no perfect method for defining the lesion core, we chose to use DWI and Apparent Diffusion Coefficient maps (ADC), based on the fact that DWI is the most informative and most common sequence performed for acute stroke detection. Likewise, prior acute stroke studies and trials defined the lesion core in DWI, so our data will be broadly comparable to those investigations. As the majority of MRIs are performed 6 or more hours after symptoms, the odds of significant change in the lesion volume is low38. Nevertheless, we tabulated the time between symptoms onset and the MRI and make it available, so one can estimate the stability of the DWI-defined lesion.
The methodological description of the lesion delineation procedures, inter- and intra-rater reliability measures, and additional technical validation of the lesion tracings are reported in “Technical Validation”. We note that the lack of ground truth for segmentation is a well know problem in imaging analysis and that lesion tracing is a subjective process, even across trained evaluators. This reinforces the importance of our exhaustive revision and final definition by consensus. Figure 2 shows the distribution of lesions according volumes and location. As shown in Table 1, the intra-parenchymal hemorrhagic lesions were significantly larger than the ischemic lesions. There was a slight predominance (not significant) of lesions in the left hemisphere compared to the right. The volumes of ischemic lesions showed a significant correlation with NIHSS (r = 0.57; p-value < 0.0001), as expected.
“Post-processed” images
Image mapping to common coordinates (e.g., to standard templates) and intensity normalization are common steps required in most pipelines for imaging processing. To expand the access to the dataset, in addition to the native data, we offer the DWI (plus B0 and ADC) and the stroke and brain masks mapped to standard MNI space. In order to convert the images to standard coordinates, we: (1) Resampled DWI, B0, and ADC into 1 × 1 × 1 mm3; (2) Skull-stripped with an in-house “UNet BrainMask Network”39; (3) Used sequential linear transformations40 to map B0 (less affected by the acute stroke) into JHUMNIB041, a template in MNI space.
Using the resulting transformation matrix, the brain and the lesion mask were registered to the MNI template by nearest neighborhood interpolation, to keep their binary nature. Because these are clinical low resolution images, with high slice thickness and, often, a fair amount of tilt on the z-axis (as they are axial oriented), the regular steps for linear transformation tend to perform less well than they do on high resolution images. Therefore, two rigorous steps of quality control were performed on the MNI-converted images: one qualitative, by visual analysis, and the other quantitative, based on how the global brain contour fits the template, as detailed in the Technical Validation section.
Regarding to the intensity normalization, it is unlikely that we can offer all the possible options that are ideal for each specific study. Some of these options are straightforward and can be easily generated by users (e.g., normalization using z-scores or maximum intensity). Others (e.g., by self-supervised methods), can be prospectively derived for particular studies using this resource. We opted by offering DWIs normalized by a method that proved successful in homogenizing images across different lesion types, in minimizing major differences in scanners (e.g. magnetic field), and in reducing the complexity and time to train Deep Learning Networks for lesion segmentation (Table 2), as detailed in in the Technical Validation section.
Probabilistic maps of lesions and “radiological normal” templates
Using the intensity-normalized DWIs in standard space (MNI) of the cases classified as “not-visible” strokes, we created average and standard deviation maps, here called “radiological normal” templates (Fig. 3). We note that “radiological normal” is an imperfect name, as these cases may still have abnormalities not directly related to the current stroke episode, such as white matter microvascular chronic lesions or brain atrophy. Nevertheless, such templates are representative of the radiological aspect of the brain tissue not directly affected by the acute stroke, of our population. Such templates are potentially useful for technical development, e.g., for modeling voxel classification by intensity.
We also created frequency maps of the “ischemic” and inrta-parenghymal “hemorrhage” lesions, with the simple purpose of visualizing the distribution of lesions across the dataset (Fig. 4). We performed a population-based averaging of the lesion masks in MNI space, producing a voxel-wise map where values can range from 0 at each voxel (no lesion in any subject) to 1 (100% presence of the lesion across subjects). The frequency maps and the “radiological normal” templates are provided with the dataset. Because the individual lesion masks are available, users are able to create multiple other types of templates and atlases that best fit their interest.
Data Records
The dataset is deposited in ICPSR37 (https://doi.org/10.3886/ICPSR38464). Because these data were originally assembled under a waiver of patient consent, the dataset is released as a restricted-use collection under a Data Use Agreement (DUA). Each subject is identified by an 8-digit random code. The data structure, format, and naming follow the BIDS guidelines, and is as follows (see Fig. 1):
-
1.
The main folder “raw-data” contains the image data (nii.gz and json files) in the native space of each subject in two subfolders:
“DWI”, with 4D DWI/B0 and ADC
“anat”, with T1-WI, MPRAGE, T2-WI, FLAIR, PWI, SWI
-
2.
The folder “DWI-mask” contains images in native space of manually-defined lesion masks
brain masks
3D DWI, B0, and “recalculated” ADC
-
3.
The folder “DWI-MNI-IntensityNormalized” contains images in standard MNI space, mapped to JHU_SS_MNI template41, of
DWI, B0, ADC, lesion mask, brain mask Intensity-normalized DWI
-
4.
The folder “phenotype” contains
Individual files with structured metadata of each subject
-
5.
The folder “templates” contains the following images in MNI space, according to JHU_SS_MNI template41
Average and standard deviation of “radiological normal” DWIs.
Frequency maps of ischemic and hemorrhage lesions.
-
6.
The folder “documentation” contains
The metadata dictionary (as in this Supplementary Material), in .txt and .json formats
The “dataset-description.json”, with the required description fields per BIDS specifications
The readme file, describing the general structure of the dataset
Technical Validation
Lesion delineation and agreement between lesion tracers
All the delineations were performed using ROIEditor (https://www.mristudio.org). A “seed growing” tool in ROIEditor was often used to achieve a broad segmentation, followed by manual adjustments. The segmentation was performed by two individuals highly experienced (more than 10 years) in lesion tracing (JH, XX). Additionally, they were trained by detailed instructions and illustrative files, in a subset of 220 cases (10% of the dataset cases with lesions). These cases were then revised by a neuroradiologist (AVF), discussed with the evaluators, and retrace and revised after 2 weeks. After achieving consensus, the evaluators started working on the whole dataset. The neuroradiologist revised all the segmentations and identified the suboptimum cases that were re-traced. The segmentations were revised as many times as necessary, until reaching final decision by the consensus of the tracers and the neuroradiologist. In the ischemic lesions, the evaluators looked for hyperintensities in DWI and/or hypointensities (<30% average brain intensity) in ADC. Additional contrasts were used to rule out chronic lesions or microvascular white matter disease. In the hemorrhage “lesion type”, extra modalities (SWI, T1WI, T2WI, FLAIR) were used to trace, in addition to DWI. Extra-parenchymal hemorrhage (intraventricular or subarachnoid) was not traced. The mean time for tracing was 7 min. The lesion definition was saved as a binary mask (lesion = 1, background = 0), in the original image space of each subject.
We calculated inter-and intra-rater reliability using the Dice similarity coefficient, which indicates if the same voxels are being selected as part of the lesion mask or not. Values range between 0 and 1 (1 is total agreement). For Dice calculation, we used the set of 220 lesions traced twice. This sample had the same proportion of ischemic and hemorrhagic lesions as the whole sample. The inter-rater Dice was 0.68 ± 0.23, while the intra-rater Dice was 0.72 ± 0.14. The agreement was better in ischemic lesions (0.76 ± 0.14 inter-evaluator and 0.79 ± 0.12 intra-evaluator) compared to hemorrhage. We also calculated the intraclass correlation coefficient (ICC) for the lesion volumes. The ICC ranges from 0–1; 1 is total agreement. The inter- and intra-rater ICC were 0.96 and 0.98, respectively. We reinforce that the final decision for the lesion segmentation in the whole dataset was made after many revisions and by consensus between the tracers and an expert neuroradiolist.
Automated skull stripping - brainmask network
The deep-learning method used for skull stripping, in order to reduce the complexity and computational time of the process, is described in our previous paper39. Briefly, to generate the gold standards brain masks, the DWI and B0 images from the “not-visible” cases were resampled into 1 × 1 × 1 mm3 and skull striped by a level set algorithm (available with ROIEditor), with W5 = 1.2 and 4, respectively (see explanation about choice of parameters in MRIstudio.org). The resulting brain masks (the union of masking on DWI and B0) were manually corrected by our annotators, serving as ground true for the “UNet BrainMask Network”. To train the network, all images are mapped to MNI and down-sampled to 4 × 4 × 4 mm3. The final brain mask inferenced by the network was then post-processed by the closing and the “binary_fill_holes” functions from Python scipy module, upsampled to 1 × 1 × 1 mm3, and dilated by one voxel with image smoothing. The Dice agreement between the “gold-standard” brain masks and those obtained with our network was above 99.9%, in an independent test set. The average processing time was about 19 seconds (against 4.3 min taken by the level-set algorithm), making it suitable for large scale, fast processing.
DWI intensity normalization
The process described here is similar to that descried in our previous publication39, now extended to the whole dataset, including the cases with hemorrhage lesions.
Intensity-normalization increases the comparability between subjects and, as normalizing images to a standardized space, is crucial for diverse image analytical processes. Although the lesion might affect intensity distribution, we assume that the majority of brain voxels are from healthy tissue and can be a good reference for intra- and inter-individual comparison. We used bimodal Gaussian function, as in42, in Eq. (1) to fit the intensity histogram of DWI and cluster two groups of voxels: the “brain tissue” (the highest peak) and “non-brain tissue” (the lowest peak at lowest intensities, composed mostly by cerebrospinal fluid).
where ai, bi, ci are the coefficients of the scale, mean, and standard deviation of Gaussian distribution. ai, bi, ci are calculated by least-square fitting the bimodal Gaussian function to the intensity histogram of individual DWI. DWI intensities are normalized to make the “brain tissue” intensity with zero mean and one standard deviation.
Figure 5a show that the DWI intensity distribution of voxels in a brain with ischemic lesions (blue), one with hemorrhage (green), and in a brain with “not visible” lesion (orange), prior-to (left column) and post-to (right column) intensity normalization. We note that the preservation of the minor peak at high intensities in the brain with ischemic lesion indicates the preservation of the lesion contrast after normalization. Figure 5b–d show the distribution of DWI intensities in groups of images, prior-to (left column) and post-to (right column) intensity normalization. We note that the distributions are much more homogeneous, and the individual variations are smaller after intensity normalization. More importantly, intensity differences between different magnetic fields and scan manufacturers are ameliorated after intensity normalization.
We also note that this normalization approach helped to reduce the complexity and time to train Deep Learning Networks for ischemic lesion segmentation. We experimented training UNet with DWIs normalized by our proposed method (‘ProposedNorm’) and three others: (1) standard z-score normalization on whole images (‘StandardNorm’), (2) standard z-score normalization on brain-masked region only (‘BrainMaskStandardNorm’), and (3) Max-Min normalization (‘MaxMinNorm’). We kept all other procedures for training the network, inferencing predicts, and the post-processing the same, as described in39. We used the intensity normalized DWI and ADC as inputs, and 5-fold cross-validation. Table 2 shows that the Dice scores between automated and manually traced images were higher when using images intensity-normalized with the proposed method.
Finally, in addition to the ADC from the scanners, we offer ADCs “recalculated” as:
where I(x, y, z), I0(x, y, z) are the intensity of DWI and B0 voxels, respectively, at (x, y, z)-coordinate, with b-value = 1000.
Quality control for images in standardized space, MNI
Clinical images offer extra challenges for brain mapping to standardized space, because of their high slice thickness and fair amount of tilt out-of-plane, therefore requiring further stringent quality control. The quality control of the normalized images was performed in two steps: (1) qualitative: a neuroradiologist looked at the MNI-normalized images with the MNI template brain mask overlaid, in order to rule-out obvious misalignments, and (2) quantitative: performed as described below.
-
a)
Three regions of 5 voxels bandwidth were defined in the template: the outside strip of the brain mask (OSBM), the inside strip of the brain mask (ISBM), and the outside strip of the lateral ventricles (OSLV), as shown in Fig. 6, top.
-
b)
The ratio of the mis-deformed voxels in OSBM, ISBM and OSLV for each subject, defined as γOSBM, γISBM, and γOSLV, was calculated as follows:
γOSBM indicates the ratio of the deformed B0 voxels aligned outside the template brain mask and γISBM indicates the ratio of the background voxels aligned inside the template brain mask. High γOSBM or γISBM indicte possible issues with the global brain mapping. γOSLV indicates the ratio of voxels from a subject’s deformed ventricles that exist outside the template lateral ventricle boundaries. High γOSLV is common in this population since aged subjects’ lateral ventricles are usually larger than the template’s lateral ventricles and indicate the need for local, and possibly non-linear deformation for brain mapping.
The average γOSBM was 0.2042 ± 0.0628 (medium = 0.1929, range [0.0723, 0.5070]). The average γISBM was 0.2683 ± 0.0426 (medium = 0.2666, range [0.1560, 0.4273]). This means that a small minority of voxels were outer or inner the template brain contour, when considering a stringent bandwidth of 5 voxels. Importantly, the small ratio of error was stable over stroke types, magnetic field, or scan manufacturer (Fig. 6).
Usage Notes
Multimodal public repositories for research data have been organized on the pillars of “FAIR” principles28: they are designed to be maximally “Findable, Accessible, Interoperable, and Reusable”. These repositories and centralized collections43,44,45,46,47, combined with initiatives to establish semantic and analytical consensus48,49, are likely to represent the core structure for future neuroscience research. The sharing of clinical data, however, is complicated by technical and regulatory issues. While the models of research data sharing are not adoptable in their exact same form for clinical data, they inspire a similar sharing structure, respecting the conditions under which the data are usable, without limiting accessibility.
We share a large dataset of clinical acute stroke MRIs, associated to demographic and clinical metadata, in alignment with the broad aim of the biomedical community to share FAIR data. Although a challenge for imaging processing, the image heterogeneity is an important feature of the dataset as it guarantees that tools developed using these images can be applied broadly. Providing multimodal image data is another important achievement as it will enable to train and test models that rely in multimodal integration and/or data fusion. The data organization, in BIDS36 recommended format, is compatible, or can be easily converted to, newly developed semantic standards, such as the NeuroImaging Data Model (NIDM)48. This provides critical capability to generate “computable data objects”, that can be readily used by the AI community and are user-friendly organized to improve access to non-expert data analysts. It also makes easy to integrate with other ongoing open science efforts29,43, analytical pipelines (such as in brainlife, https://brainlife.io/about/), application program interfaces (APIs), modules for quality control50,51 and harmonization52,53,54, and indexing and management engines55. These capabilities enable the use of this resource not only for discovery, but also for data synthesis and augmentation56,57,58, and to aid reproducibility and replication studies59,60.
Specifically, the dataset presented here could be used to train, test, or “transfer learning” to algorithms for lesion segmentation, providing highly important metrics for acute treatment, such as the volume of the ischemic core and perfusion deficits. For example, we recently used the images with ischemic strokes from this dataset to developed a public, user-friendly tool to generate “computable data objects”61 (https://www.nitrc.org/projects/ads). We also developed a public, user-friendly tool for ischemic lesion segmentation and quantification39, overcoming limitations of previous algorithms not tested in large numbers of real clinical data62,63. We created the first public digital atlas of brain arterial territories64 (https://www.nitrc.org/projects/arterialatlas), based on the frequency lesion maps of 1,298 of these cases. This dataset could also be used to develop and test algorithms for mapping low resolution images of brains with lesions. This mapping allows the examination of the overlap of the lesion with specific brain structures, like those defined in our arterial territory atlas or others. This enables voxel-based lesion symptom mapping and the automated calculation of relevant scores, as for example, we did using this dataset by automatically estimating ASPECTS65. The association of image annotations and lesion description also enabled us to develop automated image retrieval engines and to generate automated radiological reports66. Furthermore, this dataset is potentially useful as a general training and testing resource for translational research. In fact, we initiate several of those efforts by using this dataset and the tools enabled through it to study laboratory67 and anatomic-functional relations68, to explore bias in clinical measures69, to study populational trends70, and to test hypothesis developed in external, smaller datasets.
The main limitation of this dataset is that it originates from a single center. Although we used data from a certified Comprehensive Stroke Center, whose population reflects the profile of the US national population with stroke, and scans with great technical heterogeneity (collected along 10 years, in eleven scanners, and with dozens of different protocols), a regional bias71 might exist. For instance, our population includes a higher percentage of Black and lower percentage of Hispanic/Latinx and Asian patients than many urban stroke centers. We expect that future sharing and indexing initiatives enable the enrichment of this dataset with data from multiple other centers, worldwide, reducing possible population biases. Nevertheless, this dataset will serve as a valuable resource for training and testing models, particularly those for technical development and for image processing.
Code availability
No custom code was used.
References
Virani, S. S. et al. Heart disease and stroke statistics—2020 update: a report from the american heart association. Circulation 141, e139–e596 (2020).
Gajardo-Vidal, A. et al. How distributed processing produces false negatives in voxel-based lesion- deficit analyses. Neuropsychologia 115, 124–133 (2018).
Lorca-Puls, D. L. et al. The impact of sample size on the reproducibility of voxel-based lesion-deficit mappings. Neuropsychologia 115, 101–111 (2018).
Mah, Y.-H., Husain, M., Rees, G. & Nachev, P. Human brain lesion-deficit inference remapped. Brain 137, 2522–2531 (2014).
Shahid, H. et al. Important considerations in lesion-symptom mapping: Illustrations from studies of word comprehension. Hum. brain mapping 38, 2990–3000 (2017).
Wilson, S. M. Lesion-symptom mapping in the study of spoken language understanding. Lang. Cogn. Neurosci. 32, 891–899 (2017).
Esteva, A. et al. Deep learning-enabled medical computer vision. npj digit. Med 4, 1–9 (2021).
Geirhos, R. et al. Shortcut learning in deep neural networks. Nat. Mach. Intell. 2, 665–673 (2020).
Willemink, M. J. et al. Preparing medical imaging data for machine learning. Radiology 295, 4–15 (2020).
Touvron, H. et al. Llama 2: Open foundation and fine-tuned chat models. arXiv preprint arXiv:2307.09288, (2023).
Armato, S. G. III et al. The lung image database consortium (lidc) and image database resource initiative (idri): a completed reference database of lung nodules on ct scans. Med. physics 38, 915–931 (2011).
Bejnordi, B. E. et al. Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer. Jama 318, 2199–2210 (2017).
Halling-Brown, M. D. et al. Optimam mammography image database: a large-scale resource of mammography images and clinical data. Radiol. Artif. Intell. 3, e200103 (2020).
Irvin, J. et al. Chexpert: A large chest radiograph dataset with uncertainty labels and expert comparison. In Proceedings of the AAAI conference on artificial intelligence, vol. 33, 590–597 (2019).
Tschandl, P., Rosendahl, C. & Kittler, H. The ham10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions. Sci. data 5, 1–9 (2018).
Wang, X. et al. Chestx-ray8: Hospital-scale chest x-ray database and benchmarks on weakly- supervised classification and localization of common thorax diseases. In Proceedings of the IEEE conference on computer vision and pattern recognition, 2097–2106 (2017).
Simpson, A. L. et al. A large annotated medical image dataset for the development and evaluation of segmentation algorithms. arXiv preprint arXiv:1902.09063 (2019).
Zhao, R. et al. fastmri+, clinical pathology annotations for knee and brain fully sampled magnetic resonance imaging data. Sci. Data 9, 152 (2022).
Broderick, J. et al. The greater cincinnati/northern kentucky stroke study: preliminary first-ever and total incidence rates of stroke among blacks. Stroke 29, 415–421 (1998).
D’Agostino, R. B., Wolf, P. A., Belanger, A. J. & Kannel, W. B. Stroke risk profile: adjustment for antihypertensive medication. the framingham study. Stroke 25, 40–43 (1994).
Wintermark, M. et al. Acute stroke imaging research roadmap. Stroke 39, 1621–1628 (2008).
Albers, G. W. et al. A multicenter randomized controlled trial of endovascular therapy following imaging evaluation for ischemic stroke (defuse 3) (2017).
Giese, A.-K. et al. Design and rationale for examining neuroimaging genetics in ischemic stroke: The mri-genie study. Neurol. Genet. 3 (2017).
Nagakane, Y. et al. Epithet: positive result after reanalysis using baseline diffusion-weighted imaging/perfusion-weighted imaging co-registration. Stroke 42, 59–64 (2011).
Sandercock, P., Wardlaw, J., Lindley, R., Whiteley, W. & Cohen, G. Ist-3 stroke trial data available. The Lancet 387, 1904 (2016).
Saver, J., et al. Stent-retriever thrombectomy after intravenous t-pa vs. t-pa alone in stroke [published online april 17, 2015]. N Engl J Med. doi 10.
Thomalla, G. et al. Dwi-flair mismatch for the identification of patients with acute ischaemic stroke within 4· 5 h of symptom onset (pre-flair): a multicentre observational study. The Lancet Neurol. 10, 978–986 (2011).
Wilkinson, M. D. et al. The fair guiding principles for scientific data management and stewardship. Sci. data 3, 1–9 (2016).
Sansone, S.-A. et al. Fairsharing as a community approach to standards, repositories and policies. Nat. biotechnology 37, 358–367 (2019).
Liew, S.-L. et al. A large, open source dataset of stroke anatomical brain images and manual lesion segmentations. Sci. data 5, 1–11 (2018).
Liew, S.-L. et al. A large, curated, open-source stroke neuroimaging dataset to improve lesion segmentation algorithms. Sci. data 9, 320 (2022).
Liew, S.-L. et al. The enigma stroke recovery working group: Big data neuroimaging to study brain–behavior relationships after stroke. Hum. brain mapping (2020).
Bing, Y., Garcia-Gonzalez, D., Voets, N. & Jérusalem, A. Medical imaging based in silico head model for ischaemic stroke simulation. J. mechanical behavior biomedical materials 101, 103442 (2020).
Wang, Y., Juliano, J. M., Liew, S.-L., McKinney, A. M. & Payabvash, S. Stroke atlas of the brain: Voxel-wise density-based clustering of infarct lesions topographic distribution. NeuroImage: Clin. 24, 101981 (2019).
Hernandez Petzsche, M. R. et al. Isles 2022: A multi-center magnetic resonance imaging stroke lesion segmentation dataset. Sci. data 9, 762 (2022).
Gorgolewski, K. J. et al. The brain imaging data structure, a format for organizing and describing outputs of neuroimaging experiments. Sci. data 3, 1–9 (2016).
Faria, A. V. Annotated Clinical MRIs and Linked Metadata of Patients with Acute Stroke, Baltimore, Maryland, 2009–2019. ICPSR https://doi.org/10.3886/ICPSR38464.v5 (2022).
Wheeler, H. M. et al. The growth rate of early dwi lesions is highly variable and associated with penumbral salvage and clinical outcomes following endovascular reperfusion. Int. J. Stroke 10, 723–729 (2015).
Liu, C.-F. et al. Deep learning-based detection and segmentation of diffusion abnormalities in acute ischemic stroke. Commun. Medicine 1, 1–18 (2021).
Woods, R. P., Grafton, S. T., Holmes, C. J., Cherry, S. R. & Mazziotta, J. C. Automated image registration: I. general methods and intrasubject, intramodality validation. J. computer assisted tomography 22, 139–152 (1998).
Mori, S. et al. Stereotaxic white matter atlas based on diffusion tensor imaging in an icbm template. Neuroimage 40, 570–582 (2008).
Shinohara, R. T. et al. Statistical normalization techniques for magnetic resonance imaging. NeuroIm- age: Clin. 6, 9–19 (2014).
Markiewicz, C. J. et al. The openneuro resource for sharing of neuroscience data. Elife 10, e71774 (2021).
Landis, D. et al. Coins data exchange: An open platform for compiling, curating, and disseminating neuroimaging data. NeuroImage 124, 1084–1088 (2016).
Neu, S. C., Crawford, K. L. & Toga, A. W. Sharing data in the global alzheimer’s association interactive network. Neuroimage 124, 1168–1174 (2016).
Crawford, K. L., Neu, S. C. & Toga, A. W. The image and data archive at the laboratory of neuro imaging. Neuroimage 124, 1080–1083 (2016).
Kennedy, D. N., Haselgrove, C., Riehl, J., Preuss, N. & Buccigrossi, R. The nitrc image repository. NeuroImage 124, 1069–1073 (2016).
Keator, D. B. et al. Towards structured sharing of raw and derived neuroimaging data across existing resources. Neuroimage 82, 647–661 (2013).
Larson, S. D. & Martone, M. Neurolex. org: an online framework for neuroscience knowledge. Front. neuroinformatics 7, 18 (2013).
Klapwijk, E. T., Van De Kamp, F., Van Der Meulen, M., Peters, S. & Wierenga, L. M. Qoala-t: A supervised-learning tool for quality control of freesurfer segmented mri data. Neuroimage 189, 116–129 (2019).
Kim, H. et al. The loni qc system: a semi-automated, web-based and freely-available environment for the comprehensive quality control of neuroimaging data. Front. neuroinformatics 13, 60 (2019).
Ning, L. et al. Cross-scanner and cross-protocol multi-shell diffusion mri data harmonization: Algorithms and results. NeuroImage 221, 117128 (2020).
Garcia-Dias, R. et al. Neuroharmony: A new tool for harmonizing volumetric mri data from unseen scanners. NeuroImage 220 (2020).
Da-Ano, R. et al. Performance comparison of modified combat for harmonization of radiomic features for multicenter studies. Sci. Reports 10, 1–12 (2020).
Halchenko, Y. O. et al. Datalad: distributed system for joint management of code, data, and their relationship. J. Open Source Softw. 6, 3262 (2021).
Dar, S. U. et al. Image synthesis in multi-contrast mri with conditional generative adversarial networks. IEEE transactions on medical imaging 38, 2375–2388 (2019).
Xia, T., Chartsias, A. & Tsaftaris, S. A. Pseudo-healthy synthesis with pathology disentanglement and adversarial learning. Med. Image Analysis 64, 101719 (2020).
Bowles, C. et al. Brain lesion segmentation through image synthesis and outlier detection. NeuroImage: Clin. 16, 643–658 (2017).
Botvinik-Nezer, R. et al. Variability in the analysis of a single neuroimaging dataset by many teams. Nature 582, 84–88 (2020).
Kennedy, D. N. et al. Everything matters: the repronim perspective on reproducible neuroimaging. Front. neuroinformatics 13, 1 (2019).
Faria, A. V. & Liu, C. F. Acute-stroke detection segmentation (ads). [tool/resource]. washington: Nitrc. https://doi.org/10.5281/zenodo.5579390 (2022).
Chen, L., Bentley, P. & Rueckert, D. Fully automatic acute ischemic lesion segmentation in dwi using convolutional neural networks. NeuroImage: Clin. 15, 633–643 (2017).
Zhang, R. et al. Automatic segmentation of acute ischemic stroke from dwi using 3-d fully convolu- tional densenets. IEEE transactions on medical imaging 37, 2149–2160 (2018).
Faria, A. V. & Liu, C. F. Arterial atlas [tool/resource]. washington: Nitrc. https://doi.org/10.25790/bml0cm.109 (2021).
Liu, C.-F. et al. Automatic comprehensive aspects reports in clinical acute stroke mris. Sci. reports 13, 3784 (2023).
Liu, C.-F. et al. Automatic comprehensive radiological reports for clinical acute stroke mris. Commun. Medicine 3, 95 (2023).
Leigh, R. & Faria, A. V. Poor glycemic control is associated with worse blood-brain barrier disruption in ischemic stroke patients (147th Annual Meeting of the American Neurological Association, October 22-25, 2022, Chicago, IL, USA., 2022).
Stein, C. et al. Various tests of left neglect are associated with distinct territories of hypoperfusion in acute stroke. Brain communications 4, fcac064 (2022).
Vitti, E., Kim, G., Stockbridge, M. D., Hillis, A. E. & Faria, A. V. Left Hemisphere Bias of NIH Stroke Scale is Most Severe for Middle Cerebral Artery Strokes. Front. Neurol. 13 (2022).
Kim, G., Vitti, E., Stockbridge, M. D., Hillis, A. E. & Faria, A. V. Association of inferior division mca stroke location with populations with atrial fibrillation incidence. Heliyon 9 (2023).
Howard, V. J. et al. The reasons for geographic and racial differences in stroke study: objectives and design. Neuroepidemiology 25, 135–143 (2005).
Acknowledgements
We thank Sandhya Ramachandran, Xuejie Wang, Isabella Lorence, Akuna Mezu, Mah Noor, Esthefani Garcia, and Jonathan B. A. Assuncao for technical assistance. This research was supported by the National Institute of Deaf and Communication Disorders, NIDCD, through R01 DC05375, R01 DC015466, P50 DC014664 (AEH. AVF) and the National Institute of Biomedical Imaging and Bioengineering, NIBIB, through P41 EB031771 (SM, AVF).
Author information
Authors and Affiliations
Contributions
A.V.F. and C.L. conceived and designed the study, analyzed, and interpreted the data, drafted the work. J.H., X.X., X.L. analyzed the data. R.L., B.J., V.U., A.E.H. acquired the data and revised the draft. S.M. revised the draft.
Corresponding author
Ethics declarations
Competing interests
Susumu Mori owns “AnatomyWorks”. This arrangement is in accordance to the Johns Hopkins University conflict-of-interest policy. The reaming authors have no competing interest to disclose.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Liu, CF., Leigh, R., Johnson, B. et al. A large public dataset of annotated clinical MRIs and metadata of patients with acute stroke. Sci Data 10, 548 (2023). https://doi.org/10.1038/s41597-023-02457-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-023-02457-9
- Springer Nature Limited
This article is cited by
-
The stroke outcome optimization project: Acute ischemic strokes from a comprehensive stroke center
Scientific Data (2024)
-
APIS: a paired CT-MRI dataset for ischemic stroke segmentation - methods and challenges
Scientific Reports (2024)