Introduction

The use of cannabis during pregnancy has increased in recent years [1,2,3,4], becoming the most commonly used drug, excluding alcohol and tobacco, among pregnant women in the US [5, 6]; recent estimates suggest an increase from 3.4% in 2017 to 7.0% in 2022 [4, 7] with these figures still rising post COVID-19 pandemic [8]. The exponential increase in frequency is likely to be a consequence of legalisation and decriminalisation of cannabis [9,10,11,12], which may have decreased the public perception of any risk associated with maternal cannabis consumption during pregnancy [13,14,15]. However, the limited empirical research on prenatal cannabis exposure (PCE) has resulted in a lack of clarity and understanding regarding any potential risks and long-term harm to exposed individuals [16,17,18].

Recent epidemiological studies indicate an impact of PCE on neonatal health and neurodevelopment [18]. Specifically, PCE is associated with physical neonatal health metrics such as birth weight, birth timing and infant growth restriction [2, 19,20,21] but also traits related to childhood and adolescent neurodevelopment and brain function, for example, higher-order executive function [22, 23], and psychosocial and neurobehavioural traits such as psychopathy [24] and behavioural development [25]. Furthermore, PCE has observed impacts on Conduct Problem (CP) phenotypes such as externalising behaviours [26], learning disorders, and autism [27], and may increase the risk of neurodevelopmental and psychiatric disorders such as depression, anxiety, attention deficit and psychosis/schizophrenia in adulthood [28]. While these studies are associative, they warrant further investigation to clarify the risk profile of PCE. This is because PCE has the potential to directly impact offspring neurodevelopment; cannabis can cross the placenta [29], and can therefore impact the developing neonatal endocannabinoid system (ECS) [22]. The ECS is integral for correct prenatal neurodevelopment, with roles in neurite outgrowth, migration, and axon growth and guidance [30], as well as synaptic plasticity [31, 32], neuronal specification [33] and immune regulation [34]. Thus, given the central role of the ECS in brain development [22], any interference with this system via PCE may disrupt ECS signalling at critical stages of development, potentially altering structural and functional neurodevelopment [35], brain growth [36], and nervous system functioning in later life [22]. Consequently, because neurodevelopmental disorders and diseases have a complex aetiology, molecular evidence is required to be able to determine whether PCE is implicated in neurodevelopmental outcomes.

Neurodevelopmental phenotypes are often mediated at the molecular level by environmentally-induced genetic and epigenetic change [37,38,39,40,41], such as changes to DNA methylation (an epigenetic modification that is sometimes associated with gene expression levels [42,43,44,45]). Indeed, targeted studies from animal models indicate that PCE can impact DNA methylation in the genomes of exposed offspring, e.g. refs. [46,47,48,49]. Further studies show an impact of cannabis on the methylome of adult cannabis users, at genes involved in neuronal signalling [50, 51]. Additionally, cannabis exposure is associated with differential methylation at genes enriched for cardiogenesis and neurodevelopment in sperm [52, 53]. Overall, large-scale studies of PCE on human populations, while starting to emerge (e.g. refs. [23, 54]) are still scant, and thus the true nature of the relationship between PCE and neurodevelopment is still not fully understood. Consequently, the scientific and medical communities are in agreement that there is an urgent need for more evidence and research into the impacts of prenatal cannabis exposure on infant neurodevelopment.

Thus, to investigate the molecular impact of PCE, here we assessed DNA from two independent longitudinal studies: i) the Avon Longitudinal Study of Parents and Children (ALSPAC, United Kingdom), and; ii) the Christchurch Health and Development Study (CHDS, New Zealand). Using DNA methylation arrays (Illumina Infinium Human Methylation 450 K BeadChip [ALSPAC] and Illumina Infinium MethylationEPIC BeadChip [CHDS]) over the lifecourse (birth, 7 y and 15 y [ALSPAC] and 27 y [CHDS]) we performed epigenome-wide association studies (EWAS) to investigate the effect of PCE on genome-wide DNA methylation patterns, to explore the hypothesis that PCE will be associated with differential DNA methylation at genes and in pathways that have roles in neurodevelopment and neurodevelopmental disease. The findings from this study will contribute to our knowledge around the potential for PCE to influence neurodevelopmental outcomes.

Methods

Avon Longitudinal Study of Parents and Children cohort characteristics

Pregnant women resident in Avon, UK with expected dates of delivery between 1st April 1991 and 31st December 1992 were invited to take part in the study [55, 56]. 20,248 pregnancies were identified as being eligible and the initial number of pregnancies enroled was 14,541. Data is available for 13,988 children who were alive at 1 year of age. A subset of individuals from the Accessible Resource for Integrated Epigenomics Studies (ARIES [57]) at age 0 (N = 858), 7 y (N = 924), and 15 y (N = 922) were assessed for PCE (Table 1). As a longitudinal study, each time point represents the same group of patients, re-sampled at three different times. PCE was characterised based on the self-reported answer to smoked cannabis during pregnancy. Exposure was defined as mothers that responded with ‘every day’ use, 2–4 times per week, <once per week, and once per week. Non-exposed was defined as mothers that responded with ‘not at all’. Ethical approval for the study was obtained from the ALSPAC Ethics and Law Committee and the Local Research Ethics Committees. Informed consent for the use of data collected via questionnaires and clinics was obtained from participants following the recommendations of the ALSPAC Ethics and Law Committee at the time. Please note that the study website contains details of all the data that is available through a fully searchable data dictionary and variable search tool (http://www.bristol.ac.uk/alspac/researchers/our-data/). Consent for biological samples has been collected in accordance with the Human Tissue Act (2004).

Table 1 Cohort characteristics for ALSPAC and CHDS.

Christchurch Health and Development Study cohort characteristics

The CHDS includes individuals who have been studied on 24 occasions from birth to age of 40 (N = 904 at age 30), with blood collected at approximately age 27. A subset of N = 98 individuals for whom blood samples were available are included in the current study (Table 1). Of these 98 participants, N = 13 were prenatally exposed to cannabis, either with or without co-exposure to tobacco, with N = 85 serving as unexposed controls (neither prenatal cannabis nor prenatal tobacco exposure). Usage data was derived from self-reports from mothers at their birth interviews (undertaken in 1977). Tobacco consumption was measured via four questions: How many cigarettes per day did you smoke a) before you were pregnant; b) during the first three months of pregnancy; c) during the second three months of pregnancy; and d) during the last three months of pregnancy. The answers for each trimester were summed to create a measure of the average number of cigarettes smoked during each trimester. Cannabis exposure was assessed via the following self-report question: During pregnancy, did you take any of the following drugs? (y/n response option), with option a) cannabis. Mode of cannabis consumption in this cohort was via smoking, for all participants. All aspects of the study were approved by the Southern Health and Disability Ethics Committee, under application number CTB/04/11/234/AM10 “Collection of DNA in the Christchurch Health and Development Study”, and the CHDS ethics approval covering the collection of cannabis use: “16/STH/188/AM03 The Christchurch Health and Development Study 40 Year Follow-up”.

DNA extraction and genome-wide methylation profiling for CHDS

DNA was extracted from whole blood samples using the Kingfisher Flex System (Thermo Scientific, Waltham, MA USA), as per the published protocols. DNA was quantified via NanoDropTM (Thermo Scientific, Waltham, MA USA) and standardised to 100 ng/μl. Equimolar amounts were shipped to the Australian Genomics Research Facility (AGRF, Melbourne, VIC, Australia) for processing via the Infinium® Methylation EPIC BeadChip (Illumina, San Diego, CA USA). The arrays were conducted in groups over four batches (2016, 2017, 2020 and 2022). Analysis was carried out in R statistical software (Version 3.5.2) [58].

Data analysis for CHDS

Quality control checks were performed on the raw data: firstly, sex chromosomes and a total of 90 failed probes (detection P value of <0.01 in at least 50% of samples) were excluded from the analysis. Additionally, CpG sites with adjacent single nucleotide variants or that did not map to a single location in the genome were also excluded [59], removing a total of 195,354 probes from the final CHDS analysis. Functional normalisation was performed and was inspected using beta density distribution plots and multi-dimensional scaling of the 5000 most variable CpG sites. Both slide and array positions were corrected for using ComBat (as implemented by ChAMP [60]) to correct for the batch effects between sampling (four batches). Cell proportions were estimated using the Housman algorithm [61].

DNA extractions and analysis for ALSPAC

DNA samples were extracted from cord blood on delivery, and from peripheral blood samples at two time points in childhood (7 y and 15–17 y) following established methods [62], and methylation detection was undertaken using the Illumina Infinium Human Methylation450 BeadChip. Data was available post-normalisation and with estimated cell proportions with the Houseman algorithm [61] for the peripheral blood and the cord blood cell type reference for cord blood samples [63].

EWAS analysis

To partition the data for epigenome-wide analysis, data were separated into several analysis cohorts, by cohort origin (ALSPAC or CHDS), and by age. Specifically, for ALSPAC data, we undertook EWAS at three time points (Table 1); birth, where N = 30 case participants were identified, comprised of N = 10 exposed to cannabis prenatally, “PCE”, plus N = 20 exposed to cannabis and tobacco prenatally, “PCTE”, vs N = 654 controls (neither prenatal cannabis nor tobacco exposure); at 7 y old, where N = 31 case participants were identified, comprised of N = 9 PCE and N = 22 PCTE, vs. N = 700 controls, and at 15 y where N = 32 case participants were identified, comprised of N = 11 PCE and N = 21 PCTE, vs. N = 701 controls. Overall there was missing data on PCE status at each of the time points (N = 30, cord blood; N = 31, 7 y; N = 32, 15 y) and these individuals were removed from further analysis. For CHDS data, we undertook EWAS using N = 13 individuals exposed to either cannabis only (“PCE”) or cannabis and tobacco (“PCTE”) prenatally vs. N = 85 controls (Table 1).

ALSPAC EWAS analyses were controlled for prenatal tobacco exposure, but this was not possible for CHDS due to small sample sizes. Therefore data for ALSPAC are presented as PCE, corrected for prenatal tobacco exposure, while CHDS data are presented as prenatal cannabis and tobacco exposure, PCTE. To overcome the limitation of a small number of PCE individuals in CHDS, which may prohibit direct comparison between ALSPAC and CHDS datasets, we also present data from EWAS of individuals prenatally exposed to tobacco only (“PTE”, N = 55, Table 1) for CHDS, in order to distinguish between cannabis-specific vs. tobacco-associated differential methylation in the CHDS cohort.

The effects of PCE (ALSAPC) and PCTE (CHDS) exposure on methylation were analysed by linear regression with an empirical Bayes correction (limma) for: prenatal tobacco exposure (for PCE/ALSPAC only) (bivariate), adult tobacco status (bivariate), adult cannabis use (bivariate), sex (bivariate), the first principal component of the estimated cell proportions (CD8+, CD4+, granulocytes, B cells, monocytes, natural killer) as covariates. A linear regression model was then fitted to the data with the Q-Q plots of the residuals used to generate lambda values to assess for over-inflation. A similar linear regression model was used to assess prenatal tobacco exposure (PTE, CHDS only), however, the variable prenatal cannabis exposure (bivariate) was corrected for, along with adult tobacco smoking status, sex, and the first principal component of the cell type proportions. Top tables of differentially methylated CpG sites, which were corrected for multiple testing using Benjamini-Hochberg (BH) false discovery rate (FDR), were generated for all models. Differentially methylated CpG sites that were intergenic were matched to the nearest neighbouring genes in Hg19 using Granges [64]. The package ggplot2 (Version 3.3.2) was used to construct all graphs [65].

Gene ontology

We performed gene ontology enrichment analyses to determine which pathways (biological process, BP; molecular function, MF; cellular component, CC) were significantly enriched in EWAS data, at 0 y, 7 y, and 15–17 y (ALSPAC) and at 27 y (CHDS). The top 5000 differentially methylated loci in response to PCE (ALSPAC) or PCTE (CHDS; nominal P < 0.01) were selected and submitted to the EWAS Toolkit [66]. ALSPAC samples were run against a 450 K array background, and CHDS samples were run against the EPIC/850 K array background.

Results

Genome wide methylation analysis of prenatal cannabis exposure in ALSPAC cohort at 0 y, 7 y, and 15–17 y

Individual EWAS were performed for all three-time points in the ALSPAC cohort for PCE. With Benjamini–Hochberg (BH) false discovery rate (FDR) correction for multiple testing, 104 CpG sites were significantly differentially methylated (P adj < 0.05) at 0 y (cord blood, Table 2a, Supplementary Table 1 and Supplementary Fig. 1), 49 of which were hypomethylated, compared to 55 that were hypermethylated. 36 CpG sites were significantly differentially methylated (P adj < 0.05) at 7 y (whole blood, Table 2b, Supplementary Table 2 and Supplementary Fig. 2), 17 of which were hypomethylated, compared to 19 that were hypermethylated. 552 CpG sites were significantly differentially methylated (P adj < 0.05) at 15–17 y (whole blood, Table 2c, Supplementary Table 3 and Supplementary Fig. 3), 198 of which were hypomethylated, compared to 354 that were hypermethylated. The most significantly differentially methylated CpG sites with an associated gene at each timepoint were in the genes TUBB2B (0 y, P adj = 0.0160), LZTS2 (7 y, P adj = 0.0004) and WAC (15–17 y, P adj = 0.00005). All EWAS were assessed for genomic inflation, with lambda values calculated as: 0 y, 0.768; 7 y, 1.301; 15–17 y, 0.794 (Supplementary Fig. 4a–c).

Table 2 The top 20 differentially methylated CpG sites in response to PCE at 0 y (2a, ALSPAC), 7 y (2b, ALSPAC), 15–17 y (2c, ALSPAC), and PCTE at ~27 y (2d, CHDS).

Overlap of CpG sites between 0 y, 7 y, and 15-17 y associated with prenatal cannabis exposure

EWAS data was investigated for overlap of significantly differentially CpG methylated sites between all time points in the ALSPAC cohort (Fig. 1 and Table 3). Between 0 y and 7 y, one CpG site was shared (cg19141861 COQ5). A total of four CpG sites were shared between 7 y and 15-17 y (cg10170214 LZTS2, cg02850468 NPSR1, cg24635468 NT5E, and cg18576588 CRIP2) and a further two CpG sites shared between 15–17 y and 0 y (cg19201719 DOCK8, cg21836627).

Fig. 1
figure 1

Significantly differentially methylated CpG sites in response to PCE that overlap between each ALSPAC timepoint analysed (0 y, 7 y and 15–17 y). The genes in which the CpG sites reside are listed external to the circles.

Table 3 Significantly differentially methylated CpG sites in response to PCE that overlap between each ALSPAC timepoint analysed (0 y, 7 y and 15–17 y).

Genome-wide DNA methylation analysis of prenatal cannabis and co-exposure with tobacco in the CHDS cohort at 27 y

EWAS was undertaken with the CHDS cohort. With Benjamini-Hochberg (BH) false discovery rate (FDR) correction for multiple testing, 24 CpG sites were significantly associated with either prenatal cannabis (PCE) and cannabis and tobacco (PCTE) exposure (whole blood, Table 2d and Supplementary Fig. 5). All differentially methylated CpG sites were hypomethylated. The most significantly differentially methylated CpG site was in the gene GRIN2D (P adj = 0.00006). The EWAS was assessed for genomic inflation, with lambda calculated as 1.714 (Supplementary Fig. 4d).

Genome-wide DNA methylation of prenatal tobacco only exposure (PTE) in the CHDS cohort at 27 y

To assess the effect of PCE only on DNA methylation, vs. any confounding factors that may be present in co-exposure with tobacco, we performed an additional EWAS on PCE and PCTE individuals, compared to non-exposed controls, with the analysis instead based on prenatal tobacco exposure (PTE), that is, corrected for prenatal cannabis exposure and adult cannabis and tobacco use. A total of two CpG sites both within the gene FRMD4A were significantly differentially methylated (P adj < 0.05), with one additional differentially methylated CpG site in the same not reaching the FDR cut-off for genome-wide significance (Supplementary Table 4 and Supplementary Fig. 6). The EWAS was assessed for genomic inflation, with lambda calculated as 0.838 (Supplementary Fig. 4e)

Overlap of significantly differentially methylated CpG sites between ALSPAC and CHDS

Significant differential methylation at one CpG site (cg01666550) within the gene body of LRP5 (chr11:68181086) was shared between the ALSPAC cohort at age 15–17 y (rank 448 of 552, log fold change −0.0066, P adj = 0.0255) and CHDS cohort at age 27 y (rank 13 of 24, log fold change −0.3487, P adj = 0.0011) in whole blood. Five out of the 24 most significantly differentially methylated CpG sites in the CHDS cohort were probes that were only present on the 850 K array, and were absent from the 450 K array (probes ranked 9, 11, 15, 22 and 22, Table 2d).

Pathway analyses

At 0 y, 7 y, 15 y, and 27 y, gene set enrichment indicated that differential DNA methylation was enriched at loci that are annotated to multiple gene ontology pathways associated with neurodevelopment, neurotransmission, regulation of neural pathways, and neuronal structure, as well as molecular pathways that have associations with neurodevelopmental diseases and disorders (Table 4).

Table 4 Gene ontology enrichment analysis for GO terms in response to PCE (ALSPAC) and PCTE (CHDS).

Discussion

To date, limited research has explored molecular changes in offspring associated with prenatal cannabis exposure (PCE). In this study, we performed epigenome-wide association studies of PCE, and prenatal cannabis and tobacco exposure (PCTE), in two independent cohorts at four time points; 0 y, 7 y, 15-17 y, and ~27 y. Our findings show that both PCE and PCTE are associated with genome-wide significant DNA methylation differences at all time points, in both cohorts, at genes, and in pathways associated with neurodevelopment. These results provide molecular evidence in support of the epidemiological associations between PCE and impacts on neurodevelopment. Further, they provide justification for larger studies in this area and support further conversations around the need for guidelines to discuss a reduction in prenatal cannabis exposure.

Prenatal cannabis exposure and differential DNA methylation in the ALSPAC cohort

To assess the impact of PCE on DNA methylation we undertook EWAS at three time points, including birth (0 y), early childhood (7 y), and adolescence (15–17 y), in the ALSPAC cohort. The top PCE-associated differentially methylated CpG site at 0 y was in the gene TUBB2B. This gene is associated with malformations of cortical development [67], and mutations in TUBB2B are associated with developmental delays [68]. The top CpG site displaying differential methylation at 7 y was in the gene LZTS2. Differential methylation of LZTS2 is associated with cross-generational effects of THC exposure in rat nucleus accumbens [48]. Further, LZTS2 is associated with major depressive disorder [69]. Lastly, the top differentially methylated CpG site at age 15 y was in the gene WAC, which has been linked to severe intellectual disability [70].

We detected varying abundances of differentially methylated loci between early childhood and adolescence in these data, which was not unexpected; it is well established that genome-wide levels of DNA methylation change over time and that stochastic methylation changes occur during the ageing process [71], and this pattern reflected in these data. However, the varying methylation abundance with age also means that identifying consistent changes between childhood, adolescence, and adulthood, where differential methylation is expected to be more prominent [71], based on each individual’s environmental exposures, and related to exposures during development, is challenging. Despite this challenge, in these data, we identified shared differential DNA methylation at individual CpG sites between different time points examined in the ALSPAC cohort. Specifically, between individuals at 7 y and 15 y, differential DNA methylation was shared at 4 CpG sites. These sites were in or near the genes LZTS2 (discussed above), NPSR1, NT5E, and CRIP2. NPSR1 is associated with educational attainment [72] and amyloid beta measurement [73]. NT5E may have a role in excitatory neurotransmission [74]. CRIP2 currently has no known roles in neurodevelopment or neurodevelopmental disorders, however, is associated with heart development [75, 76] which is intriguing as growing evidence suggests that PCE is associated with impacts on heart development [77], and further, adult cannabis use is known to impact DNA methylation in biological pathways associated with cardiomyopathy [78]. Next, we detected significant differential DNA methylation shared between individuals at 0 y and 15 y at 2 CpG sites, in response to PCE. These sites were in the gene DOCK8, and an un-annotated CpG, with probe ID cg21836627. DOCK8 is associated with anxiety [79], opioid dependence [80], and autism spectrum disorder (ASD) [81]. The probe cg21836627 is not within or directly associated with a gene (chr6:30,069,797). However, this CpG site is 877 bp upstream of TRIM31 (chr6:30,070,674-30,080,867), which is associated with cognitive performance/intelligence [82, 83], memory [83], unipolar depression [83, 84], ASD and schizophrenia [85], all of which are epidemiologically associated with PCE. Differential methylation at one site was shared between 0 y and 7 y, and this site was in the gene COQ5. COQ5 is a methyltransferase enzyme [86] that is located in the mitochondria. COQ5 has associations with educational attainment [87], global developmental delay [88], and intellectual disability [89]. Taken together, these findings imply an impact of PCE on DNA methylation at genes with roles in neurodevelopment, across childhood and adolescence.

Given these data indicate an impact on genes involved in neurodevelopment, we undertook gene ontology enrichment analyses to determine whether this pattern was supported at the molecular pathway level. Our data indicated that differential methylation is significantly enriched at pathways associated with neurodevelopment, but also neurotransmission, regulation of neural pathways, and neuronal structure, at 0 y, 7 y, and 15-17 y. The signature of enrichment, in response to prenatal exposure, supports the principle that many neurodevelopmental disorders and diseases have developmental origins, and is also supportive of the epidemiological associations between PCE and neurodevelopment. For example, PCE is associated with consequences for neurodevelopment [25], potentially via disruption of the developing central nervous system by cannabinoid exposure [36, 90], which could impact neuron development and growth [91]. This potential impact on the developing nervous system may be linked to the association between PCE and an increase in psychosis and psychopathology in middle childhood [24, 92], and the greater risk of psychiatric disorders into adolescence [93], including affective symptoms and ADHD [94]. Further, PCE is associated with impacts on memory and behaviour in children and adolescents, and this impact persists into adulthood [26]. Thus, along with the frequent and wide-ranging reviews on the association between PCE and neurodevelopment and behaviour across the lifespan [39, 95], and recent evidence of an impact of PCE on DNA methylation at genes involved in autism in animal models [49], taken together, these data support a role for DNA methylation in the biological response to PCE.

Prenatal cannabis and tobacco exposure and differential DNA methylation in the CHDS cohort

In the CHDS cohort, we detected 24 significantly differentially methylated CpG sites, in individuals exposed to cannabis and tobacco during development, compared to controls. The most significantly differentially methylated sites were in the genes: GRIN2D, which is a glutamate receptor gene that functions in synaptic transmission and is involved in long-term potentiation/learning and memory [96, 97]; ZNF362, which has associations with individual schizophrenia symptom severity [98]; SPATA2, which is involved in fertilisation, but also intelligence [99], attention deficit hyperactivity disorder (ADHD) and ASD [100]; SLC9A3, an Na+/H+ exchanger, classes of which have an emerging role in developmental brain disorders [101], and finally; HDAC4, which has been associated with ADHD, uni- and bipolar depression, ASD and schizophrenia [102], educational attainment [87], as well as a prominent role in Brachydactyly Mental Retardation Syndrome, symptoms of which commonly include cognitive delay [103, 104] and autistic-like behaviours [105].

Gene ontology enrichment analysis for CHDS confirmed that differential methylation of genes involved in neurodevelopment and neurodevelopmental disorders were supported at the pathway level. We identified enrichment for the terms neuroepithelial cell differentiation, neural tube patterning, and neuron projection guidance, indicating enrichment for genes associated with neurogenesis and neural connections. Additionally, differential methylation is enriched at genes associated with corpus collosum development. The corpus collosum is consistently found to be reduced in size in patients with ASD [106] and schizophrenia [107], and is thought to have a role in neurodevelopmental disorders such as ASD and ADHD [108]. Further enriched is the GO term ‘positive regulation of peptidyl-tyrosine autophosphorylation’. This GO term is enriched in studies of ASD [109] and also neurodevelopmental disorders/neuromotor development [110]. Finally, loci within the Notch signalling pathway are enriched in these data. Notch is integral in nervous system development and influences plasticity and learning in the adult central nervous system [111]. Altered Notch signalling has been associated with ASD [112], schizophrenia and bipolar disorder [113]. Thus, taken together, pathway enrichment analyses support the finding that the most significantly differentially methylated loci function in neurodevelopment and neurodevelopmental disorders, as well as provide molecular evidence in support the epidemiological associations of a link between PCE and these disorders.

To understand whether an analysis of cannabis and tobacco exposure (PCTE), in combination with cannabis-only exposure (PCE), was skewing these data in favour of tobacco exposure-driven differential methylation, we assessed the same DNA methylation data and asked the extent of differential DNA methylation in response to prenatal tobacco exposure alone (PTE), corrected for PCE and adult cannabis and tobacco use. This EWAS detected genome-wide significant DNA methylation at two CpG sites, both within the gene Ferm domain containing 4 A (FRMDA4). Previous studies have found FRMDA4 to be differentially methylated in response to prenatal tobacco exposure, with the strongest effects observed in sustained prenatal tobacco smoking [114]. Thus, while we were unable to control for PTE in our CHDS analysis, our EWAS observations of both PCTE and PTE support a distinct pattern of differential DNA methylation that is driven by prenatal exposure to cannabis, in adult offspring.

Overlap in differentially methylated CpG sites between the ALSPAC and CHDS cohorts

One CpG site in the gene LRP5 was differentially methylated in CHDS adults and in the eldest time point (15–17 y) in the ALSPAC cohort. LRP5 has not yet been associated with PCE, and therefore may represent a novel target for PCE. The LRP5 gene codes for low-density lipoprotein receptor-related protein 5, and differential methylation of CpG sites within this gene have previously been associated with passive smoking exposure [115], and implicated in the proposed link between cigarette smoking habit and metabolic syndrome [116]. Moreover, it has associations with individual educational attainment [72] and mathematical ability [87], which supports further investigation of this gene for its role in the molecular link between PCE and offspring neurodevelopment.

Strengths and challenges associated with cross-cohort DNA methylation analyses

It is important to acknowledge the challenges associated with our study. Firstly, while our study identifies DNA methylation changes at loci and pathways that support the epidemiological associations of PCE, at all time points, which reflects the strength of our hypothesis, our study consists of a relatively small number of individuals with prenatal exposure to cannabis, and as such, we must emphasise the need for replication in a more highly powered study cohort. Secondly, using independent data from multiple cohorts, as we have done here, can pose additional challenges, including those associated with attempting to validate differential methylation between samples at different ages, and differences between workflows, including normalisation and quality control steps, where raw data is not available. Further, combining and validating across age points and independent datasets is even more challenging when one considers confounding factors such as tobacco, alcohol, and other drug use; these common confounders lead to heterogeneity in results cross-cohort, and along with sociodemographic factors, makes uniformity across studies hard to achieve [117]. This is pertinent when we consider the genomic inflation values for the 15–17 y time point; the lambda value here indicates a slight inflation. We predict that this is due to the available variables from the ALSPAC study; clinical data was only collected at 7 y, meaning we were unable to correct for personal tobacco smoking status, which is relevant at this time point. We suggest that this might be a possible explanation for the inflation value and the increase in the number of differentially methylated probes observed at this time point, however, without access to more clinical data, we are unable to correct for this in our EWAS model. Therefore, while identifying independent cohorts that reliably and consistently measure all possible confounding variables is not possible or feasible, it nevertheless remains a limitation of studies such as this. We also acknowledge the limitations that surround the necessary use of DNA from blood samples in this study, which we are aware may or may not reflect changes in the brain. Furthermore, due to the aforementioned limitations around study size, it was necessary to analyse PCE in combination with tobacco (PCTE) for one cohort, and we acknowledge that, despite controlling for tobacco exposure, we cannot ignore this as a potential confounder in these data. Nevertheless, while both cannabis and tobacco can result in different changes to the methylome, and while there is a large proportion of PCE individuals who have co-exposure with tobacco, we demonstrated that many significant CpG sites are unique to prenatal cannabis exposure in the CHDS. Moreover, we demonstrate that each dataset, whether PCE or PCTE, is enriched for similar/shared biological pathways, giving us confidence that our data are supportive of a biological role for DNA methylation in the association between PCE and neurodevelopment, and that further investigation in larger cohorts is required.

Conclusion

These data provide hypothesis-building insights into molecular alterations associated with prenatal cannabis exposure. The association between DNA methylation and genes involved in neurodevelopment and neurodevelopmental disorders in the present study may suggest a role for DNA methylation in the link between PCE and neurodevelopmental outcomes in exposed children, as well as provide a suite of reference genes for future investigations into the molecular mechanism of the impact of PCE on individuals. While both of our two independent cohorts contain modest sample numbers of exposed individuals, finding suitable cohorts with defined and comprehensive data on PCE is a difficult task, and the biological relevance of these data to the phenotypes being studied provides justification that PCE should be investigated strategically and specifically, in much larger sample sizes. Given the increasing prevalence of PCE, further research is required to identify the underlying mechanisms that may underlie the effects of PCE on neurodevelopment, in order to start to establish whether PCE is causative for neurodevelopmental impacts. Moreover, these data will help to fill current gaps in our knowledge around how drug exposures during development may lead to lifelong molecular disturbances that predispose an individual to poor neurocognitive outcomes in early childhood, adolescence, and beyond [39].