Exposure of young animals to general anesthesia causes neurodegeneration and lasting behavioral abnormalities; whether these findings translate to children remains unclear. This study used a population-based birth cohort to test the hypothesis that multiple, but not single, exposures to procedures requiring general anesthesia before age 3 yr are associated with adverse neurodevelopmental outcomes.
A retrospective study cohort was assembled from children born in Olmsted County, Minnesota, from 1996 to 2000 (inclusive). Propensity matching selected children exposed and not exposed to general anesthesia before age 3 yr. Outcomes ascertained via medical and school records included learning disabilities, attention-deficit/hyperactivity disorder, and group-administered ability and achievement tests. Analysis methods included proportional hazard regression models and mixed linear models.
For the 116 multiply exposed, 457 singly exposed, and 463 unexposed children analyzed, multiple, but not single, exposures were associated with an increased frequency of both learning disabilities and attention-deficit/hyperactivity disorder (hazard ratio for learning disabilities = 2.17 [95% CI, 1.32 to 3.59], unexposed as reference). Multiple exposures were associated with decreases in both cognitive ability and academic achievement. Single exposures were associated with modest decreases in reading and language achievement but not cognitive ability.
These findings in children anesthetized with modern techniques largely confirm those found in an older birth cohort and provide additional evidence that children with multiple exposures are more likely to develop adverse outcomes related to learning and attention. Although a robust association was observed, these data do not determine whether anesthesia per se is causal.
Exposure of young children to procedures requiring general anesthesia is associated with later development of learning disabilities and attention-deficit/hyperactivity disorder. However, data supporting this association were obtained in an era when older anesthetic agents were used and the current standards of monitoring were not used.
In a more recent population-based birth cohort in which contemporary anesthetic techniques were used, multiple, but not single, exposures before age 3 yr are associated with increased frequency of learning disabilities and attention-deficit/hyperactivity disorder. However, single exposures are associated with deficits in some domains of academic achievement tests.
Although these data confirm previous reports and thus demonstrate a robust association, they do not permit evaluation of whether anesthesia per se is causal.
SUBSTANTIAL preclinical evidence shows that exposure to anesthetics changes the developing brain.1–4 These changes are linked to long-term learning and behavioral deficits in various animal models, including nonhuman primates.1,5–7 To explore whether these findings translate to humans, several studies have investigated the association of receiving procedures requiring general anesthesia with long-term neurodevelopmental outcomes in children. Multiple exposures to procedures requiring general anesthesia affect learning and behavior in most retrospective studies.8–14 Some, but not all, human studies also find an association between single exposures and a variety of outcomes related to learning and behavior.8,12,13,15–24 This heterogeneity of results is perhaps not surprising given the wide range of study designs and outcomes used among these studies. Indeed, if exposure is associated with changes in specific domains of cognition or behavior, results should depend on the outcomes examined.25,26
A series of previous studies based on a birth cohort of children born in Olmsted County, Minnesota, found an association between multiple, but not single, exposures to procedures requiring general anesthesia before ages 2 to 4 yr and subsequent learning disabilities (LDs) and attention- deficit/hyperactivity disorder (ADHD), with multiple exposures associated with approximately a doubling in the incidence of both outcomes.9–11 Multiple, but not single, exposures also impaired performance on school-administered group tests of cognitive ability and academic performance.9 The particular impact of multiple exposures is consistent with emerging data from animal studies.27,28
Nonetheless, these previous birth cohort studies9–11 had several limitations, as extensively discussed at the time of their publication29 and subsequently. As with many frequently cited studies of this issue,18,21,22,30 children in the previous cohort (born 1976 to 1982) were anesthetized before the transition from halothane to sevoflurane and the routine adoption of pulse oximetry and capnography, which does not reflect contemporary anesthesia practice. In addition, the number of exposed children was relatively modest in the original studies, making it difficult to determine whether exposure is associated with a particular pattern of LDs or whether even single exposures may affect some domains of cognitive function. Finally, given the considerable potential limitations of observational studies and the relatively small number of children with the outcomes of interest in the previous studies, it is critically important to confirm or refute the fundamental observations of this first series of studies, which have proved to be one of the drivers of research in this field.
The aim of this study was to test the hypothesis that multiple, but not single, exposures to procedures requiring general anesthesia before age 3 yr (i.e., before the child’s third birthday) are associated with adverse neurodevelopmental outcomes, including LD, ADHD, need for individualized education programs (IEPs) for emotional/behavioral and speech/language disorders, and impaired performance in group-administered ability and achievement tests. This hypothesis was evaluated using a new population-based birth cohort (born between 1996 and 2000) in which children were anesthetized with largely contemporary anesthetic techniques.
Materials and Methods
This study was approved by the Mayo Clinic (Rochester, Minnesota) and Olmsted Medical Center (Rochester, Minnesota) institutional review boards. The parents of all of the children included in this analysis had provided consent for the use of their child’s medical records in research.
Previous articles described the methods used to assemble the study cohort of children used for this analysis.31,32 To summarize, a birth cohort of children born from January 1, 1994, to December 31, 2007, in Olmsted County, Minnesota, was identified. To create the study cohort, children in the birth cohort born from January 1, 1996, to December 31, 2000, were first selected. This time range was chosen to allow sufficient duration of follow-up to ascertain the outcomes of interest and to coincide with widespread clinical replacement of the volatile anesthetic halothane with sevoflurane. Next, children who (1) moved from Olmsted County before their third birthday; (2) died before their fifth birthday; and (3) were not enrolled in the local school district at age 5 yr were excluded, because relevant outcomes were not available.
A propensity-matching strategy was then used to select children who met eligibility criteria for inclusion in the study cohort, using multiple variables to calculate sex-specific propensity scores for receiving single or multiple exposures to general anesthesia (GA) and procedures before age 3 yr. This age was chosen as comparable to the previous work by us and others and to provide a balance between a presumed window of vulnerability and including sufficient numbers of children for statistical analysis. However, it should be acknowledged that evidence to support this particular age as representing a threshold of risk is limited. Based on quintiles of the observed distribution of propensity scores, 50 sex-specific, propensity-matched strata were defined and used to select children for study. After propensity matching, the study cohort consisted of 126 children who had two or more exposures, 466 exposed only once, and 465 children unexposed (n = 1,057 children). Seventy children exposed to GA were excluded from the study cohort because an appropriate propensity-matched control could not be identified. Analyses for potential biases in study cohort selection revealed the following: (1) the characteristics of children born during the study period who were and were not enrolled in the school district were similar; (2) characteristics of children in the study cohort were similar across exposure groups, suggesting successful propensity matching; and (3) characteristics of children exposed to procedures requiring anesthesia who were and were not included in the study cohort were similar.32
Using medical and school records, four learning/behavioral outcomes were sought: (1) LDs, including three subtypes of LDs: reading, mathematic, and written language; (2) ADHD; (3) receipt of an IEP for speech/language or emotional/behavioral disorders; and (4) performance in the group-administered tests of ability and achievement. Using record-linkage services from the Rochester Epidemiology Project,33 we performed a manual review of medical records for each of the 1,057 children from two major medical facilities in Olmsted County, Mayo Clinic and Olmsted Medical Center. Through a contractual agreement with independent school district 535, which serves Rochester, Minnesota, the enrollment status and cumulative school records were also available for review for all of the children. Two of the four outcomes, receipt of an IEP and group-administered ability and achievement test scores, are indexed in school record systems for automated retrieval. LD and ADHD were ascertained through manual review of the school and medical records.
LDs (Reading, Math, and Written).
LD cases were ascertained according to previously described research criteria10,34,35 based on one of two formulas: an intelligence quotient (IQ)–achievement discrepancy formula and a low achievement formula. Children were considered to have an LD if they met research criteria for at least one of the three LD subtypes (reading, written language, and mathematics disabilities) determined by either of the formulas using contemporaneous IQ and achievement scores.
Attention-deficit Hyperactivity Disorder.
ADHD cases were ascertained based on criteria previously described and validated.11,36 The criteria rely on documentation within medical and school records of ADHD diagnoses and questionnaires. Children were identified as ADHD cases if their records included either a clinical diagnosis or positive ADHD questionnaire. ADHD questionnaire results were considered positive only when both parent and teacher questionnaires were positive. The exclusion criteria specified in the Diagnostic and Statistical Manual of Mental Disorders, 4th edition (DSM-IV), were followed (i.e., ADHD was not present if children had a diagnosis of a psychotic disorder, schizophrenia, severe intellectual disability, or pervasive developmental disorder). DSM-IV criteria for ADHD (with six or more separate entries in the medical or school records that were consistent with DSM-IV criteria) were not used, because previous work in the Olmsted County population indicated that use of these criteria did not alter case ascertainment.11,36
IEPs for Speech/Language and Emotional/Behavioral Disorder.
Group-administered Ability and Achievement Tests.
Since 2001, when the oldest members of the study cohort enrolled in independent school district 535, several group-administered tests have been implemented. Of these, the Stanford/Otis-Lennon School Ability Test (OLSAT) was administered every 2 to 3 yr for the longest period of time (from 2003 to 2013). The Stanford measures academic performance in several domains, whereas the OLSAT measures cognitive abilities. Test results were not available for children attending private schools or who were home schooled, such that 79% of study cohort members had at least one Stanford/OLSAT while enrolled in independent school district 535. The last available Stanford/OLSAT scores were abstracted as a measure of academic performance and cognitive ability. Achievement scores in the subdomains of reading, mathematics, language, and spelling were analyzed and expressed as age-related national percentile ranks using z scores transformed from these ranks.
Children with severe intellectual disability are not at risk for the four learning/behavioral outcomes and were not considered for additional analysis. Identification of these children was based on research criteria,37 defined as meeting at least one of the following criteria: (1) at least one full-scale IQ score of 50 or less at or before 8 yr of age; (2) a diagnosis of severe intellectual disability or similar diagnoses documented in the medical or school records; or (3) documentation in the medical or school records indicating ineligibility of a child for neurodevelopmental tests due to severe cognitive disabilities.
In this propensity-matched study cohort, children exposed to procedures requiring anesthesia were matched with unexposed children based on their propensity for receiving GA, as described previously.32 Each child was at risk from birth until he/she met criteria for LD (including three subtypes of LD), ADHD, or IEPs. For these three outcomes, cumulative frequencies were calculated according to the Kaplan–Meier method, with censoring at the date of emigration, death, last follow-up, or the end of the study on December 31, 2014. Although the original intent was to include an equal number of exposed and unexposed individuals within each propensity-matched strata, this was not achieved for all strata. For the calculation of the Kaplan–Meier estimates, strata-specific weights were used so that the resulting estimates would be reflective of a 1:1 matching of exposed and unexposed individuals. For the prespecified primary analyses, separate stratified proportional hazards regression analyses were performed to assess the association of the number of anesthesia exposures (none, single, and multiple) with LDs, ADHD, or IEPs. In prespecified secondary analyses, stratified proportional hazards regression analyses were performed to assess whether the total duration of anesthesia exposures before age 3 yr (rather than number of exposures) is a risk factor for the aforementioned neurodevelopmental outcomes. For these analyses, separate models were used with cumulative duration of GA quantified as a continuous variable and as a categoric variable using 30-min intervals.
Two post hoc exploratory analyses were also performed with LD and ADHD as outcomes. The first explored whether the total duration or the total number of exposures (or both) was associated with the outcome by including both cumulative duration of GA and the number of exposures in the model. The second examined the same question by restricting analysis to only those children who had similar moderate total durations of exposure to GA (between 61 and 120 min).
National percentile ranking scores in the group-administered achievement tests were transformed to z scores using a probit transformation. The resulting z scores were compared across the three exposure groups (none, single, and multiple) using generalized estimating equations with robust standard error estimates.
The propensity-matching strategy was designed to account for factors such as possible differences in health status between the multiply exposed, singly exposed, and unexposed children. Since all of the individuals within a given propensity strata were not matched exactly on all covariates, two adjusted analyses were performed. One included four covariates that are known to influence the incidence of LDs or ADHD: sex, birth weight, gestational age, and mother’s education. These analyses were also intended to facilitate comparisons with our previous work, because these four factors were also used in our previous analyses.9–11 The other adjusted analysis also included socioeconomic status as a fifth covariate,32,38 because socioeconomic status has also been reported as a risk factor for LDs or ADHD. In addition, in analyses seeking potential factors moderating the effects of exposure, four multivariate logistic regression models were created for each outcome that included all of the significant variables from univariate analyses and the interaction terms between exposure (multiple, single, and none) and sex, gestational age, birth weight, and socioeconomic status, respectively, allowing for assessment of the significance of the interaction terms after adjustment for potential confounders.
Statistical power for the primary analyses assessing differences in the incidence of LDs and ADHD across exposure groups was determined based on the number of events observed for each of these outcomes. Based on the observed number of events for LDs, the smallest hazard ratio (HR) that can be detected with statistical power (two-tailed, α = 0.05) of 80% is 1.7 for the comparison of a single exposure versus no exposure, and 2.1 for the comparison of multiple exposures versus no exposure. For ADHD, the minimum detectable HR is 1.6 for the comparison of a single exposure versus no exposure and 2.1 for the comparison of multiple exposures versus no exposure.
The effect estimates and 95% CIs are presented for comparisons between multiply and singly exposed children groups and the unexposed children. In all cases, two-tailed P values are reported with no adjustment for multiple comparisons. P values less than 0.05 were considered significant. All of the data were analyzed using SAS 9.3 (SAS Institute, Inc., USA).
The propensity-matched study cohort consisted of 1,057 children, of which 21 (2%) were excluded from additional analysis because of severe intellectual disability. These included 10 multiply exposed children, 9 singly exposed children, and 2 unexposed children. Therefore 1,036 children were analyzed, including 116 multiply exposed, 457 singly exposed, and 463 unexposed children.
Anesthesia and Surgery Characteristics
The 573 children exposed to GA underwent 760 procedures before their third birthday (table 1). The American Society of Anesthesiologists physical status was significantly higher among multiply exposed children compared with singly exposed children (P < 0.001), although most (86%) children with multiple exposures were physical status I to II. The age at first exposure was significantly higher in those children with single exposures (P < 0.001), with 68% occurring after 1 yr of age. The mean total duration of anesthetic exposure was approximately threefold higher in multiply exposed children, with 52% of multiply exposed children exposed to greater than 2 h of anesthesia compared with 12% of singly exposed children.
Otorhinolaryngologic procedures accounted for the greatest number of procedures, including the majority of those performed in children greater than 1 yr of age (table 2), with cardiovascular and neurologic procedures accounting for 3%. Most children received sevoflurane and nitrous oxide, with a small minority receiving midazolam premedication or intravenous hypnotics (table 3). During this transitional time in practice, approximately one in six children received a halothane induction, with most of these then receiving isoflurane after induction.
A total of 142 children (13.7%) developed any LD, including 55 (11.9%), 57 (12.5%), and 30 children (25.9%) who were unexposed, singly exposed, and multiply exposed, respectively. The estimated cumulative frequency of any type of LD at 18 yr of age by Kaplan–Meier analysis was 14.1% (95% CI, 10.8 to 18.2%) for unexposed children, 13.5% (95% CI, 10.5 to 17.1%) for singly exposed children, and 29.9% (95% CI, 21.6 to 40.5%) for multiply exposed children (fig. 1A). In both adjusted and unadjusted analyses, multiple, but not single, exposures were significantly associated with an increased frequency of LDs, including the analysis that adjusted for socioeconomic status (table 4). In secondary analyses, the frequency of an LD was also increased with longer cumulative duration of anesthetic exposures expressed as a continuous variable. When exposure duration was analyzed as a categoric variable, durations of 90 to 120 min were significantly associated with LDs in fully adjusted analysis (HR = 1.82; P = 0.039), with a similar magnitude of effect estimated for durations of more than 121 min (HR = 1.62; P = 0.080). Multiple exposures were also associated with increased frequencies of each of the three subtypes of LDs (reading, mathematics, and written language) when analyzed separately (table 5).
In exploratory analyses, when both number and duration of exposures (as a continuous variable) were included in the models, the number of exposures was still a significant factor but not duration (table 4). When analysis was restricted to children with a similar moderate total duration of exposure and controls (61 to 120 min), multiply exposed children had an increased frequency of LD compared with singly exposed children (table 4).
A total of 165 children (15.9%) developed ADHD, including 54 (11.7%), 75 (16.4%), and 36 children (31.0%) who were unexposed, singly exposed, and multiply exposed, respectively. The estimated cumulative frequency of ADHD at 18 yr of age by Kaplan–Meier analysis was 11.7% (95% CI, 9.3 to 14.8%) for unexposed children, 17.7% (95% CI, 14.3 to 21.9%) for singly exposed children, and 33.6% (95% CI, 25.1 to 44.0%) for multiply exposed children (fig. 1B). In both adjusted and unadjusted analyses, multiple, but not single, exposures were significantly associated with an increased frequency of ADHD, including the analysis that adjusted for socioeconomic status (table 6). The frequency of ADHD was also increased, with longer cumulative duration of anesthetic exposures expressed as a continuous variable. When exposure duration was analyzed as a categoric variable, durations of 90 to 120 min were significantly associated with ADHD in fully adjusted analysis, and durations of more than 121 min were significant only in unadjusted analysis.
In exploratory analyses, when both number and duration of exposure were included in the models, the number of exposures was still a significant factor, but not duration (table 6). When analysis was restricted to children with a similar moderate total duration of exposure (61 to 120 min), multiply exposed children had an increased frequency of ADHD compared with singly exposed children (table 6).
Consistent with other literature,39 LD and ADHD frequently co-occurred. Children with both conditions often have more severe symptoms than children with only either condition alone.40,41 Of the 142 children with an LD, 72 (51%) met criteria for both LD and ADHD, whereas 70 met criteria for LD only. Of the 165 children with ADHD, 72 (44%) met criteria for both LD and ADHD, whereas 93 met criteria for ADHD only. The co-occurrence of LD and ADHD was more common among those children who were multiply exposed (among children with either LD or ADHD, 57% of multiply exposed children, 27% of singly exposed children, and 23% of unexposed children had both; P < 0.001, chi-square). This raises the possibility that LD and ADHD associated with multiple anesthesia exposures may be more severe or that these children are at particular risk for the combination of ADHD and LD.
IEPs for Emotional/Behavioral Disorders and Speech/Language.
Ninety-three children (9%) had an IEP for speech/language (SL), and 32 (3%) had an IEP for emotional/behavioral disorder (EBD). Exposures were not associated with the need for an IEP-SL or IEP-EBD (table 7) in either unadjusted or adjusted analyses.
Group-administered Ability and Achievement Tests.
The median (minimum, maximum) age at the time of the last available Stanford/OLSAT test was 14 yr (7, 15 yr) and did not differ significantly across exposure groups. The total cognitive ability score (OLSAT) differed according to exposure status (table 8), with multiple, but not single, exposures associated with a significant decrease (approximately −0.26 SD in the fully adjusted model). Scores in all of the subdomains differed significantly according to exposure status (table 8), with the exception of spelling in the fully adjusted model (P = 0.087). The associations with multiple exposures were significant for all subdomains, with effect sizes ranging from −0.23 to −0.36 SD in the fully adjusted model. Associations with single exposures were significant in the subdomain of reading for all of the models (approximately −0.17 SD) and for language (approximately −0.15 SD) for all but the fully adjusted model (P = 0.054). Single exposures were not significantly associated with the differences in the subdomains of mathematics or spelling.
Sex, gestational age, birth weight, and socioeconomic status did not moderate the association between exposures and LD or ADHD, because when interaction terms were included in the fully adjusted model, they were not significant (table 9). For example, the risk of LD and ADHD associated with exposure did not differ between boys and girls. There were also no significant interaction terms in the analysis of OLSAT scores. For the subdomains of the Stanford Achievement Test, significant interaction terms were obtained for sex and single exposures in mathematics and for multiple exposures according to gestational age for language. However, given the multiple interaction terms sought across multiple outcomes, the significance of these isolated interaction terms is unclear.
Physical Status and Indication for Procedures.
A complete listing of chronic conditions and surgical procedures received by children with multiple exposures to anesthesia is provided as Supplemental Digital Content 1 (http://links.lww.com/ALN/B473). If physical status is used as a crude measure of burden of illness, for children exposed to multiple anesthetics, 66 (66%) of the 100 children classified as physical status I or II did not develop an LD or ADHD, and 8 (50%) of the 16 children classified as physical status III or IV did not develop an LD or ADHD (P = 0.26, Fisher’s exact test; Supplemental Digital Content 2 [http://links.lww.com/ALN/B474], which is a table listing the maximum ASA physical status among exposed children who had an LD and/or ADHD). Of the 42 multiply exposed children who developed LD or ADHD, 23 (55%) were assessed as healthy (physical status I), 11 (26%) were classified as physical status II, and 8 (19%) were classified as physical status III or IV, respectively.
At the request of the peer reviewers for this article, additional post hoc sensitivity analyses were conducted, excluding some children from analysis whose underlying conditions could be associated with specific outcomes. Excluding children who received myringotomies had almost no effect on the relationship between exposure and LD (Supplemental Digital Content 3, http://links.lww.com/ALN/B475, which is a table providing the results of this sensitivity analysis). In other models, children receiving neurologic or cardiovascular surgery (a total of 16 children) were excluded from the analyses of group-administered ability and achievement tests (Supplemental Digital Content 4, http://links.lww.com/ALN/B476, which is a table providing the results of this sensitivity analysis). As a result, the associations between multiple exposures and the OLSAT total battery and the spelling subtest of the Stanford now did not reach statistical significance in the fully adjusted analyses, whereas the association between single exposures and the language subtest of the Stanford in the fully adjusted model was now statistically significant. Excluding children receiving neurologic or cardiovascular surgery had little effect on the relationship between exposure and LD or ADHD (Supplemental Digital Content 5, http://links.lww.com/ALN/B477, which is a table providing the results of this sensitivity analysis).
Outcomes in Children Who Were and Were Not Included in the Study Cohort
Seventy children who were enrolled in independent school district 535 and were exposed to GA (50 singly exposed and 20 multiply exposed) were not included into the study cohort after propensity matching due to the lack of representation of all exposure groups in their sex-specific propensity strata, a potential source of bias. After excluding children who developed severe intellectual disability, 67 children were exposed to GA but were not selected into the propensity-matched study cohort. Comparing those who were and were not included in the study cohort, no differences were observed in the frequencies of LD, ADHD, need for an IEP, or scores in the Stanford/OLSAT test (Supplemental Digital Content 6, http://links.lww.com/ALN/B478, which is a table showing this comparison).
Multiple, but not single, exposures before age 3 yr are associated with increased frequencies of LD and ADHD. Even single exposures were associated with decrements in other domains, including academic achievement. The results from this study cohort (born between 1996 and 2000) are largely consistent with and extend the main results from a previous cohort (born between 1976 and 1982).9–11
Comparison with the Previous Cohort
Although the designs of the current and previous studies are similar, two major differences are notable. First, the current study used a propensity-matched design to select a comparator group of unexposed children. The sampling strata were defined for each sex based on the combination of two propensity scores (single exposure and multiple exposures to anesthesia). In two previous articles,9,11 a single propensity score was calculated for anesthesia exposure and used as a stratification variable or to generate matched unexposed controls using a somewhat different method.9 Thus, the characteristics of the unexposed comparator group differ among studies. The second difference relates to advances in anesthesia practice after 1985, including the following: (1) availability of sevoflurane and isoflurane, which largely replaced halothane; (2) adoption of pulse oximetry and capnography as standard monitors; and (3) increased use of subspecialty-trained pediatric anesthesiologists at Mayo Clinic.
LD and ADHD.
Despite these differences, the hazard ratios of LD/ADHD for single and multiple exposures were similar between the previous and current cohorts9,11 (Supplemental Digital Content 7, http://links.lww.com/ALN/B479, which is a table showing this comparison9,11 ). This finding suggests the robustness of this fundamental observation despite interval changes in anesthetic practice and is consistent with emerging evidence in animal models that, for a given duration of anesthesia, multiple exposures may be associated with greater injury.27,28 As in the previous cohort, the association between single exposures and ADHD approached but did not reach statistical significance,11 which may reflect either a true lack of association or insufficient statistical power to detect this effect size.
The overall frequency of LD was decreased and the frequency of ADHD was increased in the study cohort compared with the previous cohort (from 20 to 14% for LD and from 8 to 16% for ADHD).10,11 Both the absolute frequencies and the trends over time are consistent with national data showing recent declines in LD rates.42 This may in part reflect a 2004 change in federal law regarding the definition of LD, which represents a potential limitation of using LD as an outcome.25 However, the definitions of LD used by the state of Minnesota (and in this study) remained generally consistent over the time of both studies. In contrast, the proportion of children diagnosed with ADHD continues to increase in the United States.43 The fact that these trends are reflected in the results of these two cohorts argues for the consistency and validity of LD and ADHD ascertainment.
Individualized Education Program.
Exposure was not associated with the need for either an IEP-SL or IEP-EBD, suggesting that the more pronounced deficits necessary to generate IEPs in these areas do not contribute to observed increases in LD or ADHD risk. These findings contrast with those in the previous cohort, which revealed an association of multiple exposures with IEP-SL (but not IEP-EBD).9 It is possible that the criteria used by the schools to generate IEPs may have changed.
The current analysis replicated the previous finding that multiple, but not single, exposures were associated with decreased total cognitive scores (mean decrease of 0.38 SD for the adjusted model in the previous cohort9 ). Regarding achievement tests, multiple exposures were associated with significant decreases in only mathematics scores in the previous cohort, which may reflect a limited number of available scores. Unlike the previous cohort, the current analysis found significant decreases in reading and language subdomains even among singly exposed children. This may reflect differences in the group-administered tests (measuring similar domains) used in the assessment of the two cohorts (Stanford/OLSAT in the current and California Achievement Test in the previous cohorts) or the greater numbers of available test scores in the current cohort, enhancing the power to detect differences. Any potential effects of single exposure on these domains were not sufficient to cause formally diagnosed LDs, perhaps because they are not sufficiently severe to affect the proportion of children falling within the lower tail of the achievement distribution.
Comparison with Other Studies
Many learning and behavioral outcomes have been used in other studies.25,26 This multiplicity of outcomes, as well as other differences in experimental design, makes direct comparisons among studies problematic, because the interpretation of studies may depend critically on the particular outcome measure analyzed.26 Nonetheless, most observational studies that specifically analyze multiple exposures find associations with adverse developmental outcomes,8–13 although these effects may be modest. In the most recent, two studies from Canada used the Early Developmental Index, a questionnaire completed by kindergarten teachers, as the outcome.12,13 Both found that exposure was associated with a small but significant reduction in Early Developmental Index scores. When issues of statistical power are taken into account, both also found that the effects of single and multiple exposures were similar. Neither study found evidence of increased risk with exposure at younger ages. Of interest, the predominant Early Developmental Index domains (broadly divided into categories of general/language/cognitive development and well-being/social competence/maturity) affected by exposure differed between the two studies. Although the Early Developmental Index is correlated with overall academic success in the general population,44 its potential relationship to the specific outcomes examined in our study is not known, such that it is difficult to directly compare results.
The literature is less consistent regarding associations with single exposures. Although some studies found impaired performance in a variety of domains,12,13,15–20 others do not,8,21,22 including preliminary results from the only available randomized trial23 and a sibling-matched study using detailed neuropsychologic assessments.24 This heterogeneity may result in part from the wide variety of outcomes studied. In the current analysis, single exposures were associated with reduced scores in assessments of reading and written language, consistent with previous studies using more sensitive measurements of related domains.16,18 On the other hand, three studies of Scandinavian populations investigating national group-administered achievement tests (which may not be comparable with the achievement test subdomains reported in the current study) found either no or very small differences in average achievement scores between exposed and unexposed groups.14,21,22 All found a higher proportion of exposed children who had either failed to attain a test score or performed below a certain percentile of national norms. Because we observed relatively selective impairment of some but not other subdomains, in addition to finding no significant association of single exposures with a measure of ability (the OLSAT), these results may well be consistent, because anesthetic effects on selective subdomains may not be reflected in overall average performance assessed in these other studies.
This study has several limitations common to all observational studies in this area.10,45,46 The most important is that unidentified confounders may affect outcomes. A propensity-matched design was used in an attempt to account for potential confounding resulting from differing health status among children across exposure groups, and children in the three exposure groups were similar in terms of their comorbidity clusters.32 However, this approach may still fail to fully capture relevant confounders. For example, cardiac and neurologic surgery (3% of procedures in this study) can be associated with abnormalities in neurodevelopment.47,48 Sensitivity analyses eliminating these children affected the statistical significance of some domains of ability and achievement testing (in both directions), so that we still cannot exclude confounding effects on these domains; analyses of LDs and ADHD were not affected. Myringotomy was frequent, and it is possible that hearing deficits could contribute to LDs. However, there is little evidence that conditions such as otitis media are associated with later abnormalities in speech and language,49 exposure was not associated with speech and language difficulties requiring an IEP, and sensitivity analysis revealed that these children were not responsible for the observed associations. Even for children with some severe chronic diseases, the burden of illness itself may not have a major impact on cognitive development.50 Other potential limitations include the following: (1) elements of the surgical experience other than anesthesia exposure, such as the surgical stress response, may be responsible for the associations; (2) although most characteristics of Olmsted County residents are similar to the rest of Minnesota, some differ from the U.S. population as a whole33 ; and (3) there was a lack of group test data in 21% of cohort members.
Multiple, but not single, exposures to procedures requiring GA before the age of 3 yr are associated with an increased frequency of LD and ADHD and decreased scores in group-administered tests. Single exposures are associated with impaired performance in some domains measured by the group-administered tests but not others. There was little evidence that any factors moderated observed associations between anesthetic exposure status and outcomes. These findings in children receiving contemporary anesthesia care confirm and extend previous observations in children anesthetized before 1985 and provide additional evidence that children receiving multiple exposures are at increased risk for adverse outcomes related to learning and attention. Although there is a robust association, these data do not demonstrate whether anesthesia per se is causal.
The authors thank Maura Scanlon, M.D. (resident in anesthesiology, Mayo Clinic, Rochester, Minnesota), Javier Morales (medical student, University of Puerto Rico, San Juan, Puerto Rico), and Jake Heier, M.D. (resident in anesthesiology, University of Washington, Seattle, Washington), for their assistance with data abstraction.
Supported by grant No. R01 HD071907 from the Eunice Kennedy Shriver National Institute of Child Health and Human Development, Rockville, Maryland, of the National Institutes of Health, Bethesda, Maryland, and the Rochester Epidemiology Project, supported by grant No. R01 AG034676 from the National Institute on Aging, Bethesda, Maryland, of the National Institutes of Health.
The authors declare no competing interests.