Complexity Heliophysics: A Lived and Living History of Systems and Complexity Science in Heliophysics

McGranaghan, Ryan M.

doi:10.1007/s11214-024-01081-2

Complexity Heliophysics: A Lived and Living History of Systems and Complexity Science in Heliophysics

Open access
Published: 04 July 2024

Volume 220, article number 52, (2024)
Cite this article

Download PDF

You have full access to this open access article

Space Science Reviews Aims and scope Submit manuscript

Complexity Heliophysics: A Lived and Living History of Systems and Complexity Science in Heliophysics

Download PDF

Ryan M. McGranaghan ORCID: orcid.org/0000-0002-9605-0007¹

1406 Accesses
3 Altmetric
Explore all metrics

Abstract

This review examines complexity science in the context of Heliophysics, describing it not as a discipline, but as a paradigm. In the context of Heliophysics, complexity science is the study of a star, interplanetary environment, magnetosphere, upper and terrestrial atmospheres, and planetary surface as interacting subsystems. Complexity science studies entities in a system (e.g., electrons in an atom, planets in a solar system, individuals in a society) and their interactions, and is the nature of what emerges from these interactions. It is a paradigm that employs systems approaches and is inherently multi- and cross-scale. Heliophysics processes span at least 15 orders of magnitude in space and another 15 in time, and its reaches go well beyond our own solar system and Earth’s space environment to touch planetary, exoplanetary, and astrophysical domains. It is an uncommon domain within which to explore complexity science. After first outlining the dimensions of complexity science, the review proceeds in three epochal parts: 1) A pivotal year in the Complexity Heliophysics paradigm: 1996; 2) The transitional years that established foundations of the paradigm (1996-2010); and 3) The emergent literature largely beyond 2010. This review article excavates the lived and living history of complexity science in Heliophysics. It identifies five dimensions of complexity science, some enjoying much scholarship in Heliophysics, others that represent relative gaps in the existing research. The history reveals a grand challenge that confronts Heliophysics, as with most physical sciences, to understand the research intersection between fundamental science (e.g., complexity science) and applied science (e.g., artificial intelligence and machine learning (AI/ML)). A risk science framework is suggested as a way of formulating the grand scientific and societal challenges in a way that AI/ML and complexity science converge. The intention is to provide inspiration, help researchers think more coherently about ideas of complexity science in Heliophysics, and guide future research. It will be instructive to Heliophysics researchers, but also to any reader interested in or hoping to advance the frontier of systems and complexity science.

Why is Complexity Science valuable for reaching the goals of the UN 2030 Agenda?

Article Open access 27 January 2021

Introduction

Systems Science, Cybernetics, and Complexity

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The [21st] century will be the century of complexity. - Stephen Hawking

Heliophysics is ‘a fundamental science discipline that is the study of the very nature of plasmas throughout space, originating with our own Sun and heliosphere and extending to planetary atmospheres and magnetospheres, stellar atmospheres and astrospheres, and interstellar space’ (Cohen et al. 2023). Heliophysics processes span at least 15 orders of magnitude in space (from, for instance, the gyroradii of particles in the solar or Earth atmosphere important to wave-particle and particle-particle interactions at sub-micrometer lengths to the $10\times ^{10}$ sizes solar phenomena can reach or the interplanetary distances over which these processes must be understood) and another 15 in time (from reconnection and collision to solar cycle or longer time scales). The reaches of this science go well beyond our own solar system and Earth’s space environment to touch planetary, exoplanetary, and astrophysical domains. The history of Heliophysics has, like many sciences, been one of specialization–categorizing and separating domains and building understanding within those ever more boundaried systems. The approach has produced remarkable achievement, yet in a century in which:

sensing capabilities are revealing multi-scale (distinct phenomena occurring at different scales) and cross-scale (ways in which interactions occur across scales) behavior (e.g., field-aligned currents across scales (McGranaghan et al. 2017b));
data analysis and computational tools are enabling cross-system and multi-scale research (McGranaghan et al. 2017a) (e.g., combined particle and magnetohydrodynamic (MHD) simulations, Sorathia et al. 2017); and
the practical demands on Heliophysics science are growing (e.g., societal dependence on space, risk to critical infrastructure due to space weather, expansion of humanity into space and the solar system–each in some respect dissolving of the boundary between Heliophysics (fundamental scientific discovery of Sun-planet connection) and humanity (the ways Heliophysics processes and phenomena affect lives and society)),

the Heliophysics community faces the need to shift the paradigm by which it creates new scientific knowledge (Kuhn 1962)–the advent of Complexity Heliophysics. We will capitalize the term in this review to draw attention to the fact that we posit it as a paradigm, a framework of assumptions, principles and methods from which the members of the community work (Kuhn 1962); a kind of generalization that characterizes the next stage of a community’s work (Anderson 1972). Note that we are not inventing the paradigm, merely giving it a name and outlining and connecting the varied research and research avenues that compose it with the intention of providing inspiration, helping researchers think more coherently about these ideas, and guiding future research. One of the world’s premier complexity science institutions, the Santa Fe Institute (SFI), originated from a recognition that trends and challenges in chemistry, biology, and psychology revealed the need for a new paradigm. One of SFI’s founding workshops was titled “A Response to the Challenge of Emerging Syntheses in Science: A New Kind of Research and Teaching Institution” (Pines 2018).

In order to usher in this new paradigm, we must first understand what complexity science is, specifically in the context of Heliophysics research. Complexity is a difficult thing to define. It is often used synonymously with ‘something we do not yet know’ (Cilliers 2000), a placeholder label for a new frontier of scientific knowledge (see Ladyman et al. 2020 for discussion about definitions of complexity). However, this review will adopt a more principled view. We will refer to complexity science as a paradigm, drawing distinction from attempts to describe it as a discipline, ill-suited to what complexity actually is. Indeed complexity is a paradigm of scientific discovery (Hofstadter 1999; Mitchell 2009; Pines 2018; Krakauer 2018; Ladyman et al. 2020). It is fundamentally distinct from ‘complicated.’ It is the study of phenomena that emerge from a collection of interacting objects. To understand a complex system requires a plurality of frameworks and the ability to move more seamlessly between scales of the system (e.g., micro and macro). As such, complexity science spans numerous dimensions. This review reveals those dimensions through examination of a large corpus of Heliohphysics research articles. These dimensions organize this review, so we introduce them upfront along with their context in Heliophysics to provide a kind of scaffolding for the material that follows.

To understand this review readers must be aware of the central motivation: Heliophysics faces a grand challenge to unify basic and applied research. Indeed, this has been and will remain a grand challenge across the sciences (Bush 1945). The challenge in the 21st century and that is made clear in this review is now one of understanding the connections between complexity science and machine learning/artificial intelligence.

1.1 Systems Science and Crossing Scales

Systems science is the philosophy and methods for studying systems, sets of things interconnected in such a way to produce their own pattern of behavior over time (Meadows and Wright 2008). It is a core of complexity thinking, whereby the pattern of behavior is not reducible to the behavior of any of the individual things. Crossing between the scales of the system is the foundation of the complexity science paradigm. Complexity science inherently expands across scale–the interacting objects may be particles in a simulation regime, flux ropes and their individual and connected dynamics on the surface of the Sun, plasma in the magnetotail subject to magnetic and electric fields, or coupling of entire systems like the magnetosphere and ionosphere. Indeed it can be any collection of interconnecting things that give rise to behaviors from their interactions–people in a research group or institution, cells in an organism, anything.

In the context of Heliophysics, complexity science is the study of a star, interplanetary environment, magnetosphere, upper and terrestrial atmospheres, and planetary surface as interacting subsystems. Each of these subsystems can be further broken down into regions (e.g., the auroral region of the upper atmosphere) and all the way down to more elementary components such as electrons and protons. At which scale one chooses to examine the Heliophysics system determines the methods one uses and ultimately one’s understanding of it. For instance, in the magnetosphere one might choose the macro-scale, dictating a magnetohydrodynamic (MHD) method, or the micro-scale, requiring a kinetic method. Indeed, one of the most vexing questions that has obstinately refused answer in Heliophysics has been, “What level do we need to look at the system to understand and predict it?” (Denton et al. 2016; Viall and Borovsky 2020; Borovsky et al. 2020). Complexity science is a paradigm that suggests ways of reconciling the micro and macro scales. It is the collection of methods to understand a system across scales, the smaller scale behavior in connection with the phenomena that emerge from it.

1.2 Emergence, Self-Organization, and Scaling Theory

Inseparable from cross-scale understanding is the concept of emergence. Emergence is the term used to describe phenomena that are ‘more than the sum of their parts’ (Rosas et al. 2020). Emergence is observed in virtually all areas of inquiry, such as how large numbers of individual fish are able to behave dynamically as a school when threatened by a predator (Parrish et al. 2002). In terms of scale, emergence is the occurrence of actions at one scale giving rise to phenomena on another level. The idea that order at some higher order or coarse-grained level of a system arises from a number of interacting sub-systems is called self-organization (Castiglione et al. 2008; Flack 2017; Foster 2011). Self-organization is a powerful toehold in complexity analyses because it reveals that emergence is observable in statistical characteristics of the system (often as power law distributions). Indeed, many fields have searched for a universal underlying generative mechanism for the ubiquitous observation of power laws in nature and self-organization has created excitement as one of the potential mechanisms (Newman 2004). Two caveats apply: 1) it is likely that there are many different mechanisms that produce power laws with different mechanisms applying in different situations; and 2) self-organization is a collective reference to many processes applicable to different systems and unsurprisingly produces behavior that is not always power law (Clauset et al. 2007). However, the regularity of power law or power law-like behavior in nature and the frequent correspondence of these distributions to self-organization make the mechanism (self-organization) and this particular outcome from it (power laws) an important feature for understanding complexity.

Power laws are a departure from assumptions of normality that has governed much traditional scientific and engineering analyses and instead involves heavy tails in the probability distributions. Normal and Gaussian distributions are ‘light-tailed,’ meaning there is an exponential fall off moving into the tail such that the likelihood of extreme events are exponentially bounded. Heavy-tailed distributions, on the other hand, are not exponentially bounded and have heavier tails than the exponential distribution. In this review, we use the general definition of heavy-tailed as any distribution that has a heavier tail than a normal distribution. Like self-organization, power laws are powerful because they imply underlying driving mechanisms that are identical across scales of the system and produce the same statistical signature at all scales. For instance, biological organisms across a remarkable range (e.g., mice to elephants) exhibit power law scaling with a 1/4 slope such that the metabolic rate increases proportionally to the body size raised to a 1/4 power (West 2017). So as body size doubles the metabolic rate, or the rate at which the organism consumes energy, increases only by about 75%. Such scaling relationships could provide fundamental principles governing systems, in this case the physiology and energy requirements of living organisms. Power laws are found across systems, from cells to cities, and there is little doubt that they extend to Heliophysics. Instead, the question Heliophysics has been grappling with is whether they reveal deeper understanding of the processes driving the distribution and how to pull out principles of the physical system that they might point to. This review will cover instances where power laws are found in Heliophysics systems. It is important to note that power laws are not the only distributions characterizing scaling laws in nature, only the most widely discussed (Clauset et al. 2007), so we adopt it as the category through which to examine the literature relating complex behavior to distributions it generates. The similarity across scales that a power law reveals is a property called scale-free or scale-invariant. Scale-invariance means that there is some feature or behavior of a system that does not change if the scale of the variables are multiplied by a common factor (for instance, a branch of a tree looks like the complete tree, only on a different length scale; often associated with self-similarity). It is an observed property and does not strictly imply a single driving mechanism or physical process. However, scale invariance as indicated by a power law and correspondence of the power law slope between those systems has often been the diagnostic to look for a mechanism or physical process that is acting commonly across those systems (i.e., universality in statistical mechanics). Such instances have been found in Heliophysics (Freeman et al. 2000), giving credence to the potential for these techniques to lead to new scientific discovery in space physics.

Returning to the relationship between distributions indicating underlying generative mechanisms in the system and the notion of self-organization, Bak et al. (1987) proposed the concept of self-organized criticality (SOC) to explain power law behavior and the correlation that extends over many orders of magnitude in complex dynamical systems. In the original SOC paradigm of (Bak et al. 1987) the system that inspired it was a pile of sand with grains slowly and continuously added and the event was an avalanche, mimicking a dynamical system with spatially complex patterns. Ultimately, SOC implies that the stability of macro-scale systems depends on the self-organization of local events into scale-invariant dynamical patterns characterized by power law probability distribution functions with certain values of the exponents (Bak et al. 1987). Power law behavior across many astrophysical phenomena made the concept relevant to understanding space physics (Aschwanden 2011), mirroring a similar utility across scientific domains from geophysics (Smyth et al. 2019) to economics (Stanley et al. 2002) to social sciences (Galam 2012). A definition of SOC evolved to the broader physics context is provided by Aschwanden et al. (2016), “SOC is a critical state of a nonlinear energy dissipation system that is slowly and continuously driven towards a critical value of a system-wide instability threshold, producing scale-free, fractal-diffusive, and intermittent avalanches with [power law]-like size distributions.” Important points in this definition are that criticality is broadened to ‘critical point,’ including almost any nonlinear system with a (global) instability threshold and that the system must be self-organizing or self-tuning without an external control parameter.

Consider a magnetospheric substorm. The critical state is the point at which the substorm occurs and the driver is the slow and continuous accumulation of magnetic flux from the solar wind into the magnetosphere that brings the system back to its critical point after each substorm. Tying together the three components of this dimension of Complexity Heliophysics (SOC, power laws, and scaling theory), the SOC system is often recognized by power law distributions of ‘event’ (exceedences of the critical threshold) occurrences and characteristics (e.g., size and repeat time) and a scaling law captures the relationship in the distribution that exhibits relationship across scale. In our example, the substorm is the event, power law distributions are found from empirical data for occurrence and other characteristics of the substorm, and scaling laws are determined that quantify the change of the properties of the substorm with the scale of the substorm.

Though we will address SOC, we point readers to more comprehensive examinations of the topic and its history in space physics (Aschwanden 2011; Aschwanden et al. 2016; Watkins et al. 2015; Sharma et al. 2016; McAteer et al. 2015; Chang et al. 2003). Those excellent developments will free up space in this review for novel development of the topic.

The study of these scale-invariant patterns is generally referred to as scaling theory, a framework that focuses on relationships between scales. The existing body of work around scaling theory offers approaches to connect small and large-scale dynamics or micro and macro states. The identification of these scaling relationships and application of scaling theory in general have been a focus of Complexity Heliophysics research.

Related to scaling theory is the concept of coarse-graining. Coarse-graining is considering a system at a higher or coarser level at which some of the finer scale behavior has been smoothed over. Newton’s laws are a coarse-graining for the physics of motion. At these scales, the laws describe the system sufficiently, though they may break down at finer scales. There are coarse-grained theories that are dynamically and statistically sufficient; aggregations that are as good predictors of their future selves as any more microscopic description is. Tools to explore self-organization and emergence provide details about when these aggregations may exist and when they do not. Such tools include cellular automata (computer simulations that apply a simple mathematical redistribution rule yet, across iterations, produce complex spatio-temporal patterns (Wolfram 2002)), statistical mechanics, network science, and genetic algorithms/evolutionary programming and agent-based simulations (Holland 1975, 1992; Schelling 1971; Axelrod 1997). We discuss network science and collective behavior, a generalization of approaches such as agent-based modeling, further below as a separate, but related, dimension of complexity science.

Coarse-graining is thus incredibly useful because it has led to ways to develop an effective theory, which is a representation that allows one to better predict the system than if the intricacies of a finer scale were considered. For instance, measuring the temperature of a system, a coarse-graining of the aggregate motion of its particles, permits more accurate predictions for a given computational capacity than if each of the individual particles’ speeds and directions were measured (Castiglione et al. 2008; Flack 2017).

1.3 Information and Uncertainty Quantification

Self-organization is most often described in combination with emergence (De Wolf and Holvoet 2005). Nonlinearity is a unifying characteristic between them, which Glansdorff et al. (1973) mathematically showed has the property of auto-catalysis or a positive feedback loop (i.e., small changes, large effect). Feedback loops connect microscale interactions to macroscale behavior, which scaling theory statistically describes. We previously introduced one particular type of scaling, power laws, and one particular mechanism by which they are generated, self-organized criticality (Newman 2004). However, the phenomena of self-organization and emergence are more general aspects of system behavior and complexity science requires more general theory to study them. Emergence is a way that order arises from many interacting parts. To analyze order mathematically, the driving principle of the complexity paradigm, one must begin with information and its counterpart entropy. Information quantifies the amount of dependency or connection between a random variable and itself at a different time or with other variables at the same or different times.

$$ I(E) = -log_{2}(p(E)), $$

(1)

where $E$ is an event and $p(E)$ is the probability of that event. Entropy quantifies the amount of uncertainty involved in the value of a random variable, given by Shannon (1948):

$$ H(X) \equiv - \sum _{x \in \mathcal{X}} P(x)logP(x), $$

(2)

where $X$ is a random variable with probability distribution $P$ over events $x$.

Hidalgo (2015) differentiates information from entropy, “In a physical system, information is the opposite of entropy, as it involves uncommon and highly correlated configurations that are difficult to arrive at.” It is in these uncommon configurations that mathematicians, physicists, and scientists have observed various physical systems. The mathematicians of the early to mid 1900s applied the study of order and disorder to communication systems, creating the field of information theory as the ‘mathematical treatment of the concepts, parameters and rules governing the transmission of messages through communication systems’ (Martignon 2001). These pioneers (e.g., Claude Shannon, Florence Violet McKenzie, Warren Weaver, Alan Turing, Norbert Wiener, to name only a few) became the first information theorists or cyberneticists (Wiener and Collection 1961). It should be noted that women and minorities are often left out of the history of cybernetics, but played integral roles whose influences are still being discovered, from Ada Lovelace’s pioneering work to develop the world’s first complex computer program 100 years before the first computer existed (Plant 1995) to Margaret Mead’s instrumental contributions to the field of cybernetics and new systems thinking (notably memorialized in the influential Macy Conferences alongside another often unrecognized contributor, Janet Freed Lynch (Hayles 1999)), from biologist Ross G. Harrison to the innumerable ways that African culture was studied by anthropologists like Mead and Gregory Bateson and used to scaffold the ideas upon which cybernetics was built (Haraway 1976). Subsequently, the complexity paradigm was built on ideas of information. Ilya Prigogine studied chemical systems via concepts and mathematics of information and entropy and directed his inquiry toward the question of where order comes from Prigogine and Nicolis (1967). A key realization was the importance of far-from-equilibrium systems that could move toward more complex states not otherwise possible through sequences of near-equilibrium transitions. Prigogine called these ‘dissipative structures’ (Prigogine and Lefever 1968; Prigogine and Nicolis 1971), and they became a bridge between self-organization (SOC systems, for instance, are in this far-from-equilibrium state), information, and physical systems as well as source material for philosophical inquiries into the formation of complexity in biological organisms.

Transmission in Shannon’s messages in a communication system and Prigogine’s fluids in a thermodynamic system both provide apt analogs to the transfer of energy through physical systems – and the impact of information theory and uncertainty quantification on physical science has grown in recent years. Heliophysicists have taken the information theoretic approach to the solar-terrestrial system (e.g., Wing and Johnson 2019). Information theory provides rigorous mathematical formalisms to study the nonlinear relationships and feedbacks that characterize complex systems (Thayer 2011), especially because they can go beyond linear correlational analyses, capture nonlinear relationships, and establish causalities. Long has the dilemma of correlation vs. causation been an undercurrent in scientific studies and discussions. A central contribution to the conversation was the Granger causality paradigm (Granger 1969), which formulated causation in a predictive manner, asserting that a time series $X$ Granger causes another $Y$ if past values of $X$ provide statistically significant information about future values of $Y$. A core assumption in Granger’s work that the cause has unique information about the future values of its effect is grounded in the notion of directed information, an information theoretic measure. Many complementary concepts now accompany Granger causality, most of which share a grounding in causality being studied based on the number of bits of information that one process provides about another, or information theory. Information theory is a rich field with many measures that are based on probability distribution functions ($pdf$s) and can therefore capture nonlinear relationships between variables (e.g., transfer entropy). Systems sciences recognize the limitations of correlation and regression-based methods for complex systems, moving toward causal inference frameworks based in ideas and formalisms from information theory. This is notably evident in Earth Systems Science (Runge et al. 2019). Space physics, too, has been enmeshed in the correlation vs. causation dilemma and though correlation analyses remain common and in some cases useful, there is a growing body of literature around the application of information theory. We review that literature in Sect. 6.3. Researchers, in space physics as across fields, have found that information theory can describe the structures and signatures of order against the random, entropic background on which they act. Information theory thus provided an entry point into the complexity science paradigm.

Information theory is inherently probabilistic. The equations deal with random variables and probability distributions. As apparent in the connection to entropy, implicit in information theoretical approaches is a quantification of uncertainty. Indeed the complexity paradigm requires an acknowledgement of uncertainty and uncertainty quantification becomes important within it. Progress toward information theoretic approaches and uncertainty quantification in Heliophysics leads to the advent of risk science and resilience studies as bridges between deterministic physics-based methods and empirical data-driven approaches (Camporeale 2019). The intersection or reconciliation of these two is a grand challenge for Heliophysics and one for which past work suggests the complexity paradigm may provide new possibilities for progress (see Sect. 7.1).

1.4 Networks, Network Science, and Collective Behavior

Information leads to another central dimension of complexity science: networks.

If the complexity science paradigm is about understanding the emergence of patterns from the interactions of their parts, then networks are its specimens and network science its toolkit. A network is simply a collection of entities, or nodes, and their relationships, or edges (see Fig. 1. For example, in a social network the nodes are people and the edges whether they know or are friends with one another.

Networks, also called graphs, permit the representation of a system in a way that captures more of the complexity than, say, a rigid spatial grid representation could. As the network structure is remarkably representative of the natural world (Kauffman 1993), thinking of a system in this way can lead to new and useful insights (Newman 2010).

The 21st century has witnessed the advent of new theoretical tools to extract knowledge from networks of many different kinds (Torres et al. 2021, and references therein). Graph theory, dating to the 18th century, was used to represent networks and as the scale of these networks grew (e.g., internet-scale data that are inherently networked) new solutions were created for the computational and interpretive challenges. New metrics for understanding the networks at different levels were created, including various measures of centrality (Newman 2010, Chap. 7) and community detection (Porter et al. 2009). From the mid-1900s into the 2000s, common graph structures were discovered and models developed to generate them such as the random graph and the Erdös-Rényi model (Erdos and Rényi 1984), the small-world graph and the Watts and Strogatz model (Watts and Strogatz 1998), and the scale-free graph and the Barab’asi-Albert model (Barabási 1999). Even more recently there have been advances in understanding how networks change over time, network dynamics, and spread across networks (Valente 1995; Pastor-Satorras and Vespignani 2001; González et al. 2008; Dutta et al. 2013), notable in recent years for their use in epidemiology and the spread of disease. The availability of new tools was grounds for a concomitant dawn of network science in Heliophysics. As an objective of this review is to focus on areas of the complexity paradigm that have received less development, special emphasis is placed on networks and network science in Heliophysics.

While there is a productive and generative body of network science research in the solar-terrestrial system (e.g., Dods et al. 2015; McGranaghan et al. 2017c), the related topic of collective behavior has received relatively little attention. Collective behavior is a term used to describe approaches to understanding emergent phenomenon particularly through representing the system as a network. From the function of parts (nodes) together with their interactions (edges) collective, or community, behavior emerges (Radicchi et al. 2003; Porter et al. 2009; Fortunato 2009). Collective behavior has progressed from a description of phenomena to a framework for understanding the mechanisms by which collective action emerges (Bak-Coleman et al. 2021). The research that composes this review suggests that areas ripe for future analyses may be identified in the language and discipline of collective behavior, especially in its treatment of networks. The areas include both the study of the physical solar-terrestrial system and the social networks or communities of Heliophysics that study it. Thus, the implications are not only for fundamental science but also for how we do science.

Ultimately, one cannot do modern science without information theory and now networks. These form pillars of complexity science that we will use to explore the history of Complexity Heliophysics.

1.5 Risk Science and Resilience

New frameworks are required to handle uncertainty and embody the complexity paradigm. This is acutely true in Heliophysics, which has an existential counterpart in the societal impacts of the solar-terrestrial connection known as space weather (Schrijver et al. 2015). In order to translate the science of Heliophysics into actionable knowledge for space weather, the complexity paradigm dictates a risk science approach and an emphasis on resilience (Scheffer et al. 2001; Sobel et al. 2014; de Bruijn et al. 2017; Angeler et al. 2018). Risk science is the set of approaches and research areas that bridge from the fundamental Heliophysics understanding to concrete and quantitative impacts that inform decision-making. In practice, it is the layer between Heliophysics knowledge and applications that encompasses the dimensions listed above (e.g., systems science, information and uncertainty, network science) and their downstream impacts, permitting a quantification of risk, defined as consequence times likelihood. Once we quantify risks, we can then consider the resilience of our systems, whether that system is a crewed spacecraft, the power grid, or the magnetosphere itself. Resilience emphasizes discovering mechanisms of a given system that maintain functionality under changing or uncertain environments (Flack and Mitchell 2021). Together, risk science and resilience outline a framework, what we will call a risk and resiliency framework, that defines a future research agenda for Heliophysics and space physics capable of responding to the grand challenge articulated above: namely navigating the intersection between basic science (e.g., complexity science) and applied science (e.g., predictive models like those from machine learning). In a risk and resilience framework a system is treated as complex and can be defined by whether or not it can accommodate changes and reorganize itself while maintaining the crucial attributes that give the system its unique characteristics (Scheffer et al. 2001). Risk and resilience offer ways that data-driven information can be incorporated with complex systems understanding and decisions made amidst uncertainty (de Bruijn et al. 2017). How do we define risk and natural hazards (the physical phenomena that create risk) and how might understanding Heliophysics and space weather in these terms illuminate a path toward a society resilient to their vicissitudes? This review pulls together the existing literature and knowledge to discuss this question. We draw from important sister disciplines, namely Earth Science, terrestrial weather, natural hazards, risk studies (Burgess et al. 2016), and disaster risk reduction (Wisner et al. 2011), to understand risk science (Burgess et al. 2016). Because the idea of Heliophysics and space weather as risk sciences is relatively novel in the published research, our treatment of the topic must be somewhat speculative and subjective. However, we ground those perspectives in the support that does exist both within and tangential to space physics.

Although the volume of work in that treats the solar-terrestrial connection as a complex system and casts the problem as one of risk quantification and resilience-building (Valdivia et al. 2005; Jonas et al. 2016; Green et al. 2016; Eastwood et al. 2017, 2018; Oughton et al. 2019; McGranaghan et al. 2020, 2022) is small, it is growing, and the changes we observe in the literature motivate the need to articulate a framework for risk science and resilience. Therefore, we conclude this review with a look at just a few of the works that have considered Heliophysics and space weather as a risk science and examine the impact they have on societal infrastructure and life on Earth (resilience), providing scaffolding for creating a framework for risk science and resilience within which actions can be determined and decisions made amidst, sometimes extreme, uncertainty. The attempt to review the existing works and to abstract a framework for risk and resilience offers guidance on the topic we have identified is central to Heliophysics and space physics in the 21st century: bridging research and operations; determining the relationship between basic and applied research. The review culminates in a discussion of this grand challenge in Sect. 7.1.

1.6 Approach and Roadmap for This Review

With the paradigm shift comes new capacities to understand the system. Heliophysics has embraced some of the dimensions, however, a number of them remain relatively unexplored. Among the nearly 400 articles manually reviewed and cited in the bibliography, and corroborated by the order of magnitude more articles included in a corpus automatically generated from natural language processing (NLP), uneven coverage of the five dimensions introduced above was discovered. ‘Emergence, self-organization, and scaling theory’ (albeit with less emphasis on emergence) enjoyed the best coverage in Heliophysics, followed by the more general category of ‘Systems science and crossing scales.’ ‘Information and Uncertainty quantification’ is slightly more prevalent than ‘Networks, network science, and collective behavior.’ ‘Risk science and resilience’ is significantly the least well-represented. The coverage in this review inevitably reflects this representation, with the lack of coverage contributing to what is identified as gaps potentially deserving of special focus. Where other review articles have already covered a topic, we choose not to reify that material but rather to point to the appropriate review.

The purpose of this review is to apprehend the development of the paradigm and to establish the historical trajectory for the tools of complexity science within Heliophysics. From this history, important trends emerge that will guide researchers (early and senior) and those with the responsibility to direct research resources toward directions that may be more capable of responding to the problemscape of Heliophysics in the 21st century. The structure of the review matches these goals: Sects. 3-4 review the work that sets the stage for Complexity Heliophysics, taking 1996 as a pivotal year; Sect. 5 covers transitional years (1996-2010) when the foundation for the paradigm was being established in earnest; and Sect. 6 broadens the scope of relevant literature to identify topics and trends for which Heliophysics exploration is relatively nascent or that we have not yet explored. This history is chronicled to substantiate the grand challenge confronting Heliophysics and space physics, which we make explicit in Sect. 7.1. Finally, we attempt to synthesize the history and the grand challenge into a coherent response and recommendation for the future for Heliophysics, space physics, and space weather in Sect. 7.2. Sections 7.1 and 7.2 attempt to articulate the challenges and future directions for Heliophysics. Such discussion about the interpretation of the current state of a field and about what lies ahead will be necessarily subjective. Therefore, those sections have subjective elements. However, we have grounded those opinions in literature and evidence to the extent possible.

First, a word about the philosophy of history that we adopt. The approach is one that did not assume the important dimensions of complexity a priori, but rather reviewed the literature liberally for those dimensions to emerge from the existing work. Second, we acknowledge the importance of positionality, or where one is located in relation to their various social identities, in constructing any history. It is unavoidable to have a bias toward Heliophysics, however to address a paradigm as encompassing as complexity, one needs to step beyond the unidisciplinary perspective and bring in works traditionally considered beyond our field. Indeed this is one aspect that distinguishes this review from others on similar or related topics (e.g., Aschwanden et al. 2016; Balasis et al. 2023).

The effect is a relatively uneven coverage of the dimensions of complexity and a reference list that is much wider than Heliophysics. We believe this paints an accurate picture of Complexity Heliophysics and provides the breadth that justifies our extrapolations in the final sections of the review.

The objectives of this work are:

1.
To define the paradigm of Complexity Heliophysics in the context of the seminal works that compose it, seen in a new converging light;
2.
To detail the network of complexity studies in Heliophysics, setting the stage for the needed research in the 21st century. The corpus of cited text will be uncommon to most Heliophysics research papers (e.g., pulling liberally from areas outside of the traditional scope of Heliophyscis research articles) and a unique resource in and of itself (e.g., serving as a hub for the network of research that all readers can use in their exploration of Complexity Heliophysics); and
3.
To lay a foundation for how complexity science may help address outstanding questions in Heliophysics science, including the intersection of fundamental and applied research and the use of artificial intelligence and machine learning (AI/ML).

A function of this review is to enable further investigation of the research that was compiled and explored and the artifacts from it. Thus, we provide those resources in a navigable way that lends itself to further exploration. Artifacts that we provide include:

1.
This review article;
2.
A glossary of terms (the bedrock of information search, integration, and automated analyses (Pomerantz 2015)) that define complexity Heliophysics; and
3.
A new corpus of Complexity Heliophysics compiled using NLP where the included papers have been filtered based on Heliophysics or Heliophysics-adjacent journals and by matching terms in the papers to those in the new glossary.

The automated corpus contains roughly 3000 articles, which augment and grow by an order of magnitude the 400 that were manually reviewed. Given the proliferation of scientific publications, outstripping the capacity of any individual researcher or even any small group, NLP is required for comprehensive treatment or knowledge of any subject.

All resources are provided in a Github repository for this publication.^{Footnote 1} Something we would like to see come from this work is a more collaborative examination of the corpus of articles (the collection of documents manually compiled in the references of this review along with those automatically generated through NLP methods (see Appendix C)). One way to accomplish this is to create a database of the papers in the corpus (our references list plus the articles compiled from NLP analysis of Heliophysics literature) on a platform that allows the community to add margin notes, annotate, and hold conversations around the papers. This would allow the insights researchers generated when reading published works to build on one another and for the conversations around those works to evolve. Inspiration can be found in the Fermat’s Library.^{Footnote 2} Altogether, the resources provided with this review constitute a library of Complexity Heliophysics in the ethos of libraries as cultural technologies^{Footnote 3} and sacred places (Price 2019).

Previous works have capably reviewed complexity science up to ∼1996 (Klimas et al. 1996). We do not attempt to restate those works here, and instead take the Klimas review as our starting point. This enables us to give more attention to the voluminous body of work that has been created since. Parts of the discussion will, of course, reach to works prior to 1996.

We augment the literature review and synthesis with a more perspective-based portion (Sects. 6 and 7.1) where we attempt to describe trends perceived through creating this review and to identify key issues confronting Complexity Heliophysics.

We acknowledge that some readers will not incorrectly read parts of this contribution as opinion; however, we have tried to support our views by quotes and references compiled across more-than-Heliophysics publication and knowledge wherever possible.

Finally, like any review or synthesis, this is incomplete. It is not the goal of this work to review exhaustively every paper that is related to complexity science in Heliophysics, but rather to present a useful selection, deliberately chosen to reveal the story of the complexity science paradigm in Heliophysics, and to illuminate generative areas for future thinking and research. Our purpose is to create a strong foundation to support future research, a place to build on for the inspired reader and community to respond to its inadequacies. The first step in scientific change is getting people into a space where they can acknowledge that there are alternatives to the ways that it does science at present and opening a discourse about what those alternatives are. Then you begin to build social movements and actions around those alternatives in order to achieve them. That is the hope for this review.

1.6.1 The Use of Natural Language Processing (NLP) in This Review

Given the volume and breadth of the information available to a Heliophysicist, a problem exacerbated by the pace of scientific research and publication, traditional approaches to search and discovery as well as ingestion of new materials (e.g., largely manual) will need to be augmented by new tools. Perhaps most pressing is the need to create and adopt mature natural language processing (NLP) tools to help one search through, organize, and summarize the vast literature. NLP refers to interactions between computers and human language and is often used to refer to the programming of computers to process, analyze, and respond to large amounts of natural language data. We have employed these techniques in this review. As far as we are aware, the use of NLP to augment the review is a novel element and further distinguishes it from related contemporary works.

As a supplementary piece that intends to make this a living contribution to the state of knowledge, we provide a corpus that was generated by natural language processing methods of 33 journals (those within the NASA Astrophysics Data System (ADS) deemed most important to Heliophysics) that any reader can use freely to determine other trends not addressed here. Across the 33 journals, we compiled a corpus of nearly 125 thousand articles with their authors and abstracts. After matching words in the title and abstract with terms in our complexity glossary, we arrived at a Complexity Heliophysics corpus of roughly three thousand documents, two orders of magnitude larger than a typical journal article bibliography. The details of how the corpus was generated can be found in Appendix C. It augments the bibliography, a corpus in itself, of works directly cited in this review. The author acknowledges a wealth of knowledge much greater than the papers directly cited in this review contributed to the writing. Resources related to the automated corpus generation and results are provided in an accompanying Github repository for this review.^{Footnote 4}

We suggest that such automated corpora could perhaps even become standard for future reviews. The one provided herein should be considered a resource that complements the extensive references cited in the body of this review and contains high potential for discovering trends and knowledge about Complexity Heliophysics. It is important to note that the manual and automated corpora are not disjoint nor is the manual corpus strictly a subset of the automated corpus. Many references are shared across them, lending validation to the process of generating the automated set, but there are many references in the manual set that are not included in the automated one. This points to the flexibility of the scientist-driven discovery process, pulling in relevant references and material that might be more distant or irregularly connected to the research at hand than the necessarily more rigid automated process. This review, in particular, read widely in gathering material, many connections of which an automated approach would likely not have captured. The point is there must be an intersection of manual and automated gathering of resources, the manual approach benefiting from flexibility and capacity to range widely and be discerning and the automated approach benefiting from the volume of resources it can examine.

The augmented way that we have approached this living review illuminates a trend in Heliophysics research and one point of this manuscript is to demonstrate the hybrid manual-automated approach. We discuss this trend in research below in Sect. 6.5.

Readers interested in the history of complexity science in Heliophysics will enjoy beginning with Sect. 3. Those who might be interested in a more quantitative analysis might begin with the importance of NLP (Sect. 6.5) and our provided corpus, where they can emerge their own conclusions and trends. Those with an interest in solving the key open questions and challenges will find most value in Sects. 6 and 7.1.

2 Key Definitions

Any conversation or published work situated sufficiently in spaces between established fields requires a period of adjusting or coming to a more shared language. We provide a set of definitions that are important to the review that follows as a way to create common language.

artificial intelligence the theory and development of computer programs able to perform tasks normally thought to require human intelligence (McCarthy et al. 2006).
coarse-graining considering a system at a higher or coarser level at which some of the finer scale behavior has been smoothed over
collective intelligence the study of collective behavior, that is adaptive, wise, or clever structures and behaviors by groups, in physical, biological, social, and many engineered systems (Flack et al. 2022)
corpus a collection of documents
disaster the state of a natural phenomenon resulting in major consequences for society (cf., ‘natural hazard’ and ‘risk’)
emergence the term used to describe phenomenon that are ‘more than the sum of their parts’
feedback a loop of interactions across a system; in the computational fields, feedbacks are defined by outputs of a process being put back into an input of the same process
graph/network a collection of entities (nodes or vertices) and their relationships (edges)
information Information quantifies the amount of dependency or connection between a random variable and itself at a different time or with other variables at the same or different times. Its counterpart, entropy, quantifies the amount of micro-states involved in the value of a random variable
machine learning the study of computer algorithms that allow computer programs to automatically improve through experience (Mitchell 1997). ML is a sub-field of artificial intelligence.
natural hazard potentially consequential phenomenon of nature outside of human control (cf., ‘disaster’ and ‘risk’)
natural language processing (NLP) programming of computers to process, analyze, and respond to large amounts of natural language data
named entity recognition (NER) subtask of NLP that seeks to locate and classify named entities mentioned in unstructured text
resilience the property of a system to accommodate changes and reorganize itself while maintaining the crucial attributes that give the system its unique characteristics (Scheffer et al. 2001)
risk the combination of the probability of occurrence of a natural phenomenon and the magnitude of the consequences (likelihood multiplied by consequence; cf., ‘disaster’ and ‘natural hazard’)
sandpile cellular automata model a model based on adding sand to a pile and observing and quantifying the results, specifically the avalanches, that occur due to simple rules defining the evolution (Bak et al. 1987). It is a specific example of an automaton that is a regular grid of cells, each with a finite number of possible states, and a fixed rule governing its state at the next time step given its current state and the states of neighboring cells (Wolfram 2002).
scale-free/scale invariant property of a system in which there is similarity across scales, the same structure is observed regardless of the scale at which the system is observed
self-organized criticality (SOC) Bak et al. (1987) proposed the concept of self-organized criticality (SOC) to explain the power law, or scale-invariant, correlation extending over many decades in complex dynamical systems. It is the observation that, as articulated by Consolini (2002), “...some slowly-driven, dissipative, extended, dynamical systems can naturally exhibit a spontaneous organization towards a sort of dynamical critical point. The critical state does not depend [on] the initial conditions, does not require a fine tuning, and behaves as an attractor for the dynamics. All these systems are characterized by an intermittent dynamics, 1/f noise, and by a threshold dynamics, i.e. a local stepwise instability that occurs when the local field exceeds some critical value.” SOC dynamics can be viewed as a sort of dynamical transition between metastable configurations near a critical point and is an explanation of the ubiquity of 1/f noise
system A group of interacting or interrelated elements that act according to a set of rules to form a unified whole (Merriam-Webster 2023). A systems perspective is at the center of complexity science.

3 Setting the Stage for Complexity Heliophysics: From 1996

The world of Complexity Heliophysics touches many areas. We will focus this review on areas that are important and have received less development, including the critical linkages between the papers. For instance, we will address the topic of self-organized criticality, but defer a comprehensive examination of the topic and its history in space physics to Aschwanden et al. (2016) and Chang et al. (2003), allowing space here for further development of complexity in Heliophysics.

As mentioned in the introduction, we begin with the review article by Klimas et al. (1996), which could be interpreted as a turning point. They provide the relevant framing, “Earth’s magnetosphere responds to the time-varying solar wind in an organized and repeatable fashion. Evidence has accumulated indicating that this organized evolution is a manifestation of low-dimensional magnetospheric dynamics. It appears that over the largest spatial scales and over substorm timescales and beyond, a relatively small number of magnetospheric state variables dominate the evolution. The identities of these variables are not known at present, and very little of the dynamical system that governs their evolution is understood. Determining these variables and understanding the related dynamics are the primary goals of this research. If these goals are reached, then a spatio-temporal framework will result within which reside the complex phenomena that are collectively called geomagnetic activity.”

Klimas et al., details the transition over the preceding several decades from an era of linear correlative studies to the beginning of a new era of nonlinear dynamical studies. Why? Because the linear filters of Bargatze et al. (1985) across varying solar wind activity demonstrated clear nonlinear behavior through distinct peaks in the time lags of activity. “Generally, there is a peak in the filters at lag time 20-30 min showing that there is always a response in electrojet activity to solar wind activity 20-30 min earlier; Bargatze et al. attributed this peak to the directly driven magnetospheric response (Perreault and Akasofu 1978; Akasofu 1979, 1980). There is a second peak at lag time one hour that is most evident for the moderate activity filters; this peak was attributed to the unloading magnetospheric response (McPherron 1970; McPherron et al. 1973; Hones 1979; Baker et al. 1979, 1981a,b)”. The Bargatze work made clear the direct and indirect (directly-driven and unloading) modes of the magnetosphere (Kamide and Kokubun 1996). Klimas et al., begin from this background and the implication that nonlinear explication of the solar wind-magnetosphere coupling requires nonlinear treatment. A key contribution of their review is a convergence of work to address the directly-driven and unloading modes of the solar wind-magnetosphere system and the corresponding evolution of the methods from linear correlative studies to nonlinear dynamical studies. It is an excellent starting point to understand the Complexity Heliophysics paradigm.

The central question in the review was whether it can it be shown that the magnetospheric dynamics, represented by systems of differential equations, are effectively represented by a low-dimensional dynamical system and, if so, what is the nature of that dynamical system? The response to this question is traced through three approaches, interleaved in time: autonomous time series, analog (input-output) models, and computationally mature input-output models. The self-described summary of the work is a review of approaches to “find a low-dimensional analogue model of the magnetospheric dynamics derived directly from data and interpreted in terms of magnetospheric physics.” It laid out the existing state-of-the-practice toward what was a major goal of the magnetospheric community since early in the formation of Heliophysics as a discipline: to find a low-dimensional analogue model of the magnetospheric dynamics.

The first observation from the collection of articles in the Klimas review is the prevalence of using the auroral electroject index (AE and the auroral upper and auroral lower indices, AU and AL, that constitute it) (Davis and Sugiura 1966a) to be the measurable variable to reconstruct the magnetosphere. In fact, the standard or benchmark dataset from most of the work addressed is a set of 34 events for which solar wind data are collated with the AL index (Bargatze et al. 1985). Appendix D contains a list of important datasets, and their original appearance in the literature, that appear across this review to aid readers who wish to compile datasets and explore data-driven research across the datasets that have factored importantly in the Complexity Heliophysics paradigm.

3.1 Autonomous Time Series

Autonomous approaches make the assumption that the evolution of the system representation (the state vector) is solely a function of the internal dynamics and independent of external factors. They are concerned with the question of whether or not the evolution of the magnetosphere is organized such that a few variables alone can describe its evolution. Studies attempt to determine whether the dissipative magnetosphere dynamically evolves into a low-dimensional ‘attractor’ state and then attempts to describe the attractor to develop a model of the organized physical state. A measure of the attractor dynamics is the correlation dimension. The correlation dimension considers the set of state variables, or the phase space, of the system. To calculate the correlation dimension an arbitrary measurement from those variables is chosen and compared to other measurements in the neighborhood of the chosen one, defined by a sphere of radius $r$. The number of measured state vectors within the neighborhood are counted ($C_{r}$) and the variation of this number as $r\rightarrow 0$ is determined. The importance of the correlation dimension, $D_{cr}$, is that if $C(r) \propto r^{D_{cr}}$ as $r\rightarrow 0$, then $D_{cr}$ is an estimate of the attractor dimension near the chosen point. For an entire system, an average correlation dimension can be found by averaging the correlation dimension from each chosen measurement over all of the measured points. For a simple closed loop phase space, purely periodic system, in three dimensions (three state variables), $D_{cr}=1$. In a chaotic system, the attractor phase space is made up of many folded and closely packed layers and $D_{cr}$ falls between 2 and 3. A non-integer correlation dimension is a feature of ‘strange’ attractors (Ruelle 1980). The dimension of the attractor is an important estimate of the physical system that gives rise to it. In the periodic case, the dimension is one, indicating that any two of the variables can be written in terms of the third and that the system is one-dimensional. Between two and three, the correlation dimension indicates the minimum phase space dimension that supports the attractor is three, and that is the physical dimension of the system.

Across the literature a rather wide range of correlation dimensions between 2.4-4 have been found: from 3.6 in the first application of the technique to magnetospheric dynamics (Vassiliadis et al. 1990) to a set of papers that found low correlation dimensions (Roberts et al. 1991; Shan et al. 1991b,a; Roberts 1991; Pavlos et al. 1992; Sharma et al. 1993). There were numerous critiques of the correlation dimension technique to determine the magnetospheric dimension, including the requirement of large databases of measurements and lack of knowledge of what volume would be sufficient, spurious periodicities in the databases as have been identified in the Bargatze et al. (1985) data, potential presence of background or superimposed trends in the data across activity levels and unrelated to the magnetospheric dynamics, and sensitivity to delay times chosen for the analysis. Among the limitations that exist across attempts to understand the complex magnetosphere and geospace environment is the inability to measure all of the state variables or even to define them a priori. The solution has been to substitute for the unobservable variables functions of measurable ones to constitute the state vector. Among the most readily available has been the AE/AL/AU indices and the best available datasets have been those that align the solar wind drivers with the AE indices’ responses. Limitations in the use of these reconstructed variables permeate the history of complexity Heliophysics studies of the magnetosphere.

A far-reaching critique of the approaches reviewed was the fact that the AL index time series is a colored noise output of a high-dimensional stochastic process, which would result in a misleading low correlation dimension when those data are used to proxy the complex magnetosphere (Shan et al. 1991b; Takalo et al. 1993, 1994). Colored noise is stochastic noise that, like a chaotic time signal, exhibits a power law spectrum $f^{\alpha}$ (Klimas et al. 1996). In the case of colored noise, the correlation dimension is actually a measure of the fractal dimension of the system and is unrelated to the existence of an attractor. Numerous studies examined the likelihood that the AE/AL index time series are generated by a stochastic signal, with varying conclusions. Though useful methods arose from these investigations (e.g., self-affinity (Osborne and Provenzale 1989), autocorrelation analyses (Theiler 1986), singular spectrum analyses (Sharma et al. 1993), the use of surrogate data (Theiler et al. 1992)), no consensus was established.

The conclusion from the autonomous method studies was rather that no conclusive statement could be made about the low dimensionality of the magnetosphere–contradictory results pervade the literature. The lack of resolution of the central question of the variables that govern the magnetosphere and its evolution led to the consideration of magnetosphere by nonlinear input-output methods. From Takalo et al. (1994):

[The] magnetosphere is not an autonomous system but is continuously driven by the stochastic solar wind. It is therefore possible that, even if the magnetosphere were a low-dimensional chaotic system, we might not find it by studying just one time series. This is because the magnetosphere may not have time enough to converge to the possible attractor for times long enough to produce data with a detectable number of close returns of the trajectory. For this reason it has been suggested that the magnetosphere should be described as a nonlinear input-output system (Prichard and Price 1992; Price and Prichard 1993).

3.2 Input-Output Models: Analogue Models

Autonomous methods assume that the system evolves solely due to internal dynamics. Loading rates or external forcing may be present, but it is taken to be a constant and a parameter of the system rather than a dynamical variable. In the autonomous methods there is a common approach to use a series expansion of a function to represent dynamics at a certain point in space and time and to discard the higher-order terms. The shortcomings of the autonomous methods raised the question of the dynamics that reside in these higher-order terms. Subsequent to the controversies and conflicting conclusions of many autonomous system approaches and studies, input-output approaches began to play a central role in the study of the magnetospheric dynamics. Input-output (I-O) approaches use both the input to the system and its output response to determine the coupling characteristics. The Klimas et al. review divides models into two categories: 1) analogue and 2) more recent nonlinear and computationally intensive approaches. Analogue modeling makes assumptions about the magnetospheric coupling characteristics (determining them from ‘preconceived notions of the processes that constitute geomagnetic activity’) whereas the latter I-O methods reviewed determine them directly from data. The output of the latter I-O methods is a so-called phenomenological dynamical system, one that has been deduced from time series analyses of the input and output data rather than through analogue modeling. The two approaches reveal a debate central to science and that foreshadows the key challenge discussion that is introduced at the conclusion of this review (see Sect. 7.1) about the relationship between theory- and data-driven theories of discovery.

Analogue approaches to magnetospheric dynamics assume that the magnetosphere is a coherently driven system that evolves in an manner that can be modeled by a low dimensional dynamical system. The analogue models reviewed by Klimas et al., share the assumption that the magnetospheric dynamics are low dimensional.

To constitute the input and output time series, the analogue models shared the approach of using one of the best available observables thought to be related to the magnetospheric dynamics: the electrojet indices (Davis and Sugiura 1966b). The AU index is a measure of the eastward electrojet driven by reconnection dynamics in the dayside magnetosphere. The AL index, alternatively, is a measure of the westward electrojet that is connected to high-latitude magnetopause reconnection in the magnetotail. In using the electrojets as measures of the response to solar wind energy input, the assumption is that the currents flow as a result of appropriate conductances and electric fields mapped from the magnetosphere to the ionosphere through Alfvèn waves along intermediary field lines.

Goertz et al. (1993) developed a model of only the directly-driven magnetosphere (responses of the magnetosphere as a result of direct energy input from the solar wind rather than unloading responses that occur at longer time lags due to magnetotail activity (Akasofu 1979)) using a system of three nonlinear ordinary first-order differential equations with six independent constant parameters:

$$ \tau _{AD} \frac{d AU_{d}(t)}{dt} + AU_{d}(t) = A_{1} E_{mpt_{max}}(t), $$

(3)

where $\tau _{AD}$ and $A_{1}$ are constant parameters, $AU_{d}(t)$ is the eastward electrojet for the directly-driven response ($d$), and $E_{mpt_{max}}(t)$ is the maximum reconnection electric field at the magnetopause. A similar form is assumed for the westward electrojet and relationship to the electric field in the central plasma sheet, with appropriate adjustments in the parameters. ISEE 3 solar wind data were used to compute the electric field at the magnetopause, the input to the model, and measured AE (composed of the AU and AL indices) was compared with predicted AE over May 18-19, 1979. A cross correlation of 0.9 was achieved (see Fig. 7 from the original publication, the central result of the paper, reproduced here in Fig. 2 with permissions).

Their model is strictly of the directly-driven response to the solar wind, and the agreement indicates little dependence on the unloading response (McPherron 1970). There were periods of marked disagreement, attributed in large part to differences in the AL index, though these were isolated in time as a result of non-adiabatic phenomena so that the overall correlation remained high. Absent a term for these non-adiabatic dynamics, the model has been shown to perform poorly on different data intervals as pointed out by McPherron and Rostoker (1993).

Klimas et al. (1992, 1994) produced a model that included analogues to both the directly-driven and unloading magnetospheric responses to the solar wind. Their model is a Faraday Loop for the time-dependent magnetotail convection with a superposed loading-unloading cycle. The model is built on the concept that if the loading rate on the dayside magnetosphere is above some level, then the unloading of the accreted energy through Earthward convection is insufficient to balance it and a substorm begins to grow. If loading continues, a critical point is surpassed whereafter explosive unloading follows. The resultant three-dimensional Faraday Loop Model (FLM) consists of dynamic variables for the cross-tail electric field, the flux content of the lobe, and a quantity dependent on the size and orientation of the tail. Their observable for comparison was again the electrojet indices, and they mapped the cross-tail electric field from their model to the westward electrojet in a simple way. After this study, it was implicit in the magnetosphere community that any dynamical model must account for both modes: directly-driven and unloading. Kilmas et al., cite the importance of the FLM as being a combination of numerous important statistical properties of geomagnetic activity: isolated substorms, steady loading, and time-dependent loading.

A number of studies have used the FLM model. Klimas et al. (1994) showed that under steady loading the FLM model evolves to a periodic attractor, with a period of substorm recurrence of roughly one hour and that the period was independent of driver strength. Farrugia et al. (1993) examined time scales of substorm occurrence for the passage of slowly varying magnetic clouds to understand substorm recurrence under various steady loading rates. They found that substorm recurrence ranged between 25 and 150 minutes, but with an average rate of approximately one substorm every 55 minutes (unloading recurrence rate). For this range of steady driving strengths the FLM model varies little from the 55 minute average unloading recurrence rate. Klimas et al. (1994) use of the FLM model seems to explain the Farrugia et al. (1993) observations: 1) the nearly invariant unloading recurrence rate under steady driving and 2) the distribution about one hour of recurrence rates during extended periods of loading. The results have been extended to time-dependent driving, where a Poisson distribution of recurrence rates around the most probable value of one hour was found. Note that the Poisson distribution expresses the probability of a given number of events occurring in a fixed interval of time given a known constant mean rate and independently of the time since the last event. The latter assumption has received focus in substorm research in the intervening decades under various names, among them substorm recurrence rates (Borovsky and Yakymenko 2017), waiting times (Freeman and Morley 2004), and intersubstorm times (Liou et al. 2018). Indeed waiting time statistics is used to identify Poisson random processes, self-organized criticality, intermittent turbulence, finite system size effects, or clusterization (Aschwanden and McTiernan 2010; Chapman et al. 1998, 1999).

As a side note, Farrugia et al. (1993) showed that both the epsilon parameter (Perreault and Akasofu 1978; Akasofu 1981) and the $VB_{s}$ (Baker et al. 1986) to be the most suitable measures of the rate at which energy is loaded into the magnetosphere by the solar wind. Developing some proxy of this loading rate is required to study the magnetospheric responses and has been the focus of much subsequent study (e.g., McPherron et al. 2015; Borovsky 2013; Newell et al. 2007). Lockwood (2022) performed an analysis of coupling functions and established best practices in their derivation and guidance on their limitations. They find the analysis of the persistence of solar wind parameters defines how best to compile a coupling function. Further, they comment on the best metrics for testing the capability of a coupling function, revealing shortcomings in correlation as a useful metric for some applications. Finally, they provide two criteria that coupling functions must describe to quantify integrated effects: 1) the large-event tail; and 2) the core of activity distributions. Improved representation of solar wind energy input to the magnetosphere, both in the form of solar wind coupling functions and multiple signatures of energy input, have led to improved understanding of substorm evolution (Haiducek et al. 2019) and even to improved prediction of their occurrence and intensity (McPherron et al. 2015).

Bargatze et al. (1985) calculated linear prediction filters (LPFs) to relate driving ($VB_{S}$) to response (AL index) for a number of intervals during a set of 1-2 day intervals. Their dataset is an important one for Complexity Heliophysics and is described in Appendix D. They found a peak in these filters at ∼20 minutes and ∼one hour, which were ascribed to the directly-driven and unloading magnetospheric modes. Power spectra were created comparing FLM-modeled and measured AL index values to $VB_{S}$ for an interval from the Bargatze data set. They are reproduced from Klimas et al. (1996) in Fig. 3 given their importance to subsequent work on power law relationships across the Heliophysics system.

Figure 3a shows a break in the measured and modeled power law spectra at high frequencies, due to the fractal nature of the AL index at high frequencies. In Fig. 3b the agreement has been recovered when the measured AL index has been smoothed over a 17.5 minute running average. The power law scaling is abundant in and indeed a hallmark of complex systems, indicating the presence of some underlying mechanism that creates self-similar, scale-invariant behavior. Klimas et al. (1996) marks an important recognition of power laws and their significance in Complexity Heliophysics.

3.3 Input-Output Models: Computationally Mature Input-Output Models

The progress and shortcomings of the linear prediction filters pointed to the next step in I-O modeling: capturing nonlinearities. The Klimas review provided an early look at two methods to specify nonlinearity: neural networks and local-linear prediction. Their advantage is that they are data-driven, meaning they determine relationships directly from the data rather than, as in the analogue approach to I-O modeling, from ‘preconceived notions of the processes that constitute geomagnetic activity.’ In other words, they do not make a priori assumptions and instead allow the data to determine the conclusions. With the data-driven approach comes the challenge of physical interpretation of the derived relationships. Interpretability (making physical meaning) of magnetospheric models was, of course, already important even in relatively explainable simple models, but the awareness was exacerbated as data-driven approaches used models with more parameters, more complexity, and became more difficult to explain. Will increase representativeness, interpretability often suffers. The tradeoff of representativeness and interpretability between data- and physics-driven approaches has only intensified in Heliophysics in the 21st century. A few comments related to the approaches that proved most successful of characterizing the magnetospheric behavior, local linear prediction methods and a subset from the field of ML known as neural networks, from the Klimas review articulate the tension:

“To understand the physical content of this“model,” it is necessary to understand the global structure of the nonlinear coupling surface. In this case, to extract the physical content of the local-linear predictor, it will be necessary to reconstruct the coupling surface from the many local approximations to it that are already available.”
“...it does appear that further research into extracting the physical content of the network is warranted.”
(Hernandez et al. 1993) “It is often said that neural networks yield no usable information on the physics of the system that they model. However, it appears that this prejudice may not be correct.”

Compiling these comments has proven prescient of a deeper discussion that has emerged and is being shaped in our more modern era of computation and artificial intelligence (AI). Klimas et al., summarize this trend with the statement, “It is anticipated that in the future, a combined approach involving both analogue [physics-based] modeling and input-output data analysis will prove most effective for understanding the magnetospheric dynamics that couples solar wind input to geomagnetic activity output.”

The early work on neural networks and local-linear predictions focused on geomagnetic activity prediction. Hernandez et al. (1993) created a neural network prediction of the electrojet index, using the Bargatze et al. (1985) $VB_{S}$-AL database. They created a state-space reconstruction (SSR) network, using input solar wind information and past output of the model (e.g., previously predicted AL index values) to predict future AL index values. As a point of comparison, they also created a nonlinear prediction filter (NPF) in which the predicted AL index values are produced based solely on solar wind input information. They determined that the SSR outperformed the NPF and that the neural networks performance were dependent on hyperparameters such as the activation function used. A severe limitation of the nets were the inability to predict large values of AL at all.

Several works advanced the concept of a local-linear predictor (LLP) (Farmer and Sidorowichl 1989; Casdagli 1992; Weigend and Gershenfeld 1994). LLP is an extension of LPF to capture nonlinear coupling between the input and output. Figure 4 shows the relationship.

The LLP addresses the situation where the input and output vary over a small range of values. As that variation increases, or as the curve is more nonlinear, the LPF approximation fails. The LLP fixes the point on the nonlinear coupling curve around which input-output data samples will be used to reconstruct the relationship. The process is one of defining the viable neighborhood. The approximation is valid only within the neighborhood and thus the process must be repeated at each step in time to predict the next step. Applications to the magnetosphere discovered an important extension that both the past and present inputs to the magnetosphere and its geomagnetic response need to be used as inputs to predict future outputs. The size of the neighborhood in the method reflects the degree of nonlinearity in the system, with the relevant neighborhood decreasing as the nonlinearity increases.

Price and Prichard (1993) used the approach to examine the nonlinearity of the magnetosphere, finding only weak evidence. Vassiliadis et al. (1995) is perhaps the seminal early work using LLP. Using the Bargatze et al. (1985) dataset, they found that the best fit for an LLP is low-dimensional and a local fit to a nonlinear predictor.

To apply the method, they build a large database of instances of the I-O data and use a pattern-matching approach to find the best values in each neighborhood of the case in question, varying the size of the neighborhood to find a minimum in the prediction error. As the nonlinearity of the I-O relationship increases, a smaller neighborhood for the LLP improves the prediction. Several important assumptions accompany the LLP method:

It is assumed that there is a set of variables, small in number, that adequately specifies the global state of the magnetosphere;
Although not all of the variables that specify the global state of the magnetosphere may be measurable, it is assumed that an equivalent state can be reconstructed from those that can be measured;
The next time step, $X_{n+1} = F(X_{n}, U_{n})$, is differentiable, where $F_{n}$ represents the magnetospheric dynamics that relates previous input and output to future output.

They suggest that the low dimensionality and nonlinearity of their LLP are characteristics of the physical magnetosphere. They provide strong evidence for nonlinearity and low dimensionality, and the results indicate that the evidence is persistent over many different local phase space neighborhoods (i.,e., over many different solar wind and magnetospheric conditions). Their work is a foundation for geomagnetic activity prediction, using past to present geomagnetic activity indicators like the AL index to predict the indicator’s evolution into the future (e.g., Lundstedt and Wintoft 1994; Topliff et al. 2020). It is now also apparent that the Vassiliadis et al. work was a predecessor of modern data-mining approaches using larger data sets (e.g., Stephens et al. 2019). Finally, LLPs suggest that, ultimately, a model of the magnetospheric dynamics can be low-dimensional, nonlinear, phenomenological.

The Klimas review summarized and may have helped sparked deeper and wider exploration of characterizing the magnetosphere as a low dimensional system. One interesting trajectory through the subsequent literature has been toward ML methods to characterize the magnetosphere: 1) specification of a magnetospheric state vector: “the state of the magnetosphere, resulting from continuous but variable forcing of the solar wind and the interplanetary magnetic field (IMF), can be empirically specified by a magnetospheric state vector,consisting of a set of hourly-averaged magnetospheric driver and response parameters.” (Fung and Shao 2008); 2) including vector correlations between the solar wind and magnetospheric state vector (Borovsky and Denton 2018); and 3) toward the advent of high-dimensional machine learning (ML) models of the magnetosphere and its coupling to the solar wind upstream and the ionosphere downstream (Bortnik et al. 2016; Chu et al. 2017; Maimaiti et al. 2019; McGranaghan et al. 2021c). We will return to ML in Sect. 7.1.

3.4 Important Themes Through 1996 That Set the Stage for Complexity Heliophysics

The studies reviewed represent launching off points for the Complexity Heliophysics paradigm that will be useful to readers (and in some cases supported by quotes from the Klimas article or others cited therein for convenience and context):

Dimensionality of the magnetosphere and self-organized criticality;
Relative success of input-output methods whereas autonomous systems methods were inconclusive;
Local linear predictions (Vassiliadis et al. 1995) as a foundation for geomagnetic activity prediction (uses past to present geomagnetic activity indicators like AL to predict the indicator’s evolution into the future);
The wide-reaching utility of neural networks. “Little seems to have been done using neural networks to predict substorm indicators such as the electrojet indices. Nevertheless, Hernandez et al. (1993) indicate that this is a promising direction for prediction. Further, in view of [the Klimas review] concerning the equivalence of neural network internal parameters to Volterra kernels, it does not appear that further research into extracting the physical content of a network is warranted.”;
Explainable AI (e.g., this review already raised the question of interpretability of the input-output models that proved most successful of characterizing the magnetospheric behavior (primarily neural networks and local linear prediction methods). These questions reach into modernity;
Converging the autonomous and the local linear prediction filter methods (gaining the benefits from each: interpretability and the success of the data-driven approach). “It is anticipated that in the future these local-linear predictor models will be studied carefully with the goal of organizing these bits and pieces into a global nonlinear predictor model. It may be advantageous to cast these predictor models as analogue models in order to maximize their physical interpretation.”; and
The assessments of the computationally mature input-output models point to a component of a risk formulation for Heliophysics models: By perturbing the input and witnessing the change one can create a measure of the sensitivity and thus a component of the resilience of the prediction technique.

Additionally, the Klimas review has numerous connections to the dimensions of complexity science discussed in the introduction and reveals Heliophysics-specific items that will be thematic, including: self-organized criticality of the magnetosphere, system dimensionality, the variability of the solar wind, bi- or multi-modal behavior of the magnetosphere/geospace, implications of various observables or metrics for analyzing the system (e.g., the sufficiency or insufficiency of the auroral electrojet indices), nonlinear I-O modeling such as neural networks, and representativeness vs. interpretability of physics-based and data-driven modeling approaches.

4 Emergence of the Connection Between Self-Organized Criticality and the Magnetosphere

The concept of self-organized criticality (SOC) provided Klimas et al. with an entry point for assessing the complexity science paradigm in Heliophysics. It’s purpose is similar here, yielding a foundational concept from which we branch to other dimensions of complexity that are important to the history of Heliophysics, including power laws and scaling theory (West 2017), fractality (Song et al. 2006), network science (Newman 2010), emergence (Holland 2000), coupling between domains of the solar-terrestrial system, observational considerations, and machine learning. SOC has been well chronicled (Aschwanden 2011; Aschwanden et al. 2016; McAteer et al. 2015; Sharma et al. 2016; Aschwanden 2019) and we will not attempt to recapitulate those excellent reviews, but will provide the necessary background for this review to be self-sufficient and point readers to the most valuable resources to discover SOC research in Heliophysics in more depth.

Bak et al. (1987) provided the world with the concept of SOC, an explanation for the ubiquitous $1/f$ power spectra characterized by a power law function $P(\nu ) \propto \nu $ that holds for any power spectra that is not purely random white noise ($P(\nu ) \propto \nu ^{0}$). The significance is that white noise represents traditional random processes with uncorrelated fluctuations, and the $1/f$ spectra is something else–an indication of non-random structures with long-range correlations in a time series. Bak used a sandpile as the example. Imagine that grains of sand are dropped onto a table. A cone-shaped pile will build up until a grain causes an avalanche. The longer a pile avoids an avalanche the larger the avalanche will be when it occurs. Bak attempted to determine the conditions under which these avalanches occur. He found that avalanches were unpredictable, dependent on the interactions between the individual grains of sand. Eventually the pile reaches a critical point at which the pile transforms into something more complex and properties emerge that are not part of the individual grains themselves. The tendency of the pile toward this critical state (self-organized criticality) was a new way of viewing nature (Bak 1997) – out of balance, but in a poised state.

Bak’s avalanches are the non-random time structures represented by the $P(\nu ) \propto \nu ^{0}$ function. That seminal work sparked a new avenue of research, beginning first with numerical simulations of avalanches and cellular automata, iterative applications of a simple mathematical redistribution rule that yields complex spatio-temporal patterns (Wolfram 2002), and subsequently to widespread application in the physical sciences. Charbonneau et al. (2001) provides a review of cellular automata in the field of Solar Physics.

Nearly ten years after and a continent away from where Klimas provided a backdrop for Complexity Heliophysics in large part beginning with the concept of the magnetosphere as a self-organized system, Valdivia et al. (2005) offered a synthesis of the field’s thinking that self-organization is an explanation for magnetospheric behavior. They addressed this history of the magnetosphere as a self-organized critical system, beginning with the premise that a complex system is one that is characterized by multiscale spatio-temporal (S-T) behavior. Their central question was how could the magnetosphere exhibit complexity at small-scales, but coherent and repetitive behavior at global scales? The canonical example is how there could exist self-similar turbulent behavior in the plasma sheet simultaneously with repeatable and coherent substorm phenomena. A key question in their history is what the threshold state between the predictable and unpredictable might be. Self-organized criticality provided the answer.

Valdivia et al., brought the SOC equations into the magnetospheric context. Taking the AL index as an indicator of global magnetospheric energy dissipation, they describe three characteristics used to indicate self-organization: $\Delta E$ (energy dissipation; area under the AL curve), $dt$ (time duration), and $\Delta \theta $ (time separation between events). Figure 5 details each characteristic in AL index data. Their distributions cover two years of AL time series data between 1986 and 1988, taking the threshold of −100 nT to define an event (for a total of N = 10,365 events). The statistics show clear power laws in all three characteristics.

Finding evidence for a self-organized state, they posit that it may provide a key for understanding substorm onset and derive a 1-D general dynamical model for the magnetic field in the diffusion region of the magnetotail to attempt to explain magnetospheric observations. Using their simple model they compile event statistics for the collective effects of many interacting instability sites (assuming a simple parameterization of the dissipation, derived from observations and data analysis), in a manner not dissimilar from microphysics and particle kinetics simulations. They identified ranges, a phase diagram of sorts, for the $(\mathbf{U} \times \mathbf{B})_{y}$ term and a certain range in which robust critical behavior occurs. Their model cannot describe the details of the microphysics of the magnetotail, yet it serves to indicate that ‘the statistical behavior of many complex distributed systems is more a property of their self-organized state, if it is achieved, than the details of the physical processes that allow such state. This is a general characteristic of systems that are close to criticality where many systems belong to the same universality class, suggesting that it is probable that the statistics of substorms, pseudobreakups, and even the evolutions of the growth and expansion phases, are unrelated to the details of the dissipation process (Shay et al. 1998) other than that dissipation allows for the establishment of a self-organized state.’

Subsequently, a quite comprehensive review of SOC as applied to solar physics and astrophysics was created during two week-long meetings at the International Space Sciences Institute (ISSI) (Aschwanden et al. 2016). They reviewed self-organized criticality across these fields from 1989-2014, highlighting trends, open questions, and future challenges.

The importance of SOC systems is punctuated by its application across domains, including: ecology (Kauffman and Johnsen 1991; Halley 1996; Milne 1998), evolutionary biology (Langton et al. 1991; Kauffman 1993; Sneppen et al. 1995; Holland 1995), geology (Bak and Tang 1989), cognitive science (Plenz et al. 2021), computer science (Wolfram 2002), the social sciences (Axelrod 1997; Miller and Page 2009; Newman et al. 2002), economics and finance (Stanley et al. 2002), political science (Brunk 2001). These diverse systems share common features that are linked through SOC: driven, dissipative, and far from equilibrium and releasing energy in a bursty intermittent manner on multiple scales with numerous routes to instability that lead to the energy release and reconfiguration (Watkins et al. 2015). The importance to this review is that SOC systems are inextricably connected to the statistics of nonlinear processes, which is signaled by power law-like size distributions. The exposition of SOC in Heliophysics was a hallmark of the complexity paradigm in the field in the years after 1996.

5 Beyond 1996: Complexity Heliophysics

5.1 Power Laws in Heliophysics

To set the stage for Heliophysicists’ adoption of the concept of SOC, we begin with the origin of the idea itself. Bak et al. (1987) documented peculiar features of the sandpile cellular automata in the discovery of the concept of self-organized criticality. The peculiarity is that the system responds to external perturbations by dissipating the stored energy in an avalanche, where the size of the avalanche is described by a power law distribution with 1/$f$ noise. Nature seems to love power laws – they appear widely in physics, biology, earth and planetary sciences, economics and finance, computer science, demography and the social sciences (Newman 2004). The origins of power law behavior, the mechanisms that can cause it, have been and remain a point of debate in the scientific community. Power law behavior in investigations of the solar wind-magnetosphere system have been taken to be strong indicators of complexity and SOC in the magnetosphere’s dynamic evolution. It is important to mention that appropriate detection of a power law distribution is non-trivial and care should be taken with the methods and power law interpretation applied to empirical data (Clauset et al. 2007).

Tsurutani et al. (1990) examined the power spectra of the AE index and the interplanetary magnetic field (IMF) north-south component ($B_{Z}$) over frequencies between 17 minutes and 28 hours using five minute averages between 1978-1980 (Fig. 6).

Notably, the power spectrum revealed a peak at 24 hours and a break in the spectrum at ∼ five hours. The spectra on either side of the break are fit using power laws with $f^{-1.00}$ at lower frequencies and $f^{2.2}$ at higher frequencies. The spectral break was found to be independent of the choice of data interval and averaging length. Overlapping the AE dataset that was used to create Fig. 6(left), they computed the power spectrum for the IMF $B_{Z}$ component, finding an unbroken power law that roughly follows a $f^{-1.42}$ slope (see Fig. 6(right)). Thus, the break in the AE index behavior at around 4.7-5.2 hours was not explained by the solar wind IMF $B_{Z}$. They showed the ratio of the power of AE to the power of the IMF $B_{Z}$ as a function of frequency and found a clear break at ∼4.6 hours, below which the ratio is independent of frequency and above which the ratio decreases at a rate of $f^{-0.5}$. The ∼5-hour break in the AE power spectrum is longer than substorm time scales of around 30 minutes for expansion phases and a couple of hours for total length. However, their results indicated no preferred period of $B_{Z}$ (no break in the power spectrum) and no preferred period of substorms (no break in the spectrum below roughly five hours). The authors suggest a number of potential explanations as to why the AE index power diminishes with increasing $B_{Z}$, citing saturation mechanisms, but do not draw firm conclusions.

Research to resolve the question of whether the magnetosphere is an SOC system was reinvigorated, perhaps as a result of the Klimas review, employing power law techniques as the prominent mechanism of investigation.

Consolini (1997) (and later extending the statistics in Consolini 2002) used the AE index and power law distributions to attempt to explain the intermittent nature of the magnetospheric dynamics (e.g., the rapid fluctuations in the AE index even in periods when the solar wind parameters were relatively smooth). They evaluated the distribution function for the intermittent burst behavior of the index. Defining the quiet time AE index level as $L_{AE} = \left [45\pm 15\right ]$ nT, they derived the strength of a burst as $s=\int _{\Omega} \left (AE(t)-L_{AE}\right )dt$, where $\Omega $ is the time interval over which the AE index is greater than the quiet time level. The distribution ($D(s)$) for a period of AE data covering around 3000 burst events was found to follow the power law form $s^{-\tau}$ with $\tau \sim 1$ over more than four decades ($10^{1} - 10^{5}$ nT⋅minute). They suggested an interpretation as an absence of characteristic length or time for the magnetospheric system. Their result related the magnetosphere to the clear demonstration that SOC systems exhibit a spontaneous organization towards something like a dynamical equilibrium. They extended the analysis to study the second important component of these systems that there are often various scaling regimes when the system is not quasistatically driven. To determine the relevance of the magnetosphere as an SOC system they explored the presence of a $1/f$ regime. They constructed the distribution of the full power spectral density (PSD) of the AE index, corroborating Tsurutani et al.’s results that the PSD divides into two power law regimes ($1/f$ and $1/f^{1.89}$) with a spectral break around $5.5\times 10^{-5}$. The low frequency regime represents the random superposition of single burst events while the high frequency regime is the result of interaction among the bursts and is the regime of the SOC state of the magnetosphere. The high frequency regime is associated with minutes to hours time scales of magnetosphere dynamics, which are those associated to magnetic storms and substorms.

Consolini (2002) reifies many of the points made by Consolini (1997), recentering the questions of whether the magnetosphere is an SOC system, what evidence there is, and what can be measured to resolve the hypothesis. They note that magnetohydrodynamic (MHD) modeling is incapable of describing the highly intermittent and multifractal character of the magnetospheric dynamics during magnetic substorms and storms. As a result of the lack of a physical model through which to study these dynamics, there has been perhaps an over-reliance on the most readily available information: the AE index. They well-capture the background on this reliance. However, the index remained the best available diagnostic, and they used it to extend the statistics of Consolini (1997) examining the burst size power and burst lifetime distributions of AE, with the goal to discriminate the impulsive dissipative events in the AE index from the enhancements that result due to convection (Kamide et al. (1999) directly-driven and unloading modes of the magnetosphere). They found power law scaling in the power distribution functions of the form $D(x) \approx x^{\tau}$ with exponents 1.0 $\leq \tau \leq $ 1.5 ($\tau = [1.35 \pm 0.06]$, and $\tau = [1.5 \pm 0.1]$ for the burst size and lifetime distributions, respectively). These relationships were shown to hold over four and two decades, respectively, falling off only once the magnitude of the burst size or lifetime exceed points where the method is likely able to distinguish between the impulsive unloading and the convective dynamical modes of the AE index. The main result of the analyses is that AE index time behavior seems to be the possible occurrence of criticality in the Earth’s magnetospheric dynamics, again meaning that no characteristic scale or time represents the magnetotail dynamics and instead scale-free behavior is exhibited. These findings support previous ones (e.g., Tsurutani et al. 1990) that the magnetotail may be an open, dissipative dynamical system at a critical state.

Further, their analysis of the power spectral density (PSD) function of the AE index data revealed two distinct regions characterized by scaling exponents of −2 at high frequencies and −1 at low frequencies with a spectral break at f∼70 μHz. To clarify the origin of the 1/f-noise region at lower frequencies, the authors explore the relationship between the AE index and simultaneous solar wind parameters, attempting to unravel the how the solar wind behavior may be driving the magnetospheric response from the magnetosphere itself as an SOC system. The work to understand the relationship between the solar wind parameters and the auroral indices is more conclusively taken up in Freeman et al. (2000), reviewed below. A final word of significance from this study. They issued a key warning that is prescient of future directions in Complexity Heliophysics: “to find scale-free distribution functions does not mean that the system is in a self-organized critical state. As a matter of fact, while SOC systems display scale-free distribution functions, many other physical mechanisms may produce scale-invariant distributions. In order to address this issue, we must investigate in great detail the physical mechanisms of this scale-free avalanche process in the magnetotail dynamics.”

Tsurutani and Consolini et al. provided strong evidence through broken power law forms of the power spectrum of AE and then its burst lifetime and size distribution that the magnetosphere behaves as a system near its critical point.

If the magnetosphere is a system near its critical point, it challenges the ability to predict its evolution as those dynamics are random. However, a system near its critical point may be confined to a sub-space characterized by a few dimensions and therefore could be well represented by a few parameters. Chang (1999) review the concepts and mathematical techniques for examining the deterministic chaos of low-dimensional nonlinear systems with fractal characteristics for the magtnetosphere. Chang begins from the idea that systems near critical configurations may exhibit low dimensionality: a dynamical system connected to a reduced number of relevant parameters. In the Heliophysics context this is a possible framework for the explanation of bursty bulk flows, low-dimensionality, and power law magnetic field spectra in the magnetotail, postulating the magnetotail to be an open, dissipative dynamical system near “forced- and/or self-organized criticality” (FSOC) Consolini (2002). Readers will recognize their postulate–it is a foundation of the arguments of Valdivia et al. (2005) (reviewed at the end of Sect. 4 above) in describing the magnetosphere using SOC. The magnetotail plasma being near the point of criticality and causing a substorm onset is like a fluid being at the critical point for equilibrium liquid/gas phase transitions–both are FSOC systems. They first emphasize that the magnetosphere is inherently multiscale and place focus on the mathematical tools to address the interplay of the kinetic, intermediate, and magnetohydrodynamic (MHD) scale fluctuations. In describing the merging of coherent magnetic structures in the magnetotail, they were perhaps the first to suggest that this process is the explanation of “bursty bulk flows,” a topic that remains an active area of research and more recent work seems to corroborate (Gabrielse et al. 2014; Wiltberger et al. 2015; Merkin et al. 2019). Of further note is that they open the possibility that the magnetic reconnection events leading to bursty bulk flows likely come from many if not all of the suggested microscopic instability mechanisms such as the collisionless tearing instability, or the cross-field two-stream instability. In such nonlinear systems there are likely overlapping and interacting mechanisms driving observed behavior and suggest multiple explanations rather than attribution to a single cause. Complexity thinking is open to multiple explanations over the traditional central and singular explanation.

The Chang paper is a good introduction to the nontraditional mathematical techniques of dynamic merging of coherent structures, nonclassical nonlinear instability, path integrals, the theory of the renormalization-group, low-dimensional chaos, self-similarity and scaling, fractals, coarse-grained helicity and symmetry breaking and an excellent complement to this review. An important contribution is the introduction of powerful techniques for quantitatively and computationally studying dynamical systems far from equilibrium, such as the renormalization-group transformation procedure. Each of the ideas in the paper play an important role in Complexity Heliophysics. Coarse-graining (the representation of a physical system in which some of the fine-grained structure has been smoothed over without introducing external details, or in other words remaining true to the microscopic details^{Footnote 5}), especially, emerges from this review as a central concept in Complexity Heliophysics. A rigorous understanding of coarse-graining is important to build compact representations of a system and thus to bridge the gap from complexity research in Heliophysics to decision-making based on the knowledge, which we further develop in Sect. 6.

Subsequent to the knowledge of broken power laws in the auroral indices and the enumeration of the techniques to study them, particularly in defining measures of energy bursts (Consolini 2002), were numerous studies that identified additional characteristics or anomalies in the distributions. Power laws still governed the distributions of burst magnitude and duration, but Consolini (in submitted work in the year 1999 that was not published entitled “Avalanches, scaling and 1/f noise in magnetospheric dynamics”) identified small ‘bumps’ with characteristic values of magnitude and duration and suggested that a better fit to these distributions was a power law with an exponential cut-off plus a lognormal distribution. Freeman et al. (2000), studying AU and |AL| data 1978-1988 paired with additional observations from the WIND satellite’s (Szabo 2014) particle and magnetic field instruments between January 1995 and December 1998, showed a power law component of the burst lifetime distribution ($P(T)$) in two measures of solar wind-magnetosphere coupling, Akasofu’s epsilon and velocity∗southward IMF ($vB_{s}$), but absent a bump. This close correspondence is illustrated in Fig. 7, comparing curves for AU, |AL|, $vB_{s}$, and $\epsilon $.

They found that AU and |AL| distributions were fit well by power law with exponential cut-off plus lognormal distributions, and that there was no evidence of the log normal component in the solar wind variables ($vB_{s}$ and $\epsilon $). They examined each component of the AU and |AL| distributions and described the physical implication. First for the power law component: The power law exponent of the solar wind variables did not significantly differ from those of the AU and |AL| indices. This similarity of the input component (solar wind) to the output component (AU and |AL|) points to the system being ‘directly driven’ (quasilinear relationship) by the solar wind at short (∼20 minutes) time lags. The similarity between AU and |AL| points to the fact that this component of the magnetospheric output acts throughout the auroral oval because the AU and AL indices rely on contributing magnetometers from different local times. Finally, because of similarity to the solar wind and the global distribution of these relationships, this power law burst lifetime component in the AE indices may be attributable to the Disturbance Polar type 2 (DP2) convection electrojets (Nishida et al. 1966). Next, the lognormal component: It was most prominent in the |AL| index for which the contributing magnetometers are concentrated in the post-midnight sector and acted over the characteristic magnetospheric timescale of 2-5 hours. Both were considered evidence that the lognormal lifetime component is the substorm unloading component associated with the DP1 electrojet, the ‘unloading’ current system (Obayashi and Nishida 1968). This component is not scale-free, but rather does have a characteristic timescale (2-5 hours).

The work of Freeman et al. extended the exploration of the connection between the solar wind driver and magnetospheric response distributions (i.e., relating the spectral density of the output of the magnetosphere (e.g., AU and AE) to the input/driver (e.g., $vB_{s}$)) as a means to understand the magnetosphere’s dynamical nature. A key conclusion was that the scale-free burst lifetime of AE is not conclusive evidence that the magnetosphere is an SOC system, and that additional observations are needed. This motivated work that utilized network analysis and imagery data to attempt to unravel what could not be unraveled from the time series alone. Their key comment that would lead to seminal work in the following years is, “...whilst scale-free behaviour in the system output is a feature of SOC systems, recent SOC models have been developed in which the scale-free behaviour is in the local or internal system output and not in the global or system wide output measured by the AE indices (Chapman et al. 1998). Thus in order to assess whether these models are an appropriate description of the magnetosphere, attention should turn to other observables that include spatially localised as well as global phenomena.”

Chapman and Watkins (2000) took up the uncertainty in many of the previous authors’ minds about the inability of the AE index to unambiguously determine the type of dynamical system that characterizes the magnetospheric behavior. They identify three main classes of ‘SOC’ system relevant to the magnetosphere: 1) the original definition of Bak et al.; 2) forced SOC (F-SOC, Chang 1992a); and 3) a phenomenological definition based on observation of some or all of a set of possible SOC diagnostics (e.g., bursty time series, 1/f power spectra, avalanche distributions). Responding to previous observational evidence and inquiry, they ask how one can reconcile the low dimensionality observed in the magnetosphere with the observed robust bursty evolution. Key is the knowledge that systems at criticality can also be low dimensional, and that therefore SOC as well as competing explanations for these systems must be distinguished by other means than dimensionality. For instance, avalanche models (strictly SOC models) must have fixed points around which the low dimensionality is observed. They argue that the complication stems from distinguishing SOC from SOC-like, “it is critical to understand to what extent measures of the system dynamics such as auroral indices also measure the solar wind driver directly and hence to quantify their appropriateness for such studies.” Systems that appear to exhibit SOC through some parameters used to proxy the state (e.g., the AE indices) may not be able to distinguish between a system with an internal attractor (SOC) from those that are driven to some state (F-SOC or SOC-like). The authors attempt to establish whether the idealized SOC state is in fact needed to account for the observed burstiness, self similarity and low dimensionality associated with magnetospheric dynamics.

In exploring this distinction they elucidate the descriptions available for turbulent and other high-dimensional systems related to the different classes of SOC description. These are numerical models available to study avalanching and intermittency, including avalanche models and Coupled Map Lattice (CML) (Kaneko 1993), which is an approach that consists of decomposing the processes underlying the phenomena of interest into potentially nonlinear independent components (e.g., convection, diffusion), and then reducing each of these to simple parallel dynamics on a lattice. They present one result in an attempt to explain how systems at criticality can also be low dimensional, an attempt to understand the fact that observational evidence exists for low dimensionality in the dynamic magnetosphere while bursty evolution is also robust. Importantly, avalanche (sandpile) models, “...have robust emergent phenomenology that produces bursty time evolution with power law burst statistics as required but these systems are by construction high dimensional, in the same sense as CML. If in addition these systems exhibit fixed points, then close to the fixed points, that is, close to criticality, the behavior is low dimensional.” So for avalanche models to explain the magnetospheric observations they must have fixed points. They then demonstrate that avalanche models can be altered to exhibit low-dimensional behavior by introducing a ‘fluidisation parameter,’ $L_{f}$, which is a fixed distance behind the leading edge of an avalanche that is flattened back, effectively moving the system away from a repulsive fixed point. Figure 8 is a reproduction of their Fig. 1 that reveals behavior of an avalanche model with varying $L_{f}$. Their model is one in which sand is redistributed when a critical gradient is exceeded locally. Redistribution in their model occurs across all sites within an ongoing avalanche by construction. The fluidisation parameter has the effect of flattening back the sand behind the leading edge of an ongoing avalanche for a fixed distance, $L_{f}$. They illustrate the central point that, under certain conditions, an originally high-dimensional sandpile model can exhibit low dimensional dynamics.

When $L_{f}$ is on the order of the system size, the behavior is that of the sandpile model–evolution is bursty and burst statistics are power law (Fig. 8a). Reducing $L_{f}$ significantly, the evolution becomes quasiregular, exhibiting a distinct loading-unloading cycle (Fig. 8b). Statistics in this case are power law only over a restricted range. Thus, by changing certain conditions of the avalanche model, the high-dimensional sandpile model can exhibit low-dimensional dynamics. The implication for systems, the magnetosphere for instance, is that low dimensionality can signify a system close to criticality or certain classes of avalanching systems whose specific parameters produce either intermittent, or quasiregular, time evolution.

The complication of distinguishing SOC from SOC-like reveals the importance of understanding to what extent measures of the system dynamics such as auroral indices also measure the solar wind driver directly. Systems that appear to exhibit SOC through e.g., the AE indices as proxies for the state may not be able to distinguish between a system with an internal attractor (SOC) from those that are driven to some state (Forced-SOC or SOC-like). Chapman et al. ultimately conclude that auroral indices are not effective at distinguishing the internal dynamics of the magnetosphere from that of the intermittent solar wind driver. The statement from the article that resounds across Complexity Heliophysics is, “Of principal concern in the magnetosphere is the variability of the driver and the extent to which any given observable yields the output of the system, the system’s internal dynamics, or a mix of these with the driver superimposed.” Raising the implications of this paper to a broader level, the authors write that dealing with real observations exhibits complications that one must be aware of in studying the phenomenology of SOC for a given dynamical system (e.g., the magnetosphere).

Encompassing works surrounding the beginning of the new millennium, Consolini (2002) called SOC a new paradigm for magnetospheric understanding, implicitly naming power law analyses a vital diagnostic.

Thus, numerous studies identify power law behavior as evidence of self-organized criticality in the internal magnetospheric dynamics (especially in connection with phenomena like substorms based on the AE index spectrum), though ‘the observed characteristics of the spectrum are also amenable to alternative interpretations’ (Chang 1999), as will be seen in Lui et al. (2000) and Uritsky and Pudovkin (1998). The openness of interpretation, the advent of computational capability, and the availability of observations from new missions and sensors spurred investigation of potentially richer and less ambiguous data, such as imagery.

Riley (2012) explored distributions for phenomena occurring across the solar-terrestrial system: solar flares, speed of coronal mass ejections, Dst index, and > 30 MeV proton fluences as inferred from nitrate records, for the purpose of estimating the likelihood of rare extreme events. Their method is that of extrapolation, assuming that the range over which the events are well observed can be reliably extended to regimes where they are rarely, if ever, observed. They argue that power laws represent the phenomena studied and from which extrapolation of probabilities is trivial. Their empirical data meet the criteria they spell out for power law distributions and they are able to estimate likelihoods of space weather events not observed in the space age in the next decade.

5.2 From Time Series to Imagery

Like Riley (2012), some studies have looked beyond the AE indices and their intrinsic limitations for observables/data to understand the magnetospheric, and ionospheric, complexity. A predominant source of observation for magnetospheric output is auroral optical activity or imagery. Imagery is a more direct measure of energy output from the magnetosphere than ground-based indices (Lui et al. 2000). The intensity, color, and location of the aurora contain information about the magetospheric particles that cause them. The size, shape, and extent of the auroral region enables inference about the size and shape of the magnetosphere and the fluctuations in the solar wind driver. These data provide capabilities that indices or in-situ observations do not, e.g., observing over large spatial areas of the high-latitudes, but also require additional care in preparing and interpreting the data as we will see. There is rich and wide literature on the use of auroral imagery to study the magnetosphere and geospace. We will focus on those studies that have used these data to inquire about the nature of magnetospheric dynamics specifically.

Lui et al. (2000) is one of the early examples of examining auroral imagery data to infer the complex adaptive behavior of the magnetosphere. Motivating their study was the seeming distinct behavior of internal high-dimensional plasmasheet dynamics (e.g., burstiness) that drive small-scale auroral structures and global dynamics (storms and substorms) and the fact that the causal dynamics of the two modes are not understood. A number of studies demonstrated that these modes are inherently connected (Chang 1992a,b, 1998, 1999; Consolini and Chang 2001; Klimas et al. 2000a). Principle among them and already discussed above is Chang (1999) who discussed forced self-organized criticality whereby global dynamics can be a consequence of high-dimensional SOC behavior and the high-dimensional plasmasheet behavior (driving small-scale aurora) can be an artifact of the global loading-unloading magnetospheric dynamics.

Lui et al. attempted to use auroral imagery to monitor the total energy output of the magnetosphere across scales (small auroral arc-like, and global). They produced the first probability distributions of the power and spatial size of the magnetospheric energy output across scales from auroral emission regions. To determine the distribution of power and sizes of these auroral activity region they identified individual auroral blobs in the ultraviolet (UV) imager on the Polar spacecraft (Torr et al. 1995), calculated the power in the intensity of the blob in the image (Germany et al. 1997) and the area of each blob, and compiled these values into statistics. Each image was manually inspected and classified as substorm or non-substorm to separately examine the statistics for these global and non-global cases. The distributions are shown in Fig. 9 (Fig. 3 reproduced from Lui et al. 2000).

Quiet time distributions (left column in Fig. 9) display power law behavior across roughly four decades of dissipation size and power. Similar power laws (with slopes matching those of the quiet times within uncertainties) are found for the substorm intervals, but a peak above $\sim 10^{5}\text{ km}^{2}$ and $\sim 5\times 10^{8}$ Watt exists in the size and power, respectively. They interpret these power law regions to mean that there is an ever-present component of auroral activity which exhibits the scale-free behavior of an avalanche system and that this behavior exists regardless of the presence or absence of substorms. They interpret the behavior that is independent of the level of activity as ‘bursty, internal (localized) relaxations of the system.’ On the other hand, the peaks noticed in the substorm intervals, but not in quiet times, are interpreted as global reconfigurations. Put another way, their interpretation was that there was a scale-free (power law) component of auroral activity that was always present (i.e., regardless of whether or not there is substorm activity), but that global events during substorm intervals superimpose on the scale-free behavior well-defined peaks in emitted power and size of emission regions. Their ultimate conclusion was that the magnetosphere acts as an avalanche system.

There remained questions about the peaks found by Lui et al. Prominent among them was whether the avalanching magnetospheric system could exhibit power law (scale-free) behavior in the energy due to internal relaxations/burstiness while having a characteristic mean in the energy released with global reconfiguration that scales with the system size (e.g., the global extent of magnetospheric activity) (Chapman et al. 1998). Uritsky et al. (2002) took up those questions and the idea that, “understanding the complexity in the magnetospheric behavior associated with critical phenomena appears to be necessary for a correct description of geomagnetic activity as a response to the solar wind driver,” suspecting that the absence of the temporal dimension might have contributed to the strange peaks. They extended Lui et al.’s method from a static spatial analysis to a spatio-temporal analysis using the same UV Imager data from Polar. This spatio-temporal representation was found to be vital to describe SOC dynamics in a strongly driven system (Watkins et al. 2015). It begs the question of why the spatio-temporal perspective is necessary? In avalanche models the main characteristics of an avalanche are its size and energy. These characteristics are found by integrating over both spatial and temporal coordinates. While in theoretical or laboratory avalanche experiments the driving can be closely controlled, this is not true of forcing in the real world. With a natural and uncontrollable source, the task to verify power law statistics requires more elaborate techniques to identify individual events. They cite two limiting cases permitting the use of laboratory SOC inference techniques (i.e., separating temporal and spatial domains) to physical systems: 1) the avalanching event lifetime is much shorter than the sampling time of the dataset and spatial propagation characteristics of the event are well known, then a purely spatial analysis is warranted; and 2) in the absence of spatial information but one can be certain that there are not multiple avalanches evolving simultaneously, then avalanche distributions can be calculated from time series of the output characteristics of the system. Neither case is true for the magnetosphere and auroral emissions. For instance, the low driving rate condition in effect requires that only one reconnection site in the plasmasheet be active at any one time given that the frequency of reconnection in the plasmasheet is high relative to the driving of the magnetosphere. However, it is well-established that there are often multiple reconnection sites over an extended spatial region (Angelopoulos et al. 1999). This leads to a key statement of the text, “Therefore, the low driving rate assumption is not satisfied in the magnetosphere and so the results of previously reported time series analyses related to the hypothesis of SOC in the magnetosphere (Consolini 1997; Takalo et al. 1999; Freeman et al. 2000; Uritsky et al. 2001) are insufficient for obtaining correct avalanche distributions in terms of a rigorous SOC approach. Moreover, since the lifetime of many auroral activations is longer than the sampling time of the Polar UVI image series, the static spatial analysis reported by Lui et al. (2000) is also inappropriate.”

The key to their spatio-temporal approach is to identify and distinguish multiple simultaneous avalanches (auroral emisions) and calculate their individual properties (Hwa 1992). They analyzed more than 30,000 Polar UVI images from January–February 1997 and January–February 1998 and largely underwent a similar preprocessing as the Lui et al. data. The sampling time of the images was 184 seconds, also including a period where the sampling occurred at a higher rate (37 seconds). They compiled statistics for active auroral regions that persisted longer than the sampling time and thus across two or more consecutive images, tracking that region through the images up to five hours. They treated events that split but had a unique source as a single event, and merged events with spatially distinct sources as separate events, following Becker et al. (1995). For each event, they calculated lifetime T, integrated size S, and integrated energy E as well as maximum active surface area A and maximum energy deposition rate W. Figure 10 reproduces their principal result.

The central finding is that, across UV images and a variety of solar wind conditions, they found no characteristic time, size, or energy scales within the entire available range of studied parameters. The auroral events exhibited well-defined power law statistics over a broad range of scales. They did not find the distribution peaks that Lui et al. reported, concluding that the peaks are an artifact of an incomplete avalanche detection methodology that missed the temporal component. Thus, auroral emissions exhibit statistical properties of avalanches in SOC models. In relating the magnetosphere dynamics through auroral observation and the behavior they observed, they were able to draw analogies to the avalanche model itself: auroral activations are the avalanches while reconnection events in the plasmasheet are the internal dissipation of SOC models.

In addition to addressing methodological issues that may have produced the peaks in Lui et al., Uritsky et al. contributed several other key findings, including:

The power law distribution, and thus SOC-like behavior, was exhibited over many orders of magnitude for the duration, power, and size of auroral activations and therefore the magnetosphere behaved as an SOC system across wide ranges of geomagnetic activity. It is worth noting that not all Heliophysics research claiming power law distributions cover the same observational range as Uritsky et al., exhibiting this kind of relationship over only a small number of orders of magnitude;
The dynamics comprising all levels of magnetospheric activity remains scale invariant; and
The magnetosphere as represented by the spatio-temporal evolution of auroral emissions operates in a self-organized state. The dynamics of auroral perturbations corresponds well to avalanche dynamics at criticality.

The lasting discovery is that one can expect cross-scale coupling effects to play a significant, if not crucial, role in the development of geomagnetic disturbances and that large-scale properties of the magnetotail plasma sheet depend critically on the statistical hierarchy of small- and intermediate-scale perturbations associated with sporadic localized magnetic reconnections, current sheet disruptions, and other localized plasma instabilities (Klimas et al. 2000b) A question raised by this work for the community is whether statistics of mesoscale magnetosphere simulations match observed statistics from the SOC paradigm.

A general comment can be made from the Lui-to-Uritsky development: the spatio-temporal domain is required for identifying SOC dynamics in a strongly driven system. To make appropriate comparison, identification of the auroral activations had to be conducted in spatio-temporal space. Uritsky et al. found that the spatial-only analysis of Lui et al. produced ‘bumps’ in the power law distribution that were entirely methodological and were incorrectly interpreted to be unique behavior during periods of high perturbation.

In order to truly understand the system, data across the full spectrum of system activity were critical.

Following the demonstration of the importance of the spatio-temporal perspective in concert with the advent of imaging platforms, Complexity Heliophysics began to use imagery data more regularly, a trend that persists into the 2020s and is expected to continue (Kozelov et al. 2004; Uritsky et al. 2007; Golovchanskaya et al. 2008; Klimas et al. 2010; Aschwanden et al. 2014; Longden et al. 2014).

Indeed the NASA mission intended to resolve longstanding fundamental questions about the nature of substorms, the Time History of Events and Macroscale Interactions during Substorms (THEMIS), included All Sky Imagers (ASIs) in the ground observatories that accompanied the magnetospheric spacecraft (Donovan et al. 2006). Imagery, as well as multi-modal observational systems, will continue to be a vital component of unraveling the complexity of the solar wind-magnetosphere-ionosphere system, and the capabilities of these systems will grow (e.g., see the University of Calgary’s Transition Region Explorer (TREx) sensor web (Spanswick et al. 2018)).

Of course, imagery data have relatively shorter histories and pose their own processing and analysis challenges, so time series analyses remain important sources of new complexity studies (Chapman et al. 2004; Consolini et al. 2008; Borovsky and Osmane 2019; Meng and Verkhoglyadova 2021). Prominent among the continuing body of research are results related to the complexity of the geospace system made possible by the Swarm mission, which we do not provide a detailed review of in this manuscript but point readers to several important works (de Michelis et al. 2015; Papadimitriou et al. 2020, and references therein). As far as processing challenges, more recently groups have been bringing tools from artificial intelligence and machine learning (AI/ML) to bear (on auroral imagery Syrjäsuo and Donovan 2002, 2004; Clausen and Nickisch 2018; Kvammen et al. 2020; Nanjo et al. 2022 and solar imagery Galvez et al. 2019; Armstrong and Fletcher 2019; Upendran et al. 2020; Brown et al. 2022). Section 7.1 discusses the intersection between complexity science and AI/ML, positioning it as a key challenge for 21st century Heliophysics and indeed all of science.

6 Emerging Literature: Topics and Trends

The following section examines emerging literature (largely drawn from publications after the year 2010) and extracts topics and trends. These perceived topics and trends are organized by section. This section is a departure from the previous ones in that along with the literature review, subjective assessment of areas that might be important to Heliophysics in the coming years are provided. These could be interpreted as predictions and should thus be treated with a degree of uncertainty or openness to interpretation. We also draw extensively from complexity science literature outside of the field of Heliophysics to establish trends.

6.1 Metrics and Diagnostics of Complexity

The first observation is that the complexity paradigm in Heliophysics is shared across disciplines (physics, in general, biology, social sciences, etc.), and understanding the topics common to each of these versions of the complexity paradigm informs the future research avenues for each of them. The first has to do with how we quantify and make legible complexity. The means of ‘metric-ing’ complexity will be a good transition into this section as it will call back to themes of the literature review above and point to trends we identify below.

There is no single metric of complexity, just as there is no single metric suitable to understand the capability of a model. Lloyd (2001) gives three dimensions along which to measure complexity:

How hard is it to describe?
How hard is it to create?
What is its degree of organization?

Many metrics have been proposed for assessing and quantifying a complex system (see Lloyd 2001 for an informative, but non-exhaustive enumeration), and the list continues to grow. A complex system is by definition multi-faceted. No single measure could describe it adequately. Complexity pioneer and Nobel laureate physicist, Murray Gell-Mann, noted as much “A variety of different measures would be required to capture all our intuitive ideas about what is meant by complexity” Gell-Mann (1995). Complexity is a collection of features, not a single phenomenon (Ladyman et al. 2020). Therefore, a science of complexity must understand the various metrics available to quantify aspects of complexity, their capabilities and shortcomings. We have already encountered numerous measures of complexity in the literature review above, namely self-organization and power laws. Here we will review several other measures, filtered for inclusion based on their relevance to Heliophysics. The literature suggests that the degree of organization dimension perhaps dominates in the domain. We will not address the more computational/computer science metrics such as logical depth and algorithmic complexity. Mitchell (2009) (Chap. 7) provides a more general development of the topic of complexity metrics.

The most basic complexity metric is numerosity, or counting entities and interactions between them. Numerosity is common to all science. Next are measures of order/disorder in a system. Disorder is mathematically represented through probability distributions and their measures of dispersion such as variance. Related to these measures of disorder and one of the important core measures of complexity science is Shannon entropy, measuring the amount of uncertainty in a probability distribution (Shannon 1948):

$$ H(X) \equiv - \sum _{x \in \mathcal{X}} P(x)logP(x), $$

(4)

where $X$ is a random variable with probability distribution $P$ over events $x$. Shannon entropy quantifies the difficulty of predicting an actual outcome given possible outcomes ($x$) or, in a temporal context, predicting future outcomes given past events ($x$). Shannon entropy is inextricable from uncertainty quantification. In some domains, ‘diversity’ is used as opposed to disorder. Shannon entropy is a part of most measures of diversity in these domains (e.g., ecology) (Page 2011). Numerosity and entropy allude to statistical physics, more generally, as an approach to quantifying complexity (López-Ruiz et al. 1995; Sethna 2021).

The next feature of complex systems that requires measurement is feedback. Feedbacks are loops of interactions across a system. Feedbacks are created when a change in a component of a system affects the rate of change of that same component (Meadows and Wright 2008). There is no measure of feedback. However, feedbacks support the persistence or disappearance of a behavior over time such that their signatures are present in the system. In the computational fields, feedbacks are defined by outputs of a process being put back into an input of the same process. Feedbacks reveal themselves in physical systems through patterns, structures, and nonlinearities such that measures to quantify the effects of feedback are those of structure formation and nonlinearity. Fractals themselves, well studied in Heliophysics as indicated by the volume of publications into their statistical signature–the power law–in the solar-terrestrial system, are a result of repeating a simple process over and over in a feedback loop.

A widely used computational tool for studying feedback is the agent-based model, a simulation of collections of ‘agents’ of entities of the system in which the agents follow certain rules for interactions and their evolution is studied. The closest example in Heliophysics are test particle simulations (e.g., Sorathia et al. 2017). Attempts to understand the societal impact of Heliophysics research, i.e., space weather, might be an arena for agent-based models (ABMs) of human behavior that might help understand preparedness to respond to space weather storms, especially in the context of potential compounding effects such as terrestrial weather or related system failure (McGranaghan et al. 2022). This would be one way to integrate Heliophysics systems models with human behavior models. ABMs, capaciously defined, is an intriguing future direction for Heliophysics and space weather sciences.

The results of ABMs are multidimensional data characterized by interactions between agents. Their structure is inherently a graph or a network. Therefore, making sense of those data depends on graph theory, which enjoy well-defined and mathematically rigorous formalisms. Section 6.4 introduces graphs and discusses important quantitative measures to understand them. In short, graph theory provides a way to study the geometry and evolution of a network through centrality measures, community structure, and modularity (Newman 2010). These measures offer deeper insight into a system and should be considered core metrics of complexity.

Finally, among the most essential ideas in complexity science is self-organization, that order can spontaneously arise from many uncoordinated interactions (Ladyman et al. 2020). Self-organization is measured through the order of the system. This is perhaps the measure best explored in Heliophysics to this point. Correlation measures, from linear Pearson correlation to covariance to information theoretic calculations like mutual information and transfer entropy, reveal order in a system. Nonlinear order is often studied with power law relationships.

A way of measuring complex systems that has not been as widely explored is robustness and resilience. Robustness refers to the ability of a system to maintain its structure or function in the presence of perturbation. Sans the requirement that structure is maintained, resilience refers to the property of a system to accommodate changes and reorganize itself while maintaining the crucial attributes that give the system its unique characteristics (Scheffer et al. 2001). Tools to study robustness and resilience include dynamical systems theory and theories of phase transitions (Ladyman et al. 2020), such as stability analysis (Demirel and Gerbaud 2019), critical slowing down (Scheffer et al. 2009), and tipping points (Scheffer 2009). Resilience offers a way that decisions can be made based on complex systems understanding, perhaps permitting a new framework for bridging research and operations in Heliophysics and Space Weather.

The measures identified above have in some manner been used in Heliophysics. However, the metrology of complexity is a young field, and therefore rapidly changing (Wood 1986; Lloyd 2001; Zurek 1990; Gregersen 2002; Mitchell 2009; Krakauer 2018). Paying attention to the advent of complexity metrics might lead our research to tools for better representing the phenomena we observe.

The lesson from this brief enumeration of measures of complexity is that the multi-feature nature of Heliophysics now requires multi-faceted approaches to measurement and evaluation. The geospace community more recently has recognized this, advocating (Liemohn et al. 2018, 2021) and providing frameworks (McGranaghan et al. 2021c) for more robust evaluation of our models across numerous metrics and levels. This review suggests that a similar approach must be taken for future work to quantitatively evaluate complexity in the system.

6.2 Coarse-Graining

Chang and Wu (2007) describe ‘coherent structures’ as a characteristic of dynamical complexity, phenomena that result from the nonlinear interactions of their constituent parts and are dramatically different than the behavior of those parts. The truism alluded to is that the whole is more than the sum of the parts. What the authors describe explicitly is present across the complexity Heliophysics literature, albeit often implicit and unnamed: there is a process of ‘coarse-graining’ to identify the relevant macrostates of a system. Flack (2017) defines a coarse-grained description of a system as one in which some of the fine microscopic behavior of a physical system has been smoothed over. She emphasizes that this is a principled smoothing, not arbitrarily reducing the granularity, but instead based on whether information remains important to a descriptive or predictive task at hand and does not introduce outside information. She writes that this is a ‘lossy, but true’ process. Coarse-graining is a process of integrating over parts, results in a compact representation of a system, and provides the basis for an effective theory. The preeminent example is temperature: a macroscopic description of a fluid that is an integration of microscopic behavior of particles. Averages are but one method of coarse-graining and there are many more, some much more complicated. The concept of coarse-graining is relevant across domains. In physics, renormalization theory is one remarkable example (Gell-Mann and Low 1954). As early as the 1970s in biology, coarse-graining has been used in molecular modeling of biomolecules (Levitt and Warshel 1975). Even in art it has a long history in drawing attention to the scale at which one witnesses the world. The artist Piet Mondrian painted series of representations of trees, displaying the trees as increasingly geometric and abstract until the tree itself could scarcely be recognized (Coppes and Jansen 2022). His work, like representations in science, explore the inherent patterning and ordering of nature. The prevalence of coarse-graining in science and society suggests an essential role of this process and that a review of complexity in a domain of science should address.

Despite the utility of macroscopic effective theories like thermodynamics and statistical physics, there exists long-standing debate in physics and biology about the right level at which to describe a system, or in how far ‘down’ we need to go. Associated with this debate are questions related to mappings between microscopic to macroscopic states. Though much of this debate has been staged in fundamental physics and biological domains, it speaks to unresolved issues in Heliophysics (Lui 2001; Denton et al. 2016; Denton 2021; Viall and Borovsky 2020), conversations punctuated by a perceived need to reconcile particle-level behavior with system-level phenomena and multiscale quandaries. The debate has shown up in contrasting modeling approaches, e.g., particle-in-cell and magnetohydrodynamic ways of describing the solar and magnetospheric systems. On the multiscale understanding side, questions abound about the ‘right’ level at which to look at the system and how to study the relationships between scales. The literature suggests that what is needed are hybrid models (e.g., note progress made in unifying particle and magnetohydrodynamic modeling (Sorathia et al. 2017)) and new methods for conducting multiscale analyses (e.g., explore the advances made in comparing scales and studying relationships between scales (McGranaghan et al. 2017b; Nishimura et al. 2021; Consolini et al. 2021; Nishimura et al. 2022)). It is quite possible that multiscale understanding and ideas about the right level at which to represent the Heliophysics system will bear on the outstanding desire to bridge the gap between research and operations (McGranaghan 2022). Research and operations function at different scales. Where research may need to look at the finest scales technologically possible to advance the boundaries of knowledge, operations requires an efficient representation of the adequate knowledge to make a decision. In some sense, the research to operations (and operations to research) gap is the problem of developing an effective theory or a compact representation that permits moving between scales. Presaging a discussion that concludes this review, this is the same tension that exists between scientific understanding and prediction. These separate regimes of knowledge discovery and science dictate concomitant separate regimes of representation. In Sect. 7.1 we center this discussion in terms of fundamental science vs. prediction-oriented science (e.g., basic science vs. applied science; physics-based modeling vs. artificial intelligence/machine learning) and suggest this is inextricable from the future of Heliophysics research.

In the sense that coarse-graining is capturing underyling structure and pattern in complex systems, two forms of coarse-graining are particularly important to Heliohpysics: information theory and network science. We discuss them in sequence next.

6.3 Disentangling Drivers and Parameters Amidst Nonlinearities: Information Theory

Like coarse-graining, information theory is often used to simplify a complex system, to understand the more parsimonious description of its functioning. Indeed, in the context of entropy, the two concepts are quite similar. There is a growing body of research within Heliophysics that suggests that information theory is a useful form of coarse-graining for our field.

When a system’s drivers are nonlinearly correlated, and the parameters of the system that they effect are numerous, it is a challenge to untangle their relative effects. There are an immense number of frameworks with which to find and investigate causality relations among different time series (Runge et al. 2019); information theory has proven powerful for the detection of causal information flows in complex systems. Information theory provides a mathematical framework for quantifying nonlinear flow of information from drivers to system parameters and between system parameters. It assumes that a given domain of interest can be described by coupled subsystems that interact (exchange information) with one another. Information theoretic approaches then use a number of possible measures to extract the direction of the information flow and to infer causality.

Recent literature has revealed promise for information theory to provide observational constraints that can help guide the development of the theories and physics-based models and for feature selection to create more accurate data-driven models (Wing and Johnson 2019).

Stumpo et al. (2020) discusses a method for quantifying the strength and direction of the coupling between the solar wind and the magnetosphere-ionosphere system. The authors introduce a new measure of information transfer to the solar wind-magnetosphere-ionosphere domain called the Transfer Entropy (TE), which is useful for the nonlinear analysis of the relationship between two time series. TE is based on transition probabilities between two random processes, $X$ and $Y$, obtained by inserting the Markov condition into the conditioned Kullback-Leibler distance such that the information flow from $X$ to $Y$ is accounted for. TE is a quantitative way of determining if the past history of $X$ is predictive of the future $Y$. TE also provides a means to distinguish bidirectional information flow, thus providing evidence about feedback processes, mentioned above as a difficult thing to measure in complex systems. They show that TE is a useful measure for capturing relationships between the solar wind and magnetosphere. They showed a strong information transfer from the vertical component of the interplanetary magnetic field $B_{Z}$ into the geomagnetic indices, with time delays of about 30 to 60 minutes. Further, they inferred that substorms drive geomagnetic storms from a strong observed information flow from the AE index into the SYM-H index (analogous to the Dst index).

Similarly, Manshour et al. (2021) were interested in measures from information theory that could illuminate the relationship between the solar wind and magnetosphere-ionosphere system. Their measure was Granger causality. They found a tighter temporal relationship between $B_{Z}$ and the AE index, a delay of only 10 minutes, and a 30 minutes delay with SYM-H commensurate with Stumpo et al. (2020). They, however, did not find any relationship between AE and Sym-H, casting uncertainty on the claim that substorms might drive geomagnetic storms.

In addition to solar wind-magnetosphere coupling applications, two areas of Heliophysics have demonstrated the physical discovery potential of information theory: radiation belt dynamics (Wing et al. 2016) and solar cycle dynamics Wing et al. (2018). Researchers in the decade following 2010 have been applied the theory to understand the influence and the timing of the solar activity on the near-Earth environment, extending its utility to the space weather domain (Materassi et al. 2011).

A caution and a difficulty of information theoretic analyses is that they require robust support in the available data. Care must be exercised in assessing the data in numerous ways, including: 1) data sufficiency: the available data must be statistically representative of the system, sampling the full space; 2) relevant variables: the data must include all relevant variables that affect the transmission and reception of information in the system; and data diversity: the data must be diverse enough to capture the full range of possible transmission scenarios and conditions, which often requires integrating data from multiple platforms and sources.

Information theory is promising, but realizing its potential suggests that Heliophysicists develop or strengthen certain literacies such as probability, statistics, noise quantification, and systems science.

6.4 Network Science: A Future for Heliophysics and Space Weather

Networks surround us. Network structure, or a high number of dynamic interacting units, characterizes much of our society, from molecules that constitute biological organisms to our communication infrastructure like the internet to the power grid. Networks even describe the structure of how we interact with one another–social networks. We have been grappling with the prevalence of networks in our society for many years (Milgram 1967; Granovetter 1973; Erdos and Rényi 1984) and more recently recognizing their capacity to represent the complex world around us (Boccaletti et al. 2006; Watts and Strogatz 1998; Albert and Barabási 2002; Newman 2010).

The reason for this is that networks are how complex systems are represented (Torres et al. 2021, and references therein). Networks are the lingua franca of complexity, so to speak. The profound importance of this discovery is that network analysis rests on a well-defined domain of discrete mathematics known as graph theory (graph, in this context, is synonymous with network), dating back to Leonhard Euler’s solution to the Königsberg bridge problem (Biggs et al. 1986). Thus, if one can figure out how to encode a system as a network, then a robust domain of mathematics can be applied to study it and discovering important properties such as community structures, key nodes in the flow of information in the network, and the type of network that reveals properties of its functioning (e.g., random networks (Erdos and Rényi 1984), small-world networks (Watts and Strogatz 1998), and scale-free networks (Barabási 1999)).

First, a very brief primer on graphs and networks. We will hereafter use the term network to represent both–indeed they are synonymous, with the only difference being that some communities prefer graph (e.g., mathematics) while others prefer network (e.g., social sciences). Networks show connections between things. The ‘things’ are called nodes or vertices and can represent people as in a social network or any other definition of the entity for a domain. The connections are called edges and they represent a relationship between nodes. In a social network the vertices might indicate if two people know one another. The quantitative means that one chooses to define a vertex is central to the construction of the network.

Another key concept is the adjacency matrix. The adjacency matrix is a square matrix with rows and columns corresponding to every node in the network. The corresponding matrix element will be either 1 or 0 according to whether those nodes are connected or not. This is one form of visualizing or understanding the network. Other means to derive important sub-networks (collections of a few of the nodes of the full network and their connections) and aggregations of the network are available and important given the complexity that most real-world networks display. The purpose of these aggregations are to permit an understanding of the network topology or geometry.

Torres et al. (2021) provide a review of methods to encode a system as a network. Graph theory applied to complex systems, possessing irregular structure and dynamically evolving over time, gave rise to the term ‘complex network analysis.’ In this review, we use the parsimonious ‘network analysis’ to encompass network studies of any sort. The methods for encoding and subsequently analyzing systems as networks have been empirically, if nonlinearly and cobbled together from across disciplines, determined and were accelerated in the 20th century. The 1920s witnessed a sophistication of social network analysis–networks of relationships among social entities (e.g., trade between nations, communication between members of a group) (Milgram 1967). It has found more recent application in biological, engineering, and geophysical systems (Tsonis et al. 2006; Donges et al. 2009; Steinhaeuser et al. 2011; Malik et al. 2011).

The power of the network representation is that understanding and approaches from the mathematical field of graph theory can be used to explore and understand the networks. Indeed network measures exist to define the geometry and evolution of the network. Responding to Sect. 6.1, network measures are being used as new metrics for complexity, in effect means to aggregate the system in less lossy ways (e.g., better coarse-graining). One could consider three levels of measuring a network: 1) micro: the level of nodes and edges; 2) macro: distributions aggregating quantities; and 3) mesoscale: a large class in between (Porter et al. 2009).

At the micro level, one considers the nodes and edges individually or in small groups. A node’s degree is the number of edges it has. At the macro level, one aggregates the entire network and studies statistical distributions that attempt to describe it. Common macro measures include diameter (the length of the longest geodesic path between any pair of nodes in the network for which a path actually exists), average path length (average shortest path between all pairs of nodes in the network), degree distribution (the frequency distribution of the degree of the network nodes), and clustering coefficient (average probability that two neighbors of a vertex are themselves neighbors). Together, these are ways of understanding the geometry of a network, which is the complement of it’s size, connectivity, efficiency, and homogeneity/heterogeneity. The characteristics of the degree distribution, including its higher order moments, are a fundamental way that networks of different types and behavior are distinguished and it conveys information about the functioning of the system.

The mesoscale level covers everything in between. A topic that has received much attention in the network science literature is centrality, which is the attempt to quantify the most important or central nodes in a network. There are numerous different ways to think about and thus to calculate centrality, some quite simple like degree centrality (looking at the degree of a node with respect to others in the network), and others more involved and considering neighborhoods around a node such as eigenvector centrality, betweenness centrality, and closeness centrality (Newman 2010). Another mesoscale structure that receives much attention is the community (a group of nodes relatively densely connected to each other but sparsely connected to other dense groups in the network (Fortunato 2009; Porter et al. 2009)). Identifying communities in networks has significant implications for making discoveries about systems, though there is no consensus on technique for detecting them. It is perhaps in the mesoscale where undiscovered insight into Heliophysics systems and processes lays. Indeed, this is reflected in the Heliophysics network science literature reviewed below.

The ability to capture multiscale relationships and behavior using a network structure or representation provides an exciting opportunity to improve multiscale understanding of the Heliophysics system (McGranaghan et al. 2017c).

Beginning in the mid-2010s a few pioneering works began to realize the potential of network analysis for space physics, Heliophysics, and space weather applications. Traditional approaches to systems analyses in Heliophysics and space weather have attempted to track energy flow from the Sun to the Earth or other planet through a collection of time series. This approach does not generalize to multi-event studies nor to extracting statistics. Networks liberate the systems approach from the limitations inherent in this approach. They allow one to track energy flow and dynamic changes across a system for multiple events in a principled manner and to quantify them using well-established measures from graph theory and network analysis. Not only do network approaches reveal new parameters by which to understand the system, they enable quantifying the likelihood of those measures that can become the basis of a risk quantification system (Simpson et al. 2021) and ultimately understanding how to create technologies and society resilient to the threats of space weather. Thus, we believe it important to provide a brief review of those works here. We anticipate that this field will evolve rapidly in the coming years and intend to provide a basis for researchers to become oriented and trace some of the important history here.

Dods et al. (2015) applied network analysis to >200 distributed ground-based magnetometers that are indexed in the Super Magnetometer Initiative (SuperMAG) (Gjerloev 2009). Already a debate in the Heliohpysics community whether ground-based indices such as the AE and Disturbance storm-time (Dst), themselves aggregates of small numbers of ground-based magnetometers, were capable measures for understanding magnetospheric dynamics, Dods et al. studied whether a network representation of ground-based magnetometer data can quantitatively extend our qualitative understanding of magnetospheric substorms, creating the first application of network analysis to SuperMAG data and one of the early applications to space weather in general. The observations are vector magnetometer time series data at 1 minute cadence from the SuperMAG database (all stations from the northern hemisphere). Canonical correlation (Brillinger 2001) was used to establish similarity between the pair of vector time series as a function of time. To construct the network, they defined magnetometers as nodes and correlation above a threshold between the vector magnetometer time series from pairs of stations within a running time window as edges. Figure 11 reproduces their visual explanation of the network construction. They investigated four substorms. For each event, they form dynamical networks of connected stations in magnetic local time (MLT)-MLAT space in the Northern Hemisphere.

Importantly, the correlation threshold between any two stations might be different, so they created a global threshold that effectively normalizes the likelihood of being connected to the network. This is an important step of network analyses for Heliophysics and Space Weather applications where observations are distributed and baselines are commonly distinct.

They are able to then construct networks for any given event. Networks are reconstructed in the paper for four selected substorm events (defined according to Gjerloev and Hoffman 2014) and one steady magnetospheric convection (SMC) event. From each network dimensionless parameters were obtained that quantitatively characterize the network and by extension, the spatio-temporal dynamics of the substorm under observation. They found several typical signatures of the isolated substorm:

Before onset, the network has few connections;
Connectivity rapidly and clearly responds to the onset, characterized by high-latitude connections, but not without low- and cross-latitude connections;
In the recovery phase, connection structure switches from high-latitude dominant to low-latitude dominant; and
The normalized total number of connections and the average geodesic connection distance (physical distance) of the substorm period networks are greater than those for the SMC event and much greater than during quiet times.

Thus, they discovered that network responses to substorms, SMCs, and quiet times are quantitatively distinct. They conclude also that their technique may have applicability to other magnetospheric phenomena.

That supposition was examined by Dods et al. (2017), who apply the same methodology to the response of the quiet time (no substorms or storms) large-scale ionospheric transient equivalent currents to north-south and south-north IMF turnings. The calculation of the correlations between station pairs is identical to Dods et al. (2015) but they also map the network onto a regular grid and aggregate the network responses over more than 350 events (between 1998–2004) to obtain an averaged response as a function of geomagnetic location (MLT-MLAT) and of the time delay since the occurrence of the IMF north-south and south-north turnings.

For both north-south and south-north IMF turnings they examined short-range (station-pair connections with geodesic separation <4000 km) and long-range (>4000 km) connections. Their results indicate magnetometer correlation network responses are distinct for north-south and south-north turnings and the sign of IMF B$_{\textrm{Y}}$ component. Demonstrating the potential of network analysis, they provided new information on two competing concepts for reconfiguration of ionospheric currents in response to a change in the north-south component of the IMF: a fast initiation of the transient currents associated with ubiquitous and near-simultaneous response at the high-latitudes (e.g., Ridley et al. 1997) and a gradual reconfiguration spreading from an initial response on the dayside followed more gradually by the nightside (e.g., Lockwood et al. 1986). The network response shows near-simultaneous responses (∼8-10 minutes) between magnetopause impact and network response, consistent with Ridley et al. (1998), and references therein. They discuss tentative evidence for a two-step process: fast initiation of change in ionospheric equivalent currents between day and night followed by a more gradual reconfiguration that first appears on the dayside and then the nigthside.

Using spatial maps of the edges of the network, they found that turnings are associated with increases in connectivity (correlation) in the areas known to be associated with the two-cell convection pattern (Dungey 1961). Ultimately, this was one of the first studies to reveal that dynamic correlation networks can characterize the spatio-temporal ionospheric response observed in large numbers of ground-based magnetometers.

Orr et al. (2019) advanced the analysis of the spatio-temporal evolution of substorm ionospheric current systems in SuperMAG data with networks by introducing lags in the canonical cross correlation. Considering lags, rather than zero lag correlation as Dods et al. had done, permitted construction of a directed network that captured not only the formation of coherent patterns observed by magnetometers but also the direction of information propagation of those coherent structures. They used the directed networks to test different proposed methods for how the ionospheric current system evolves during a substorm.

To assess the direction of propagation, they divide the nightside auroral ionosphere (18 MLT to 6 MLT, passing through midnight; 60-75^∘ MLAT) into three zones of six hours MLT, a typical extent of the substorm current wedge (SCW) (Gjerloev et al. 2007).

They conclude that the magnetic perturbations are consistent with the SCW formation during substorm onset and westward expansion into a coherent current system in the premidnight (MLT) sector (see their Fig. 2). Subsequently, a coherent correlation pattern emerges that spans the entire nightside ionosphere.

Orr et al. (2021b) took the analysis a step further, taking advantage of a wider set of network science analysis techniques to understand the properties of a network. Namely, they studied community structure in the SuperMAG networks in which a community is defined consistently with Porter et al. (2009): an area of a network more densely connected to one another than to the rest of the network. They detect communities in SuperMAG networks across 41 isolated substorms with 1-min resolution data. Primary results are illustrated in Fig. 12.

In the networks, multiple discrete current systems exist prior to onset (see Fig. 12a-d) and progressively transition into a coherent SCW (see Fig. 12f-h), notably a transition to a coherent large-scale spatially extended structure rather than flux accumulation of incoherent small-scale wedgelets. The same pattern is observed across numerous algorithms for community detection. Thus, the SCW is a characteristic part of substorm evolution, potential resolution for long and ongoing controversy in substorm science. The spatially extended communities they observed cannot be obtained by having many, small, spatially localized wedgelets, which are internally correlated, but lack cross correlation with each other.

Of immense societal relevance for ground-based magnetometer observations of space weather activity are the corresponding potential hazard to grounded technology like the power grid. The threat to the power grid is quantified by geomagnetically induced currents (GIC). A pair of studies have explored network analysis for GICs, both providing insight into the GIC hazard and revealing network analysis as important for bridging from Heliophysics insight to space weather risk assessment and societal relevance. Hughes et al. (2022) produced networks connecting SuperMAG magnetometers to newly available GIC data collected by power utilities through the Electric Power Research Institute (EPRI) SUNBURST project. They calculate probability multipliers for all pairs of magnetometer-GIC sensors, information that would be useful to using magnetometer observations to determine risk the power grid. Overall, there is a factor of 1.83 increase in the GIC increase given magnetometer changes. On a sensor-to-sensor comparison, however, the magnetometers that provide the most information for a given GIC sensor are often not those in closest geographic proximity, meaning networks reveal non-intuitive relationships.

Orr et al. (2021a) used SuperMAG ground magnetic perturbation measurements as input to a model of the high-voltage (HV) power grid in the United Kingdom (UK), which output GICs at the grid transformers. They quantified the spatio-temporal response of the GICs in a manner similar to Orr et al. (2019). A number of conclusions were drawn, including:

the entire physical power grid is spanned by coherent connections with long-range correlations at intense storm times;
the GIC networks are not a simple response to the rate of change of the magnetic field;
during storms, networks have intermittent quiescent periods in which distinct sub-networks form; and
GIC networks are distinct from the physical networks of the HV grid, exhibiting characteristics unlike the exponential and small-world nature of the physical grid.

Their work offers a direct connection to space weather risk assessment: “The GIC response networks that we have determined here have significantly different properties to that of the physical HV grid. This is important since it implies that previous studies that focus on stability of the physical grid to the failure of individual network connections may not fully inform the assessment of space weather risk.”

Concurrently and drawing inspiration from Dods and Orr et al.’s revelations about the utility of networks in geospace science, McGranaghan et al. (2017c) applied network techniques to another important data set: total electron content (TEC). Global, high-latitude response of TEC is the result of numerous complex geospatial processes, each with unique spatial and temporal scales (Mendillo and Klobuchar 2006; Shim 2009; Emardson et al. 2013). Despite being rich with information about the Earth’s space environment, their characteristics at high latitudes are not well understood, and the complex nature of the processes in this regime requires innovative and sophisticated approaches to (1) understand the information content of these data and (2) gain the most scientific utility from them. These were motivation to attempt to understand the spatio-temporal characteristics of TEC in the high-latitude regime. In their application, nodes were defined by the magnetic coordinate system grid points (physical locations) and edges by spatio-temporal correlations exceeding a threshold between them. It was the first application of such techniques to TEC data. Their data are hourly averages of TEC data compiled from the worldwide system of ground-based GPS receivers binned into a geographic 1^∘ latitude × 1^∘ longitude grid (rebinned in this work into equal area magnetic coordinates). Data from winter and summer seasons in 2016 were used. Data were studied separately for the northern and southern hemispheres and all data were separated into interplanetary magnetic field (IMF) clock angle bins, the angle between geocentric solar magnetic (GSM) north and the projection of the IMF vector onto the GSM Y-Z plane, to determine dependence on the solar wind forcing. Figure 13 visually details the network construction steps.

Using predominately the network measures of degree centrality, median geodesic separation distance, and local clustering coefficient, their analyses suggested that the Northern Hemisphere exhibits correlations over shorter distances but is more spatially uniform while the Southern Hemisphere correlations typically extend over larger distances but that the hemisphere as a whole is more spatially fragmented. Their resultant maps indicate the scale sizes important to characterize the ionosphere during geomagnetic activity depend on season and hemisphere. The proof of concept study was exciting and pinpoints several ideas for follow-on inquiry.

These studies provide a framework through which to analyze the complex magnetosphere-ionosphere-thermosphere system free of the limiting assumption that phenomena can be described by interpolating distributed observations onto a grid. As Dods et al. (2017) write network analysis enables the characterization of, “... the spatio-temporal correlation pattern for [any] event directly from the spatially nonuniform original observations, then aggregate many such patterns onto a single grid to give a complete spatial coverage.”

How does network analysis relate to risk and resiliency? Risk is probability times likelihood. Risk is a distribution. In the terms of the insurance industry, hazard is actual cost incurred from a risk. To effectively quantify risk for space weather we must use the most informative parameters and calculate their likelihoods. The important parameters will be hazard-specific (e.g., the relevant parameters for power grid risk will be different from those for communications systems risk). Traditionally, and as a result of observational limitations, we have relied on indices like Dst and AE as the important parameters. Yet we know them to contain inadequacies (e.g., see Sect. 3 and the discussion of the use of AE to represent the magnetosphere). Network measures are potentially much more directly related to the physical phenomena relevant to a given risk and thus an exciting pathway to better risk quantification, upon which resilience is built.

Finding new applications for Heliohpysics is a trend that continues, particularly in connecting fundamental research with applied outcomes as evidenced by Orr et al. (2021a), Hughes et al. (2022).

6.5 The Role of Natural Language Processing

Natural Language Processing (NLP) is concerned with programming computers to process, analyze, and respond to large amounts of natural language data. The importance of augmenting human research and activities with automated analysis of text is now undeniable, given the sheer volume of relevant scientific literature. It is beyond the capacity of an individual researcher to understand all of the relevant scientific information about a topic. The growth rate in scientific literature is exacerbating and compounding the problem (Bornmann et al. 2020). For inter- or trans-disciplinary science, the kind that complexity demands, the amount of relevant information grows exponentially simply through the need to incorporate more than one domain’s knowledge. To some extent the history of disciplinary science has been to put up boundaries on the science one attempts to answer as a means of reducing the seemingly infinite amount of information that must be considered. Complexity science exposes those boundaries as artificial. Therefore, it requires new methods for handling more voluminous information. This review has already demonstrated some of those methods for data analysis, but NLP is vital for textual analyses.

There are many applications of NLP that have already or are poised to have an impact on scientific research and the process of science. Common tasks include, each applied to a given piece of natural language (e.g., a publication):

Named entity recognition (NER): identify and locate entities in unstructured text such as person, organization, location;
Information retrieval: searching for information contained in a document;
Keyword generation: extract or identify the most relevant words, phrases, or ideas in a document;
Summarization;
Sentiment analysis: identify the affective state and the subjective foundation of a piece of text; and
Question and answer: return an answer to a natural language question a user poses based on a document or collection of documents.

Many of these ‘downstream tasks’ (Bommasani et al. 2021) rely on a language model. A language model is a model that assigns a probability to a sequence of words (Jurafsky and Martin 2000). Language models take a collection of words and conditioned on that information assign the probability of another word or collection of words. For a simple example, perhaps you want to predict the next word in a sentence given the ones that precede it. A language model can fulfill that task. However, this basic functionality can extend to much more complicated tasks such as taking an entire document and predicting descriptive keywords or creating a summary.

For a number of reasons modern language models have been changing rapidly. First, the growth of textual data on the internet is growing exponentially, providing vast volumes of data for training these models, which typically have millions of parameters that need to be determined and require heretofore unimaginable volumes of data to constrain. Second, computational power is making it possible to process those data. Finally, AI research produced new modeling approaches such as recurrent neural networks (Rumelhart et al. 1986), transformer architectures (Vaswani et al. 2017), and self-supervised learning (e.g., Baevski et al. 2020). Together, these influencing factors produced step changes in performance of language models on many tasks (Tamkin et al. 2021). Improvements in capability coupled with wider awareness owing to much greater accessibility of language models through services like ChatGPT,^{Footnote 6} early 2023 has become a cultural moment for NLP and language models (e.g., Klein 2023).

In 2018 Google developed the Bidirectional Encoder Representations from Transformers (BERT) language model (Devlin et al. 2019). Judging the general language models incapable of deeply contextual scientific support, scientists subsequently recognized the opportunity to tailor the baseline models like BERT for their domains or narrower applications. The first result was SciBERT, a fine-tuning of the BERT model that focused on scientific papers from the Semantic Scholar corpus^{Footnote 7} (Beltagy et al. 2019). SciBERT is part of a growing list of models that adapt BERT to specific domains and tasks, for which perhaps the most relevant to this review is that created for the astrophysics and astronomy domain, astroBERT (Grèzes et al. 2021). One of the most sophisticated examples, likely due to the availability of large volumes of training data in the biology and biomedical domain and a more sophisticated baseline language model from a family of models known as generative pre-trained transformers (GPT) (Radford and Narasimhan 2018), Bio-GPT has demonstrated improvement across six biomedical NLP tasks (Luo et al. 2022). Progress in Heliophysics is nascent ^{Footnote 8} and most of the NLP work being done is on downstream tasks using existing language models rather than training our own. Despite not yet realizing the potential that some have stated exists for these domain-specific models, the concept of foundation models (Bommasani et al. 2021) remains enticing. Capable language models for Heliophysics research are far from settled and available, yet may be a core component of searching and discovering vast amounts of literature and research artifacts available.

Encompassing the diverse disciplinary integration needed for complex systems analyses will require sophisticated NLP capabilities, including perhaps entirely new approaches to building language models. This review combined traditional literature review processes with NLP to create a hybrid review article. The NLP used was relatively simple, mining the NASA Astrophysics Data System (ADS) for articles, two-fold filtering based first on selected Heliophysics journals and second on a manually-created Complexity Heliophysics glossary, but the approach created a much larger corpus and discovered articles that were not included in the hand-selected corpus for this review. Thus, the hybrid approach produced a richer coverage of the literature. Appendix C describes the NLP approach.

It is useful to future Complexity Heliophysics work to chronicle briefly the outcomes and open questions from the application of NLP to augment this review. A more detailed taxonomy of uses for NLP in scientific research has emerged. For encoder-like language models, tasks fall generally under five categories:

1.
Question answering;
2.
Text classification;
3.
Semantic equivalence;
4.
Named entity extraction; and
5.
Knowledge extraction.

For generative models, language models support tasks related to conversational artificial intelligence, conversion of data to text, and text summarization (Ramasubramanian et al. 2020) (and personal correspondence: Muthukumaran Ramasubramanian; March 2023).

There are also many open questions. A few of importance to Heliophysics are: 1) given that Heliophysics is inherently a systems science, how might we extract relationships from scientific literature that guide us to incorporate new knowledge in our science?; 2) what role might NLP play in improving our information search and discovery processes?; 3) to what extent can NLP support efforts to better represent Heliophysics knowledge in human- and machine-readable ways (e.g., support building semantic technologies and capabilities) (Biffl and Sabou 2016)?

The advent of large language models (LLMs) and the discussion of their use in scientific research is pointing to a larger conversation that is unfolding in the future of science: what is the intersection of complexity science with AI/ML? We deliberate on this question in Sect. 7.1 with the hope of seeding important conversations Heliophysicists need to have, placing it in the context of conversations all scientists are grappling with.

6.6 Areas of Complexity Science That Have Not Yet Been Widely Explored in Heliophysics

There are tools viewed by the complexity science community as necessary to understand complexity (Krakauer 2019; Hobson et al. 2018) that have not yet been widely employed in Heliophysics. Two areas conclude this section on emerging literature.

First, agent-based modeling (ABM). ABM is a model that simulates the actions and interactions of agents, most traditionally autonomous, individual elements with properties and capable of actions. ABMs have been used extensively in the social sciences, biology, and ecology (Niazi and Hussain 2011) and their efficacy owes to their combination of elements of game theory, sociology, evolutionary programming, and emergence. Their operating principle is to give agents relatively simple operating rules, simulate their interaction, and study the emergent collective phenomena. Though ABM is often associated with modeling behavior of living organisms, perhaps more important to Heliophysics it is also a technique for modeling the behavior and interactions of things such as particles in the magnetosphere. Particle simulations, therefore, can be understood as a form of agent-based model and the connection might allow computational approaches to Heliophysics learn from a rich domain of research. There may also be application agent-based modeling for human responses to events such as space weather storms, though outside of ‘simulation game’ activities (McGranaghan et al. 2022) this is virtually unexplored. However, understanding human behavior in disaster situations is an important component to quantifying risk and establishing resiliency in our societal systems.

Second, collective intelligence. If ABMs are the tools, collective intelligence is the study of their output data. It is a nascent field of study to understand collective behavior, that is adaptive, wise, or clever structures and behaviors by groups, in physical, biological, social, and many engineered systems (Flack et al. 2022). Discovery in disciplines as diverse as biology and ecology to psychology and economics point to cross-disciplinary utility. At present there is exists essentially no conversation about the use of methods and techniques from collective intelligence. We urge researchers to think capaciously about how collective intelligence might impact Heliophysics, perhaps even providing new solutions to long-standing questions. Two areas, in particular, might be fruitful: 1) interpreting particle simulations and 2) studying responses to natural hazards as a way of more accurately predicting societal impacts of extreme space weather events. Anticipating the final section of this paper, there is a growing literature that exists at the intersection of collective and artificial intelligence (Berditchevskaia et al. 2022), a trend indicative of most sciences.

7 Frontiers of Inquiry and Investigation Emerging from Complexity Science

This review has covered much ground. What follows is a synthesis of the discovery and insights from the history of complexity science in Heliophysics and space physics into a path forward for its research and its community. The path involves three elements:

Articulating a key challenge for Complexity Heliophysics that is shared with numerous scientific domains (Sect. 7.1);
Defining a scientific framework capable of responding to it (Sect. 7.2.1); and
Discussing the socio-cultural dimension that must be addressed (Sect. 7.3).

The sections that follow will be necessarily more subjective and perspective-leaning that the previous ones. This is deliberate. However, those perspectives are grounded in literature and scholarly artifacts. Because these are ‘frontiers,’ coverage within Heliophysics and space physics is as yet inadequate and we must borrow liberally from sister and even more distant domains. The intention for the following material is that it is not only provocative, but suggestive of generative new thinking and inquiry.

7.1 Key Challenge for 21st Century Science: The Intersection of Complexity and Artificial Intelligence and a Framework to Explore It

Artificial intelligence (AI) and machine learning (ML) are not new. They are traced as fields of study to the 1950s, when Alan Turing worked on the concept of intelligent machines and how to create them (Turing 1950) and the conference that many credit with coining the term artificial intelligence was held (McCarthy et al. 2006); and their origins as areas of thought predate even that by many decades (in science fiction writing (Asimov 1942) as in philosophy and the mechanical manipulation of symbols (Descartes 1968)). In the 1950s, hyperbolic beliefs about nearness to genuine artificial intelligence led to an ‘AI winter’ when those hopes failed to materialize and lasted from the 1970s to the 2000s. However, in the past several decades the rise of internet-scale data, computing power consistent with a doubling in the number of transistors in an integrated circuit every two years (Moore’s Law (Moore 1998)), and improvement in algorithms have brought a renewed zeal and concomitant advance to AI/ML. A quick note that ML is a sub-domain of AI where AI is the ability to accomplish complex goals (Tegmark 2017) while ML is leveraging data to improve computer performance on some task or set of tasks (Mitchell 1997). Since existing studies in Heliophysics do not truly address AI, but the broader concept is important in this section, we will adopt the shorthand AI/ML to refer to the full set of methods.

Heliophysics and space weather have been a part of the passionate exploration, adopting and applying AI/ML advances coming out of industry. Camporeale (2019) review the state of AI/ML in Heliophysics research, including prominent applications in forecasting: geomagnetic indices, relativistic electrons at geosynchronous orbits, solar flares occurrence, coronal mass ejection propagation time, and solar wind speed. Their synopsis of the field led them to conclude that there is a need to shift forecasting in Heliophysics to probabilistic approaches centered on the reliable assessment of uncertainties, and the combination of physics-based and machine learning approaches. Their discussion echoes a long-standing conversation that has been staged across the sciences (e.g., Mazzocchi 2015) and indeed across culture more broadly, becoming a common subject of popular science writing, science fiction literature, and futurism (e.g., Anderson 2008; Chiang 2000; Kelly 2016; Finn 2017; Ottino and Mau 2022). The two sides of this debate have been given different names over time: hypothesis-driven vs. empiricism, deductive vs. inductive reasoning, theory-driven vs. data-driven.

Our review, too, leads us to the need to find common ground between these poles of approach to growing scientific knowledge, now perhaps with the language of AI/ML vs. complexity science. Words create worlds (Heschel and Heschel 1989) and it deserves some inquiry into what new perspectives these new words for the old debate may create.

McGranaghan et al. (2021a) and McGranaghan et al. (2017a) taxonomized AI/ML as a part of the broader field of data science, defining the latter it as scalable architectural approaches, techniques, software and algorithms which alter the paradigm by which data are collected, managed and analyzed and communicated. They point to a similar need to integrate knowledge of the physical domain with data-driven approaches.

The trend in the literature around AI/ML and data science in Heliophysics is clear: the future of Heliophysics must explore the intersection between data-driven approaches with theory-driven science (Karpatne et al. 2016; Pankratius et al. 2016). Klimas et al. (1996), where we began this review, points to this key challenge for Complexity Heliophysics: converging the autonomous and the local linear prediction filter methods, merging the benefits of interpretability with the success of data-driven approaches. Klimas et al. (1996) writes, “It is anticipated that in the future these local-linear predictor models will be studied carefully with the goal of organizing these bits and pieces into a global nonlinear predictor model. It may be advantageous to cast these predictor models as analogue models in order to maximize their physical interpretation.” Klimas et al. are pointing to a reconciliation that is at the heart of a debate that spans the sciences in the 21st century: between first principles, physics-based models and data-driven, artificial intelligence or machine learning AI/ML algorithms. They represent different reasoning approaches: inductive and deductive, and correspond to different capabilities to explain the result found. Physics-based models are inherently explainable–the behavior arises from understood laws stitched together in traceable logical reason. AI/ML models discover patterns directly from data, but are less clearly interpretable. This review has chronicled the fact that complexity introduces, sometimes extreme, uncertainty to physics-based understanding and precludes predictability (Wolfram 2002). However, the advent of AI/ML (and other data-mining approaches) and the requisite computation has produced, in some cases, capabilities to represent complicated systems more accurately than physics-based equations (Camporeale 2019; Stephens et al. 2019). These models may be capturing some as yet unknown physical properties. This quality is similar to the way that power laws in complexity science capture some underlying mechanism that acts across scales (West 2017). Reconciling the first principles with data-driven approaches, physics and complexity with artificial intelligence, is a grand challenge for the 21st century.

Indeed, the consilience of physics with data-driven approaches is important to all areas of inquiry (Wilson 1998). Domains tend to embrace this call to integrate the two when the fundamental and applied components of their science come into contact: understanding pathologies of diseases in medicine vs. predicting if and when a disease will occur; understanding the Earth system vs. predicting natural disasters; understanding the solar-terrestrial system (Heliophysics) vs. predicting its consequences for our technological systems (space weather). In science there is a ‘tangled relationship between prediction and understanding’ (Krakauer 2020).

7.2 Space Weather as a Risk Science

Recasting the debate about theory- and data-driven science as a tension between AI/ML and complexity science, the body of literature in this review suggests that risk science coupled with an emphasis on resilience can provide a new framework. Definitions of commonly confused terms help approach the subjects of risk science and resilience.

Risk is likelihood of occurrence of a natural hazard multiplied by the consequence of that. It is therefore distinct from a natural hazard, which refers to the physical phenomenon, and disaster, which is the particular occurrence of a natural hazard that results in major consequences. If risk is the focus of a science that spans physical understanding to decision-making, resilience is the applied goal of that science. Resilience is the capacity of a system to recover from a disturbance (Lent 2017). Risk requires specification or prediction of events; resilience requires understanding the systemic impacts of those events, including the physical system along with the elements of preparing for and responding to the risk (change anticipation, exposure, mitigation, damage minimization). Together, they are a way of formulating grand scientific and societal challenges in a way that AI/ML and complexity science converge.

Other fields have demonstrated this convergence: Dynamical Systems: Fischer et al. (2022); Climate: Hultman et al. (2010); Ecology: Walker et al. (2004), Gunderson (2000); Socio-Ecological Systems: Carpenter et al. (2001); Disaster Research: Paton et al. (2000); Medical Anthropology: Panter-Brick (2014); Health: Promislow et al. (2022); among others: see Bhamra et al. (2011) for a review of the concept of resilience across disciplines.

7.2.1 Risk Science

In the context of climate science, Sobel et al. (2014) and Sobel (2022) argued for the necessity and the great intellectual opportunity of creating a discipline of climate risk science. They qualified such a science as being a layer between modeling and applications, requiring probabilistic approaches, and being related to adaptation. The similarities to the dimensions of complexity science and the grand challenge articulated in this review are striking and suggestive of how we might affect the trajectory of space weather research.

The innovation is to study space weather as a risk science and develop a framework for evaluating and quantifying space weather risk. It is important that space weather follow the examples of other natural hazards in adopting a risk formulation not only to overcome disconnects between science and decisions for space weather itself, but also for eventual incorporation into multi-hazard risk studies (e.g., understanding the power grid when space weather acts contemporaneously with terrestrial weather changes), permitting multi-hazard, compounding hazard, cascading hazard, and systemic risk research (Helbing 2013).

To develop a risk science framework, we can and should begin with the fields of risk studies (Burgess et al. 2016) and disaster risk reduction (DRR) (Wisner et al. 2011). Drawing on the framework developed for DRR in Wisner et al. (2011), there are five elements:

Environment: the system under study, including its overlaps and interconnections to other systems;
Hazards characteristics: understanding the details of a natural hazard (and if a multi-hazard analysis, then the collective, co-occurrence, connected details of the hazards) such as location, intensity, frequency, probability as well as quantifying the uncertainty of statistically averaging;
Vulnerability: differential impact from a natural hazard;
Capacity: resources and assets available to resist, cope with, and recover from natural hazards (Wisner et al. 2004); and
Exposure: the situation of people, infrastructure, housing, production capacities and other tangible human assets located in hazard-prone areas.^{Footnote 9}

There is a sixth element that we will not address in this review: recovery. Figure 14 relates each of the five elements to space weather.

The framework is built on three important principles: 1) Consideration of the holistic Sun-to-society system; 2) Quantification of the uncertainty that arises from coarse-graining and statistical simplification (McGranaghan 2022); and 3) relating space weather hazard to societal impact.

Much of this review has already been focused on efforts to quantify the characteristics of Heliophysics phenomena (e.g., burst statistics and scaling relationships for geomagnetic storms and substorms, as well as a not insignificant body of research on quantifying probabilities for extreme space weather events that we have not reviewed (Jonas et al. 2018, and references therein)). Here we will review articles that address the final three elements of the framework, to the extent that such research exists.

Schrijver et al. (2015) adopted an economic perspective to understand the relative potential impacts of extreme events and less extreme, more frequent events. He determined that societal impacts of both common severe and of rare extreme space weather are substantial, concluding that quantifying the characteristics of both kinds of event are vital to preparing for them, including creating mitigation strategies where possible. While the analysis of Schrijver et al. (2015) was for general space weather disturbance, a majority of risk studies for space weather have been conducted around the impact to the power grid.

Oughton et al. (2017) explored the potential costs associated with failure in the electricity transmission infrastructure in the United Kingdom due to extreme space weather, focusing on daily economic loss, exploring both direct and indirect consequences, and commenting on the implication for cost-benefit analyses of space weather forecasting and mitigation investment. Their key contributions include creating a foundation for the quantification of the economic impact of space weather, identifying the structural relationships that tie space weather impacts to the power grid to supply chains, and methodology to connect regional and national and direct and indirect impacts. Key among the results was the finding that the direct economic cost incurred from disruption to electricity was only a fraction of the total cost for the space weather scenarios explored. Space weather impacts are not merely of the direct kind, but systemic.

Oughton et al. (2019) in many ways culminates these threads of space weather as a risk science in the context of the power grid, establishing a framework for risk assessment. Their framework includes quantifying general geophysical risk, asset vulnerability, and the network structure of critical infrastructure systems. Figure 15 is a reproduction of their three-part risk assessment framework. In the description of the figure we have related their work to the language we develop in this review to aid in mapping between frameworks, which are indeed quite close.

Concurrently, Eastwood et al. (2018) explored a similar framework, also focusing on the impact of substorms on the power grid. They delineate the components of economic impact (to any hazard, not limited to space weather): spatial and temporal extent, vulnerability of the technologies/infrastructure, extent of mitigation, capacity to maintain production and support consumption across firms and consumers. Like Oughton et al. (2019) they map the physical phenomena to the impact through the intensity of the space weather event, resilience of the power grid, capability of the forecast, and socioeconomic models. The differences between these two similar exploratory studies to create risk assessment frameworks are illustrative of the open research questions in space weather risk science. They include the determination of the likelihoods of extreme storms, incorporation of forecast information, evaluation of the vulnerabilities of the infrastructure, details of recovery times, and socioeconomic models. A key challenge both studies articulate is the inhomogeneity of available information across the globe, making global impact calculations difficult.

Across space weather more broadly, Eastwood et al. (2017) assessed the existing knowledge available to quantify the economic impact. Their survey included the phenomena that represent space weather hazards (environment), the existing research to calculate occurrence and intensity statistics (hazards characteristics), and documented impacts across industries affected (vulnerability). Thus, their work aligns to the framework that we suggested above. Where it fell short of assessing the complete risk science framework reveals important research gaps to fill. It also points to structural issues in Heliophysics research where some of these gaps exist because of a lack of collaboration between the involved communities and others due to unavailability of vital data. There is clearly much that needs to be learned to make space weather a risk science.

A theme in this literature is that of interconnections, of disciplines, of physical phenomena to socioeconomic impacts, of spatial regions, and of critical infrastructure; it reiterates the complexity paradigm as the needed response, and risk science as a means to create general systemic understanding. A type of risk not often addressed in the literature, perhaps in large part due to the sheer challenge associated with it, is interconnected risk–risk due to externalities acting co-temporally and/or co-spatially to space weather such as extreme terrestrial weather.

If space weather is approached as a risk science, the domain could share a common framework with other natural hazards such as extreme terrestrial weather, hurricanes, earthquakes, wildfires, and floods.

Given the potential link that risk science provides between physical understanding and data-driven specification and prediction, it is important to understand the ways that risk is studied and used as a framework for research in various scientific disciplines. That is left as a call for Heliophysicists to understand approaches of ‘sister’ disciplines and how they might be impactful to Heliophysics research. In concluding this review, we guide that effort by relating a few of the ways the risk science framework might relate to and be used in Heliophysics. They fall generally into three categories in order from general to specific: 1) systemic risk; 2) resilience research; and 3) critical transitions.

7.2.2 Systemic Risk

Existing understanding of natural hazards is predominantly from the perspective of a single hazard in isolation. Yet the behavior of systems is not observable by considering each part in isolation. In short, emergence only occurs in the interconnected systems. Buldyrev et al. (2009) laid out a framework to study critical transitions (what they call catastrophic failures because of their focus on interdependent power grid and internet networks implicated in a 2003 blackout) in interdependent networks. They present details of the critical fraction of nodes that, on removal, will lead to a failure cascade and to a complete fragmentation of the interdependent power grid and internet networks. They study the functional integrity of the composite network using the largest connected set of nodes, $G$, in the system as a proxy. Network nodes are progressively removed and the effect on $G$ observed. The general finding is that interdependent networks exhibit not only a smaller critical threshold than isolated networks, leading to different levels of disruption, but also a different nature of abrupt ‘first-order’ transitions in system breakdown (see Fig. 16) (Vespignani 2010). There is a relationship between failure of nodes in one network and failure of nodes in the other. Buldyrev et al.’s results can be interpreted as general, exemplifying complexities and fragilities arising from network interdependencies. Relating the findings to the study of natural hazards, it is quite possible that understanding hazards in isolation will be inadequate to understanding the system and its resilience. Extreme events may be a result of the inherent interdependent system dynamics rather than of unexpected individual external events.

Helbing (2013), too, points to limitations in uni-hazard/uni-disciplinary analyses. They envision a ‘Global Systems Science’ that begins from the recognition that systemic failures and extreme events are consequences of highly interconnected systems and networked risks due both to natural connections between physical systems and connections created by the human and built world.

The Heliophysics community has been an active participant in shaping systemic risk research. Heliophysicists helped lead the geophysical monograph Extreme Events and Natural Hazards: The Complexity Perspective (Sharma et al. 2012), whose contributions collectively established the set of techniques (drawn from complexity science) and state of research at the time of writing for studying the extremes of natural hazards encompassed by the upper tail of the probability distribution. That monograph understands natural hazards as multidisciplinary phenomena, requiring concomitant multidisciplinary research, and recognizes that risks are only appropriately quantified if rare events across domains are coupled. They introduce the collection, “Like most of the major scientific challenges in the Earth and space sciences, there is increasing recognition that an integrated approach involving multiple disciplines will be needed to advance the science underlying extreme events that lead to natural hazards...The distributed nature of the components and the strong interaction among them is [a] feature common to systems exhibiting extreme events.”

The conclusions arrived at from this review’s examination of the history of complexity Heliophysics, those from Helbing (2013) in establishing ‘Global Systems Science,’ and the contributions to Sharma et al. (2012) are too similar to ignore. With consensus guidance on what needs to be done, it seems time for adoption across the research community from policy-makers to research scientists. The following two sub-sections are areas that offer pathways into systemic risk research.

7.2.3 An Emphasis on Resilience

Systemic risk enables the study of resilience, the ability of a system to maintain specific functions in the face of change (Scheffer et al. 2001; Baggio et al. 2015).

Returning to Vespignani (2010), the key finding is that understanding resilience, and ultimately designing more resilient systems, requires consideration of the interconnected system, the mutually dependent network properties. Indeed, the notion of resilience has been a powerful way into risk and systems science.

Resilience is a systemic phenomenon. At the highest level resilience is defined as a system’s accommodation of changes and reorganization of itself while maintaining the crucial attributes that give the system its unique characteristics (Scheffer et al. 2009, 2018).

Despite evidence to its importance and vast literature about it, there exists no consensus framework for understanding and managing resilience, a fact not unrelated to the challenge of quantifying resilience (Scheffer et al. 2018). Yet, a set of generic dynamic indicators of systems near critical transitions observable in time series data (see Sect. 7.2.4 below) has emerged that, alongside proliferation of time series data across domains and indications of corresponding spatial indicators of resilience (Dakos et al. 2010), enable new progress to understand resilience.

The literature on resilience is wide and multidisciplinary. Immediately meaningful in this review are seven principles of resilience that, although written for social-ecological systems, share a common basis with Heliophysics and Space Weather as a risk science in the need to build capacity to deal with unexpected change:

1.
maintain diversity and redundancy;
2.
manage connectivity;
3.
manage slow variables and feedbacks;
4.
foster complex adaptive systems thinking;
5.
encourage learning;
6.
broaden participation; and
7.
promote polycentric governance systems.

An emphasis on resilience speaks to understanding sought by Complexity Heliophysics and Space Weather as a risk science:

Both predictive and responsive: Resilience acknowledges that all responses are dual, including both the pre-emptive actions possible and those that must be responsive to the existing conditions). One could think about these dual responses as the pre-emptive actions accommodating the more or less deterministic signals while the responsive actions are those taken in the face of uncertainty and are inherently unpredictable.
Translational: Resilience requires translation between the science and the responses available (thus requiring understanding of system capability and capacity);
Multi-level: Resilience also requires a multi-level understanding of the system and the different responses for each level (Sober and Wilson 2009). For instance, a power grid operator monitoring the grid in Washington, DC and an individual at the Department of Energy tasked with the health of the country’s grid as a whole will have distinct responses to a National Oceanic and Atmospheric Administration (NOAA) warning;
Interdependent: Resilience connects a system to the external systems that may amplify or attenuate a particular effect (Levin et al. 2021), which reveals the final component;
Semantic: To understand interconnections, relationships between domains must become first class citizens to enable agents (whether a human or an intelligent machine) to navigate between them and integrate information from across them (Narock and Fox 2012; Bentley et al. 2011; Shimizu et al. 2020). This requires understanding information flow, in the physical, socio-cultural, and technological senses (McGranaghan et al. 2022).

These are sites of future research that must draw from both the complexity paradigm and AI/ML.

7.2.4 Critical Transitions

Resilience is complicated by and inextricable from the notion of critical transitions, or tipping-points. Tipping-points that represent critical transitions are points at which a dynamical system abruptly shifts from one state to another. In dynamical systems language, these are points at which small changes in a parameter can lead to qualitative change in the behavior of the system, or bifurcation points (Strogatz 2018). Under enhanced resilience, a system is more likely to accommodate changes without undergoing a critical transition into a qualitatively different state. Critical transitions are notoriously challenging to predict. However, exciting research points to the existence of generic properties (and the way that they can be measured) for systems near critical transitions (Scheffer et al. 2009):

Critical slowing down (Wissel 2004): As the system approaches such critical points, it becomes increasingly slow in recovering from small perturbations. In these regimes, the dynamical system will show an increase in lag-1 autocorrelation and increased variance in the pattern of fluctuations as the recovery rate from a small perturbation is reduced;
Skewness and flickering before transitions: Asymmetry in fluctuations increase near a critical transition. Flickering is indicated in the frequency distribution of states as increased variance and skewness as well as bimodality (Carpenter and Brock 2006); and
Increased spatial coherence: An analog to critical slowing down for spatial data, it has been shown that systems nearing critical transitions exhibit increased spatial correlation (Dakos et al. 2010). In systems consisting of numerous coupled units, slowing down near a critical transition will equalize differences between the units as each will tend to take the state of the units to which it is connected.

Scheffer et al. (2018) calls these generic properties “dynamic indicators of resilience.”

There are numerous types of bifurcation that each represent a critical transition. Figure 17 shows examples of fold, (supercritical) Hopf, and transcritical bifurcation.

The type of bifurcation may be associated with different dynamics around the critical transition. For instance, system that undergoes a fold bifurcation exhibits an abrupt transition to a very different state whereas a transcritical bifurcation usually causes a smooth transition and a Hopf bifurcation can lead the system into a state of oscillatory behavior (Bury et al. 2021). Despite the different dynamical behavior, as noted above the general properties preceding critical transitions can be associated with a wide variety of transition in complex systems (Scheffer et al. 2009) and are exciting in their cross-disciplinary impact.

Universality of these ideas is made apparent in the range of fields in which they have been studied and led to new understanding. In sociology, one of the most cited papers is Mark Granovetter’s ‘Threshold Models of Collective Behavior’ (Granovetter 1978), whose impact created a new domain of analyses (‘threshold models’) and that rely on critical transition theory to interpret model results. Similar developments have been observed in medicine (Litt et al. 2001), finance (Kambhu et al. 2007), and climate (Lenton et al. 2008)), among numerous others. The similarity of the problems in those disciplines to the challenges in Heliophysics and various stellar-planetary systems and data (e.g., Schunk et al. 2021; Srivastava et al. 2021; Palmerio et al. 2022) suggest that the science of risk and critical transitions may have important utility to future leaps in our understanding. Application of these theories in Heliophysics is nascent, but may represent an important frontier.

7.3 Convergence Research

Achieving these ambitious research directions will be a challenge.

The complexity paradigm, risk, and resilience are all bridging concepts (Baggio et al. 2015; Burgess et al. 2016). They create connections between domains, between areas of scholarship, between science and society that would otherwise not exist (Granovetter 1973; Allen 2023). To engage in them is to understand that the work needed is not only technical, but social and cultural, too.

Pioneering domains like bioengineering (Nersessian 2022) are scientific forebears in the creation of resilience research that treat the system as complex and integrate data-driven analyses. Their examples reveal that an emphasis on resilience and a risk science framework facilitate transdisciplinary approaches (Promislow et al. 2022) (here a distinction is made between transdisciplinary and interdisciplinary: interdisciplinarity brings different disciplines together, but maintains their identification; transdisciplinarity takes place in the context of a real world problem in which the disciplines are tightly integrated, their methods and epistemologies synthesized and blended into a novel approach, perhaps even a new discipline). One such approach, in many ways an instantiation of the transdisciplinary paradigm, is convergence research. In 2016, the National Science Foundation (NSF) named “Growing Convergence Research” as one of its 10 Big Ideas for prioritizing future investments in science and engineering. At its most general, convergence has been defined “an approach to problem solving that cuts across disciplinary boundaries. It integrates knowledge, tools, and ways of thinking from life and health sciences, physical, mathematical, and computational sciences, engineering disciplines, and beyond to form a comprehensive synthetic framework for tackling scientific and societal challenges that exist at the interfaces of multiple fields” (National Research Council 2014). With convergence comes a new spectrum of challenges involving how we work across disciplinary lines, collaborate meaningfully in large groups, and develop healthy–meaning open, participatory, and resilient–connections among diverse stakeholders. Risk and hazard domains have been proving grounds for convergence research (White and Haas 1975; Quarantelli 1987; Prince 2009; Solnit 2009; Peek et al. 2020). The full spectrum of risk science means spanning natural hazard (the physical phenomenon) to risk (likelihood times consequence) to resilience (understanding of the system’s vulnerability and capacity) to societal impact. Crossing the spectrum requires convergence science. Indeed, tackling the problems we face as a society, whether global pandemics or climate change or complex systems, requires new levels of cooperation, facilitation, and synthesis (McGranaghan et al. 2022).

In Heliophysics, these transdisciplinary convergent approaches offer the potential to integrate space physics, space weather, and society. Two developments are needed: 1) creating a knowledge commons: a combination of intelligent information representation and the openness, governance, and trust required to create a participatory ecosystem whereby the whole community maintains and evolves this shared information space (Hess and Ostrom 2007; McGranaghan et al. 2021b); and 2) acknowledging the need for new literacies on scientific teams for facilitation (either within Heliophysics researchers or through new roles on science teams),^{Footnote 10} which represent capacities on teams for facilitation and creating healthy communities. As our science questions grow in complexity, so, too, must the information and the knowledge we bring to them. As information grows, the costs and challenges of communication grow. Convergence research is a and New roles to support convergence research

A framework for risk science and a concomitant emphasis on resilience may be poised intellectually and institutionally to conduct complexity science (Peek et al. 2020), bridge between predictive methods like AI/ML and fundamental science, and bring convergence research to Heliophysics. These inspire further studies on the nature of resilience from a systems level perspective of the Solar-Terrestrial connection, taking up the call of the 21st century “to integrate the sciences of complexity with machine learning and artificial intelligence” (Krakauer 2020).

8 Conclusion

The 21st century will be marked by complexity, according to Stephen Hawking, and this is especially true for the field of Heliophysics. Heliophysics has traditionally been characterized by categorizing and separating domains, but with the advent of new sensing capabilities, data analysis, and computational tools, there is a growing need for a paradigm shift towards Complexity Heliophysics. Complexity science is the study of phenomena that emerge from a collection of interacting objects and requires a plurality of frameworks that move between levels of a system. This lived and living review details the network of complexity studies in Heliophysics and provides a definition of the Complexity Heliophysics paradigm.

This review first outlined five dimensions of complexity science. Then, the analysis of the existing literature mapped into three parts: 1) a pivotal year for the paradigm: 1996; 2) transitional years that established dimensions of the paradigm between 1996-2010; and 3) emergent literature largely after 2010. For the final ternate, we drew on a much wider base of literature to situate Complexity Heliophysics in a broader context of the physical sciences, revealing trends and gaps. Several are proposed:

First, the ability to capture underlying structure and patterns in complex systems through coarse-graining is crucial in Heliohpysics. Two forms of coarse-graining, namely information theory and network science, are particularly important for the future of the field.

Second, reconciling the first principles with data-driven approaches, physics and complexity with artificial intelligence, is a grand challenge for the 21st century. We centered the discussion of Complexity Heliophysics in the tension between fundamental science vs. prediction-oriented science (e.g., basic science vs. applied science; physics-based modeling vs. artificial intelligence/machine learning) and suggest that this history of complexity science within Heliohpysics is instrumental in finding pathways between these extremes, therefore becoming inextricable from the future of Heliophysics research. The trend is clear: the future of Heliophysics and its applied counterpart, space weather, must explore the intersection between data-driven approaches with theory-driven science. Indeed, Klimas et al. (1996), where the review begins, pointed to a key challenge for Complexity Heliophysics: converging the autonomous and the local linear prediction filter methods, merging the benefits of interpretability with the success of data-driven approaches. We provided a vision that could help respond to the challenge in a risk science framework, which adopts probability and resilience as organizing concepts and identifies corresponding analysis methods. The technical challenges of complexity science are accompanied by socio-cultural challenges for which we conclude by relating the methods of convergence research.

Ultimately, this review provides a foundation for how complexity science can help address outstanding questions in Heliophysics and space weather science. The artifacts from this work include this review article; a glossary of terms that define Complexity Heliophysics and can be useful to search and discovery of related resources, individuals, and groups; and a new corpus of Complexity Heliophysics that is likely full of further discovery and generative of new research questions. With the paradigm shift, we will gain new capacities to understand the Heliophysics system, and this will guide researchers towards directions that are better equipped to respond to the challenges of the 21st century.

Notes

References

Akasofu SI (1979) Interplanetary energy flux associated with magnetospheric substorms. Planet Space Sci 27(4):425–431. https://doi.org/10.1016/0032-0633(79)90119-3
Article ADS Google Scholar
Akasofu SI (1980) The solar wind-magnetosphere energy coupling and magnetospheric disturbances. Planet Space Sci 28(5):495–509. https://doi.org/10.1016/0032-0633(80)90031-8
Article ADS Google Scholar
Akasofu SI (1981) Energy coupling between the solar wind and the magnetosphere. Space Sci Rev 28:121–190. https://doi.org/10.1007/BF00218810
Article ADS Google Scholar
Albert R, Barabási AL (2002) Statistical mechanics of complex networks. Rev Mod Phys 74:47–97. https://doi.org/10.1103/RevModPhys.74.47
Article ADS MathSciNet Google Scholar
Allen DS (2023) Justice by means of democracy. University of Chicago Press, Chicago
Book Google Scholar
Anderson PW (1972) More is different. Science 177(4047):393–396
Article ADS Google Scholar
Anderson C (2008) The end of theory: the data deluge makes the scientific method obsolete. Wired. https://www.wired.com/2008/06/pb-theory/
Angeler DG, Allen CR, Garmestani A et al. (2018) Resilience in environmental risk and impact assessment: concepts and measurement. Bull Environ Contam Toxicol 101:543–548. https://doi.org/10.1007/s00128-018-2467-5
Article Google Scholar
Angelopoulos V, Mozer FS, Mukai T et al. (1999) On the relationship between bursty flows, current disruption and substorms. Geophys Res Lett 26(18):2841–2844. https://doi.org/10.1029/1999GL900601
Article ADS Google Scholar
Armstrong JA, Fletcher L (2019) Fast solar image classification using deep learning and its importance for automation in solar physics. Sol Phys 294:80. https://doi.org/10.1007/s11207-019-1473-z
Article ADS Google Scholar
Aschwanden M (2011) Self-organized criticality in astrophysics: the statistics of nonlinear processes in the universe. Springer, Berlin. https://doi.org/10.1007/978-3-642-15001-2
Book Google Scholar
Aschwanden MJ (2019) Self-organized criticality in solar and stellar flares: are extreme events scale-free? Astrophys J 880:105. https://doi.org/10.3847/1538-4357/ab29f4
Article ADS Google Scholar
Aschwanden MJ, McTiernan JM (2010) Reconciliation of waiting time statistics of solar flares observed in hard X-rays. Astrophys J 717:683–692
Article ADS Google Scholar
Aschwanden MJ, Xu Y, Jing J (2014) Global energetics of solar flares: I. magnetic energies. Astrophys J 797:50. https://doi.org/10.1088/0004-637X/797/1/50
Article ADS Google Scholar
Aschwanden MJ, Crosby NB, Dimitropoulou M et al. (2016) 25 years of self-organized criticality: solar and astrophysics. Space Sci Rev 198(1–4):47–166. https://doi.org/10.1007/s11214-014-0054-6
Article ADS Google Scholar
Asimov I (1942) Runaround Astounding Science-Fiction
Axelrod R (1997) The complexity of cooperation: agent-based models of competition and collaboration. Princeton University Press, Princeton
Book Google Scholar
Baevski A, Zhou H, Mohamed A, Auli M (2020) wav2vec 2.0: a framework for self-supervised learning of speech representations. arXiv:2006.11477
Baggio JA, Brown K, Hellebrandt D (2015) Boundary object or bridging concept? A citation network analysis of resilience. Ecol Soc 20(2)
Bak P (1997) How nature works: the science of self-organized criticality. Copernicus, New York, NY. https://doi.org/10.1007/978-1-4757-5426-1
Book Google Scholar
Bak P, Tang C (1989) Earthquakes as a self-organized critical phenomenon. J Geophys Res 94:15,635–15,637
Article ADS Google Scholar
Bak P, Tang C, Wiesenfeld K (1987) Self-organized criticality: an explanation of the 1/f noise. Phys Rev Lett 59:381–384. https://doi.org/10.1103/PhysRevLett.59.381
Article ADS Google Scholar
Bak-Coleman JB, Alfano M, Barfuss W et al (2021) Stewardship of global collective behavior. Proc Natl Acad Sci 118(27). https://doi.org/10.1073/pnas.2025764118
Baker DN, Belian RD, Higbie PR et al. (1979) High-energy magnetospheric protons and their dependence on geomagnetic and interplanetary conditions. J Geophys Res 84:7138–7154
Article ADS Google Scholar
Baker DN, Higbie PR, Belian RD (1981a) Global properties of the magnetosphere during a substorm growth phase. J Geophys Res 86(A11):8941–8956. https://doi.org/10.1029/JA086iA11p08941
Article ADS Google Scholar
Baker DN, Hones EW, Payne JB et al. (1981b) A high time resolution study of interplanetary parameter correlations with ae. Geophys Res Lett 8:179–182
Article ADS Google Scholar
Baker DN, Bargatze L, Zwickl RD (1986) Magnetospheric response to the IMF - substorms. J Geomagn Geoelectr 38:1047–1073
Article ADS Google Scholar
Balasis G, Balikhin MA, Chapman SC et al. (2023) Complex systems methods characterizing nonlinear processes in the near-earth electromagnetic environment: recent advances and open challenges. Space Sci Rev 219:38. https://doi.org/10.1007/s11214-023-00979-7
Article ADS Google Scholar
Barabási A (1999) Emergence of scaling in random networks. Science 286(5439):509–512
Article ADS MathSciNet Google Scholar
Bargatze LF, Baker DN, McPherron RL et al. (1985) Magnetospheric impulse response for many levels of geomagnetic activity. J Geophys Res Space Phys 90(A7):6387–6394. https://doi.org/10.1029/JA090iA07p06387
Article ADS Google Scholar
Becker T, de Vries H, Eckhardt B (1995) Dynamics of a stochastically driven running sandpile. J Nonlinear Sci 5:167–188
Article ADS MathSciNet Google Scholar
Beltagy I, Lo K, Cohan A (2019) Scibert: a pretrained language model for scientific text. In: Conference on empirical methods in natural language processing
Google Scholar
Bentley R, Brooke J, Csillaghy A et al. (2011) HELIO: discovery and analysis of data in heliophysics. In: 2011 IEEE seventh international conference on eScience, pp 248–255. https://doi.org/10.1109/eScience.2011.42
Chapter Google Scholar
Berditchevskaia A, Maliaraki E, Stathoulopoulos K (2022) A descriptive analysis of collective intelligence publications since 2000, and the emerging influence of artificial intelligence. Collective Intelligence 1(1). https://doi.org/10.1177/26339137221107924
Bhamra R, Dani S, Burnard KJ (2011) Resilience: the concept, a literature review and future directions. Int J Prod Res 49:5375–5393
Article Google Scholar
Biffl S, Sabou M (2016) Semantic web technologies for intelligent engineering applications. Springer, Cham. https://doi.org/10.1007/978-3-319-41490-4
Book Google Scholar
Biggs N, Lloyd E, Wilson R (1986) Graph theory, 1736-1936. Clarendon, Oxford
Google Scholar
Blei DM, Ng AY, Jordan MI (2003) Latent Dirichlet allocation. J Mach Learn Res 3(ull):993–1022
Google Scholar
Boccaletti S, Latora V, Moreno Y et al. (2006) Complex networks: structure and dynamics. Phys Rep 424(4):175–308. https://doi.org/10.1016/j.physrep.2005.10.009
Article ADS MathSciNet Google Scholar
Bommasani R, Hudson DA, Adeli E et al (2021) On the opportunities and risks of foundation models. arXiv:2108.07258
Börner K (2015) Atlas of knowledge: Anyone can map. MIT Press, Cambridge, MA
Google Scholar
Bornmann L, Mutz R, Haunschild R (2020) Growth rates of modern science: a latent piecewise growth curve approach to model publication numbers from established and new literature databases. Humanit Soc Sci Commun 8:1–15
Google Scholar
Borovsky JE (2013) Physical improvements to the solar wind reconnection control function for the Earth’s magnetosphere. J Geophys Res Space Phys 118(5):2113–2121. https://doi.org/10.1002/jgra.50110
Article ADS Google Scholar
Borovsky JE, Denton MH (2018) Exploration of a composite index to describe magnetospheric activity: reduction of the magnetospheric state vector to a single scalar. J Geophys Res Space Phys 123:7384–7412
Article ADS Google Scholar
Borovsky JE, Osmane A (2019) Compacting the description of a time-dependent multivariable system and its multivariable driver by reducing the state vectors to aggregate scalars: the Earth’s solar-wind-driven magnetosphere. Nonlinear Process Geophys 26:429–443
Article ADS Google Scholar
Borovsky JE, Yakymenko K (2017) Substorm occurrence rates, substorm recurrence times, and solar wind structure. J Geophys Res Space Phys 122(3):2973–2998. https://doi.org/10.1002/2016JA023625
Article ADS Google Scholar
Borovsky JE, Delzanno GL, Valdivia JA et al. (2020) Outstanding questions in magnetospheric plasma physics: the pollenzo view. J Atmos Sol-Terr Phys 208:105,377
Article Google Scholar
Bortnik J, Li W, Thorne RM et al. (2016) A unified approach to inner magnetospheric state prediction. J Geophys Res Space Phys 121:2423–2430
Article ADS Google Scholar
Brillinger DR (2001) Time series - data analysis and theory
Book Google Scholar
Brittnacher M, Spann J, Parks G et al. (1997) Auroral observations by the polar Ultraviolet Imager (UVI). Adv Space Res 20(4):1037–1042. https://doi.org/10.1016/S0273-1177(97)00558-9
Article ADS Google Scholar
Brown EJE, Svoboda F, Meredith NP et al. (2022) Attention-based machine vision models and techniques for solar wind speed forecasting using solar euv images. Space Weather 20(3):e2021SW002,976. https://doi.org/10.1029/2021SW002976
Article Google Scholar
Brunk GG (2001) Self-organized criticality: a new theory of political behaviour and some of its implications. Br J Polit Sci 31:427–445
Article Google Scholar
Buldyrev SV, Parshani R, Paul G et al. (2009) Catastrophic cascade of failures in interdependent networks. Nature 464:1025–1028. https://doi.org/10.1038/nature08932
Article ADS Google Scholar
Burgess A, Alemanno A, Zinn J (eds) (2016) Routledge handbook of risk studies. Routledge, London. https://doi.org/10.4324/9781315776835
Book Google Scholar
Bury TM, Sujith RI, Pavithran I et al. (2021) Deep learning for early warning signals of tipping points. Proc Natl Acad Sci USA 118(39):e2106140118. https://doi.org/10.1073/pnas.2106140118
Article MathSciNet Google Scholar
Bush V (1945) Science: the endless frontier. Report to the President. United States Government Printing Office, Washington
Buzan T, Buzan B (1994) The mind map book: How to use radiant thinking to maximize your brain’s untapped potential
Camporeale E (2019) The challenge of machine learning in space weather: nowcasting and forecasting. Space Weather 17(8):1166–1207. https://doi.org/10.1029/2018SW002061
Article ADS Google Scholar
Carpenter SR, Brock WAB (2006) Rising variance: a leading indicator of ecological transition. Ecol Lett 9(3):311–318
Article Google Scholar
Carpenter S, Walker B, Anderies JM et al. (2001) From metaphor to measurement: resilience of what to what? Ecosystems 4:765–781
Article Google Scholar
Casdagli M (1992) A dynamical systems approach to modeling input-output systems
Castiglione P, Falcioni M, Lesne A et al. (2008) Chaos and coarse graining in statistical mechanics. Cambridge University Press, Cambridge. https://doi.org/10.1017/CBO9780511535291
Book Google Scholar
Chang TTS (1992a) Low-dimensional behavior and symmetry breaking of stochastic systems near criticality-can these effects be observed in space and in the laboratory? IEEE Trans Plasma Sci 20:691–694
Article ADS Google Scholar
Chang T (1992b) Path integrals, differential renormalization-group, and stochastic systems near criticality. Int J Eng Sci 30:1401–1405
Article MathSciNet Google Scholar
Chang T (1998) Sporadic localized reconnections and multiscale intermittent turbulence in the magnetotail. In: Horwitz JL et al. (eds) Geospace mass and energy flow. Geophysical Monograph Series, vol 104. American Geophysical Union, Washington, DC, pp 193–200. https://doi.org/10.1029/GM104p0193
Chapter Google Scholar
Chang TN (1999) Self-organized criticality, multi-fractal spectra, sporadic localized reconnections and intermittent turbulence in the magnetotail. Phys Plasmas 6:4137–4145. https://doi.org/10.1063/1.873678
Article ADS Google Scholar
Chang T, Wu C (2007) Dynamical complexity, intermittent turbulence, coarse-grained dissipation, criticality and multifractal processes. AIP Conf Proc 932(1):161–166. https://doi.org/10.1063/1.2778959
Article ADS Google Scholar
Chang T, Tam SW, Wu CC et al. (2003) Complexity, forced and/or self-organized criticality, and topological phase transitions in space plasmas. Space Sci Rev 107:425–445. https://doi.org/10.1023/A:1025502023494
Article ADS Google Scholar
Chapman SC, Watkins NW (2000) Avalanching and self-organised criticality, a paradigm for geomagnetic activity? Space Sci Rev 95:293–307. https://doi.org/10.1023/A:1005236717469
Article ADS Google Scholar
Chapman SC, Watkins NW, Dendy R et al. (1998) A simple avalanche model as an analogue for magnetospheric activity. Geophys Res Lett 25(13):2397–2400
Article ADS Google Scholar
Chapman SC, Watkins NW, Rowlands G (1999) Signatures of dual scaling regimes in a simple avalanche model for magnetospheric activity. J Atmos Sol-Terr Phys 63:1361–1370
Article ADS Google Scholar
Chapman SC, Dendy R, Watkins NW (2004) Robustness and scaling: key observables in the complex dynamic magnetosphere. Plasma Phys Control Fusion 46:B157. https://doi.org/10.1088/0741-3335/46/12B/014
Article Google Scholar
Charbonneau P, McIntosh SW, Liu HL et al. (2001) Avalanche models for solar flares. Sol Phys 203:321–353. https://doi.org/10.1023/A:1013301521745
Article ADS Google Scholar
Chiang TK (2000) Catching crumbs from the table. Nature 405:517–517
Article Google Scholar
Chu X, Bortnik J, Li W et al. (2017) A neural network model of three-dimensional dynamic electron density in the inner magnetosphere. J Geophys Res Space Phys 122:9183–9197
Article ADS Google Scholar
Cilliers P (2000) Knowledge, complexity, and understanding. Emergence 2(4):7–13. https://doi.org/10.1207/S15327000EM0204_03
Article Google Scholar
Clausen LBN, Nickisch H (2018) Automatic classification of auroral images from the Oslo auroral themis (oath) data set using machine learning. J Geophys Res Space Phys 123(7):5640–5647. https://doi.org/10.1029/2018JA025274
Article ADS Google Scholar
Clauset A, Shalizi CR, Newman MEJ (2007) Power-law distributions in empirical data. SIAM Rev 51:661–703
Article ADS MathSciNet Google Scholar
Cohen IJ, Baker DN, Bortnik J et al (2023) Reimagining heliophysics: a bold new vision for the next decade and beyond. Bull AAS 55(3). https://doi.org/10.3847/25c2cfeb.f31e0ecb
Consolini G (1997) Sandpile cellular automata and magnetospheric dynamics. In: 8th GIFCO Conference – Cosmic physics in the year 2000, p 123.
Google Scholar
Consolini G (2002) Self-organized criticality: a new paradigm for the magnetotail dynamics. Fractals 10(03):275–283. https://doi.org/10.1142/S0218348X02001397
Article Google Scholar
Consolini G, Chang TS (2001) Magnetic field topology and criticality in geotail dynamics: relevance to substorm phenomena. Space Sci Rev 95:309–321. https://doi.org/10.1023/A:1005252807049
Article ADS Google Scholar
Consolini G, Michelis PD, Tozzi R (2008) On the earth’s magnetospheric dynamics: Nonequilibrium evolution and the fluctuation theorem. J Geophys Res 113:A08222. https://doi.org/10.1029/2008JA013074
Article ADS Google Scholar
Consolini G, Quattrociocchi V, D’Angelo G et al (2021) Electric field multifractal features in the high-latitude ionosphere: CSES-01 observations. Atmosphere 12(5). https://doi.org/10.3390/atmos12050646
Coppes W, Jansen L (2022) Beyond categorisation: on piet Mondrian’s artistry and success (1911-1919). Oud Holland – J Art Low Countries 135(2–3):138–156. https://doi.org/10.1163/18750176-1350203007
Article Google Scholar
Dakos V, van Nes EH, Donangelo R et al. (2010) Spatial correlation as leading indicator of catastrophic shifts. Theor Ecol 3:163–174. https://doi.org/10.1007/s12080-009-0060-6
Article Google Scholar
Davis TN, Sugiura M (1966a) Auroral electrojet activity index ae and its universal time variations. J Geophys Res 71(3):785–801. https://doi.org/10.1029/JZ071i003p00785
Article ADS Google Scholar
Davis TN, Sugiura M (1966b) Auroral electrojet activity index ae and its universal time variations. J Geophys Res 71:785–801
Article ADS Google Scholar
de Bruijn K, Buurman J, Mens M et al. (2017) Resilience in practice: five principles to enable societies to cope with extreme weather events. Environ Sci Policy 70:21–30. https://doi.org/10.1016/j.envsci.2017.02.001
Article Google Scholar
de Michelis P, Consolini G, Tozzi R (2015) Magnetic field fluctuation features at swarm’s altitude: a fractal approach. Geophys Res Lett 42:3100–3105
Article ADS Google Scholar
De Wolf T, Holvoet T (2005) Emergence versus self-organisation: different concepts but promising when combined. In: Brueckner SA, Di Marzo Serugendo G, Karageorgos A et al. (eds) Engineering self-organising systems. Springer, Berlin, pp 1–15
Google Scholar
Demirel Y, Gerbaud V (2019) Chap. 12 - stability analysis. In: Demirel Y, Gerbaud V (eds) Nonequilibrium thermodynamics, Forth edn. Elsevier, Amsterdam, pp 573–602. https://doi.org/10.1016/B978-0-444-64112-0.00012-5
Chapter Google Scholar
Denton MH, Borovsky JE, Stepanova M et al. (2016) Preface: unsolved problems of magnetospheric physics. J Geophys Res Space Phys 121(10):10,783–10,785. https://doi.org/10.1002/2016JA023362
Article Google Scholar
Denton MH (2021) In: Maggiolo R et al. (eds) Some unsolved problems of magnetospheric physics. Geophysical Monograph Series, vol 46. American Geophysical Union (AGU), Washington, pp 743–751. https://doi.org/10.1002/9781119815624.ch46.
Chapter Google Scholar
Descartes R (1968) Discourse on method. Harmondsworth, Penguin
Google Scholar
Devlin J, Chang MW, Lee K et al (2019) Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv:1810.04805
Dods J, Chapman SC, Gjerloev JW (2015) Network analysis of geomagnetic substorms using the supermag database of ground-based magnetometer stations. J Geophys Res Space Phys 120(9):7774–7784. https://doi.org/10.1002/2015JA021456
Article ADS Google Scholar
Dods JE, Chapman SC, Gjerloev JW (2017) Characterizing the ionospheric current pattern response to southward and northward imf turnings with dynamical supermag correlation networks. J Geophys Res Space Phys 122:1883–1902
Article ADS Google Scholar
Donges JF, Zou Y, Marwan N et al. (2009) The backbone of the climate network. Europhys Lett 87:48,007
Article Google Scholar
Donovan EF, Mende SB, Jackel B et al. (2006) The themis all-sky imaging array—system design and initial results from the prototype imager. J Atmos Sol-Terr Phys 68:1472–1487
Article ADS Google Scholar
Dungey JW (1961) Interplanetary magnetic field and the auroral zones. Phys Rev Lett 6:47–48
Article ADS Google Scholar
Dutta C, Pandurangan G, Rajaraman R et al. (2013) On the complexity of information spreading in dynamic networks. In: Khanna S (ed) Proceedings of the 2013 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pp 717–736. https://doi.org/10.1137/1.9781611973105.52
Chapter Google Scholar
Eastwood JP, Biffis E, Hapgood MA et al. (2017) The economic impact of space weather: where do we stand? Risk Anal 37(2):206–218. https://doi.org/10.1111/risa.12765
Article Google Scholar
Eastwood JP, Hapgood MA, Biffis E et al. (2018) Quantifying the economic value of space weather forecasting for power grids: an exploratory study. Space Weather 16(12):2052–2067. https://doi.org/10.1029/2018SW002003
Article ADS Google Scholar
Emardson R, Jarlemark P, Johansson JM et al. (2013) Spatial variability in the ionosphere measured with gnss networks. Radio Sci 48:646–652
Article ADS Google Scholar
Erdos PL, Rényi A (1984) On the evolution of random graphs. Trans Am Math Soc 286:257–257
Article MathSciNet Google Scholar
Farmer JD, Sidorowichl JJ (1989) Exploiting chaos to predict the future and reduce noise. In: Evolution, learning and cognition, pp 277–330
Chapter Google Scholar
Farrugia CJ, Freeman MP, Burlaga LF et al. (1993) The Earth’s magnetosphere under continued forcing - substorm activity during the passage of an interplanetary magnetic cloud. J Geophys Res 98:7657–7671
Article ADS Google Scholar
Finn E (2017) What algorithms want: imagination in the age of computing
Book Google Scholar
Fischer T, Rings T, Tabar MRR et al (2022) Towards a data-driven estimation of resilience in networked dynamical systems: Designing a versatile testbed. Frontiers in Network Physiology 2
Flack JC (2017) Coarse-graining as a downward causation mechanism. Philos Trans R Soc A, Math Phys Eng Sci 375(2109):20160,338. https://doi.org/10.1098/rsta.2016.0338
Article Google Scholar
Flack J, Mitchell MM (2021). Complex systems science allows us to see new paths forward. AEON. https://aeon.co/essays/complex-systems-science-allows-us-to-see-new-paths-forward
Flack JC, Ipeirotis P, Malone TW et al (2022) Editorial to the inaugural issue of collective intelligence. Collective Intelligence 1
Fortunato S (2009) Community detection in graphs. arXiv:0906.0612
Foster J (2011) Economic systems. In: Hooker C (ed) Philosophy of complex systems. Handbook of the philosophy of science, vol 10. North-Holland, Amsterdam, pp 509–530. https://doi.org/10.1016/B978-0-444-52076-0.50018-3
Chapter Google Scholar
Freeman MP, Morley SK (2004) A minimal substorm model that explains the observed statistical distribution of times between substorms. Geophys Res Lett 31
Freeman MP, Watkins NW, Riley DJ (2000) Evidence for a solar wind origin of the power law burst lifetime distribution of the ae indices. Geophys Res Lett 27(8):1087–1090. https://doi.org/10.1029/1999GL010742.
Article ADS Google Scholar
Fung SF, Shao X (2008) Specification of multiple geomagnetic responses to variable solar wind and imf input. Ann Geophys 26:639–652
Article ADS Google Scholar
Gabrielse C, Angelopoulos V, Runov A et al. (2014) Statistical characteristics of particle injections throughout the equatorial magnetotail. J Geophys Res Space Phys 119:2512–2535
Article ADS Google Scholar
Galam S (2012) Sociophysics: a physicist’s modeling of psycho-political phenomena. Springer, New York. https://doi.org/10.1007/978-1-4614-2032-3
Book Google Scholar
Galvez R, Fouhey DF, Jin M et al (2019) A machine learning dataset prepared from the NASA Solar Dynamics Observatory mission. Astrophys J Suppl 242:7. https://doi.org/10.3847/1538-4365/ab1005
Gell-Mann M (1995) What is complexity? Remarks on simplicity and complexity by the Nobel prize-winning author of the quark and the jaguar. Complexity 1(1):16–19. https://doi.org/10.1002/cplx.6130010105
Article ADS MathSciNet Google Scholar
Gell-Mann M, Low FE (1954) Quantum electrodynamics at small distances. Phys Rev 95:1300–1312
Article ADS MathSciNet Google Scholar
Germany GA, Parks GK, Brittnacher M et al. (1997) Remote determination of auroral energy characteristics during substorm activity. Geophys Res Lett 24(8):995–998. https://doi.org/10.1029/97GL00864
Article ADS Google Scholar
Gjerloev JW (2009) A global ground-based magnetometer initiative. Eos Trans AGU 90(27):230–231. https://doi.org/10.1029/2009EO270002
Article ADS Google Scholar
Gjerloev JW, Hoffman R (2014) The large-scale current system during auroral substorms. J Geophys Res Space Phys 119:4591–4606
Article ADS Google Scholar
Gjerloev JW, Hoffman R, Sigwarth JB et al (2007) Statistical description of the bulge-type auroral substorm in the far ultraviolet. J Geophys Res 112
Glansdorff P, Prigogine I, Hill RN (1973) Thermodynamic theory of structure, stability and fluctuations. Am J Phys 41(1):147–148
Article ADS Google Scholar
Goertz CK, Shan LH, Smith RA (1993) Prediction of geomagnetic activity. J Geophys Res 98:7673–7684
Article ADS Google Scholar
Golovchanskaya I, Kozelov BV, Sergienko T et al (2008) Scaling behavior of auroral luminosity fluctuations observed by auroral large imaging system (alis). J Geophys Res 113
González MC, Hidalgo CA, Barabási AL (2008) Understanding individual human mobility patterns. Nature 453:779–782. https://doi.org/10.1038/nature06958
Article ADS Google Scholar
Granger CWJ (1969) Investigating causal relations by econometric models and cross-spectral methods. Econometrica 37(3):424–438. https://doi.org/10.2307/1912791
Article Google Scholar
Granovetter MS (1973) The strength of weak ties. Am J Sociol 78:1360–1380
Article Google Scholar
Granovetter MS (1978) Threshold models of collective behavior. Am J Sociol 83:1420–1443. https://doi.org/10.1086/226707
Article Google Scholar
Green L, Deighton R, Baker D (2016) Building space weather resilience in the finance sector
Gregersen NH (2002) From complexity to life: on the emergence of life and meaning
Book Google Scholar
Grèzes F, Blanco-Cuaresma S, Accomazzi A et al (2021) Building astroBERT, a language model for astronomy & astrophysics. arXiv:2112.00590
Gunderson LH (2000) Ecological resilience–in theory and application. Annu Rev Ecol Syst 31:425–439
Article Google Scholar
Haiducek JD, Welling DT, Morley SK et al (2019) Using multiple signatures to improve accuracy of substorm identification. J Geophys Res Space Phys 125
Halley JM (1996) Ecology, evolution and 1 f-noise. Trends Ecol Evol 11(1):33–37
Article Google Scholar
Haraway DJ (1976) Crystals, fabrics, and fields: metaphors of organicism in twentieth-century developmental biology. Yale University Press, New Haven and London
Google Scholar
Hayles NK (1999) How we became posthuman: virtual bodies in cybernetics. Literature, and informatics. University of Chicago Press, Chicago
Book Google Scholar
Helbing D (2013) Globally networked risks and how to respond. Nature 497:51–59. https://doi.org/10.1038/nature12047
Article ADS Google Scholar
Hernandez JV, Tajima T, Horton W (1993) Neural net forecasting for geomagnetic activity. Geophys Res Lett 20(23):2707–2710. https://doi.org/10.1029/93GL02848
Article ADS Google Scholar
Heschel AJ, Heschel S (1989) Moral grandeur and spiritual audacity: essays
Google Scholar
Hess C, Ostrom E (2007) Understanding knowledge as a commons: from theory to practice. MIT Press, Cambridge. https://doi.org/10.7551/mitpress/6980.001.0001
Book Google Scholar
Hidalgo C (2015) Why information grows: the evolution of order, from atoms to economies. Penguin, Baltimore
Google Scholar
Hobson EA, Ferdinand V, Kolchinsky A et al. (2018) Rethinking animal social complexity measures with the help of complex systems concepts. Anim Behav 155:287–296
Article Google Scholar
Hofstadter DR (1999) Godel escher Bach: an eternal golden braid. Basic Books, USA
Google Scholar
Holland JH (1975) Adaptation in natural and artificial systems. MIT Press, Cambridge, MA. https://doi.org/10.7551/mitpress/1090.001.0001
Book Google Scholar
Holland JH (1992) Genetic algorithms. Sci Am 267(1):66–73. https://www.jstor.org/stable/24939139
Article ADS Google Scholar
Holland JH (1995) Hidden order: how adaptation builds complexity. Perseus Books, Reading
Google Scholar
Holland JH (2000) Emergence: from chaos to order. OUP, Oxford
Google Scholar
Hones EW (1979) Transient phenomena in the magnetotail and their relation to substorms. Space Sci Rev 23:393–410. https://doi.org/10.1007/BF00172247
Article ADS Google Scholar
Hughes J, McGranaghan R, Kellerman AC et al. (2022) Revealing novel connections between space weather and the power grid: network analysis of ground-based magnetometer and geomagnetically induced currents (gic) measurements. Space Weather 20(2):e2021SW002,727. https://doi.org/10.1029/2021SW002727
Article Google Scholar
Hultman NE, Hassenzahl DM, Rayner S (2010) Climate risk. Annu Rev Environ Resour 35(1):283–303. https://doi.org/10.1146/annurev.environ.051308.084029
Article Google Scholar
Hwa K (1992) Avalanches, hydrodynamics, and discharge events in models of sandpiles. Phys Rev A 45(10):7002–7023
Article ADS Google Scholar
Jonas S, McCarron E, Murtagh W (2016) Space weather policy and effects. Insight 19(4):20–23. https://doi.org/10.1002/inst.12121
Article Google Scholar
Jonas S, Fronczyk K, Pratt LM (2018) A framework to understand extreme space weather event probability. Risk Anal 38(8):1534–1540. https://doi.org/10.1111/risa.12981
Article Google Scholar
Jurafsky D, Martin JH (2000) Speech and language processing - an introduction to natural language processing, computational linguistics, and speech recognition. Prentice Hall series in artificial intelligence. Prentice Hall, New York
Google Scholar
Kambhu J, Weidman ST, Krishnan N (2007) New directions for understanding systemic risk: a report on a conference cosponsored by the federal reserve bank of New York and the national academy of sciences. Econ Policy Rev 13:83
Google Scholar
Kamide Y, Akasofu SI (1983) Notes on the auroral electrojet indices. Rev Geophys 21:1647–1656
Article ADS Google Scholar
Kamide Y, Kokubun S (1996) Two-component auroral electrojet: importance for substorm studies. J Geophys Res Space Phys 101(A6):13,027–13,046. https://doi.org/10.1029/96JA00142
Article ADS Google Scholar
Kamide Y, Kokubun S, Bargatze L et al. (1999) The size of the polar cap as an indicator of substorm energy. Phys Chem Earth, Part C, Sol-Terr Planet Sci 24(1):119–127. https://doi.org/10.1016/S1464-1917(98)00018-X. International Symposium on Solar-Terrestrial Coupling Processes
Article Google Scholar
Kaneko K (1993) Theory and applications of coupled map lattices. Nonlinear science: theory and applications. Wiley, New York
Google Scholar
Karpatne A, Atluri G, Faghmous JH et al. (2016) Theory-guided data science: a new paradigm for scientific discovery from data. IEEE Trans Knowl Data Eng 29:2318–2331
Article Google Scholar
Kauffman S (1993) The origins of order: self-organization and selection in evolution. Oxford University Press, London
Book Google Scholar
Kauffman SA, Johnsen S (1991) Coevolution to the edge of chaos: coupled fitness landscapes, poised states, and coevolutionary avalanches. J Theor Biol 149(4):467–505
Article ADS Google Scholar
Kelly KF (2016) The inevitable: Understanding the 12 technological forces that will shape our future
Klein E (2023) This changes everything. New York Times. https://www.nytimes.com/2023/03/12/opinion/chatbots-artificial-intelligence-future-weirdness.html
Klimas AJ, Baker DN, Roberts DA et al. (1992) A nonlinear dynamical analogue model of geomagnetic activity. J Geophys Res 97(12):12,253–12,266
Article ADS Google Scholar
Klimas AJ, Baker DN, Vassiliadis D et al. (1994) Substorm recurrence during steady and variable solar wind driving: evidence for a normal mode in the unloading dynamics of the magnetosphere. J Geophys Res 99:14,855–14,861
Article ADS Google Scholar
Klimas AJ, Vassiliadis D, Baker DN et al. (1996) The organized nonlinear dynamics of the magnetosphere. J Geophys Res Space Phys 101(A6):13,089–13,113. https://doi.org/10.1029/96JA00563
Article ADS Google Scholar
Klimas AJ, Uritsky VM, Valdivia JA et al. (2000a) On the compatibility of the coherent substorm cycle with the complex plasma sheet. In: Wilson A (ed) 5th international conference on substorms, pp 165–168
Google Scholar
Klimas AJ, Valdivia JA, Vassiliadis D et al. (2000b) Self-organized criticality in the substorm phenomenon and its relation to localized reconnection in the magnetospheric plasma sheet. J Geophys Res Space Phys 105(A8):18,765–18,780. https://doi.org/10.1029/1999JA000319
Article ADS Google Scholar
Klimas AJ, Uritsky VM, Donovan EF (2010) Multiscale auroral emission statistics as evidence of turbulent reconnection in Earth’s midtail plasma sheet. J Geophys Res 115
Kozelov BV, Uritsky VM, Klimas AJ (2004) Power law probability distributions of multiscale auroral dynamics from ground-based tv observations. Geophys Res Lett 31
Krakauer D (2018) Worlds Hidden in Plain Sight: the Evolving Idea of Complexity at the Santa Fe Institute, 1984-2019. Santa Fe Institute of Science
Krakauer D (2019) Beyond borders: New complexity economics. Parallax (Fall 2019). https://sfi-edu.s3.amazonaws.com/sfi-edu/production/uploads/publication/2019/10/22/SFI-Parallax-Fall-2019.pdf
Krakauer D (2020) At the limits of thought. Aeon
Kuhn T (1962) The structure of scientific revolutions, vol II(2). University of Chicago Press, Chicago
Google Scholar
Kvammen A, Wickstrøm K, McKay D et al. (2020) Auroral image classification with deep neural networks. J Geophys Res Space Phys 125(10):e2020JA027,808. https://doi.org/10.1029/2020JA027808
Article Google Scholar
Ladyman J, Lambert J, Wiesner K (2020) What is a complex system? Eur J Philos Sci 3:33–67
Article Google Scholar
Langton CG et al. (eds) (1991) Artificial life II. Addison-Wesley, Redwood City, CA
Google Scholar
Leger JM, Jager T, Bertrand F et al. (2015) In-flight performance of the absolute scalar magnetometer vector mode on board the swarm satellites. Earth Planets Space 67:1–12
Article Google Scholar
Lent J (2017) The patterning instinct: a cultural history of humanity’s search for meaning
Google Scholar
Lenton TM, Held H, Kriegler E et al. (2008) Tipping elements in the Earth’s climate system. Proc Natl Acad Sci 105:1786–1793
Article ADS Google Scholar
Levin SA, Anderies JM, Adger WN et al (2021) Governance in the face of extreme events: Lessons from evolutionary processes for structuring interventions, and the need to go beyond. SSRN Electron J
Levitt M, Warshel A (1975) Computer simulation of protein folding. Nature 253:694–698
Article ADS Google Scholar
Liemohn MW, McCollough JP, Jordanova VK et al. (2018) Model evaluation guidelines for geomagnetic index predictions. Space Weather 16:2079–2102
Article ADS Google Scholar
Liemohn MW, Shane AD, Azari AR et al (2021) Rmse is not enough: Guidelines to robust data-model comparisons for magnetospheric physics. J Atmos Sol-Terr Phys
Liou K, Sotirelis T, Richardson I (2018) Substorm occurrence and intensity associated with three types of solar wind structure. J Geophys Res Space Phys 123(1):485–496. https://doi.org/10.1002/2017JA024451
Article ADS Google Scholar
Litt B, Esteller R, Echauz JR et al. (2001) Epileptic seizures may begin hours in advance of clinical onset a report of five patients. Neuron 30:51–64
Article Google Scholar
Lloyd S (2001) Measures of complexity: a nonexhaustive list. IEEE Control Syst Mag 21(4):7–8. https://doi.org/10.1109/MCS.2001.939938
Article Google Scholar
Lockwood M (2022) Solar wind-magnetosphere coupling functions: pitfalls, limitations, and applications. Space Weather 20(2):e2021SW002,989. https://doi.org/10.1029/2021SW002989
Article Google Scholar
Lockwood M, van Eyken AP, Bromage BJI et al. (1986) Eastward propagation of a plasma convection enhancement following a southward turning of the interplanetary magnetic field. Geophys Res Lett 13(1):72–75. https://doi.org/10.1029/GL013i001p00072
Article ADS Google Scholar
Longden N, Chisham G, Freeman MP (2014) Magnetic local time variation and scaling of poleward auroral boundary dynamics. J Geophys Res Space Phys 119:10,006–10,022
Article Google Scholar
López-Ruiz R, Mancini H, Calbet X (1995) A statistical measure of complexity. arXiv:1009.1498
Lui ATY (2001) Current controversies in magnetospheric physics. Rev Geophys 39(4):535–563. https://doi.org/10.1029/2000RG000090
Article ADS Google Scholar
Lui ATY, Chapman SC, Liou K et al. (2000) Is the dynamic magnetosphere an avalanching system? Geophys Res Lett 27(7):911–914. https://doi.org/10.1029/1999GL010752
Article ADS Google Scholar
Lundstedt H, Wintoft P (1994) Prediction of geomagnetic storms from solar wind data with the use of a neural network. Ann Geophys 12:19–24
Article ADS Google Scholar
Luo R, Sun L, Xia Y et al (2022) Biogpt: Generative pre-trained transformer for biomedical text generation and mining. Brief Bioinform
Maimaiti M, Kunduri BSR, Ruohoniemi JM et al. (2019) A deep learning-based approach to forecast the onset of magnetic substorms. Space Weather 17:1534–1552
Article ADS Google Scholar
Malik N, Bookhagen B, Marwan N et al. (2011) Analysis of spatial and temporal extreme monsoonal rainfall over south Asia using complex networks. Clim Dyn 39:971–987
Article Google Scholar
Manshour P, Balasis G, Consolini G et al (2021) Causality and information transfer between the solar wind and the magnetosphere–ionosphere system. Entropy 23
Martignon L (2001) Information theory. In: Smelser NJ, Baltes PB (eds) International encyclopedia of the social & behavioral sciences. Pergamon, Oxford, pp 7476–7480. https://doi.org/10.1016/B0-08-043076-7/00608-2
Chapter Google Scholar
Materassi M, Ciraolo L, Consolini G et al. (2011) Predictive space weather: an information theory approach. Adv Space Res 47:877–885
Article ADS Google Scholar
Mazzocchi F (2015) Could big data be the end of theory in science? EMBO Rep 16:1250–1255. https://doi.org/10.15252/embr.201541001
Article Google Scholar
McAteer RTJ, Aschwanden MJ, Dimitropoulou M et al. (2015) 25 years of self-organized criticality: numerical detection methods. Space Sci Rev 198:217–266. https://doi.org/10.1007/s11214-015-0158-7
Article ADS Google Scholar
McCarthy J, Minsky M, Rochester N et al. (2006) A proposal for the dartmouth summer research project on artificial intelligence, August 31, 1955. AI Mag 27:12–14
Google Scholar
McGranaghan R (2022) The evolution of heliophysics: complexity, community, and open science. Front Astron Space Sci 9. https://doi.org/10.3389/fspas.2022.951411
McGranaghan RM, Bhatt A, Matsuo T et al. (2017a) Ushering in a new frontier in geospace through data science. J Geophys Res Space Phys 122(12):12,586–12,590. https://doi.org/10.1002/2017JA024835
Article Google Scholar
McGranaghan RM, Mannucci AJ, Forsyth C (2017b) A comprehensive analysis of multiscale field-aligned currents: characteristics, controlling parameters, and relationships. J Geophys Res Space Phys 122(12):11,931–11,960. https://doi.org/10.1002/2017JA024742
Article Google Scholar
McGranaghan RM, Mannucci AJ, Verkhoglyadova O et al. (2017c) Finding multiscale connectivity in our geospace observational system: network analysis of total electron content. J Geophys Res Space Phys 122(7):7683–7697. https://doi.org/10.1002/2017JA024202
Article ADS Google Scholar
McGranaghan R, Borovsky JE, Denton MH (2018) How do we accomplish system science in space? Eos
McGranaghan R, Kellerman A, Arritt R et al (2020) The heliophysics and space weather open knowledge network: the convergence hub for the exploration of space science (CHESS). https://doi.org/10.1002/essoar.10503724.1
McGranaghan R, Camporeale E, Georgoulis MK et al (2021a) Space weather research in the digital age and across the full data lifecycle: Introduction to the topical issue. J Space Weather Space Clim
McGranaghan R, Klein S, Cameron A et al (2021b) The need for a Space Data Knowledge Commons. Structuring Collective Knowledge https://knowledgestructure.pubpub.org/pub/space-knowledge-commons
McGranaghan RM, Ziegler J, Bloch T et al. (2021c) Toward a next generation particle precipitation model: mesoscale prediction through machine learning (a case study and framework for progress). Space Weather 19(6):e2020SW002,684. https://doi.org/10.1029/2020SW002684.
Article Google Scholar
McGranaghan R, Kellerman AL, Olson MW (2022) Converging toward solutions to grand challenges. Eos
McPherron RL (1970) Growth phase of magnetospheric substorms. J Geophys Res 75(28):5592–5599. https://doi.org/10.1029/JA075i028p05592
Article ADS Google Scholar
McPherron RL, Rostoker G (1993) Comment on “prediction of geomagnetic activity” by C. K. Goertz, Lin-Hua Shan, and R. A. Smith. J Geophys Res 98:7685–7686
Article ADS Google Scholar
McPherron RL, Russell CT, Aubry MP (1973) Satellite studies of magnetospheric substorms on August 15, 1968: 9. Phenomenological model for substorms. J Geophys Res 78(16):3131–3149. https://doi.org/10.1029/JA078i016p03131
Article ADS Google Scholar
McPherron RL, Hsu TS, Chu X (2015) An optimum solar wind coupling function for the AL index. J Geophys Res Space Phys 120(4):2494–2515. https://doi.org/10.1002/2014JA020619
Article ADS Google Scholar
Meadows D, Wright D (2008) Thinking in systems: a primer. Chelsea, New York
Google Scholar
Mendillo M, Klobuchar JA (2006) Total electron content: Synthesis of past storm studies and needed future work. Radio Sci 41
Meng X, Verkhoglyadova OP (2021) Quantifying contributions of external drivers to the global ionospheric state. Space Weather 19(9):e2021SW002,752. https://doi.org/10.1029/2021SW002752
Article Google Scholar
Merkin VG, Panov EV, Sorathia KA et al. (2019) Contribution of bursty bulk flows to the global dipolarization of the magnetotail during an isolated substorm. J Geophys Res Space Phys 124:8647–8668
Article ADS Google Scholar
Merriam-Webster (2023) Systems. https://www.merriam-webster.com/dictionary/system
Milgram S (1967) The small world problem. Psychol Today 2:60–67
Google Scholar
Miller JH, Page SE (2009) Complex adaptive systems: an introduction to computational models of social life. Princeton University Press, Princeton
Book Google Scholar
Milne BT (1998) Motivation and benefits of complex systems approaches in ecology. Ecosystems 1:449–456
Article Google Scholar
Mitchell T (1997) Machine learning. McGraw-Hill international editions. McGraw-Hill, New York
Google Scholar
Mitchell M (2009) Complexity: a Guided Tour. Oxford University Press, London. https://doi.org/10.1093/oso/9780195124415.001.0001
Book Google Scholar
Moore GE (1998) Cramming more components onto integrated circuits. Proc IEEE 86:82–85
Article Google Scholar
Nanjo S, Nozawa S, Yamamoto M et al (2022) An automated auroral detection system using deep learning: real-time operation in Tromsø, Norway. Sci Rep 12
Narock T, Fox P (2012) From science to e-science to semantic e-science: a heliophysics case study. Comput Geosci 46:248–254. https://doi.org/10.1016/j.cageo.2011.11.018
Article ADS Google Scholar
National Research Council (2014) Convergence: facilitating transdisciplinary integration of life sciences, physical sciences, engineering, and beyond. The National Academies Press, Washington, DC. https://doi.org/10.17226/18722
Book Google Scholar
Nersessian NJ (2022) Interdisciplinarity in the making: models and methods in frontier science. MIT Press, Cambridge
Book Google Scholar
Newell PT, Gjerloev JW (2011a) Evaluation of supermag auroral electrojet indices as indicators of substorms and auroral power. J Geophys Res 116
Newell PT, Gjerloev JW (2011b) Substorm and magnetosphere characteristic scales inferred from the supermag auroral electrojet indices. J Geophys Res 116
Newell PT, Gjerloev J (2014) Local geomagnetic indices and the prediction of auroral power. J Geophys Res Space Phys 119:9790–9803
Article ADS Google Scholar
Newell PT, Sotirelis T, Liou K et al (2007) A nearly universal solar wind-magnetosphere coupling function inferred from 10 magnetospheric state variables. J Geophys Res Space Phys 112(A1). https://doi.org/10.1029/2006JA012015
Newman MEJ (2004) Power laws, Pareto distributions and Zipf’s law. Contemp Phys 46:323–351. https://doi.org/10.1080/00107510500052444
Article ADS Google Scholar
Newman M (2010) Networks: an introduction. Oxford University Press, London. https://doi.org/10.1093/acprof:oso/9780199206650.001.0001
Book Google Scholar
Newman MEJ (2010) Networks: an Introduction
Newman MEJ, Watts DJ, Strogatz SH (2002) Random graph models of social networks. Proc Natl Acad Sci USA 99:2566–2572
Article ADS Google Scholar
Niazi MA, Hussain A (2011) Agent-based computing from multi-agent systems to agent-based models: a visual survey. Scientometrics 89:479–499
Article Google Scholar
Nishida A, Iwasaki N, Nagata T (1966) Origin of fluctuations in the equatorial electrojet: a new type of geomagnetic variation. Ann Geophys 22:478–484
Google Scholar
Nishimura Y, Deng Y, Lyons LR et al. (2021) In: Multiscale dynamics in the high-latitude ionosphere. Am. Geophys. Union, Washington, pp 49–65. https://doi.org/10.1002/9781119815617.ch3
Chapter Google Scholar
Nishimura Y et al. (2022) Chap. 1 - multiscale processes in the m-i-t system. In: Nishimura Y, Verkhoglyadova O, Deng Y et al. (eds) Cross-scale coupling and energy transfer in the magnetosphere-ionosphere-thermosphere system. Elsevier, Amsterdam, pp 1–63. https://doi.org/10.1016/B978-0-12-821366-7.00007-X. https://www.sciencedirect.com/science/article/pii/B978012821366700007X
Chapter Google Scholar
Obayashi T, Nishida A (1968) Large-scale electric field in the magnetosphere. Space Sci Rev 8:3–31. https://doi.org/10.1007/BF00362569
Article ADS Google Scholar
Orr L, Chapman SC, Gjerloev JW (2019) Directed network of substorms using supermag ground-based magnetometer data. Geophys Res Lett 46(12):6268–6278. https://doi.org/10.1029/2019GL082824
Article ADS Google Scholar
Orr L, Chapman SC, Beggan CD (2021a) Wavelet and network analysis of magnetic field variation and geomagnetically induced currents during large storms. Space Weather 19(9):e2021SW002,772. https://doi.org/10.1029/2021SW002772
Article Google Scholar
Orr L, Chapman SC, Gjerloev JW et al (2021b) Network community structure of substorms using supermag magnetometers. Nat Commun 12
Osborne AR, Provenzale A (1989) Finite correlation dimension for stochastic systems with power-law spectra. Phys D: Nonlinear Phenom 35:357–381
Article ADS MathSciNet Google Scholar
Ottino J, Mau B (2022) The nexus: augmented thinking for a. Complex world–the new convergence of art, technology, and science. MIT Press, Cambridge
Google Scholar
Oughton EJ, Skelton A, Horne RB et al. (2017) Quantifying the daily economic impact of extreme space weather due to failure in electricity transmission infrastructure. Space Weather 15(1):65–83. https://doi.org/10.1002/2016SW001491
Article ADS Google Scholar
Oughton EJ, Hapgood M, Richardson GS et al. (2019) A risk assessment framework for the socioeconomic impacts of electricity transmission infrastructure failure due to space weather: an application to the United Kingdom. Risk Anal 39(5):1022–1043. https://doi.org/10.1111/risa.13229
Article Google Scholar
Page S (2011) Diversity and complexity. Princeton University Press, Princeton. https://doi.org/10.1515/9781400835140
Book Google Scholar
Palmerio E, Lee CO, Mays ML et al (2022) Cmes and seps during November-December 2020: a challenge for real-time space weather forecasting. Space Weather 20
Pankratius V, Li JD, Gowanlock MG et al. (2016) Computer-aided discovery: toward scientific insight generation with machine support. IEEE Intell Syst 31:3–10
Article Google Scholar
Panter-Brick C (2014) Health, risk, and resilience: interdisciplinary concepts and applications. Annu Rev Anthropol 43(1):431–448
Article Google Scholar
Papadimitriou CH, Raghavan P, Tamaki H et al. (1998) Latent semantic indexing: a probabilistic analysis. J Comput Syst Sci 61:217–235
Article MathSciNet Google Scholar
Papadimitriou C, Balasis G, Boutsi AZ et al (2020) Dynamical complexity of the 2015 St. Patrick’s day magnetic storm at swarm altitudes using entropy measures. Entropy 22
Parrish J, Viscido S, Grünbaum D (2002) Self-organized fish schools: an examination of emergent properties. Biol Bull 202(3):296–305. https://doi.org/10.2307/1543482
Article Google Scholar
Pastor-Satorras R, Vespignani A (2001) Epidemic dynamics and endemic states in complex networks. Phys Rev E 63:066117. https://doi.org/10.1103/PhysRevE.63.066117
Article ADS Google Scholar
Paton D, Smith LM, Violanti JM (2000) Disaster response: risk, vulnerability and resilience. Disaster Prev Manag 9:173–179
Article Google Scholar
Pavlos GP, Kyriakou GA, Rigas AG et al. (1992) Evidence for strange attractor structures in space plasmas. Ann Geophys 10:309–322
ADS Google Scholar
Peek L, Tobin J, Adams RM et al (2020) A framework for convergence research in the hazards and disaster field: The natural hazards engineering research infrastructure converge facility. Front Built Environ 6:110. https://doi.org/10.3389/fbuil.2020.00110
Perreault P, Akasofu SI (1978) A study of geomagnetic storms. Geophys J Int 54(3):547–573. https://doi.org/10.1111/j.1365-246X.1978.tb05494.x
Article ADS Google Scholar
Pines D (2018) Emerging syntheses in science: proceedings of the founding workshops of the Santa Fe institute. SFI Press
Book Google Scholar
Plant S (1995) The future looms: weaving women and cybernetics. Body Soc 1:45–64. https://doi.org/10.1177/1357034X95001003003
Article Google Scholar
Plenz D, Ribeiro TL, Miller SR et al (2021) Self-organized criticality in the brain. Front Phys
Pomerantz J (2015) Metadata. The MIT press essential knowledge series. MIT Press, Cambridge
Book Google Scholar
Porter MA, Onnela JP, Mucha PJ (2009) Communities in networks. Not AMS 56(9):1082–1166
MathSciNet Google Scholar
Price S (2019) Jason Reynolds calls for architects of understanding. American Libraries. https://americanlibrariesmagazine.org/blogs/the-scoop/jason-reynolds-opens-annual/
Price CP, Prichard D (1993) The non-linear response of the magnetosphere: 30 October 1978. Geophys Res Lett 20:771–774
Article ADS Google Scholar
Prichard D, Price CP (1992) Spurious dimension estimates from time series of geomagnetic indices. Geophys Res Lett 19:1623–1626
Article ADS Google Scholar
Prigogine I, Lefever R (1968) Symmetry Breaking Instabilities in Dissipative Systems. II. Journal of Chemical Physics 48:1695–1700. https://doi.org/10.1063/1.1668896
Article ADS Google Scholar
Prigogine I, Nicolis G (1967) On symmetry-breaking instabilities in dissipative systems. Journal of Chemical Physics 46:3542–3550. https://doi.org/10.1063/1.1841255
Article ADS Google Scholar
Prigogine I, Nicolis G (1971) Biological order, structure and instabilities. Quarterly Reviews of Biophysics 107–148. https://doi.org/10.1017/S0033583500000615
Prince SH (2009) Catastrophe and Social Change, Based upon a Sociological Study of the Halifax Disaster
Promislow DEL, Anderson RM, Scheffer M et al (2022) Resilience integrates concepts in aging research. IScience 25
Quarantelli EL (1987) Disaster studies: an analysis of the social historical factors affecting the development of research in the area. Int J Mass Emerg Disasters 5:285–310
Article Google Scholar
Radford A, Narasimhan K (2018) Improving language understanding by generative pre-training. https://gwern.net/doc/www/s3-us-west-2.amazonaws.com/d73fdc5ffa8627bce44dcda2fc012da638ffb158.pdf
Radicchi F, Castellano C, Cecconi F et al. (2003) Defining and identifying communities in networks. Proc Natl Acad Sci USA 101(9):2658–2663. https://doi.org/10.1073/pnas.0400054101
Article ADS Google Scholar
Ramasubramanian M, Virts KS, Shirey A et al (2020). Surveying the machine learning landscape in Earth sciences
Ridley AJ, Lu G, Clauer CR et al. (1997) Ionospheric convection during nonsteady interplanetary magnetic field conditions. J Geophys Res 102:14,563–14,579
Article ADS Google Scholar
Ridley AJ, Lu G, Clauer CR et al. (1998) A statistical study of the ionospheric convection response to changing interplanetary magnetic field conditions using the assimilative mapping of ionospheric electrodynamics technique. J Geophys Res 103:4023–4039
Article ADS Google Scholar
Riley P (2012) On the probability of occurrence of extreme space weather events. Space Weather 10:S02012. https://doi.org/10.1029/2011SW000734
Roberts DA (1991) Is there a strange attractor in the magnetosphere? J Geophys Res 96:16,031–16,046
Article ADS Google Scholar
Roberts DA, Baker DN, Klimas AJ et al. (1991) Indications of low dimensionality in magnetospheric dynamics. Geophys Res Lett 18:151–154
Article ADS Google Scholar
Rosas FE, Mediano PAM, Jensen HJ et al. (2020) Reconciling emergences: an information-theoretic approach to identify causal emergence in multivariate data. PLoS Comput Biol 16(12):1–22. https://doi.org/10.1371/journal.pcbi.1008289
Article Google Scholar
Ruelle D (1980) Strange attractors. Math Intell 2(126). https://doi.org/10.1007/BF03023053
Rumelhart DE, Hinton GE, Williams RJ (1986) Learning representations by back-propagating errors. Nature 323:533–536
Article ADS Google Scholar
Runge J, Bathiany S, Bollt EM et al. (2019) Inferring causation from time series in earth system sciences. Nat Commun 10:2553. https://doi.org/10.1038/s41467-019-10105-3
Article ADS Google Scholar
Schank T, Wagner D (2005) Approximating clustering coefficient and transitivity. J Graph Algorithms Appl 9:265–275
Article MathSciNet Google Scholar
Scheffer M (2009) Critical Transitions in Nature and Society
Scheffer M, Carpenter S, Foley J et al. (2001) Catastrophic shifts in ecosystems. Nature 413:591–596. https://doi.org/10.1038/35098000
Article ADS Google Scholar
Scheffer M, Bascompte J, Brock WAB et al. (2009) Early-warning signals for critical transitions. Nature 461:53–59
Article ADS Google Scholar
Scheffer M, Bolhuis JE, Borsboom D et al. (2018) Quantifying resilience of humans and other animals. Proc Natl Acad Sci USA 115(11):11,883–11,890
Article Google Scholar
Schelling TC (1971) Dynamic models of segregation. J Math Sociol 1:143–186. https://doi.org/10.1080/0022250X.1971.9989794
Article Google Scholar
Schrijver CJ, Kauristie K, Aylward AD et al. (2015) Understanding space weather to shield society: a global road map for 2015–2025 commissioned by COSPAR and ILWS. Adv Space Res 55(12):2745–2807. https://doi.org/10.1016/j.asr.2015.03.023
Article ADS Google Scholar
Schunk RW, Scherliess L, Eccles V et al (2021) Challenges in specifying and predicting space weather. Space Weather 19
Sethna JP (2021) Statistical mechanics: entropy, order parameters, and complexity
Book Google Scholar
Shan LH, Goertz CK, Smith RA (1991a) On the embedding-dimension analysis of ae and al time series. Geophys Res Lett 18(8):1647–1650
Article ADS Google Scholar
Shan LH, Hansen P, Goertz C et al. (1991b) Chaotic appearance of the ae index. Geophys Res Lett 18(2):147–150
Article ADS Google Scholar
Shannon CE (1948) A mathematical theory of communication. Bell Syst Tech J 27:623–656
Article MathSciNet Google Scholar
Sharma AS, Vassiliadis D, Papadopoulos KD (1993) Reconstruction of low-dimensional magnetospheric dynamics by singular spectrum analysis. Geophys Res Lett 20:335–338
Article ADS Google Scholar
Sharma AS, Baker DN, Bhattacharyya A et al. (2012) Complexity and extreme events in geosciences: an overview. In: Sharma AS et al. (eds) Extreme events and natural hazards: the complexity perspective, pp 1–16. https://doi.org/10.1029/2012GM001233
Chapter Google Scholar
Sharma AS, Aschwanden MJ, Crosby NB et al. (2016) 25 years of self-organized criticality: space and laboratory plasmas. Space Sci Rev 198:167–216. https://doi.org/10.1007/s11214-015-0225-0
Article ADS Google Scholar
Shay MA, Drake JF, Denton RE et al. (1998) Structure of the dissipation region during collisionless magnetic reconnection. J Geophys Res 103:9165–9176
Article ADS Google Scholar
Shim JS (2009) Analysis of total electron content (tec) variations in the low- and middle-latitude ionosphere
Shimizu C, Mcgranaghan R, Eberhart A et al. (2020) Towards a modular ontology for space weather research. In: Workshop on ontology design and patterns (WOP)
Google Scholar
Simpson NP, Mach KJ, Constable A et al. (2021) A framework for complex climate change risk assessment. One Earth 4(4):489–501
Article ADS Google Scholar
Smyth WD, Nash JD, Moum JN (2019) Self-organized criticality in geophysical turbulence. Sci Rep 9:3747. https://doi.org/10.1038/s41598-019-39869-w
Article ADS Google Scholar
Sneppen K, Bak P, Flyvbjerg H et al. (1995) Evolution as a self-organized critical phenomenon. Proc Natl Acad Sci USA 92(11):5209–5213
Article ADS Google Scholar
Sobel AH (2022) The science of climate risk. In: AGU Fall Meeting abstracts, pp A23C–01
Google Scholar
Sobel AH, Tippett MK, Camargo SJ et al. (2014) Science-based risk assessments for rare events in a changing climate. In: AGU Fall Meeting abstracts, NH33B-3915
Google Scholar
Sober E, Wilson DS (2009) Unto others. In: Ruse M (ed) Philosophy after Darwin. Princeton University Press, Princeton, p 433
Google Scholar
Solnit R (2009) A paradise built in hell: the extraordinary communities that arise in disaster. Viking Press
Google Scholar
Song C, Havlin S, Makse HA (2006) Origins of fractality in the growth of complex networks. Nat Phys 2(4):275–281
Article Google Scholar
Sorathia KA, Merkin VG, Ukhorskiy AY et al. (2017) Energetic particle loss through the magnetopause: a combined global mhd and test-particle study. J Geophys Res Space Phys 122(9):9329–9343. https://doi.org/10.1002/2017JA024268
Article ADS Google Scholar
Spanswick EL, Donovan E, Liang J et al. (2018) First-light observations from the transition region explorer (TREx) ground-based network. In: AGU Fall Meeting abstracts
Google Scholar
Srivastava N, Mierla M, Zhang J (2021) Editorial: space weather prediction: challenges and prospects. Front Astron Space Sci. https://doi.org/10.3389/fspas.2021.818878
Article Google Scholar
Stanley HE, Amaral LAN, Buldyrev SV et al. (2002) Self-organized complexity in economics and finance. Proc Natl Acad Sci USA 99:2561–2565
Article ADS Google Scholar
Steinhaeuser K, Ganguly AR, Chawla N (2011) Multivariate and multiscale dependence in the global climate system revealed through complex networks. Clim Dyn 39:889–895
Article Google Scholar
Stephens GK, Sitnov MI, Korth H et al. (2019) Global empirical picture of magnetospheric substorms inferred from multimission magnetometer data. J Geophys Res Space Phys 124(2):1085–1110. https://doi.org/10.1029/2018JA025843
Article ADS Google Scholar
Strogatz S (2018) Nonlinear dynamics and chaos with applications to physics, biology, chemistry and engineering. CRC Press, Boca Raton
Google Scholar
Stumpo M, Consolini G, Alberti T et al (2020) Measuring information coupling between the solar wind and the magnetosphere–ionosphere system. Entropy 22
Syrjäsuo M, Donovan E (2002) Analysis of auroral images: detection and tracking. Geophysica 38(1–2):3–14
Google Scholar
Syrjäsuo MT, Donovan EF (2004) Diurnal auroral occurrence statistics obtained via machine vision. Ann Geophys 22:1103–1113
Article ADS Google Scholar
Szabo A (2014) NASA Wind satellite. In: Allahdadi F, Pelton J (eds) Handbook of cosmic hazards and planetary defense. https://doi.org/10.1007/978-3-319-02847-7_13-1
Chapter Google Scholar
Takalo J, Timonen J, Koskinen HEJ (1993) Correlation dimension and affinity of ae data and bicolored noise. Geophys Res Lett 20:1527–1530
Article ADS Google Scholar
Takalo J, Timonen J, Koskinen HEJ (1994) Properties of ae data and bicolored noise. J Geophys Res 99:13,239–13,249
Article ADS Google Scholar
Takalo J, Timonen J, Klimas AJ et al (1999) A coupled-map model for the magnetotail current sheet. Geophys Res Lett 26
Tamkin A, Brundage M, Clark J et al (2021) Understanding the capabilities, limitations, and societal impact of large language models. arXiv:2102.02503
Tegmark M (2017) Life 3.0: Being human in the age of artificial intelligence
Thayer J (2011) Coupling, energetics, and dynamics of atmospheric regions (cedar) the new dimension, strategic vision. https://cedarscience.org/sites/default/files/2021-10/CEDAR_Plan_June_2011_online.pdf
Theiler J (1986) Spurious dimension from correlation algorithms applied to limited time-series data. Phys Rev A, Gen Phys 34(3):2427–2432
Article ADS Google Scholar
Theiler J, Eubank S, Longtin A et al. (1992) Testing for nonlinearity in time series: the method of surrogate data. Phys D: Nonlinear Phenom 58:77–94
Article ADS Google Scholar
Topliff C, Cohen MB, Bristow WA (2020) Simultaneously forecasting global geomagnetic activity using recurrent networks. arXiv:2010.06487
Torr MR, Torr DG, Zukic M et al. (1995) A far ultraviolet imager for the international solar-terrestrial physics mission. Space Sci Rev 71:329–383. https://doi.org/10.1007/BF00751335
Article ADS Google Scholar
Torres L, Blevins AS, Bassett DS et al. (2021) The why, how, and when of representations for complex systems. SIAM Rev 63:435–485
Article MathSciNet Google Scholar
Tsonis AA, Swanson KL, Roebber P (2006) What do networks have to do with climate. Bull Am Meteorol Soc 87:585–595
Article ADS Google Scholar
Tsurutani BT, Sugiura M, Iyemori T et al. (1990) The nonlinear response of ae to the imf bs driver: a spectral break at 5 hours. Geophys Res Lett 17(3):279–282. https://doi.org/10.1029/GL017i003p00279
Article ADS Google Scholar
Turing AM (1950) Computing machinery and intelligence. Mind LIX:433–460
Article MathSciNet Google Scholar
Upendran V, Cheung MCM, Hanasoge SM et al (2020) Solar wind prediction using deep learning. Space Weather 18:e2020SW002478. https://doi.org/10.1029/2020SW002478
Uritsky VM, Pudovkin MI (1998) Low frequency 1/f-like fluctuations of the ae-index as a possible manifestation of self-organized criticality in the magnetosphere. Ann Geophys 16(12):1580–1588. https://doi.org/10.1007/s00585-998-1580-x
Article ADS Google Scholar
Uritsky VM, Klimas AJ, Vassiliadis D (2001) Comparative study of dynamical critical scaling in the auroral electrojet index versus solar wind fluctuations. Geophys Res Lett 28
Uritsky VM, Klimas AJ, Vassiliadis D et al. (2002) Scale-free statistics of spatiotemporal auroral emissions as depicted by polar uvi images: dynamic magnetosphere is an avalanching system. J Geophys Res Space Phys 107(A12):SMP 7–1–SMP 7–11. https://doi.org/10.1029/2001JA000281
Article Google Scholar
Uritsky VM, Paczuski M, Davila JM et al. (2007) Coexistence of self-organized criticality and intermittent turbulence in the solar corona. Phys Rev Lett 99(2):025,001
Article Google Scholar
Valdivia JA, Rogan J, Muñoz V et al. (2005) The magnetosphere as a complex system. Adv Space Res 51:1934–1941
Article ADS Google Scholar
Valente TW (1995) Network models of the diffusion of innovations. Comput Math Organ Theory 2:163–164. https://doi.org/10.1007/BF00240425
Article Google Scholar
Vassiliadis D, Sharma AK, Eastman TE et al. (1990) Low-dimensional chaos in magnetospheric activity from ae time series. Geophys Res Lett 17:1841–1844
Article ADS Google Scholar
Vassiliadis D, Klimas AJ, Baker DN et al. (1995) A description of the solar wind-magnetosphere coupling based on nonlinear filters. J Geophys Res 100:3495–3512
Article ADS Google Scholar
Vaswani A, Shazeer NM, Parmar N et al (2017) Attention is all you need. arXiv:1706.03762
Vespignani A (2010) Complex networks: the fragility of interdependency. Nature 464:984–985. https://doi.org/10.1038/464984a
Article ADS Google Scholar
Viall NM, Borovsky JE (2020) Nine outstanding questions of solar wind physics. J Geophys Res Space Phys 125(7):e2018JA026,005. https://doi.org/10.1029/2018JA026005
Article Google Scholar
Walker BW, Holling CS, Carpenter SR et al. (2004) Resilience, adaptability and transformability in social–ecological systems. Ecol Soc 9:5
Article Google Scholar
Watkins NW, Pruessner G, Chapman SC et al. (2015) 25 years of self-organized criticality: concepts and controversies. Space Sci Rev 198:3–44. https://doi.org/10.1007/s11214-015-0155-x
Article ADS Google Scholar
Watts DJ, Strogatz SH (1998) Collective dynamics of ‘small-world’ networks. Nature 393:440–442
Article ADS Google Scholar
Weigend AS, Gershenfeld NA (1994). Time series prediction: Forecasting the future and understanding the past. Science
West G (2017) Scale: the universal laws of life and death in organisms, cities and companies. Orion
Google Scholar
White GF, Haas JE (1975) Assessment of research on natural hazards. MIT Press, Cambridge
Google Scholar
Wiener N, Collection BLJF (1961) Cybernetics or control and communication in the animal and the machine. MIT Press, Cambridge
Google Scholar
Wilson EO (1998) Consilience: the unity of knowledge. Vintage Books
Google Scholar
Wiltberger M, Merkin VG, Lyon JG et al. (2015) High-resolution global magnetohydrodynamic simulation of bursty bulk flows. J Geophys Res Space Phys 120:4555–4566
Article ADS Google Scholar
Wing S, Johnson JR (2019) Applications of information theory in solar and space physics. Entropy 21(2). https://doi.org/10.3390/e21020140
Wing S, Johnson JR, Camporeale E et al. (2016) Information theoretical approach to discovering solar wind drivers of the outer radiation belt. J Geophys Res Space Phys 121:9378–9399
Article ADS Google Scholar
Wing S, Johnson JR, Vourlidas A (2018) Information theoretic approach to discovering causalities in the solar cycle. Astrophys J 854
Wisner B, Blaikie P, Cannon T et al. (2004) At risk: natural hazards, people’s vulnerability and disasters, 2nd edn. Routledge, London. https://doi.org/10.4324/9780203714775
Book Google Scholar
Wisner B, Gaillard JC, Kelman I (eds) (2011) The Routledge handbook of hazards and disaster risk reduction Routledge, London. https://doi.org/10.4324/9780203844236
Book Google Scholar
Wissel C (2004) A universal law of the characteristic return time near thresholds. Oecologia 65:101–107
Article ADS Google Scholar
Wolfram S (2002) A new kind of science. Wolfram Media
Google Scholar
Wood RE (1986) Task complexity: definition of the construct. Organ Behav Hum Decis Process 37(1):60–82. https://doi.org/10.1016/0749-5978(86)90044-0
Article Google Scholar
Zurek WH (1990) Complexity, entropy and the physics of information. CRC Press, Boca Raton. https://doi.org/10.1201/9780429502880
Book Google Scholar

Download references

Acknowledgements

Wide and deep thanks is due to a community of conversants, whose time, attention, and ideas were intellectual exhilaration and generativity for this manuscript. An incomplete list of those individuals includes (in no particular order): Joseph Borovksy, Juan Valdivia, Simon Wing; Eric Donovan, Massimo Materassi, John Dorelli; Jeffrey Thayer, Vadim Uritsky, Sandra Chapman, Jay Johnson, Josh Semeter, Seebany Datta-Barua, Olga Verkhoglyadova, Giuseppe Consolini, Elizabeth Butler, Anthony Mannucci, Paul Wong, Jacob Bortnik, Enrico Camporeale, Barbara Thompson, Madhulika Guhathakurta, Jesper Gjerloev, Nick Watkins, Xing Meng, Brian Thomas, and numerous past guests on the Origins Podcast (https://www.originspodcast.co/).

Several events were also formative for this work. To the organizers, conveners, and attendees I am deeply grateful: “Exploring Systems-Science Techniques for the Earth’s Magnetosphere-Ionosphere-Thermosphere” (July 2018 McGranaghan et al. 2018), the Santa Fe Institute’s Complexity Interactive held January 2022 (https://www.santafe.edu/engage/learn/programs/complexity-interactive), the Lorentz Center event “Space Weather: A Multidisciplinary Approach” held in September 2017 (https://event.cwi.nl/spaceweather2017/), the series of NASA Living With a Star Jack Eddy Symposia (especially the 3rd event held in June 2022 (https://cpaess.ucar.edu/meetings/eddy-symposium-2022), and the National Science Foundation (NSF) Convergence Hub for the Exploration of Space Science (CHESS) event “Simulating Space Weather Extremes: A Workshop to Identify Research Needs to Improve Power Grid Resilience to Geomagnetic Activity” held April 2022 (https://www.nsf.gov/awardsearch/showAward?AWD_ID=2131047).

The author gratefully acknowledges the support of the NASA Early Career Investigator Program (ECIP) Program (NASA Grant Number: 80NSSC21K0622) for the resources to research, pursue, and write articles on the philosophy of Heliophysics science and how to make breakthroughs in our epistemology such as this one. Additionally, the author is deeply appreciative of the NASA Center for HelioAnalytics, funded by NASA ISFM, for supporting this work and creating a community in which conversations like these occur.

Data and software supporting this review are available from a Github repository (https://github.com/rmcgranaghan/Complexity_Heliophysics) that provides information about generating the corpus for automated or natural language processing in support of this work as well as the glossary used to filter the articles and automated corpus itself. We acknowledge Omar Shalaby for the development of tools to programmatically query ADS and organize results.

Thank you to the NASA Center for HelioAnalytics and Heliophysics Digital Resource Library for creating, maintaining, and making available HelioCloud, a cloud computing service for Heliophysicists. The corpus generation and natural language processing analysis for this publication were carried out on that platform.

Inordinate thanks is due to Semantic Scholar (https://www.semanticscholar.org/) built by the Allen Insitute for AI (https://allenai.org/) for its aid in finding literature and appropriately and conveniently citing it for inclusion in this review.

Many thanks to the Lingo4G (https://carrotsearch.com/lingo4g/) team for support in getting their software up and running and for providing numerous trial licenses to complete the analysis detailed in this manuscript.

The research was in-part carried out at the Jet Propulsion Laboratory, California Institute of Technology, under a contract with the National Aeronautics and Space Administration (80NM0018D0004). ©2023. California Institute of Technology. Government sponsorship acknowledged.

Author information

Authors and Affiliations

NASA Jet Propulsion Laboratory, California Institute of Technology, Pasadena, 91109, CA, USA
Ryan M. McGranaghan

Authors

Ryan M. McGranaghan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ryan M. McGranaghan.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix A: Acronyms

Term	Abbreviation
agent-based modeling	ABM
artificial intelligence and machine learning	AI/ML
auroral electrojet	AE
conditional mutual information	CMI
Coupled Map Lattice	CML
disaster risk reduction	DRR
disturbance storm-time	Dst
European Space Agency	ESA
interplanetary magnetic field	IMF
Geocentric Solar Ecliptic	GSE
Geocentric Solar Magnetic	GSM
geomagnetically induced currents	GIC
large language model	LLM
local-linear predictor	LLP
mutual information	MI
named entity recognition	NER
National Aeronautics and Space Administration	NASA
natural language processing	NLP
nonlinear prediction filter	NPF
Polar Ultraviolet Imager	Polar UVI
probability distribution function	PDF
Santa Fe Institute	SFI
self-organized criticality	SOC
state-space reconstruction	SSR
substorm current wedge	SCW
total electon content	TEC
Time History of Events and	THEMIS
transfer entropy	TE

Appendix B: Questions Identified in the Papers Reviewed in This Work to Guide Future Research

This appendix lists a few of the most potent questions either explicitly identified or implied by the studies reviewed for this paper. They are meant to be generative of future Complexity Heliophysics work.

(Chapman and Watkins 2000) What constitutes successful prediction of some observed activity?;
(From a collection of articles that each speak to this point) What are appropriate measurables for magnetospheric dynamics (e.g., AE)? What can we determine given the limitations of these measurables (e.g., for the AE index, the limitations are well-documented (Kamide and Akasofu 1983; Bargatze et al. 1985; Klimas et al. 1996))?;
In distinguishing magnetospheric dynamics from those convolved with solar wind driving, are bursts in AU/AL causally related to those in $\epsilon$ and $vB_{s}$(e.g., Freeman et al. 2000)? As we bring the complexity paradigm from the solar wind and magnetosphere into the upper atmosphere, what methods permit to connect or distinguish causal driving from internal dynamics?;
(Collection of questions) “Nine Outstanding Questions of Solar Wind Physics” (Viall and Borovsky 2020);
(Collection of questions) “Outstanding questions in magnetospheric plasma physics” (Borovsky et al. 2020);
Have observational networks capable of finer resolution (e.g., local time-dependent auroral electrojet indices from the SuperMAG network (Gjerloev 2009; Newell and Gjerloev 2011a,b, 2014)) provided the necessary data to resolve open questions about the dynamic behavior of the magnetosphere that have been ambiguous in existing data (e.g., limitations of the AE index (Bargatze et al. 1985; Klimas et al. 1996))?
(Uritsky et al. 2002) Given the observed power law statistics of the dynamic magnetosphere in auroral data and the corresponding inference of stationary critical dynamics in the magnetosphere, what can be learned about the role of cross-scale coupling in the development of geomagnetic disturbances?;
(Uritsky et al. 2002) Do statistics of mesoscale magnetosphere simulations match observed statistics from the SOC paradigm?;
Which of our models of the magnetosphere and geospace exhibit complexity (e.g., emergent behavior)? How can those behaviors be observed;
What are the time scales for the phenomena that define Heliophysics in the Sun-Earth system (e.g., solar flares, coronal mass ejections, bursty bulk flows, substorms, auroral precipitation, ionospheric conductivity) and how are these consistent across a wide continuum of stellar-planetary systems?
(McGranaghan et al. 2017c; Orr et al. 2019) Are topologies of Heliophysics systems (e.g., solar corona, ionosphere, magnetic environment) qualitatively different between low, moderate, and extreme periods of activity?
(McGranaghan et al. 2022; Peek et al. 2020) To what extent can the study of space weather be transformed by adopting frameworks for natural hazards that enable convergence research and objectify resilience?
(McGranaghan 2022) What are the new literacies required of Heliophysicists and space scientists to embrace the Complexity paradigm?

Appendix C: Generation and Analysis of the Complexity Heliophysics Corpus

The references that constitute this review article were largely manually curated. The reference list includes well over 300 articles. However, current best estimates Bornmann et al. (2020) place annual growth rate of scientific literature at 4% with a doubling time every 17 years. The impact is a significantly larger number of papers a researcher must read to be up to date on a field. More revealing is the growth in the number of papers a researcher must sift through to decide what to read. This ‘to read’ pile can and does easily become thousands of papers long.

The growth in publication and number of research artifacts is a trend that will only continue. The complexity paradigm exacerbates the problem, in which systems understanding demands information from more numerous and diverse sources to be integrated. It is likely that the effective growth rate when many disciplines must be considered is even greater. The outcome is that researchers must augment their traditional manual approach to curating and synthesizing literature with automated methods using databases of literature and research artifacts (e.g., Clarivate’s Web of Science^{Footnote 11} and those tailored to more specific contexts like NASA’s Astrophysics Data System (ADS)^{Footnote 12}). Artificial Intelligence and Machine Learning (AI/ML) will play a role in augmentation and synthesis.

Indeed, researchers are already squarely in the middle of this challenge to keep pace with the academic literature. To make this review an example of how to augment manual methods with automated approaches, we have used natural language processing (NLP) to examine the NASA ADS to construct a Complexity Heliophysics corpus (database of documents). We refer to this as the automated corpus, which is in addition to the manual corpus that is the reference list of this review. We describe our approach to create the automated corpus and provide the end product as an artifact with which future researchers can examine this new paradigm and also to experiment with AI/ML methods.

The process is as follows:

1.
Construct a ‘Complexity Heliophysics’ terms list. These terms attempt to encompass relevant topics, phenomena, and concepts of the paradigm. The terms list contains 153 terms and is provided in the Github repository that accompanies this manuscript (https://github.com/rmcgranaghan/Complexity_Heliophysics);
2.
Select a database of literature and research artifacts to explore. We chose the NASA ADS for its close context to Heliophysics and space physics research as opposed to something like Web of Science, which is much broader. We consider the years 1996 (when we picked up the thread of Complexity Heliophysics with Klimas et al. 1996) to the present in this review;
3.
Apply a ‘Heliophysics filter.’ From ADS we identify a subset of artifacts by selecting Heliophysics journals from the collection of journals that ADS indexes ^{Footnote 13}. We manually and in cooperation with the Heliophysics community selected 33 journals. We intentionally broadly conceived of Heliophysics in selecting these journals, effectively casting a wide net for articles we would identify. The journals are listed in the Github repository.
4.
Next we applied a ‘complexity science filter.’ For each article, we looked at the abstract and title and performed a simple match: if the abstract and title contained >N terms in the Complexity Heliophysics terms list, then it was considered a match and added to the corpus. N can be tuned based on desired inclusivity/exclusivity in the final corpus. Figure 18 shows the number of documents in the corpus for each threshold between three and ten, falling off roughly exponentially with the number of terms required to match. The graph indicates a slight knee between four and six matching terms. In an attempt to balance number of documents with relevance to Heliophysics, we chose five terms (i.e., only documents whose abstracts and titles include five or more distinct terms in the Heliophysics glossary were included). This give an automated database that is roughly one order of magnitude greater than the number considered manually (∼3000 vs. 300).
Fig. 18
Number of documents in the automated corpus vs. the threshold applied. The threshold indicates the number of distinct terms in the Complexity Heliophysics glossary that must be present in the abstract or title of a document for it to be included in the automated corpus. Numbers at the top of the bars indicate the size of the resultant corpus for that threshold. Dashed red line indicates the selected threshold.
Full size image

In our analysis, ∼120k articles were obtained from ADS between 1996-present. Among those articles, Geophysical Research Letters (36k), Advances in Space Research (15k), Journal of Geophysical Research (14k), and Journal of Geophysical Research: Atmospheres (14k) contributed more than 13k articles. After matching based on the glossary, the automated corpus consisted of $\sim2600$ documents.

The corpus can then be examined manually or programmatically. We provide a few examples that give a glimpse into the corpus. Readers can freely explore the corpus via the Github repository https://github.com/rmcgranaghan/ComplexityHelio_LivingReviews/tree/main/data.

First, Table 1 provides all words from the glossary that were found more than 100 times in the corpus (remember that this corpus only keeps papers with ≥5 words from the glossary, so this table represents the number of times that glossary word appears among such papers).

Table 1 Terms from the complexity glossary found more than 100 times in the corpus^*

Full size table

As an example, we used a text clustering tool Lingo4G^{Footnote 14} applied to the automated corpus to determine document clusters and topics. The mode used was Lingo4G’s phrase extractor that extracts frequent words and sequences of words from the corpus (in this case the titles and abstacts from the corpus, not their full texts). Those frequent words and sequences are assigned as labels.

Figure 19 shows a cluster map created from the automated corpus. Groups of thematically related labels are provided as high-level themes (overview plot on the left) and sub-themes (zoomed-in plot on the right). These clusters are helpful in myriad ways, including identifying content-wise similar documents (you can find each document within the cluster map), for identifying outliers as possible themes in need of more exploration, and ‘bridge’ themes that link two or more prominent themes in the corpus. To further explore the cluster map, we created a network map of the clusters. Each cluster was identified by an exemplar label, which serves as the description for the entire cluster. Then, related clusters and labels are connected to one another. The exemplar label of one cluster can be a member of another cluster. Figure 20 illustrates the network map created for the ‘magnetic’ label. Here, ‘flux,’ ‘flow,’ ‘particle,’ etc. are all members of the magnetic cluster, but also are exemplars of their own clusters. One use of this network map would be to discover resources related to time series fluctuations that are also part of the broader class of magnetic phenomena. This permits rapid identification of documents in specific areas and gap analysis for what has not been extensively explored.

Topic modeling methods like latent dirichlet allocation (LDA) Blei et al. (2003) could easily be applied to the corpus as an engine for recommendation and insight and different semantic mapping of the information in the corpus (e.g., Papadimitriou et al. (1998)). Additionally, more complex analyses of the automated corpus are possible.

We next analyzed the automated corpus using network analysis. To construct the network, we considered each paper in the automated corpus using the threshold of five (i.e., only papers whose titles and abstracts contained five distinct terms from the Complexity Heliophysics glossary). For each paper, we looked at all other papers and a connection was defined if two papers shared more than four terms from the glossary. For instance, ‘paper1’ and ‘paper2’ would be connected if their respective set of terms from the glossary are: 1) [‘complexity’, ’systems’, ’fractal’, ’multiscale’, ’network’, ’nonlinearity’, ’boundaries’] and 2) [exponential, bifurcation, ‘complexity’, ’systems’, ’dynamical’, ‘equilibrium’, ’multiscale’, ’network’, ’nonlinearity’]. We constructed a directed network (meaning the edges have a preferred direction) that points from the earlier paper to the later paper, perhaps more capably capturing a flow of ideas. To analyze and interpret the network, we constructed a random network containing the same number of nodes and connections as the resultant network. Perhaps unsurprisingly, we find a much higher average clustering coefficient Schank and Wagner (2005) for the automated corpus than the random network, indicating the presence of much more densely connected clusters or communities and stronger local structure. This network contains rich potential for discovery. To enable the community to explore the possibilities, we have visualized and made fully interactive the network using kumu.io, which can be accessed at https://embed.kumu.io/82d58b5453d1c4ba4f05d7240d142102. Figure 21 shows a few screenshots of the kumu visualization. Any node can be selected (Figure 21b), revealing title, abstract, and words from the glossary found in them. Any word can be selected and all nodes that contain it in the network will be highlighted (Figure 21c). Zooming in, one can explore local structures and clusters (Figure 21d).

The automated corpus and topic analysis augmented the manual identification and review of articles and is an additional artifact of this review; perhaps even one that can become standard for future reviews. It should be considered a resource that complements the extensive references cited in the body of this review and contains high potential for discovering trends and knowledge about Complexity Heliophysics. It is important to note that the manual and automated corpora are not disjoint nor is the manual corpus strictly a subset of the automated corpus. Many references are shared across them, lending validation to the process of generating the automated set, but there are many references in the manual set that are not included in the automated one. This points to the flexibility of the scientist-driven discovery process, pulling in relevant references and material that might be more distant or irregularly connected to the research at hand than the necessarily more rigid automated process. This review, in particular, read widely in gathering material, many connections of which an automated approach would likely not have captured. The point is there must be an intersection of manual and automated gathering of resources, the manual approach benefiting from flexibility and capacity to range widely and be discerning and the automated approach benefiting from the volume of resources it can examine.

Finally, given the corpus of articles from both manual and automated compilation, ‘mind maps’ Buzan and Buzan (1994), Börner (2015) were constructed from the main ideas in the articles. We do not present those mind maps, but a direct result from that mapping activity is the structure of this review such that the very sections and progression of this document reflects the achronological development of the ideas of Complexity Heliophysics.

Appendix D: Key Datasets

This appendix compiles a list of important datasets, and their original appearance in the literature, that appear across this review to aid readers who wish to compile datasets and explore data-driven research across the datasets that have factored importantly in the Complexity Heliophysics paradigm. This list is merely an introduction, certainly not exhaustive.

Citation(s)	Description
Bargatze et al. (1985)	34 intervals of high time resolution Solar Wind (IMP8)–AL index dataset.
	These data are the standard or benchmark dataset from most of the work
	addressed in Klimas et al. (1996)
Torr et al. (1995)	Polar spacecraft ultraviolet imager (UVI) observations.
Brittnacher et al. (1997)	High quality global images of auroral activity.
Uritsky et al. (2002)	15,500 POLAR UVI frames showing activity in the nighttime sector of the
	aurora (55 to 90^∘)
	MLAT, 2000 to 0400 MLT) in the Lyman-Birge-Hopfield-long filter mode.
	These data permitted a spatio-temporal technique that proves vital in auroral
	data analyses
Gjerloev (2009)	A worldwide collaboration of organizations and national agencies that operate
	more than 200 ground-based magnetometers. Provides measurements of
	magnetic field perturbations from all available stations in common
	coordinate frames, identical time resolution, and a common baseline removal
Leger et al. (2015)	Fluctuations of the Earth’s magnetic field as observed in-situ.
de Michelis et al. (2015)	European Space Agency (ESA) Swarm satellites vector field and absolute
de Michelis et al. (2015)	scalar magnetometers.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

McGranaghan, R.M. Complexity Heliophysics: A Lived and Living History of Systems and Complexity Science in Heliophysics. Space Sci Rev 220, 52 (2024). https://doi.org/10.1007/s11214-024-01081-2

Download citation

Received: 16 June 2023
Accepted: 21 May 2024
Published: 04 July 2024
DOI: https://doi.org/10.1007/s11214-024-01081-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Complexity Heliophysics: A Lived and Living History of Systems and Complexity Science in Heliophysics

Abstract

Similar content being viewed by others

Why is Complexity Science valuable for reaching the goals of the UN 2030 Agenda?

Introduction

Systems Science, Cybernetics, and Complexity

Explore related subjects

1 Introduction

1.1 Systems Science and Crossing Scales

1.2 Emergence, Self-Organization, and Scaling Theory

1.3 Information and Uncertainty Quantification

1.4 Networks, Network Science, and Collective Behavior

1.5 Risk Science and Resilience

1.6 Approach and Roadmap for This Review

1.6.1 The Use of Natural Language Processing (NLP) in This Review

2 Key Definitions

3 Setting the Stage for Complexity Heliophysics: From 1996

3.1 Autonomous Time Series

3.2 Input-Output Models: Analogue Models

3.3 Input-Output Models: Computationally Mature Input-Output Models

3.4 Important Themes Through 1996 That Set the Stage for Complexity Heliophysics

4 Emergence of the Connection Between Self-Organized Criticality and the Magnetosphere

5 Beyond 1996: Complexity Heliophysics

5.1 Power Laws in Heliophysics

5.2 From Time Series to Imagery

6 Emerging Literature: Topics and Trends

6.1 Metrics and Diagnostics of Complexity

6.2 Coarse-Graining

6.3 Disentangling Drivers and Parameters Amidst Nonlinearities: Information Theory

6.4 Network Science: A Future for Heliophysics and Space Weather

6.5 The Role of Natural Language Processing

6.6 Areas of Complexity Science That Have Not Yet Been Widely Explored in Heliophysics

7 Frontiers of Inquiry and Investigation Emerging from Complexity Science

7.1 Key Challenge for 21st Century Science: The Intersection of Complexity and Artificial Intelligence and a Framework to Explore It

7.2 Space Weather as a Risk Science

7.2.1 Risk Science

7.2.2 Systemic Risk

7.2.3 An Emphasis on Resilience

7.2.4 Critical Transitions

7.3 Convergence Research

8 Conclusion

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing Interests

Additional information

Publisher’s Note

Appendices

Appendix A: Acronyms

Appendix B: Questions Identified in the Papers Reviewed in This Work to Guide Future Research

Appendix C: Generation and Analysis of the Complexity Heliophysics Corpus

Appendix D: Key Datasets

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation