Abstract
We search for outliers in extreme events of statistical size distributions of astrophysical data sets, motivated by the Dragon-King hypothesis of Sornette, which suggests that the most extreme events in a statistical distribution may belong to a different population, and thus may be generated by a different physical mechanism, in contrast to the strict power-law behavior of self-organized criticality models. Identifying such disparate outliers is important for space weather predictions. Possible physical mechanisms to produce such outliers could be generated by sympathetic flaring. However, we find that Dragon-King events are not common in solar and stellar flares, identified in 4 out of 25 solar and stellar flare data sets only. Consequently, small, large, and extreme flares are essentially scale-free and can be modeled with a single physical mechanism. In very large data sets (N ≳ 104) we find significant deviations from ideal power laws in almost all data sets. Nevertheless, the fitted power-law slopes constrain physical scaling laws in terms of flare areas and volumes, which have the highest nonlinearity in their scaling laws.
1. Introduction
The largest natural catastrophes that can happen are obviously of highest interest, because they cause the largest threats and damages, such as the biggest earthquakes, landslides, wildfires, volcanic eruptions, terrorist attacks, stock market crashes, solar flares, coronal mass ejections, or magnetospheric storms (see reviews and textbooks by Bak 1996; Jensen 1998; Charbonneau et al. 2001; Hergarten 2002; Sornette 2004; Aschwanden 2011, 2013; Pruessner 2012; Aschwanden et al. 2016; Sharma et al. 2016; Watkins et al. 2016). The ubiquitous power-law–like size distributions that are universally found in the statistics of event sizes have been attributed to nonlinear scale-free processes. The observed size distributions can generally be subdivided into three regimes: (i) a range of incomplete sampling below the inertial range, (ii) an inertial range over which the power-law function holds, and (iii) a cutoff above the inertial range that approaches effects of the finite system size. The three regimes can be associated with (i) small events, (ii) large events, and (iii) extreme events. Generally, it is tacitly assumed that small, large, and extreme events belong to the same population, the same category of events, and be driven by the same physical event generation mechanism (for a given phenomenon). However, it has recently been suggested that the most extreme events do not develop similarly to the other events, at least for the case of stock market crashes, which are predictable to some extent based on their precursor behavior, a phenomenon that has been dubbed the “Dragon-King hypothesis” (Sornette 2009; Sornette & Ouillon 2012). This hypothesis suggests that extreme events may result from amplification and synchronization processes, and that the “big events” do have characteristic fingerprints enabling their forecasting and prediction. Another application of this hypothesis is the subcritical failure of materials under load, where a long phase of damage accumulation characterized by power-law distributed microfracture events finally give way to catastrophic breakdown (Baro et al. 2013). A third application are magnetospheric storms, where a growing recognition emerges that long-range correlations are an essential feature of systems that exhibit extreme events (Sharma 2017). In this study we investigate the question of whether extreme events of solar and stellar flares are consistent with a power-law distribution as an extension of a scale-free inertial range, or if they show significant deviations from ideal power laws.
2. Data Analysis and Results
2.1. Definition of Power-law Distributions
An ideal power-law distribution that represents a differential occurrence frequency distribution, also simply called a size distribution N(x) of events, as a function of some size parameter x, can be described by four parameters (),

where x1 and x2 are the lower and upper bounds of the power-law inertial range, αx is the power-law slope, and n0 is the normalization constant,

with nev being the total number of events contained in the inertial range [x1 ≤ x ≤ x2].
The ideal power-law function (Equation (1)) can be generalized with an additional “shift” parameter x0 (with respect to x), also called the Lomax distribution (Lomax 1954), generalized Pareto distribution (Hosking & Wallis 1987), or thresholded power-law size distribution (Aschwanden 2015),

with the normalization constant n0 for the range ,

This additional parameter x0 accommodates three different features: truncation effects due to incomplete sampling of events below some threshold (if x0 > 0), incomplete sampling due to instrumental sensitivity limits (if x0 > 0), or subtraction of event-unrelated background (if ), as it is common in astrophysical data sets.
While Equations (1) and (3) represent differential size distributions Ndiff(x), it is statistically more advantageous to employ cumulative size distributions Ncum(x), especially for small data sets and near the upper cutoff, where we deal with a small number of events per bin. We will use cumulative size distributions that include all events accumulated above some size x, such as for the ideal power-law function,

or for the thresholded power-law size distribution,

We show an example of a size distribution that contains the counts of events per bin ncts(x) in Figure 1(a), the corresponding differential size distribution in Figure 1(b), and the corresponding cumulative size distribution Ncum(>x) in Figure 1(c). The event count histogram (Figure 1(a)) displays the inertial range (x0, x2), the minimum (x1), and maximum value (x2) of the size parameter x, with a peak in the event count histogram at x0 (Figure 1(a)), which provides a suitable threshold definition, because incomplete sampling of small values
is manifested by the drop of detected events on the left side of the peak x0. This definition of a threshold x0 has been proven to be very useful for characterizing data sets with incomplete or sensitivity-limited sampling (Aschwanden 2015). This provides also a definition for the inertial range (x0, x2), which we will quantify by the number of decades as a logarithmic ratio, i.e.,
.
Figure 1. Statistics of solar flare hard X-ray peak photon rates are given in the form of a histogram with the number of events per logarithmic bin (a), the differential occurrence frequency distribution (b), and the cumulative occurrence frequency distribution (c), with the rank-order plot overlayed for the 10 most extreme events (black diamonds). The observed data are shown in the form of black histograms, while the fitted thresholded power-law function is shown with a red curve. The data range [x1, x2] is marked with dotted vertical lines, and the threshold value [x0] with a dotted dashed line. The least-square fit criterion is calculated for the entire distribution sampled above the threshold (χall = 1.46), and for the range of extreme events (χee = 0.56).
Download figure:
Standard image High-resolution imageAn equivalent method to the cumulative size distribution is the rank-order plot. If the statistical sample is rather small, in the sense that no reasonable binning of a histogram can be done, either because we do not have multiple events per bin or because the number of bins is too small to represent a distribution function, we can create a rank-order plot. A rank-order plot is essentially an optimum adjustment to small statistics, by associating a single bin to every event. From an event list of a parameter , which is generally not sorted, we first have to generate a rank-ordered list by ordering the events according to increasing size,

The bins are generally not equidistant, neither on a linear nor logarithmic scale, defined by the difference between subsequent values of the ordered xj,

In a rank-ordered sequence of nx events, the probability for the largest value is , for events that are larger than the second-largest event it is 2/nx, and so forth, while events larger than the smallest event occur in this event list with a probability of unity. Thus, the cumulative frequency distribution is simply the reversed rank order,

and the distribution varies from for j = 1 to
for
. We can plot a cumulative frequency distribution with
on the y-axis versus the size xj on the x-axis. The distribution is normalized to the number of events nx,

We overlay rank-order plots in each of the cumulative size distributions in Figures 1–6 (shown with black diamond symbols), for the nee = 10 most extreme events in every size distribution. If there is a “Dragon-King” event, it should manifest itself by a significant offset from the best-fit cumulative size distribution at the upper end in the zone of most extreme events near x ≲ x2, say by a factor of two at least. The example shown in Figure 1 exhibits a ratio of qee = 1.03 for the most extreme event, which is a very small offset from the best-fit cumulative size distribution that does not manifest a Dragon-King event.
2.2. Least-square Fits of Size Distributions
In order to test the consistency of the size distribution of extreme events with the size distribution of all events, we perform least-square fits of the cumulative size distributions, such as shown in Figure 1(c). The cumulative distribution of an observed parameter x is shown in form of a black histogram with error bars Ncum(>x), while the best-fit cumulative size distribution is shown with a red curve (fitted over the range (x0, x2) above the threshold), and the extrapolated range below the threshold x0 is shown with a dashed red curve. We see in Figure 1(c) that the reduced chi-square value is χall = 1.46 for all events, while the chi-square value is χee = 0.56 in the range that contains the nee = 10 most extreme events, and thus is self-consistent between the two ranges, so that we can conclude that the extreme events do not show any significant deviation (χ ≳ 2) from the overall (best-fit) thresholded power-law distribution in this particular data set.
We estimate the goodness-of-fit from the uncertainties σcum(xi) of the values in the fitted size distribution with the standard reduced χ2-criterion,

where nx is the number of bins, is the number of free parameters (
) of the fitted size distribution,
is the observed cumulative number of events in each bin,
is the fitted cumulative number of events in each bin, and σcum(xi) is the estimated uncertainty of the cumulative number of events,

This definition is slightly different from the estimate in the previous study (Aschwanden 2015), i.e., , which represents a lower limit on the uncertainty only, since the counts of events in each bin are not independent of each other in a cumulative distribution function. Our new approach takes only independent events into account for the estimation of the uncertainty, which in a cumulative distribution is the increment
, while the remainder of the counts in the bins
do not vary in the cumulative fit of bin
, and therefore do not contribute to the uncertainty
.
Besides the fits of the cumulative size distributions (Figure 1(c)), we also perform fits to the differential size distributions for consistency tests (Figure 1(b)). Since the differential size distribution is quantified in terms of counts per bin width, , the corresponding χ2-criterion is,

and the uncertainty in terms of Poisson statistics is,

An example of such a fit is shown in Figure 1(b). Note that the size distribution of the differential size distribution shows a straight power-law function even in the range of extreme events, while the cumulative size distribution shows an exponential-like drop-off towards the zero value at x2, as a consequence of the integral function defined in Equation (6). We can compare now the results of the two power-law fits and find a power-law slope of αdiff,all = 1.75 ± 0.02 for the differential size distribution (Figure 1(b)), while the the power-law slope is for the cumulative size distribution (Figure 1(c)). Now, if we inspect the χ2-values in the range of the ten most extreme events, we find χcum,all = 1.46 for all events (above the threshold x0), and a value of χcum,ee = 0.56 for the 10 most extreme events (Figure 1(c)). Therefore, not only the two methods of differential and cumulative distributions are self-consistent, but the two different ranges (of small-to-large and extreme events) lead to a self-consistent size distribution function also, so that we can conclude that extreme events belong to the same population as all other events, and that no significant deviation from the thresholded power-law distribution function is found for this data set, which was obtained from hard X-ray peak photon count rates in solar flares, using the Hard X-ray Burst Spectrometer (HXRBS) onboard the Solar Maximum Mission (SMM). The first size distributions of these solar flare data were published in Dennis (1985) and Crosby et al. (1993).
Alternatively to the chosen method of least-square fitting of differential or cumulative size distributions, other methods have been used to test the consistency of an ideal power-law function with observed size distributions, such as the maximum likelihood estimator, the Bayesian information criterion, the Anderson–Darling test, or the Kolmogorov–Smirnov statistic (Clauset et al. 2009). However, since no accommodation for (i) undersampling of small events, (ii) subtraction of event-unrelated background, or (iii) automated determination of a threshold is provided in those alternative methods, they are not suitable for modeling of astrophysical data (Aschwanden 2015).
2.3. Solar Flare Observations
The results of the data analysis are presented in Figures 2–6 and in Table 1. While we used the generic term “size” in the definition of occurrence frequency distributions, we should be aware that the size can be represented by any measure, magnitude, or physical quantity that can be measured in a set of events, but we should not expect to find the same power-law slope for different quantities, because there is often some nonlinear scaling involved. For the solar observations analyzed here we sample the following quantities: peak count rates of the hard X-ray photon fluxes of flares (P) (observed with the instruments HXRBS and the Burst and Transient Source Experiment (BATSE) on the Compton Gamma Ray Observatory (CGRO), and the Ramaty High Energy Solar Spectroscopic Imager (RHESSI) spacecraft), total counts (or fluences) of hard X-ray fluxes (C), flare durations (T), emission measure (EM) in soft X-rays and extreme ultra-violet (EUV), thermal flare energies (Eth), nonthermal flare energies (Enth), dissipated magnetic energies (Emag), flare volumes (V), flare areas (A), and flare length scales (L). The energetic parameters have been calculated in recent statistical studies on the global energetics of solar flares (Aschwanden et al. 2014, 2015, 2016).
Figure 2. Solar flare hard X-ray peak photon rates (a)–(c); fluences or counts (d)–(f); and flare durations (g)–(i) are histogrammed from HXRBS observations (a), (d), (g); BATSE observations (b), (e), (h); and RHESSI observations (c), (f), (i) (shown in the form of black histograms with uncertainties), and in the form of the best-fit cumulative distribution functions (red curves). The chi-square is given for all events above the threshold x0, χall(x > x0), and for the 10 most extreme events separately, χee(x > x3).
Download figure:
Standard image High-resolution imageTable 1. Results of Forward-fitting of Differential and Cumulative Size Distributions for Different Statistical
BG | N | NT | IR | x0 | αdiff | αcum | χdiff | χcum | χee | qee | |
---|---|---|---|---|---|---|---|---|---|---|---|
Solar flares: | |||||||||||
Peak HXRBS) | 58 | 10856 | 7579 | 4.0 | 19 | 1.75 ± 0.02 | 1.73 ± 0.02 | 0.96 | 1.46 | 0.56 | 1.03 |
Peak HXRBS | 41 | 6223 | 3979 | 3.8 | 31 | 1.69 ± 0.03 | 1.70 ± 0.02 | 1.18 | 1.86 | 1.12 | 1.09 |
Peak BATSE | 0 | 7234 | 4996 | 3.3 | 501 | 1.91 ± 0.03 | 1.81 ± 0.02 | 3.18 | 4.39 | 0.78 | 1.11 |
Peak RHESSI | 0 | 7996 | 4399 | 3.7 | 20 | 1.87 ± 0.03 | 1.86 ± 0.02 | 1.28 | 2.00 | 0.77 | 1.88 |
Peak RHESSI (A16) | 0 | 290 | 149 | 1.7 | 822 | 2.44 ± 0.19 | 2.28 ± 0.13 | 0.70 | 1.07 | 0.81 | 1.45 |
Counts HXRBS | 2 | 8129 | 3314 | 4.6 | 3100 | 1.72 ± 0.03 | 1.66 ± 0.02 | 1.54 | 2.68 | 0.65 | 1.40 |
Counts BATSE | 0 | 2748 | 2748 | 4.8 | 2000 | 1.48 ± 0.03 | 1.50 ± 0.03 | 1.67 | 5.20 | 2.94 | 1.07 |
Counts RHESSI | 0 | 11548 | 8081 | 4.9 | 4783 | 1.60 ± 0.02 | 1.64 ± 0.02 | 3.34 | 10.98 | 2.77 | 1.54 |
Counts RHESSI (A16) | 0 | 289 | 147 | 1.8 |
![]() |
2.40 ± 0.19 | 2.00 ± 0.12 | 1.17 | 1.95 | 1.95 | 1.63 |
Duration HXRBS | 2 | 11539 | 4341 | 2.2 | 126 | 2.61 ± 0.04 | 2.66 ± 0.02 | 4.43 | 5.45 | 1.58 | 0.90 |
Duration BATSE | 0 | 7241 | 4562 | 2.1 | 37 | 2.32 ± 0.03 | 2.19 ± 0.03 | 5.65 | 5.81 | 0.00 | 1.77 |
Duration RHESSI | 0 | 11445 | 8296 | 2.2 | 25 | 2.05 ± 0.02 | 1.90 ± 0.02 | 7.59 | 8.28 | 10.19 | 0.93 |
Duration (A14) | 0 | 171 | 123 | 1.3 | 0 | 2.76 ± 0.24 | 2.73 ± 0.21 | 0.58 | 0.83 | 0.56 | 1.15 |
Duration (A16) | 0 | 289 | 189 | 1.0 | 0 | 3.22 ± 0.22 | 3.33 ± 0.20 | 1.53 | 1.49 | 0.64 | 1.83 |
Length (A14) | 0 | 171 | 112 | 0.9 | 26 | 5.13 ± 0.44 | 5.03 ± 0.38 | 1.10 | 0.52 | 0.55 | 1.25 |
Length (A15) | 0 | 389 | 210 | 0.8 | 8 | 4.23 ± 0.27 | 4.45 ± 0.23 | 1.18 | 1.00 | 1.04 | 0.95 |
Area (A14) | 0 | 171 | 100 | 1.5 | 899 | 2.84 ± 0.27 | 2.82 ± 0.22 | 0.71 | 1.05 | 0.73 | 1.75 |
Volume (A14) | 0 | 171 | 96 | 2.2 | 30300 | 2.32 ± 0.22 | 2.09 ± 0.16 | 0.94 | 1.34 | 0.68 | 3.08 |
Volume (A15) | 0 | 389 | 239 | 2.1 | 640 | 1.83 ± 0.12 | 1.86 ± 0.09 | 0.56 | 0.72 | 0.55 | 1.07 |
Magnetic energy (A14) | 0 | 171 | 68 | 1.1 | 119 | 3.04 ± 0.34 | 2.83 ± 0.22 | 1.11 | 0.92 | 0.65 | 0.92 |
Nonthermal energy (A16) | 0 | 192 | 89 | 2.6 | 0.32 | 2.50 ± 0.25 | 1.59 ± 0.11 | 1.28 | 2.34 | 2.08 | 1.87 |
Thermal energy (A15) | 0 | 390 | 279 | 1.8 | 3 | 2.15 ± 0.13 | 2.16 ± 0.11 | 0.86 | 0.73 | 0.73 | 1.02 |
Emission measure (A15) | 0 | 391 | 251 | 1.4 |
![]() |
2.97 ± 0.19 | 2.82 ± 0.14 | 0.86 | 0.58 | 0.49 | 1.00 |
Stellar flares: | |||||||||||
Peak KEPLER | 0 | 208 | 124 | 2.8 | 60255 | 1.66 ± 0.14 | 1.68 ± 0.12 | 0.81 | 1.26 | 1.45 | 3.55 |
Bolom. KEPLER | 0 | 1537 | 727 | 1.5 | 3.8 × 1034 | 2.56 ± 0.09 | 2.55 ± 0.07 | 1.31 | 2.18 | 0.67 | 0.97 |
Terrestrial data: | |||||||||||
Earthquakes | 1 | 17425 | 9423 | 4.6 | 100 | 1.80 ± 0.02 | 1.85 ± 0.01 | 2.22 | 3.11 | 1.45 | 2.82 |
Fires | 1 | 55853 | 38190 | 5.3 | 2 | 1.56 ± 0.01 | 1.53 ± 0.01 | 13.90 | 30.65 | 30.93 | 0.92 |
Cities | 0 | 19445 | 12784 | 4.0 | 501 | 1.79 ± 0.02 | 1.79 ± 0.01 | 5.69 | 18.37 | 14.18 | 1.79 |
Blackouts | 0 | 209 | 134 | 1.7 | 50000 | 1.98 ± 0.16 | 1.90 ± 0.13 | 0.80 | 0.83 | 0.63 | 3.35 |
Terrorism | 1 | 2699 | 1949 | 2.4 | 2 | 2.48 ± 0.05 | 2.28 ± 0.04 | 4.86 | 2.25 | 0.43 | 6.15 |
Words | 1 | 6609 | 4980 | 3.6 | 1 | 2.15 ± 0.03 | 1.99 ± 0.02 | 6.31 | 3.33 | 1.09 | 1.99 |
Surnames | 2 | 2280 | 1663 | 2.2 | 15700 | 2.55 ± 0.06 | 2.50 ± 0.05 | 1.10 | 1.89 | 1.34 | 1.13 |
Weblinks | 1 |
![]() |
![]() |
2.4 | 4 | 2.47 ± 0.03 | 2.39 ± 0.01 | 4.39 | 223 | 0.67 | 1.07 |
Download table as: ASCIITypeset image
The results of forward-fitting differential and cumulative size distributions of different events are summarized in Table 1, which includes: background counts (BG), the number of all events (N), the number of events above threshold (NT), the logarithm of the inertial range (IR), the threshold value (x0), the power-law slope from fitting differential (αdiff) and cumulative occurrence frequency distributions (αcum), the χ2-values of differential (χdiff) and cumulative size distributions (χcum), and the mean χ2-values in the range of the 10 most extreme events (χee).
We determined the background level by minimizing the goodness-of-fit χ2-criterion as a function of the background level (see Figure 7 in Aschwanden et al. 2015). It turned out that only the HXRBS data needed background subtraction (by 41 and 58 cts s−1), while the BATSE and RHESSI flare catalogs provide background-subtracted flux values.
The total number of events (N in Table 1) encompasses all sampled events, but only those (NT) above the threshold x0 could be used in the fitting of size distributions. The inertial range is characterized by the logarithmic ratio , which ranges from 2 to 5 decades. Those with the largest inertial ranges and largest number of events provide the most accurate fits of size distributions.
2.4. Self-consistency of Size Distributions
A first important test is the self-consistency between the two different methods, i.e., the fitting of the differential and cumulative size distributions. We can simply compare the resulting power-law slopes, which are labeled as αdiff and αcum in Table 1. We find a satisfactory agreement between the two methods within the derived uncertainties of a few percent, as it can be seen for all data sets with large statistics (Table 1: events). This corroborates the self-consistency between the two methods. The high accuracy is mostly achieved by modeling the size distribution with a thresholded power-law distribution function (Equations (3) and (6)), which is more suitable to represent the data than an ideal power-law function.
A second important test is the comparison of the goodness-of-fit between the differential and cumulative size distributions, labeled as χdiff and χcum in Table 1. Among the nine fitted size distributions of three solar observables (P, C, T) with three different instruments each (HXRBS, BATSE, RHESSI; shown in Figure 2), we find that only one data set is consistent with the fitted (differential and cumulative) size distributions, namely P of HXRBS, with χdiff ≈ 2 and χcum ≲ 2, while the other eight data sets exhibit significant mismatches χdiff ≈ χcum ≈ 3–13. This indicates that the large-number statistics of these data sets is sensitive to apparently little but significant deviations from power laws. These deviations can clearly be recognized in the poor values χdiff,all and χcum,all in the plots of Figures 2(b)–(i). There is a tendency that peak count (P) size distributions match a power-law function closest, while the total counts (C) and the flare durations (T) deviate significantly. The reason for the latter property could be due to a violation of the principle of timescale separation that is required in self-organized criticality (SOC) models (Aschwanden et al. 2016), caused by confusion between overlapping short and long-duration flares.
2.5. Energetic Solar Flare Parameters
The flare observables (P, C, T) are easiest to measure and have often been used in modeling of SOC models. However, since these observables may not be a good proxy for the representation of the energy contained in SOC avalanches, we also inspect the size distribution of spatial and flare parameters, such as (), which have been determined under the scope of global energetics in solar flares in a series of ongoing studies (Aschwanden et al. 2014, 2015, 2016). The cumulative size distributions are presented in Figures 3 and 4. Since the number of events for which energetic parameters have been determined includes M and X Geostationary Operational Environmental Satellites (GOES) class flares only, we deal with small-number statistics in the order of
events per data set. Consequently, the resulting fits yield acceptable values for the goodness-of-fit, i.e., χ2 ≲ 2, but these small data sets have insufficient sensitivity to measure deviations from power-law functions. Nevertheless, since they contain large flares only, they are suitable to detect the outliers of extreme events.
Figure 3. The cumulative size distributions of solar flare quantities are shown for length scales (a), (f); flare areas (b); flare volumes (c), (g); flare durations (d); the dissipated magnetic energy (e); the EUV emission measure (h); and the thermal energy (i), calculated in two previous works (Aschwanden et al. 2014, 2015). Representation otherwise similar to Figure 2.
Download figure:
Standard image High-resolution imageThe only obvious deviation from a power law is detected for the size distribution of nonthermal energies (Figure 4(b)), which is suspected to arise due to an instrumental irregularity, such as pulse pile-up in RHESSI data.
Figure 4. The cumulative size distributions of solar flare quantities are shown for the dissipated magnetic energy (a), the nonthermal energy (b), the thermal energy (c), flare durations (d), peak counts (e), total counts (f), the peak intensity of stellar flares (g), and the stellar bolometric energy of stellar flares (h), calculated from two previous works (Aschwanden et al. 2016). Representation otherwise similar to Figure 2.
Download figure:
Standard image High-resolution image2.6. Extreme Events
We now investigate the so-called Dragon-King hypothesis (Sornette 2009; Sornette & Ouillon 2012). This hypothesis suggests that the most extreme events may belong to a different population of event size distributions than smaller or even large events. In this study we define three different parameters to characterize Dragon-King events: (i) a range of the largest extreme events in a data set, which we set arbitrarily to nee = 10 here; (ii) the ratio of the observed size to the expected size
of the most extreme event, based on the cumulative size distribution fit, i.e.,
, (which we may call the extreme event excess factor); and (iii) the goodness-of-fit χee of the cumulative size distribution fit in the range of the largest nee = 10 events. These three values are given in the lower right corner of each panel in Figures 1–6 and Table 1.
As an example we consider Figure 1, where the goodness-of-fit in the extreme event part of the size distribution is i.e., χee = 0.56, which compares well with the value obtained from fitting all events above the threshold, i.e., χall = 1.46, which does not yield any evidence for a different population. A second test is the extreme event excess factor, which for this event is qee = 1.03 and yields no evidence for a Dragon-King event either (Figure 1(c)).
We list all evaluated extreme event excess factors qee in Table 1 (also given in the panels of Figures 1–6). Among the solar flare parameters, all those factors vary in the range of qee ≈ 0.9–1.9, with a single outlier of qee = 3.08 (for one flare volume data set, see Figure 3(c)), out of 23 solar data sets. Therefore, we see no indication that solar extreme events belong to different populations of event distributions.
The estimation of an unknown tail distribution function can be derived with tools from extreme value theory, such as with the Pickands–Balkema–de Haan theorem (Balkema & de Haan 1974; Pickands 1975), the Fisher–Tippett–Gnedenko theorem (Fisher 1930), or as described in Gumbel (1958), which converges to three possible distributions (the Gumbel distribution, the Fréchet distribution, or the Weibull distribution). In our case, however, the cumulative form of the thresholded power-law size distribution (generalized Pareto) function is our preferred model of choice, due to its ability of taking incomplete sampling, background subtraction, and finite system size into account, and thus is known a priori, so that it does not need to be estimated with extreme value theory, as it would be needed in the case of an unknown tail distribution function. The theoretically defined distribution is then fitted to the observed distributions directly, and the extreme event excess factor qee is conveniently quantified in terms of standard deviations in common χ2-statistics.
2.7. Stellar Flare Observations
Impulsive flaring with rapid increases in the brightness in UV or EUV has been observed for a number of so-called flare stars, such as AD Leo, AB Dor, YZ Cmi, EK Dra, or Eri. These types of stars include cool M dwarfs, brown dwarfs, A-type stars, giants, and binaries in the Hertzsprung–Russell diagram. Most of these stars are believed to have hot, soft X-ray emitting coronae, similar to our Sun (a G5 star), and thus magnetic reconnection processes are believed to operate in a similar way as on our Sun (see, e.g., review by Güdel 2004).
We complement the solar flare data sets with three data sets of stellar flares, two recorded with KEPLER and one observed with the EUVE telescope, with the cumulative size distributions shown in Figures 4(g), (h) and 5. The fit of a cumulative power-law distribution shown in Figure 4(g) produces a size distribution with a power-law slope of α = 1.68 ± 0.12 for the peak counts, which is similar to the slope observed in solar flares, i.e., α = 1.70 ± 0.02 with HXRBS (Figure 2(a)), α = 1.81 ± 0.02 with BATSE (Figure 2(b)), and α = 1.86 ± 0.02 with RHESSI (Figure 2(c)). The goodness-of-fit for extreme events is χee = 1.45, which is similar to χall = 1.26 for all events (Figure 4(g)).
A second data set with stellar flare data from KEPLER, derived from the bolometric energy, is shown in Figure 4(h), yielding a power-law slope of , which is similar to the value expected for flare areas
(Table 2). Since the largest contributions to the bolometric intensity come from the solar or stellar photospheric surface area, it makes sense that its size distribution scales with the flare area (rather than with a 3D volume).
Table 2. Theoretically Predicted Power-law Slopes of Solar Flare Size Distributions
Parameter | Power-law Exponent | Power-law Exponent | Power-law Exponent | Data Set |
---|---|---|---|---|
(General Expression) |
![]() |
of Observed Solar Flares | ||
![]() |
αx | |||
Length L |
![]() |
![]() |
![]() |
A15 |
Length L |
![]() |
![]() |
![]() |
A14 |
Area A |
![]() |
![]() |
![]() |
A14 |
Magnetic energy Emag |
![]() |
![]() |
![]() |
A14, A16 |
Emission measure EM |
![]() |
![]() |
![]() |
A15 |
Bolom. energy Ebol |
![]() |
![]() |
![]() |
KEPLER |
Volume V |
![]() |
![]() |
![]() |
A14 |
Volume V |
![]() |
![]() |
![]() |
A15 |
Thermal energy Eth |
![]() |
![]() |
![]() |
A15, A16 |
Time duration T | αT = 1 + (S − 1)β/2 | αT = 2.00 | αT = 2.66 ± 0.03 | HXRBS |
Time duration T | αT = 1 + (S − 1)β/2 | αT = 2.00 | αT = 2.19 ± 0.03 | BATSE |
Time duration T | αT = 1 + (S − 1)β/2 | αT = 2.00 | αT = 1.90 ± 0.02 | RHESSI |
Time duration T | αT = 1 + (S − 1)β/2 | αT = 2.00 | αT = (2.73 ± 0.21) | A14, A16 |
Peak flux P | αP = 1 + (S − 1)/(γS) |
![]() |
![]() |
HXRBS |
Peak flux P | αP = 1 + (S − 1)/(γS) |
![]() |
![]() |
BATSE |
Peak flux P | αP = 1 + (S − 1)/(γS) |
![]() |
![]() |
RHESSI |
Peak flux P | αP = 1 + (S − 1)/(γS) |
![]() |
![]() |
A16 |
Peak flux P | αP = 1 + (S − 1)/(γS) |
![]() |
![]() |
KEPLER |
Fluence C |
![]() |
![]() |
![]() |
HXRBS |
Fluence C |
![]() |
![]() |
![]() |
BATSE |
Fluence C |
![]() |
![]() |
![]() |
RHESSI |
Fluence C |
![]() |
![]() |
![]() |
A16 |
Nonthermal energy Enth |
![]() |
![]() |
![]() |
A16 |
Note. The power-law slopes fitted to observed data sets (HXRBS, BATSE, RHESSI) are listed in the fourth column. Measurements with poor statistics and less reliable values are marked with parenthesis (...), obtained from the data sets A14, A15, and A16 (Aschwanden et al. 2014, 2015, 2016).
Download table as: ASCIITypeset image
In Figure 5 we present cumulative size distribution fits of 12 cool (type F to M) stars, observed by Audard et al. (2000), with typically 5–15 flare events per star. For most of these stars, the flare intensities closely follow the fitted cumulative distribution all the way to the most extreme event, except for two cases (out of 12): HD 2726 (F2 V) in the top left panel and CN Leo, 1994 (M6 V) in the bottom middle panel, which could qualify as outliers (or Dragon-King events). The power-law slopes inferred in this sample vary strongly within a range of , which tend to be somewhat higher than the mean values of solar flare size distributions, but are not reliable in such small-number statistics.
Figure 5. Cumulative frequency distributions of flare energies (total counts) observed for 12 cool (type F to M) stars with EUVE (Audard et al. 2000). The flare events are marked with diamonds, fitted with a power-law fit in the lower half (P; thick line), and fitted with a cumulative frequency distribution (C; curved function). Note the two stars with outliers at the largest events (HD 2726) and (CN Leo, 1994).
Download figure:
Standard image High-resolution imageIn summary, stellar flares fit cumulative size distributions at the upper cutoff (within an energy range of ≈0.8–1.4 decades), but the power-law slopes appear to be slightly steeper than observed in solar flares, and outliers of extreme events occur only rarely (in 2 out of 12 cases).
3. Discussion
3.1. Astrophysical Predictions of Power-law Slopes
While we are interested in the investigation of deviations from ideal power laws, which is a second-order effect, we should consider first the physical justification of the existence of power-law size distributions in the first place. The original discoverers of SOC models noted power-law–like size distributions with power-law slopes of of the sizes x of SOC avalanches, as well as power-law slopes of αT ≈ 2 for the durations T of SOC avalanches, obtained in cellular automaton simulations (e.g., Pruessner 2012). A more physical approach of modeling and predicting power-law slopes of size distributions observed from astrophysical SOC systems has been put forward in terms of the so-called fractal-diffusive (FD-SOC) model (Aschwanden 2013, 2015; Aschwanden et al. 2016). We juxtapose theoretical predictions of this model and mean observational values in Table 2.
The most fundamental assumption in the FD-SOC model is the scale-free probability conjecture, which directly predicts a power-law size distribution function of geometric length scales L from first principles, i.e.,

where S is defined as the Euclidean dimension of the SOC system (with possible values of ). From this fundamental power-law distribution, other power-law size distributions of any physical size parameter x that has a linear (x ∝ L), or a nonlinear relationship
to the length scale L (with power index p), can be derived. The following scaling laws have been used in the calculation of the power-law indices listed in Table 2:






where D2 is the area (A) fractal dimension, D3 is the volume (V) fractal dimension, β is the spatiotemporal diffusion coefficient for the time duration (T) of a SOC avalanche with classical diffusion (β = 1), γ is the flux-volume scaling of observed fluxes F of astrophysical sources, Vmax is the maximum fractal volume, which obeys the Euclidean dimension S of the SOC system, is the peak or maximum flux, and E = F T is the fluence, defined as the product of the flux F and time duration T.
The mean fractal dimension is estimated from the average of the minimum () and maximum
values,

which yields and
for SOC models that occupy either an Euclidean 2D or 3D dimensional space.
Since all relationships given in Equations (15)–(21) are expressed in terms of the variable L, the power-law indices αx for the size distribution of the parameters can be derived by substitution of variables, i.e.,
, yielding the power-law indices αx,







where the approximative numerical values are obtained by inserting .
In Table 2 we juxtapose these theoretical power-law slope values with the observed values. We see that there is mostly a good agreement (within a few percent) for solar flare size distributions with large statistics (), such as for peak fluxes (αP), total counts (αC), and flare durations (αT), while others with small-number statistics show larger deviations from the predictions, such as for length scales (αL), areas (αA), and volumes (αV).
New results are obtained here from the power-law slopes of energetic flare parameters. The dissipated magnetic energies Emag are found to have power-law indices that are close to the values of flare areas, i.e., versus
, which could be explained by the fact that the dissipation of free magnetic energies is concentrated in the chromospheric flare ribbons, which have a horizontal area-like surface geometry. This is also true for the bolometric flare energies observed in stellar flares, i.e.,
versus
, since the bolometric brightness is mostly irradiated in the photosphere of stellar surfaces. Also for the thermal EM, we find similarity with area-like features, i.e.,
versus
, which suggests that the soft X-ray EM in solar flares has an area-like extension with relatively low altitudes. For thermal flare energies, for which we expect mostly a volume dependence,

as the electron density and the electron temperature vary much less than the flare volume. Indeed, we find values of that are similar to the volume power-law index,
. For nonthermal flare energies we find a close coincidence of the power-law slope of
, which is close to the power-law slopes of fluences,
, which is expected because both the fluences (E = F T) and the nonthermal energies,
, are time-integrated flare quantities.
These comparisons of power-law indices thus reveal interesting relationships that are relevant for determining the physical scaling laws that govern solar flares.
3.2. Outliers or Dragon-King Events
From the 23 solar (Figures 3, 4) and 14 stellar data sets (Figures 4, 5) we found little evidence for outliers among the most extreme events in each data set. The largest outliers for the most extreme events are found for a small data set (N = 172) of flare volumes with an excessive extreme event factor of qee = 2.75 (Figure 3(c)), for a small stellar flare data set (N = 209) with qee = 3.55 (Figure 4(g)), and for the two stars HD 2726 and CN Leo with even less statistics (N ≈ 15; Figure 5). The fact that all these most extreme events occur in small data sets, while larger data sets with N ≳ 103 − 104 events (Figure 2) reveal no excesses larger than qee ≲ 2.0, provides little evidence for the existence of super-extreme events that are not part of the “normal” population of small and large flare events. Hence, we conclude that extreme events of solar and stellar flares are largely scale-free (over an inertial range of ≈3–4 decades) in their volumes, fluxes, or energies.
What does it mean that we find no significant outliers in the cumulative size distributions of solar and stellar flares? As the size distribution of length scales reveals in Figure 3(a), solar flares have length scales of Mm, which at the upper limit matches the maximum size of active regions on the solar surface. The largest possible flare that extends over the entire solar surface would have a circumference of
Mm, which is apparently impossible since we never observed a flare with a size larger than L ≲ 200 Mm. One effect that could help to increase the maximum flare size is the case of sympathetic flaring, which involves a magnetic coupling between adjacent (or even remote) active regions. However, the larger an unstable magnetic field configuration is, the smaller the filling factor of magnetic energy and heated plasma is with respect to the size of a flaring region. Consequently, the envisioned process of amplification and synchronization promoted in the Dragon-King hypothesis (Sornette 2009; Sornette & Ouillon 2012) seems to be questionable for solar and stellar flares.
3.3. Deviations from Power Laws
It has been argued that real data deviate from ideal power-law size distributions, as they have been reproduced by cellular automaton simulations (e.g., Pruessner 2012). Some skeptics went even as far as to deny the existence of power laws at all (Stumpf & Porter 2012). Traditional least-square fitting techniques have been criticized to be inaccurate, while maximum-likelihood fitting methods with goodness-of-fit tests based on the Kolmogorov–Smirnov statistic were preferred (Clauset et al. 2009). However, we have demonstrated here and earlier (Aschwanden 2015) that least-square fitting methods of both differential and cumulative size distributions yield self-consistent fits of the power-law slope, as long as data truncation, undersampling of data in small events below some threshold, subtraction of event-unrelated background noise, and the steep fall-off at the upper bound of the cumulative size distribution is properly modeled. In other words, a straight power-law function represents an oversimplified model for most cases, while inclusion of the effects enumerated above is essential in defining adequate models for fitting statistical distributions.
Now, with the availability of large data sets (with events), we are becoming much more sensitive to the smallest deviations from ideal power laws. So it is no surprise that deviations from power laws become the rule rather than the exception. The deeper reason for these power-law deviations is the inhomogeneity of the medium and the physical conditions. It is instructive to consider the sandpile analogy of SOC systems. Although the cone shape of a sandpile appears to have a constant slope or angle of repose, when viewed from distance, closer inspection reveals macroscopic channels, erosions, bumps, and dents, on top of the microscopic fine structure. As an illustration we mention some empirical data sets (chosen by Clauset et al. 2009) that are shown in Figure 6 and described in the Appendix. Some data sets clearly reveal a complete inadequacy to fit any power-law function to data with a broken power law (Figure 6(b): wildfires; Figure 6(c): cities). The case of weblinks has such a tremendous statistic (
events) that it is highly sensitive to the smallest deformations from an ideal power law, i.e., amounting to χall = 223 for a slight deficit near
(Figure 6(h)). A Dragon-King event with an excess of
is indicated in the data set of terrorism (Figure 6(e)).
Significant deviations from ideal power-law size distributions have also been detected in stellar flares, e.g., in the Kepler-based study of Sheikh et al. (2016), and most of the analyzed stars were found to exhibit no evidence for SOC avalanching. It is argued that these stellar flares are subject to “tuned criticality” rather than “SOC” (Sheikh et al. 2016).
3.4. Physical Mechanisms of SOC Avalanches
The observations of SOC avalanches exhibit power-law–like size distributions, which constrain (scale-free) scaling laws of physical parameters (see Section 3.1), but do not directly reveal the physical mechanism of SOC processes as such. The power-law slopes of the size distributions of solar and flare data have been found to be consistent with the scale-free probability conjecture (Equation (15)), fractal-diffusive transport (Equation (18)), and with EUV and soft X-ray emission (flux or intensity) that is quasiproportional to the fractal volume of an avalanche event (Equation (20)). These scaling laws are consistent with standard solar flare models, which are driven by a magnetic reconnection process in the solar corona (Lu & Hamilton 1991). However, the observed scaling laws are also consistent with other physical mechanisms, such as turbulent photospheric convection, which may be (stochastically) coupled with coronal reconnection events (Uritsky & Davila 2012; Uritsky et al. 2013; Knizhnik et al. 2018). Additional interpretations of physical mechanisms operating in SOC systems can be found in Aschwanden (2011) and Aschwanden et al. (2016).
The interpretation of SOC systems in terms of physical mechanisms is beyond the scope of this study, but the fact that we find virtually no outliers or Dragon-King events implies that only one single physical mechanism with appropriate properties is needed to explain the observed power-law size distributions. A dual energy dissipation system is likely to produce broken power-law distributions. However, the same physical process, such as a magnetic reconnection process, can have multiple secondary energy dissipation processes, including nonthermal particle acceleration (hard X-ray fluxes), thermal flare plasma heating (soft X-ray fluxes), launching of a coronal mass ejection (white light emission), or electron beam formation (radio emission). Quantitative energy budget calculations have revealed energy closure for the primary flare-dissipated magnetic energy and secondary energy dissipation mechanisms (Aschwanden et al. 2017). All energy that is dissipated during a solar flare is supplied by the free (magnetic) energy in an active region. Interestingly, multiscale intermittent dissipation with power-law–like SOC properties has also been detected in quiet sun regions at spatial scales of ≳3 Mm, controlled by turbulent photospheric convection (Uritsky et al. 2013). It appears that we observe a coexistence of coronal SOC systems (in active regions) and photospheric SOC systems (in quiet sun regions).
4. Conclusions
In this study we search for outlier events in astrophysical data sets from solar and stellar flares. Such an endeavor is motivated by the “Dragon-King hypothesis” (Sornette 2009; Sornette & Ouillon 2012), which suggests that the most extreme events in a statistical distribution may belong to a different population, and thus may be generated by a different physical mechanism, than the ubiquitous power-law distributions commonly observed in nonlinear systems driven by SOC. For this project we extracted 23 data sets of solar flare data (from HXRBS/SMM, BATSE/CGRO, RHESSI, AIA/SDO, and HMI/SDO), 12 data sets from stellar flares (from EUVE and KEPLER), and another 8 data sets of nonastrophysical data. We define a thresholded power-law distribution, which includes the effects of data truncation, undersampling of small events below a threshold, subtraction of event-unrelated background noise, and the steep fall-off at the upper bound of cumulative size distributions. We tested the accuracy of our size distribution forward-fitting method for both the differential and the cumulative occurrence frequency size distributions, as well as using the method of rank-order plots. Our least-square fitting techniques focus mostly on detecting deviations from ideal power laws (by means of the χ2-criterion) and on detecting outliers of the most extreme events in each statistical distribution. In the following we summarize the major conclusions.
- 1.Identifying Dragon-King events is important for accurate predictions and forecasting of the largest catastrophes. For solar and stellar flare data, the frequency and size of the most extreme events have generally been established from extrapolating power-law distributions, which may underestimate the largest events, if such outlier events belong to a different population and are generated by different physical mechanisms. Prediction and forecasting of large events generated by solar flares or coronal mass ejections have been recognized to play a central role in space weather predictions (e.g., Bloomfield et al. 2012; Gallagher et al. 2002; Barnes et al. 2007; Leka & Barnes 2003; Georgoulis & Rust 2007; Schrijver 2007, 2009; Barnes & Leka 2008; Aschwanden 2019, Sections 16.8 and 16.9 and references therein).
- 2.Dragon-King events are theoretically possible in solar and stellar flares, e.g., in the case of sympathetic flaring. Sympathetic flares are a pair of flares that occur almost simultaneously in different active regions, not by chance, but because of some physical connection (for instance by propagating Alfvénic waves). Statistical evidence for sympathetic flaring has been established for 48 events, some of them consisting of transequatorial loops (Moon et al. 2002). Another example is a quadrupolar configuration with two magnetic flux ropes that are located within a pseudostreamer, which have been observed to lead to two consecutive reconnections and eruptions, a scenario for twin-filament eruptions that can explain coupled sympathetic eruptions also (Török et al. 2011). Coupling flares in adjacent regions could double up the joint flare volumes and energies, if both active regions have similar sizes.
- 3.Dragon-King events are not common in solar and stellar flares. In our analysis we measured excessive extreme event factors qee for each data set, which characterizes the offset of outliers by the ratio of the observed most extreme event to the expected most extreme size based on a cumulative size distribution fit. We found that this factor amounts generally to less than a factor of qee ≲ 2, in 21 out of the 25 investigated solar and stellar size distributions. The four cases possibly containing Dragon-King events were found for solar flare volumes (qee = 3.08), stellar flare peak counts (qee = 3.55), and in flares from the red dwarf stars HD 2726 and AD Leo (Figure 5). However, Dragon-King events have been found in other, nonastronomical data sets (Figure 6), such as in earthquakes (qee = 2.82), power blackouts (qee = 3.35), and terrorism (qee = 6.15).
- 4.Extreme events of solar and stellar flares are scale-free. The fact that almost all extreme events of solar and stellar flares fit a power-law size distribution with a small value of the excessive extreme event factor (qee ≲ 1.5) indicates that solar and stellar flare processes are scale-free (within the inertial range), and thus can be modeled with the same power-law (SOC) model for small or large events. This implies also that small and large flare events are subject to the same physical mechanism, most likely caused by a magnetic reconnection process in active regions, driven by shearing and twisting of the magnetic field.
- 5.Deviations from power laws are common in large-number statistical distributions. Adversely, distributions with acceptable power-law fits are more likely to be found in size distributions with small-number statistics. The accuracy of fitting power-law slopes scales with the square root of the number of events, and hence the measurement of deviations from ideal power laws is a matter of instrumental sensitivity. However, large-number statistics (N ≳ 104) requires comprehensive flare catalogs or automated pattern recognition codes, which is the reason why we have a high accuracy of power-law fits for extensive data sets only, such as for peak counts (P), total counts (C), or time durations (T), while energetic parameters could only be measured in large (GOES M- and X-class) flares, yielding small-number statistics (N ≳ 102) with a lower degree of accuracy.
- 6.Accurate size distributions provide tests of physical scaling laws in solar and stellar flares. The fractal-diffusive SOC model predicts a set of power-law slopes (for avalanches in a SOC system) that can straightforwardly be compared with the corresponding observed size distributions. Consequently, the obtained power-law slopes provide consistency tests of the underlying physical scaling laws. In this study we discovered new correlations that indicate that the dissipated magnetic energies (
), flare EMs (
), and bolometric energies (
) scale with the flare area, that the thermal flare energy scales with the flare volume (
), and that the nonthermal energy scales with the total (time-integrated) hard X-ray counts (
). The reason why these flare scaling laws can be inferred mostly from the flare volume V or area A is the high nonlinearity of these geometric parameters (
,
), while other physical parameters (such as the electron density ne or temperature Te) scale only linearly in the definitions of the thermal energy and pressure.
There are a lot of new questions that arise from this study that can be addressed in follow-on work, e.g., what physical conditions are different in Dragon-King events? Does sympathetic flaring occur in the most extreme events? Which stars produce outliers and what is different in the flare productivity of these stars? If we increase the size of the data sets to large-number statistics, do we obtain more accurate power-law slopes that converge from different instruments? If we improve the statistics, do the power-law indices in physical scaling laws converge to the theoretically predicted values? Can stellar power laws be used to tell us which flare phenomena are controlled by the flare size and their relationship to the depth of the stellar convection zone?
This work was stimulated by the organizers of a workshop on “Mechanisms for extreme event generation” (MEEG) at the Lorentz Center at Snellius, Leiden, The Netherlands, 2019 July 8–12, organized by Drs. Norma Bock Crosby, Bertrand Groslambert, Alexander Milovanov, Jens Juul Rasmussen, and Didier Sornette. The author acknowledges the hospitality and partial support of two previous workshops on “Self-Organized Criticality and Turbulence” at the International Space Science Institute (ISSI) at Bern, Switzerland, during 2012 October 15–19, and 2013 September 16–20, as well as constructive and stimulating discussions with Sandra Chapman, Paul Charbonneau, Aaron Clauset, Norma Crosby, Michaila Dimitropoulou, Manolis Georgoulis, Stefan Hergarten, Henrik Jeldtoft Jensen, James McAteer, Shin Mineshige, Laura Morales, Mark Newman, Naoto Nishizuka, Gunnar Pruessner, John Rundle, Surja Sharma, Antoine Strugarek, Vadim Uritsky, and Nick Watkins. This work was partially supported by NASA contracts NNX11A099G “Self-organized criticality in solar physics,” NNG04EA00C of the SDO/AIA instrument, and NNG09FA40C of the IRIS instrument.
Appendix: Empirical Data
In the spirit of interdisciplinary research, let us visit eight nonastrophysical data sets of empirical data (Figure 6, Table 1) that have been compiled and scrutinized for deviations from ideal power laws by Clauset et al. (2009). If we investigate the goodness-of-fit χ2-values of all events in each data set, we find significant deviations from ideal power laws for most of the cases, except for blackouts (Figure 6(e): χcum = 0.83), but this phenomenon has also the smallest statistics (N = 210), and thus is not sensitive to small deviations from power laws. On the other hand, if we consider just the average goodness-of-fit χ2-values among the 10 most extreme events, we find consistency with thresholded power-law distributions for earthquakes (Figure 6(a): χee = 1.45), blackouts (Figure 6(d): χee = 0.63), terrorism (Figure 6(d): χee = 0.43), surnames (Figure 6(g): χee = 1.34), and weblinks (Figure 6(h): χee = 1.07). These results differ somewhat from those obtained in Clauset et al. (2009), although they also find most of the data are not consistent with an ideal power-law distribution, which is expected for fitting different functions to the observed distributions, such as the thresholded Pareto-type power-law function used here. The evaluation of power-law deviations in Clauset et al. (2009) is subject to an arbitrary truncation of small events, while our method derives a threshold automatically, which is important, because the choice of (scale-free) inertial range boundaries affects the outcome of the power-law slope value most.
Figure 6. Cumulative size distributions are shown for eight empirical data sets from Clauset et al. (2009): earthquake intensities (a), forest fires (b), population of cities (c), electric power blackouts (d), terrorist attack severity (e), count of words (f), frequency of surnames (g), and links to web sites (h). Representation otherwise similar to Figures 2–4.
Download figure:
Standard image High-resolution image