The GCP Experiment
The GCP recorded its first data on August 4, 1998.
Beginning with a few random sources, the network
grew to about 10 instruments by the beginning of 1999, and to 28 by
2000. It has continued to grow,
stabilizing at roughly 60 to 65 eggs by 2004.
The early experiment simply asked whether the network
was affected when powerful events caused large numbers of people to
pay attention to the same thing. This experiment was based on a
hypothesis registry
specifying a priori for each event a period of time
and an analysis method to examine the data for changes in statistical
measures.
Various other modes of analysis including attempts to find general
correlations of GCP statistics with other longitudinal variables
have been considered, and continue to be developed.
Purpose
In the most general sense, the purpose of the project was and is
to create and document a consistent database of parallel streams of
random numbers generated by high-quality physical sources. The goal
is to determine whether any correlations might be detectable of
statistics from these data with independent long-term physical
or sociological variables. In the original experimental design we
asked the more limited question
whether there is a detectable correlation of deviations
from randomness with the occurrence of major events in the world.
Hypothesis
The formal hypothesis of the original event-based experiment is very broad.
It posits that engaging global events will correlate with
deviations in the data. The identification of global events and the
times at which they occur are specified case by case, as are the recipes
for calculating the variance deviations. This latitude of choice makes
the original experiment complicated to analyse, but by standardizing
the results, we can obtain a composite outcome. This
constitutes a general test of the broadly defined formal hypothesis.
Analytical Recipes
The formal events are fully specified in a
hypothesis registry. Over the years, several different
analysis
recipes were invoked, though most analyses specify either the
"network variance" (the squared Stouffer Z) or the "device variance" method.
Each recipe stipulates how the event statistic is
calculated, by first specifying a block statistic within the
blocked examination period and then a method for combining these to give an
event statistic. Note that the test statistic is a single
value representing the deviation from expectation for the
whole period specified in the registry. The results table
has links to details of the analyses, typically including a "cumulative
deviation" graph tracing the history of the second-by-second
deviations during the event, leading to the terminal value
which is the test statistic.
The following table shows the precise algorithms for
the basic statistics used in the analyses.
Control Data
It is possible to generate various kinds of controls, including
matched analysis with a time offset in the actual database, or matched
analysis using a pseudorandom clone database. However, the most general
control analysis is achieved by comparisons with the empirical distributions of the test
statistics.
These provide a rigorous control background and confirm the
analytical results for the formal series of hypothesis tests.
Compound Result
Over the six years since the inception of the project, 170
replications of the basic hypothesis test have been accumulated.
The composite result is a statistically
significant departure from expectation of 4 standard deviations.
The combined result from
these analyses thus gives support for the formal hypothesis,
and this encourages a deeper look, beginning with a thorough re-analysis
of the original findings, and proceeding to extensive analysis using
other methods.
Sharpening the Focus
The focus of our effort turns now to a more comprehensive program of
rigorous analyses and incisive questions intended to characterize the
data more fully and to facilitate the identification of any non-random
structure.
We begin with thorough documentation of the analytical and
methodological background for the main result, to provide a solid
basis for new hypotheses and experiments. The goal is to
increase both the depth and breadth of our assessments, to develop
sound interpretations, and ultimately to elucidate the meaning
of the original findings.
Critical Assessments
A variety of analyses have been undertaken to establish the
quality of the data and characterize the output of individual devices
and the network as a whole.
The first stage is a careful search for any data that are problematic
because of equipment failure or other mishap. Such data are removed.
With all bad data removed,
each individual REG or RNG can be
characterized to provide empirical estimates for statistical parameters.
These are used to convert the database into a normalized, completely
reliable data resource to facilitate rigorous analysis.
The intent is to lay the basis for an
assessment of the multi-year database with sophisticated statistical and
mathematical techniques.
We then
can use a range of statistical tools to look for small, but reliable
changes from expected random distributions that may be correlated with
natural or human-generated variables.
Acceptable Events
A major effort was made to identify the "formal" events that could be
accepted according to rigorous criteria. This resulted in a set of
170 usable events over the first 6 years of the project.
A total of 13 events that were originally in the formal series were
excluded because they were partially redundant or overlapped others, or
were not unambiguously defined in the original narrative hypotheses.
Real Devices vs Theory
Ideally, the trials recorded from the REGs distribute like
binomial [200, 0.5] (mean 100, variance
50). But although they all are high-quality random sources, perfect
theoretical performance is not
the case for these real-life devices. A logical XOR of the raw
bit-stream with a fixed pattern of bits with exactly 0.5 probability
compensates mean biases of the regs.
Normalized and Standardized Data
After XOR'ing, the mean is
guaranteed over the long run to fit theoretical expectation. The trial
variances remain biased, however. The biases are small (about 1 part in
10,000) and generally stable on long timescales. We treat them as real
albeit tiny biases that need to be corrected by normalization for
rigorous analysis.
They are corrected by converting the trialsums for each individual egg
to standard normal variables (z-scores), based on the emprirical standard
deviations.
Re-Analysis of the
Event-based Experiment
The normalized and standardized data resource allows us to to a rigorous
re-analysis of the
event-based experiment
This was the primary analysis approach for the first few years of the
project, and it generated sufficient evidence of anomalous correlations
to justify deeper analysis, and more general correlation strategies.
In this approach, "global events" are identified and hypothesis that
specifies a time period and an analysis recipe is registered. The
analytical results are combined into a cumulative, or aggregate,
assessment of the hypothesis of correlated departures from expectation.
New Analyses: Extensions and Explorations
The background of careful preparation for rigorous analysis can be
envisioned as a conversion of the GCP database to a "data resource"
which can be examined with power and flexibility. As we proceed, new
materials will be added to this page.
The following excursions are examples of what can now be done with
some facility. Some provide deeper understanding of previous work,
others give new perspectives and insights.
We have developed a number of questions
that are capable of informing us deeply about the nature and quality
of the evidence. As we proceed, we expect to have many cases that, in
Peter's term, "will require a lot of mulling," but can learn much
from the ability to visualize the data in different ways.
Long Trends and Correlations
The rigorously normalized and standardized data resource
can be used for a wide variety of completely general analyses
that are not constrained to the event-based protocols.
For example, we ask whether there is any significant large scale
structure with questions addressing
long trends and correlations in the full, six-year database.
If there are long trends, we can in principle expect to find
correlations with independent external measures. Among the
possibilities are sociological data. For example, we compare GCP
network variance with perceptions of
presidental performance
measured in polls that ask the question:
"Do you approve or disapprove of the way the President is handling his job?"
Splitting the Data
A strong test of the hypothesis that there is structure in the data can
be made by determining if the same trends and patterns are found in
independent subsets. We look at this by
splitting the data into alternate seconds.
Sliding the Event Time
Here we look at the aggregate event zscore when the event examination
periods are shifted uniformly in time.
The question is what happens to the evidence for anomalous deviation
associated with an event as a result of
sliding the event periods over the
dbase in 1/2 hr steps and recalculating the aggregate Z at each step.
Impulse Events, Quakes, Distance
Examination of "Impulse" events shows structure and includes indications
that the response may begin a little early. Here we look at
events defined for the formal hypothesis test series, and compare them
with a new database of arguably similar events, namely big earthquakes.
They show remarkably similar trends, even though there are differences
related to the choice of statistics. This page also include some
modeling to assess the effect of geographic distance on pair
correlations.
Earthquakes, Population, Locality
Large earthquakes occur with sufficient frequency to allow assessment of
the relationship of GCP effects to the distribution of the Eggs. They
provide an opportunity to consider the question whether the effects are
nonlocal in the strong sense, and whether the proximity of people
affected by the earthquake is an essential contributor to the effects.
Assessing the Effect of Blocking
The earliest analysis method was a hand calculation
using 15-minute blocking of the data. In this method the
composite Z for each egg is computed for each time block in the event.
The sum of the resulting Zē values
is a Chisquare with degrees of freedom equal to the number of
blocks times eggs.
The early procedure was replaced for most events
by a "standard analysis" using the raw data with no blocking.
But an obvious question was what effect the
various blocking levels might have on the
outcome. One form of the question is, "What is the optimum blocking
level?"
Here we begin to look at this question in a rigorous and comprehensive
way.
New Year Celebrations
One of the most interesting recurring events that we have examined is
the New Year transition. We have made a hypothesis each year since
1998/1999 that the period around midnight on New Years eve will show
structure in the network variance -- the squared Stouffer Z across eggs.
Beginning in 1999/2000 we also examined the device variance -- the
sum of squared Z-scores per egg each second. These analyses accomodate
the moving locus of the New Year celebrations by doing a signal average
across time zones. The data resource allows a much more facile
exploration of the question whether the
New Year Variance Analysis shows structure.
Analysis of Periodic Variation
Fourier analysis gives us a general answer to the
question whether there is any indication of periodic structure in
the data. We wish to know, for example, if there is any diurnal
variation suggesting differences corresponding to time of day, or if
there are any longer term effects associated with the day or the week,
etc.