|
The GCP database has a small number of known errors due to hardware
problems in a computer or an REG/RNG source. The database cannot
be edited to exclude these errors, so they must be accounted for in
analysis. Fortunately, they affect very little of the time we have been
collecting data, but the list is presented here so that any analysis
which covers an affected date can exclude the problematic
egg's data.
A computer readable (CSV) file of egg errors
on available on request. It is the best source to use for excluding
bad data, and must be used for formal analyses.
Please refer to the Official Error List
for up to date information.
Following are the larger chunks of bad data that should be excluded from
any analysis.
-
Egg 1005, Defective data from December 10 1998 to December 17 1998.
-
Egg 1005, Defective data from October 31 to November 3 2002, with good
data beginning at 13:25 PST = 21:25 UTC again (analyst may drop this egg
on Nov 3).
New OS and machine, trouble bringing system up, resulting in a mean of
255 for several hours, 17:25 to 22:19:50, on 20021031.
-
Egg 106, Defective data on 18 November 1999.
-
Egg 1025, Defective data from 9 November 1999 to 21 November 1999.
-
Egg 1026, Defective data from 17 November 1999 to December 30, 1999.
-
Egg 1000, Defective data from 10:40:01 UTC 16-OCT-2000 until 07:39:58, 23-OCT-2000.
Running tests with substituted defective RNG.
-
Egg 28, Defective data from March 20 2001 to May 30 2001.
Later discovered the bad data started earlier. Exclude from Feb 20 to
May 30 2001.
Power supply replaced, data checked on 31st and are now nominal.
-
Egg 28, Defective data from August 15 2002 to Aug 17 2002; variance ~28,
indicating large bias corrected by the XOR.
Power supply again, most likely. I will substitute a Microreg.
-
Egg 1101, Defective data on December 25, 2001.
Unknown cause, Mean is ~15 or 16.
-
Egg 223, All data recorded from egg 223 must be discarded. It had an
out-of-spec power supply and sent data with extremely small variance,
largely determined by the XOR rather than the random source.
The egg has been changed to an Orion, with egg ID # 1223.
-
Egg 118, from March 28 to May 19 inclusive, 2001, all data should be excluded.
There may be none because 118 was down for change to new system, then
reporting impossible trial values of
255 during this time because of a serial port problem.
Good data begins on May 20 at 18:15.
The above supersedes an earlier note which said:
"Defective data from May 20 to end of June, 2001."
Intermittent, unknown cause, possibly power deficit? We have
established that likely cause was incorrect power from Mac serial port.
-
Egg 2222, Defective data from June 13 to June 15, and June 19, 2001
Brief shutoff at 14:15 to 15:00 on 13th, then Z's of 20 and 30, not subtle.
Cause was incorrect BIOS setting for serial ports. Unfortunately, the
incident of the 19th, comprised of impossibly high means for perhaps a
minute after a resumption following a break, means the data from 2222
may not be reliable in general, and should be excluded from primary
analyses. Subsequent monitoring indicates the egg is functioning
normally, so return to the reliable list as of July 1, 2001.
-
Egg 1000, uncertain data after restart on February 27 2002, continuing
to date.
No obvious bad trials, but many missings. The uncertainty and the
missings result from using a non-standard, untested system of eggsh and OS.
It is the new MAC OS X. By trial and error, the host discovered the
number of missings was reduced if the regtest program is running
concurrently with eggsh. I recommend excluding the data for analyses
that need guaranteed data, but otherwise have no reason to reject them.
-
Egg 226 had a 0 in the data from 2002-10-29. Apparently a hardware
crash generated at least one corrupt trial. Exclude the whole day.
-
Egg 161 had bad data from about 02:15 to 08:30 on 2002-12-03.
The egg was moved to new hardware about that time, but I don't yet know
what caused the problem. Exclude the whole day.
-
Egg 1022 had bad data on 2003-02-24, for a short period, surrounded by
no data. The computer had crashed and this was from a new installation,
all high trials, ranging from 110 to 124, mean 118.121.
The egg also had bad data on 2003-02-25. Exclude both days.
-
Egg 2236 had bad data on 2003-03-11, reporting both a 0 and a 251, and a
Variance of 55.722. Exclude 2236 the whole day.
-
Egg 142 had bad data on 2003-05-21, reporting a 0 and a 184, after which
it shut down. Exclude egg 142 for the whole day.
-
Egg 227 had bad data on 2003-05-25, reporting a series of 255 values.
It was being transferred to a new host and had bad data when it was
-
Egg 2235 had a 0 on 2003-06-23. That whole day for 2235 should be excluded.
The same thing on 2003-06024, so exclude that day also for 2235, and
shut down the egg -- as of June 26 00:10. Exclude also the 25th and 26th.
-
Egg 102 had a 0 and a 250 in 85863 trials on 2003-08-06. That whole day for
102 should be excluded. The host reported his system was damaged by the
heat wave and had repairs around this date.
-
Egg 102 had a 253 in 42153 trials on 2003-08-13. That whole day for
for 102 should be excluded. Host has been having trouble, see previous.
-
Egg 2235 was opened again for data in August, and ran OK for some weeks,
then had a 52 (6.8 sigma) on 2003-09-12, which indicates it still has
intermittent errors. The whole day of 2003-09-12 should be excluded
for that egg.
-
Egg 110 had a 0, probably because of an unexpected shutdown, so exclude
2003-10-05 00:00:00,2003-10-05 23:59:59,110
-
Egg 2235 had a 31 on 2003-10-07 and a 0 on 2003-10-08. Both days
should be excluded for that egg. Also the 9th.
For rigorous analyses, egg 2235 should
be excluded for all days beginning 2003-06-23. Another 0 on 031013 and
one on 031014. Exclude the 15th and 16th as well. Turned off on Oct 16 2003.
-
Egg 115 had a 173 around 21:15, with no indication of the source. Exclude
the whole day 2003-11-05 00:00:00,2003-11-05 23:59:59, egg 115.
-
Egg 2236 had bad data beginning 2003-11-09, identified with low variance
indicating failure of RNG or Serial port. Exclude 2236 from that date
until further notice.
-
Egg 115 had a 180 around 01:45 on Nov 16 2003, no indication of source. Exclude
the whole day 2003-11-16 00:00:00,2003-11-16 23:59:59, egg 115.
-
Egg 100 had bad data generated during a shutdown on Dec 11 2003.
Exclude the whole day 2003-12-11, 00:00:00 23:59:59, egg 100.
Please refer to the Official Error List
for up to date information.
In addition, there are occasional trials that are clearly out of range,
that apparently occur when something causes a computer shutdown. See
the following sections.
Device Standard Deviation
Because the three REG/RNG devices are real-world machines, they
cannot be "perfect" theoretical devices. The logical XOR does guarantee
that the mean will be unbiased, but it is possible for the standard
deviation (SD) to be consequentially reduced if the logic
corrects for a very large bias. Other factors can in principle affect
the empirical SD, for example, bit-wise autocorrelation.
The designs are excellent, so these potential distortions are
very small, indeed too small to be detected in analyses of hours or
days. However, with really large samples, using on the order of six
months of data, several of the devices do show significant decreases or
increases, and it is possible to see
"SD signatures" that
distinguish the the three REG types.
Mike Meyer has been carefully examining the actual behavior of the
REG/RNG devices over long time periods, to see how much the real-world
physical devices differ from their theoretical image. The most
comprehensive version of his work is summarized in tables
Comparing Theoretical and Empirical SD
for GCP calculations.
An earlier version of this effort looks at
SD for individual eggs.
Out of Range Trials
Mike's programs also checked for, and rejected all values of 0 and 200.
In the period tested for the individual SD calculations, the
results show four instances of rejection, and show that the zeros found have
the same pattern surrounding them. The pattern for all 4 days is:
-random number
-1
-1
-1
-184
-0
-a few other numbers
-then the generators produce no data for an unknown length of time
Here are the dates and times of the events:
Jun 22, 2000 21:43:16 egg id# 111
Dec 09, 2000 13:57:48 egg id# 115
Feb 27, 2001 01:33:04 egg id# 118
Mar 16, 2001 21:43:16 egg id# 106
The simplest procedure to deal with this infrequent but important
misbehavior is to include a step in analysis that excludes
all data with values <55 or >145.
It is possible that other instances exist, although the
general case is that the eggs either work
according to design, or they break badly and show obviously
defective data. We have discussed implementing some
filtering to remove bad data according to objective
criteria, but this has not been done. If you have ideas and skills
which you would like to apply to this task, please let
Roger Nelson know.
Synchronization
While most of the eggs run in complete synchrony to the second,
using software to
continually correct the time using timeservers or other precise sources,
a few dialup eggs are apparently not running such programs,
or cannot connect to
timservers during their relatively short dialup times. Their data-streams are
therefore not precisely aligned but are likely to be offset to varying
degrees from correct UTC time. In most cases the difference is on the
order of a few seconds, and will not make an important
difference in analyses using large
blocking units, although the offset will exert a conservative influence in
any fine-scale analyses. The following list are probably off by a few
seconds, and they should be excluded from analyses that strongly
depend on synchronization to the second: Eggs 100, 103, 107, 108,
114, 1025, 1026.
Other Information
In addition to actual errors, there are occasional differences from the
normal course of events, such as the following instance of testing with
a different random source than the one normally assigned to the node:
For EGG 2000, the period from UTC 16-Jan-01 13:53:59 until 17-Jan-01
04:24:08 was collected using Mindsong REG #139, rather than the normal
Orion
REG. There are short gaps in data before and after this period, so it
should
be clearly identifiable in the data.
This substitution should not make any important difference in results;
we have not detected any
differences in the performance of the three different REG/RNG sources.
Until recently, that is. Mike Meyer examined individual egg parameters
in data sequences of six months duration. He determined that while the
mean is within specifications, the standard deviation is slightly
different from theory.
The effect is probably not important in most analyses. The magnitude is
on the order 0.0001 on average. For best accuracy, the empirical standard
deviation calculated over a large database (at least several days)
should be used.
|