We wish to study how the performance of the different players and species on
this experiment has changed over time. Fig. 3.10 shows the sliding
window method applied to one robot. It reveals how inexact or ``noisy'' the
RS estimates are when too few games are put together. It is apparent that 100
games or more are needed to obtain an accurate measure.
Since each individual agent embodies a single, unchanging strategy for the game
of Tron, the model should estimate approximately the same strength value for
the same agent at different points in history. This is indeed the case, as seen
for example on figs. 3.10 (bottom) and 3.11a.The
situation with humans is very different: people change their game, improving
in most cases (fig. 3.11b).