Next: Results and Analysis Up: Learning Cooperative Procedures Previous: MODIFY-PKCB.

Section 5. Learning to Interleave Goals

As mentioned previously, using execution traces as a basis for future cooperative plans allows the agents to learn plans beyond the scope of the first-principles planner (or a traditional second-order planner based on it). Another example shows how lifter L2 learns to interleave two goals. In this example, HTO and L1 will act as in the first example, with the box in question now extra large box XLBOX1 rather than MBOX3. The following are L2's plans and actions, starting four steps before HTO makes her first request to L1.

1.
L2 creates a typical plan to get XLBOX1 onto the truck: to lift, carry and load the box jointly with L1. L1 agrees to lift the box together and they do so. L1 then agrees to carry the box to the street. However, XLBOX1 is too large to carry, even jointly, and the operator (and hence rest of the plan) fails.
2.
L2 creates a plan to get SBOX5 onto the truck. The plan consists of putting down XLBOX1 with L1's help and then lifting, carrying and loading SBOX5 onto the truck by herself. L2 is delayed in asking for L1's assistance because HTO calls first with a request to put XLBOX1 onto HANDTR2. L1 agrees to help HTO.
3.
When L2 does ask L1 to help achieve HANDEMPTY via putting XLBOX1 down together, L1 replies that she would rather load the box together onto the hand-truck. L2's planner adapts her current plan by replacing the PUT-DOWN-TOGETHER with the appropriate LOAD-TOGETHER. The agents then load XLBOX1 onto the hand-truck. L2 continues on with her plan and loads SBOX5 onto the truck. Meanwhile, HTO has pushed the hand-truck to the street and L1 has agreed to get XLBOX1 onto the truck.
4.
L2 constructs a plan to get large box LBOX2 onto the truck and moves back to ROOM1.
5.
L2 is interrupted before attempting to lift LBOX2 by a request from L1 to help unload XLBOX1 from the hand-truck. L2 constructs the plan of moving to the street and then unloading XLBOX1. The agents do so.
6.
L1 asks L2 to load XLBOX1 onto the truck and they do.

The sequence of actions L2 undertakes corresponds to six different calls to the planner. Nonetheless, L2 can extract a single cooperative procedure from her execution traces by the machinery of collective memory (showing the original literals instead of new variables for clarity):

((WAIT-FOR L1 (ON XLBOX1 HANDTR2))
 (LOAD-TOGETHER XLBOX1 HANDTR2)
 (LOAD SBOX5 TRUCK1)
 (WAIT-FOR L1 (ON XLBOX1 TRUCK1))
 (LOAD-TOGETHER XLBOX1 TRUCK1))

Interleaving loading and unloading a box onto the hand-truck with loading a small box onto the truck, as in this example, is the most useful procedure lifters learn in the current version of the system. Its utility arises from low idle time and wide applicability.


Next: Results and Analysis Up: Learning Cooperative Procedures Previous: MODIFY-PKCB.
Andrew Garland
1998-05-22