<?xml version="1.0"?><!DOCTYPE article SYSTEM "/project/take/software/searchbench_offline_processing/paperxml_generator/aclextractor/src/python/../resource/dtd/paperxml.dtd"><article><header><firstpageheader><page local="1"/><title>ON THE INTERPRETATION OF NATURAL LANGUAGE INSTRUCTIONS</title><author surname="Eugenio" givenname="Barbara Di"><org  name="University of Pennsylvania" country="USA" city="Philadelphia"/></author><author surname="White" givenname="Michael"><org  name="University of Pennsylvania" country="USA" city="Philadelphia"/></author></firstpageheader><frontmatter><p>On the Interpretation of Natural Language Instructions</p><p><b>Barbara Di Eugenio        Michael White</b></p><p><b>Department of Computer and Information Science University of Pennsylvania Philadelphia, PA, USA {dieugeni, mwhite}@linc.cis.upenn.edu</b></p></frontmatter><abstract><b>In this paper, we discuss the approach we take to the interpretation of instructions. Instructions describe actions related to each other and to other goals the agent may have; our claim is that the agent must actively compute the actions that s/lie has to perform, not simply "extract" their descriptions from the input.</b> <b>We will start by discussing some inferences that arc necessary to understand instructions, and we will draw some conclusions about action representation formalisms and inference processes. We will discuss our approach, which includes an action represen­tation formalism based on Conceptual Structures rJac90], and the construction of the structure of the agent's intentions. We will conclude with an example that shows why such representations help us in analyzing instructions.</b> </abstract></header><body><section number="1" title="Making sense of instructions"><p>Consider the following three instructions:</p><p>(la) <i>Go into the other room to get the urn of coffee.</i></p><p>(lb) <i>Before you pick it up, be sure to unplug it.</i></p><p><b>(lc)   </b><i>When you bring it back here, carry it carefully with both hands.</i></p><p>Let's consider (la). To understand this instruction, an agent must find the connection between the two actions <i>a</i><b>—</b><i>go into the other room, </i>and <i>ß</i><b>—</b><i>get the urn of coffee. </i>The infinitival <i>to </i>alerts the agent to the fact that <i>a </i>con­tributes to achieving <i>ß. </i>General knowledge about phys­ically getling objects requires that the agent move to the place where the object is located; therefore, the agent will infer that the (most direct) connection between these ac­tions has <i>go into the other room </i>fulfilling this requirement However, this is not enough. An assumption needs to be made for such connection to go through, namely, that <i>the urn is in the other room.</i></p><p>This example shows that to <i>make sense </i>of instructions, an agent must engage in the active computation of the action(s) to be executed, and cannot simply "extract" all such information from the input. This differentiates our work from others', as we will discuss shortly.</p><p>Another important point that arises from (la) is that the relation <i>contributes </i>holding between <i>a, </i>described in the matrix clause, and <i>ß, </i>described in the purpose clause<footnote anchor="1"/>, can be specilied either as <i>generation </i>or <i>enablement, </i>as a study of naturally occurring <i>purpose clauses </i>[Di 92a] shows.</p><p>Generation was introduced by [Gol70], Informally, if ac­tion <i>a </i>generates action <i>ß, </i>we can say that <i>ß </i>is exe­cuted by executing <i>a. </i>An example is <i>Turning on the light </i>by <i>flipping the switch.</i></p><p>Enablement. Following <b>[P0I86] </b>and [Bal90], action <i>a en­ables </i>action <i>ß </i>if and only if an occurrence of <i>a </i>brings about conditions necessary for the subsequent perfor­mance of <i>ß. </i>In <i>Unscrew the protective plate to expose the box, </i>"unscrew the protective plate" enables "tak­ing the plate off' which generates "exposing the box".</p><p>In <b>[P0I86], </b>it is shown that these two relations arc nec­essary to model action descriptions conveyed by Natural Language. We would like to add one further observation: such relations allow us to draw conclusions about action execution too. This is quite useful since we <i>do </i>have to execute (ie„ animate) the input instructions, as our work is taking place in the context of the <i>Animation from Natural Language (AnimNL) </i>project at the University of Pennsyl­vania [WBD*91].</p><p>As far as generation is concerned, while two actions are described, only <i>a, </i>the generator, needs to be performed; instead, if <i>a </i>enables <i>ß, </i>after executing <i>a, ß </i>still needs to be executed. In fact, if <i>a </i>enables <i>ß, a </i>has to begin, but not necessarily end, before <i>ß.</i></p><footnote label="1">We are using the term purpose clauses to informally designate sub­ordinate clauses — such as those introduced by to — that express the agent's purpose in executing Ihe action described in the matrix clause. The usage of the term purpose clause in the syntactic literature is some­what different — see [Jon85].</footnote><doubt alpha="51.2" length="43" tooSmall="False" monospace="0.0">Actes dk COL1NG-92, Nantes, 23-28 août 1992</doubt><doubt alpha="0.0" length="4" tooSmall="False" monospace="0.0">1147</doubt><doubt alpha="47.7" length="44" tooSmall="False" monospace="0.0">Proc. of COLING-92. Nantes, Auo. 23-28, 1992</doubt><page local="2"/><p>In both cases, the goal <i>ß </i>also <i>constrains </i>the interpreta­tion and / or execution of a. An example of this as regards generation is (2) <i>Cut the square in half to create two triangles.</i><i></i></p><p>The only action to be performed is <i>cut the square in half. </i>However, there is an infinite number of ways to cut a square in half: the goal <i>create two triangles </i>restricts the choice to <i>cut the square along one of the two diagonals.</i></p><p>We tum next to the second instructipn (lb). Observe that the agent understands <i>pick up </i>to be part of the sequence that achieves <i>get the urn of coffee. </i>This is not warranted by the preposition <i>before: </i>if (lb) were <i>Before you ruin it, be sure it's unplugged, </i>the agent clearly shouldn't infer that <i>ruin it </i>is part of <i>getting the urn\ </i>This shows that in <i>before a, ß, </i>the action o is not necessarily part of achieving a certain goal, even if <i>ß </i>is.</p><p>As far as (lc) goes, the agent has to understand that <i>bring it back here </i>is part of achieving <i>getting the urn; </i>that <i>carry it carefully with both hands </i>generates <i>bring it back here, </i>provided that <i>carry it carefully with both hands </i>is augmented with the destination <i>back here. </i>Notice that the action description carry <i>it carefully with both hands </i>is fairly complex, sporting two modifiers in addition to the traditional arguments of agent and patient.</p></section><section number="2" title="Problems and Proposed Solutions"><p>The following conclusions can be drawn from the obser­vations in the previous section:</p><p>1. NL action descriptions are fairly complex, including modifiers of many different types—see also [WD90]. An action representation formalism must be able to deal with complex descriptions, such as <i>carry it care­fully with both hands; </i>with descriptions at different levels of abstraction, such as <i>go </i>and <i>walk to, </i>or such as <i>cut the square in half </i>and <i>cut the square in half along the diagonal </i>in (2).</p><p>2. NL instructions include a wide variety of construc­tions, such as <i>purpose clauses </i>and <i>temporal clauses. </i>Instruction interpretation systems must be able to deal with complex imperatives and with the relations be­tween actions that they express.</p><p>3. An instruction interpretation system cannot assume that the descriptions of the actions to be performed are equivalent to the logical forms computed by the parser: such logical forms have to be constrained in various ways, e.g. by computing assumptions, as in (la), or more specific action descriptions, as in (2)<footnote anchor="2"/>. Notice that these constraints derive from the interac­tion between the actions to be executed and the goals the agent adopts. It is essential that this interaction is taken into account by such systems.</p><footnote label="2">In this paper we will only discuss the former type of constraint com­pulation; the latter is discussed in [Di 92b).</footnote><p>Work done in the past on understanding instructions has generally concentrated on simple positive commands, and has failed to address some of the desiderata listed above: [VB90] limits the interaction between new and preexist­ing goals to inserting the new goats in the list of goals if their execution does not violate preexisting œnstraints, otherwise they are rejected. [Cha91] proposes a model of instruction interpretation which seems useful at the level of the basic skills an agent is endowed with, but in which there is no internal structure to actions, and no distinction between the agent's actions and goals. [AZC91] instead does assume a rich relation between instructions and pre­existing goal(s). However, instructions arc not continually integrated into the plan the agent is developing; instead they are used as a resource when the stored knowledge about plans cannot be adapted to the situation at hand.</p><p>Turning now to our proposal, our approach to these prob­lems includes</p><p>1. An <b>action representation formalism </b>based on <b>lack­</b>endoff s Conceptual Structures [Jac90].</p><p>2. An <b>action KB </b>that contains simple plans that repre­sent common sense knowledge about actions.</p><p>3. A <b>plan graph </b>that represents the structure of the agent's intentions.</p></section><section number="3" title="Action representation"><p>We have chosen to use Jackendoff's Conceptual Structures [Jac90] for two reasons. First, as our point of departure is NL, there are the obvious benefits of using a linguistically motivated representational theory, e.g. easing the burden upon the parser to produce such representations [Whi92]. Second, there is significant mileage to be gained from using a decompositional theory of meaning, insofar as the prim­itives effectively capture important generalizations. In Ulis section we introduce the notation and some minor modifi­cations to the theory as presented in [Jac90]. We use <i>Go into the other room </i>as a representative example.</p><p>In Jackendoff's theory, an entity may be of ontological type Thing, Place, Path, Event, State, Manner or Property. The conceptual structure for a room is shown in (3a) below:</p><p>(3a)   <b>[Thing </b><b>hoom] </b>(3b)   <b>[Thing KITCHEN]</b></p><p>Square brackets indicate an entity of type Thing meet­ing the enclosed featural description. Small caps indi­cate atoms in conceptual structure, which serve as links to other systems of representation; for example, the con­ceptual structure for a kitchen (3b) differs from that of a to a body that generates a header.<page local="3"/> The <i>annotations </i>on the body specify the relations between the subactions; such re­lations include partial temporal ordering, enablement, and possibly others.</p><doubt alpha="57.5" length="40" tooSmall="False" monospace="0.0">ActesmiCOLING-92. Nantes, 23-28 août1992</doubt><doubt alpha="42.9" length="49" tooSmall="False" monospace="0.0">1148 Pkoc. oi-COLING-92, Nantes, Aug. 23-28, 1992</doubt><p>From the planning tradition, we retain the notions of <i>qualifiers </i>and <i>effects. </i>Qualifiers are conditions that make an action relevant: for example, <i>unplug x </i>is relevant only if <i>x </i>is plugged.</p><p>Notice the importance of using a representation such as Jackendoff's: it helps us capture the common characteris­tics of different actions, e.g. <i>get </i>and <i>carry. </i>The seman­tic representation for carry would also match the generic move-action template, and would add to it a qualification such as</p><doubt alpha="56.7" length="30" tooSmall="False" monospace="0.0">(10)[MannerWITH([T1„„gHANDS])]</doubt><p>Having such a representation is also useful for comput­ing qualifiers and effects in a systematic way: they can be precompiled from the representation itself. For example, for every action including a component <b><i>6 </i></b>such as we know tliat after <b><i>6,</i></b><b><i> </i></b><i>j </i>must be at /, therefore we can include this in the effects of the action. Given the further restriction that <i>j </i>cannot be in two places at once, we may infer that <i>j </i>cannot <b><i>\k </i></b>at / now, and thus precompute the qualifier<footnote anchor="8"/>.</p><doubt alpha="20.0" length="15" tooSmall="False" monospace="0.0">[«&gt;„.(,[™rr™])L</doubt></section><section number="4" title="The plan graph"><p>The <i>plan graph </i>represents the structure of the intentions that the agent adopts as a response to the instructions. It keeps track of the goals the agent is pursuing, of the hier­archical relations between the goals and the actions whose execution achieves such goals, and of various relations be­tween the actions. It also helps interpret the instructions that follow. In (t), establishing the initial goal <i>get the urn of coffee </i>provides the context in which the two following instructions have to be interpreted—a similar strategy is adopted for example by [Kau90]. In Fig. 2, we show the complete structure built after interpreting <b>(1).</b></p><p>A node in a plan graph contains the Conceptual Structure representation of an action, augmented with the consequent slate achieved alter the execution of that action<footnote anchor="9"/>. The arcs represent relations between actions; among them, those relevant to our example arc: <i>temporal, </i>such as <i>precedes </i>in Fig. 2; <i>enablement; generation, </i>and its generalization <i>substep, </i>used when <i>a </i>belongs to a sequence of more than one action that generates <i>ß.</i></p><p><b>BJackendoff suggests something analogous with his inference rules, which have yet to be formalized.</b></p><p><b>°In Fig. 2 the labels on the nodes are only mnemonics, and do not represent their real contents.</b></p><p><b>Al: UE(um. lNtfoliitr-rooin])) A2: DEtum, plüggcd-ln)</b></p><figure caption="Figure 2: The plan graph."></figure><p>There may also be assumptions associated with a plan graph. If an assumption is derived from the qualifiers as­sociated with an action, it is associated with the node de­scribing that action—A2 in Fig. 2; if it is derived while inferring a relation between two actions, it is associated with the corresponding arc—Al.</p><p>The plan graph is built by an interpretation algorithm that takes as its input the logical form constructed by the parser. The algorithm works by keeping track of the ac­tive <i>nodes, </i>which include the goal currently in focus, and the nodes just added to the tree. The topmost level of the algorithm invokes different procedures, according to the particular syntactic construction at hand - e.g. the con­struction <i>Do a to do ß </i>will trigger the hypothesis that <i>a </i>either generates or enables <i>ß </i>[Di 92b]. These procedures retrieve the plan(s) associated with the goal currently in focus, and then expand such plans in a hierarchical fash­ion.</p><p>These procedures embody various inference processes, that can be characterized either as <i>planning</i>—e.g. plan ex-paasion, subgoaling— or as <i>plan inference</i>—e.g. inferring assumptions, inferring the more abstract goal some actions are supposed to achieve. Space doesn't allow us to go into further delails about the algorithm or the inference pro­cesses; rather, in the next section we will give an example of how assumptions are computed.</p></section><section number="5" title="Making an Assumption"><p>We will now show how the assumption that the um is to be found in die other room is made while processing (la), <i>Go into the other room to gel the urn of coffee.</i></p><p>The process begins with the following representation constructed by the parser, where the FOR-function (de­rived from the <i>to </i>-phrase) encodes the <i>contributes </i>relation holding between the go-action a and the get-action <i>ß:</i></p><doubt alpha="56.1" length="41" tooSmall="False" monospace="0.0">Actes de COLING-92, Nantes, 23-28août1992</doubt><doubt alpha="0.0" length="5" tooSmall="False" monospace="0.0">114!)</doubt><doubt alpha="47.7" length="44" tooSmall="False" monospace="0.0">l'Roc.of COLING-92, Nantes, Aug. 23-28, 1992</doubt><page local="4"/><p>room only in its choice of constant, leaving the determi­nation of their similarities and differences to a system of representation better suited to the task<footnote anchor="3"/>.</p><p>To distinguish instances of a type, we follow [ZV91] in requiring every conceptual structure to have an index:</p><doubt alpha="60.0" length="15" tooSmall="False" monospace="0.0">(4)[Thingroom)!</doubt><p>Conceptual structures may also contain complex features generated by conceptual functions over other conceptual structures. For example, the conceptual function IN: Thing —&lt;■ Place may be used to represent the location <i>in the room </i>as shown in (Sa) below. Likewise, the function TO: Place —» Path describes a path that ends in the specified place, as shown in (5b) — (5c) is an equivalent representation of (5b), where the index / stands for the entire constituent<footnote anchor="4"/>:</p><p>(5a)  <b>bine </b>IN<b>([Thing </b>RooM<b>]t)]i</b></p><doubt alpha="48.7" length="39" tooSmall="False" monospace="0.0">(5b)[p,„hTO([pi„« IN([Thmgroom]t)]i)]„,</doubt><doubt alpha="41.2" length="17" tooSmall="False" monospace="0.0">(5c)[p.thTO(I)]„,</doubt><p>To complete our clause<footnote anchor="5"/>, it remains only to add the con­ceptual function GO: Thing x Path —» Event:</p><doubt alpha="64.0" length="25" tooSmall="False" monospace="0.0">its[EventGO([Thing]•, m)]</doubt><footnote label="1">' IP.th TO([IN ([OTHER -room])])] m</footnote><p>As there is no subject in our clause, the constituent <i>i </i>(prag­matically, the <b>agent) </b>in (6) is left unspecified.</p><p>To distinguish <i>Walk into the other room </i>from (6), we include an indication of manner<footnote anchor="6"/>:</p><doubt alpha="45.5" length="11" tooSmall="False" monospace="0.0">m \GO(i'm)1</doubt><p><b>L </b><b>[Manner </b><b>walking] </b><b>J</b></p><p>Finally, semantic fields, such as Spatial and Posses-sional, are intended to capture the similarities between sen­tences like <i>Jack went into the other room </i>and <i>The gift went to Bill, </i>as shown in (8) below:</p><doubt alpha="48.9" length="47" tooSmall="False" monospace="0.0">(8a)   [GOsp([jack],[TO([IN([otiieb-ROOM])])])]</doubt><doubt alpha="43.6" length="39" tooSmall="False" monospace="0.0">(8b)[GOPo„([gift], [TO([AT([BiLL])])])]</doubt><p>The idea is that verbs like <i>go </i>leave the semantic field un-derspecified, whereas verbs like <i>donate </i>specify a particular field. In addition to these semantic fields, we propose to add a new one called Control. It is intended to represent the functional notion of <i>having control over </i>some object. For example, in sports, the meanings of <i>having the ball, keeping the ball, </i>and <i>getting the ball </i>embody this notion, and are clearly quite distinct from their Spatial and Pos-sessional counterparts; (9) represents <i>Jack got the ball:</i></p><doubt alpha="42.9" length="42" tooSmall="False" monospace="0.0">(9)   [GOCtl.i([ball],[TO([AT([jack])])])]</doubt><footnote label="3">In our case, the action representation formalism is grounded in the animation system serving as the back-end to the AntmNL project.</footnote><p><b>* We will often adopt the representation in (5c), and leave out indices and ontologies, types, in order to lessen the typographical burden of representing large conceptual structurel.</b></p><p><b>ignoring, of course, the meaning of <i>other </i>for now.</b></p><footnote label="6">Though this is clearly intended, Jackendoff never explicitly represents such a distinction.</footnote><doubt alpha="100.0" length="6" tooSmall="False" monospace="0.0">Header</doubt><doubt alpha="54.8" length="31" tooSmall="False" monospace="0.0">[CAUSE([agent].,[GOsp(j, fc)])]</doubt><doubt alpha="50.0" length="24" tooSmall="False" monospace="0.0">FROM([AT(j)]) I TO(i-) I</doubt><doubt alpha="100.0" length="4" tooSmall="False" monospace="0.0">Body</doubt><doubt alpha="44.4" length="27" tooSmall="False" monospace="0.0">- [GOSp([i([TOC[ATO)1)])]7I</doubt><p><b>- [CAUSE(i, [GOctriO. [TO([AT(i)])])])].</b></p><doubt alpha="48.3" length="29" tooSmall="False" monospace="0.0">[ GOSp(.,fc) ]" I[WITIIO)]J,3</doubt><p><b>- Annotations -</b> <b>Qualifiers</b> <b>-[NOTBEspO, DI</b></p><doubt alpha="63.6" length="22" tooSmall="False" monospace="0.0">- 71enables72enables73</doubt><doubt alpha="100.0" length="7" tooSmall="False" monospace="0.0">Effects</doubt><doubt alpha="38.5" length="13" tooSmall="False" monospace="0.0">- [BESp(j,1)]</doubt><figure caption="Figure 1: AMove Something SomewhereAction.3.1   The action KB"></figure><p>The action KB contains simple plans that represent com­mon sense knowledge about actions, and whose compo­nents arc expressed in terms of Jackendoff's semantic prim­itives. To discuss the characteristics of these plans, we will refer to the move-action KB entry shown in Fig. 1, which might be described as follows: go to where <i>j </i>is, get control over it, then take it to <i>I<footnote anchor="7"/>.</i></p><p>Actions have a <i>header </i>and a <i>body. </i>This terminology is reminiscent of planning operators; however we express the relations between these components in terms of <i>enablement </i>and <i>generation</i>—e.g. the body <i>generates </i>its header.</p><p>The representation does not employ preconditions, be­cause it is very difficult to draw the line between what is a precondition and what is part of the body of an action. One could say that <i>having control over </i>the object to be moved is a precondition for a move-action. However, if the object is heavy, the agent will start exerting force to lift it, and then carry it to the other location. It is not obvi­ous whether the lifting action is still part of achieving the precondition, or already part of the body. Therefore, we don't have preconditions, but only actions which are sub-steps in executing another action, that is, they may belong<page local="5"/></p><footnote label="7">This do-it-yourself method is but one way to move something from where it is to somewhere else. Other methods would be listed separately in the action kb.</footnote><doubt alpha="47.3" length="93" tooSmall="False" monospace="0.0">Actes de COLING-92, Nantes, 23-28 août 1992 1150 Proc. of COLING-92, Nantes, Aug. 23-28, 1992</doubt></section><section title="Acknowledgements"><doubt alpha="46.3" length="54" tooSmall="False" monospace="0.0">I"GOSp([agent](, [T()([IN([other-room])])])[ l'"OR(/5)</doubt><doubt alpha="62.5" length="40" tooSmall="False" monospace="0.0">[CAUSE(i,[GOsp([uriN-oi^coFFF,K]j,k)))]p</doubt><doubt alpha="43.5" length="23" tooSmall="False" monospace="0.0">f FROM([AT(j)]) [ TO(/)</doubt><p>Given Die presence of the <i>to </i>phrase, we know that <i>a </i>may be part of a sequence of actions that generate <i>ß. </i>To pursue this hypothesis, we begin by looking up <i>ß </i>in the action KB. <i>ß </i>matches the general move-action shown in Fig. 1 if the object to be moved <i>j </i>is bound to <i>the urn of coffee:</i></p><doubt alpha="66.7" length="18" tooSmall="False" monospace="0.0">j= [URN-OF-GOKFEE]</doubt><p>Next we try to match <i>a </i>with some subaction <b>7 </b>of <i>ß. a </i>matches the first action <b>71 </b>in <i>ß </i>if we take [AT(j)J and <b>[IN</b>([<b>otuer-room1)] </b>to be the same place. This is tanta­mount to making the following assumption:</p><doubt alpha="44.4" length="36" tooSmall="False" monospace="0.0">(11)   [BESp(j, [IN([OTHER-RO0M])j)]</doubt><p>Once the instruction is understood in this way, the two actions may be incorporated into the plan graph as shown in Fig. 2.</p><p>One should mention that assumption (11) could of course be wrong, say if there were a note in the next room saying <i>ha ha, it's not really in this room but the next.</i></p><p>Notice that even if there is already an urn of coffee in the current room, die instruction <i>Go into the other room to get the urn of coffee </i>is still understood to refer to an urn in the other room. This contrasts sharply with <i>Go into the other room to </i>wash <b>out </b><i>the urn of coffee, </i>where the most likely urn is the currently visible one. In the current framework, this difference would be captured in the following way. Unlike in the case of the <i>get </i>-action, the #o-action matches titc following subaction of <i>wash-out :</i></p><doubt alpha="63.4" length="41" tooSmall="False" monospace="0.0">|GOSp([i,rrO([AT(WASUING-MATERIAt.s])])]7</doubt><p>Therefore, assumption (11) will not be derived, permitting die possibility of Ute urn being in the current room.</p></section><section number="6" title="Summary and Future Research"><p>We have presented an approach to action representation and iastruction interpretation which we feel is more flexible than previously proposed formalisms: it allows us to use terms at different levels of specificity, and to perform the complex inferences that <b>NL </b>instructions require.</p><p>Future research includes exploring how to integrate a hierarchical organization of entities, actions and plans with the action KB.</p><p>The system is being implemented in Quintus Prolog, with substantial progress having been made in particular on the parser [Whi92], and on the action KB.</p><p><b>This research was supported by the following grants: DARPA no. N00014-90-J4863, ARO no. BAAL CO-89-C-0031, NSF no. IRI 90-16592, and Hen Franklin no. 91S.3078C-1. We would like to thank all the members of the AnimNL group, and in particular Bonnie Webber, Libby Levison and Chris Geib, for very stimulating and helpful discussions.</b></p></section><references><p><b>[AZC911 Richard Altcmian, Roland Zito-Wolf, and Tamitha Carpenter. <i>Interaction, Comprehension, and Instruc­tion Usage. </i>Technical Report CS-91-161, Iirandeis University, 1991.</b></p><p><b>[Bal90] Cécile Balkanski. <i>Modelling act-type relations in col­laborative activity. </i>Technical Report TR-23-90, Cen­ter for Research in Computing Technology, Harvard University, 1990.</b></p><p><b>[Cha91] David Chapman. <i>Vision, Instruction and Action. </i>Cambridge: MIT Press, 1991.</b></p><p><b>[Di 92a] llarbaru Di Eugenic. <i>Goals and Actions in Natural Language Instructions. </i>Technical Report MS-CIS-92-07, University of Pennsylvania, 1992.</b></p><p><b>[Di 92b] Barbara Di Eugenio. Understanding Natural Lan­guage Instructions: the Case of Purpose Clauses. In <i>Proceedings ACL 92, </i>1992.</b></p><p><b>[G0I7O] </b><b>Alvin Goldman. <i>A Theory of Human Action. </i>Prince­ton University Press, 1970.</b></p><p><b>[Jac90[ Ray Jackendoff. <i>Semantic Structures. Current Stud­ies in Linguistics Series, </i>Ttie MTT Press, 1990.</b></p><p><b>[Jon85] Charles Jones. Agent, patient, and control into pur­pose, clauses. In <i>Chicago Linguistic Society, 21, </i>1985.</b></p><p><b>[Kau90] Henry Kautz. A circumscriptive theory of plan recog­nition. In J. Morgan, P. Cohen, and M. Pollack, edi­tors. <i>Intentions in Communication, </i>MIT Press, 1990.</b></p><p><b>[P0I86J </b><b>Martha Pollack. <i>Inferring domain plans in question-answering. </i>PhD thesis. University of Pennsylvania, 1986.</b></p><p><b>[VB90| Steven Vere and Timothy Bickmorc. A basic agent. <i>Computational Intelligence, </i>6:41-60, 1990,</b></p><p><b>[WBD»9U Bonnie Webber, Norman Badlcr, Barbara Di Euge­nio, Libby Levison, and Michael White. Instruct­ing Animated Agents. In <i>Proc. US-Japan Workshop on Integrated Systems in Multi-Media Environments. Las Cruces, NM, </i>1991.</b></p><p><b>[WD90] Bonnie Webber and Barbara Di Eugenio. Free Ad­juncts in Natural Language Instructions. In <i>Proceed­ings COUNG 90, </i>pages 395^)00. 1990.</b></p><p><b>[Whi92] Michael White. Conceptual Structures and CCG: Linking Theory and Incorporated Argument Ad­juncts. 1992. COUNG '92.</b></p><p><b>[ZV91] Joost Zwarts and Henk Verkuyl. An Algebra of Con­ceptual Structure; an investigation into Jackendoff's Conceptual Semantics. 1992. To appear in <i>Linguis­tics and Philosophy.</i></b></p><doubt alpha="51.1" length="45" tooSmall="False" monospace="0.0">Actus de COLING-92,Nantes, 23-28août19921 151</doubt><doubt alpha="47.7" length="44" tooSmall="False" monospace="0.0">Proc. of COLING-92. Nantes, Aug. 23-28, 1992</doubt></references></body></article>