<?xml version="1.0"?><!DOCTYPE article SYSTEM "/project/take/software/searchbench_offline_processing/paperxml_generator/aclextractor/src/python/../resource/dtd/paperxml.dtd"><article><header><firstpageheader><page local="1" global="161"/><title>Meta-rules as a Basis for Processing Ill-Formed Input</title><pubinfo>Copyright 1984 by the Association for Computational Linguistics. Permission to copy without fee all or part of this material is granted provided that the copies are not made for direct commercial advantage and theJournalreference and this copyright notice are included on the first page. To copy otherwise, or to republish, requires a fee and/or specific permission. 0362-613X/83/030161-17$03.00 American Journal of Computational Linguistics,   Volume 9, Numbers 3-4,   July-December 1983</pubinfo><author surname="Weischedel" givenname="Ralph M."><org  name="University of Delaware" country="USA" city="Newark"/></author><author surname="Sondheimer" givenname="Norman K."><org  name="University of Delaware" country="USA" city="Newark"/></author></firstpageheader><frontmatter><p>Meta-rules as a Basis for Processing Ill-Formed Input<footnote anchor="1"/></p><p>Ralph M. Weischedel<footnote anchor="2"/></p><p>Department of Computer &amp; Information Sciences University of Delaware Newark, DE 19716</p><p>Norman K. Sondheimer</p><p>USC/lnformation Sciences Institute 4676 Admiralty Way Marina del Rey, CA 90292</p></frontmatter><abstract><b>If natural language processing systems are ever to achieve natural, cooperative behav­ior, they must be able to process input that is ill-formed lexically, syntactically, semantical-ly, or pragmatically. Systems must be able to partially understand, or at least give specific, appropriate error messages, when input does not correspond to their model of language and of context.</b> <b>We propose meta-rules and a control structure under which they are invoked as a framework for processing ill-formed input. The left-hand side of a meta-rule diagnoses a problem as a violated rule of normal processing. The right-hand side relaxes the violated rule and states how processing may be resumed, if at all.</b> <b>Examples discussed in the paper include violated grammatical tests, omitted articles, homonyms, spelling/typographical errors, unknown words, violated selection restrictions, personification, and metonymy. An implementation of a meta-rule processor within the framework of an augmented transition network parser is also described.</b> </abstract></header><body><section number="1." title="Introduction"><p>Natural language understanding systems have im­proved markedly in recent years, and natural language interfaces have even begun to enter the commercial marketplace, for example, the INTELLECT system of Artificial Intelligence Corporation (Harris 1978). These systems promise to make major improvements in the ease of use of data base management and other computer systems.  However, they have only begun to consider the problems of truly natural input. The emphasis has been, and continues to be, on the under­standing of well-formed inputs. True natural language input is often ill-formed <i>in the absolute sense </i>of being filled with misspellings, mistypings, mispunctuations, tense and number errors, word order problems, run-on sentences, extraneous forms, meaningless sentences, and impossible requests. In addition, natural input is ill-formed <i>in the relative sense </i>of containing requests that are beyond the limits of either the computer sys­tem or the natural language interface.</p><footnote label="1">This material is based upon work supported in part by the National Science Foundation under Grant Nos. IST-8009673 and IST-8311400 and in part by the Defense Advanced Research Pro­jects Agency under Contract No. MDA 903-8l-C-0335, ARPA Order No. 2223. Views and conclusions contained in this paper are the authors' and should not be interpreted as representing the official opinion or policy of DARPA, the U.S. Government, or any person or agency connected with them.</footnote><footnote label="2">Currently visiting at the Department of Computer &amp; Informa­tion Science, University of Pennsylvania, Philadelphia, PA 19104.</footnote><p>Evidence indicates that absolutely ill-formed input regularly occurs in a data base query environment. For instance, in an extensive study (Thompson 1980) including 1615 inputs, only 1093 were parsable, and an overall total of 446 contained various kinds of er­rors: 161 with vocabulary problems, 72 with punctua­tion errors, 62 with ungrammaticality, and 61 with spelling errors.<page local="2" global="162"/> Furthermore, 211 inputs were frag­mentary, including 91 parsed terse replies and 67 parsed terse questions.</p><p>In another experiment (Eastman and McLean 1981), 693 English queries to a data base system were analyzed. Co-occurrence violations, including subject-verb disagreement, tense errors, apostrophe problems, and possessive/plural errors occurred in 12.3% of the queries. Omitted words, extraneous words/phrases, telegraphic ellipsis, and incomplete sentences arose in 14% of the queries.</p><p>Our conjecture is that wherever natural language interfaces are employed, ill-formed input will occur. Any natural language interface, when faced with ill-formed input, must either intelligently guess at a user's intent, request direct clarification, or at the very least accurately identify the ill-formedness. As Wilks (1976) states, "Understanding requires, at the very least, ... some attempt to interpret, rather than merely reject, what seems to be ill-formed utterances."</p><p>Though experience has shown that users can adapt to the limitations of the system's well-formed antici­pated input (Harris 1977b, Hendrix et al. 1978), we feel that relying on such user adaptation ignores one of the most powerful motivations for English input: enabling infrequent users to access data without an intermediary and without extensive practice. Even the person who frequently uses such a system will be exas­perated if it cannot explain why it misunderstands an input.</p><p>In some circumstances, ill-formed input may be less frequent. For instance, Fineman (1983) reports that in an experiment where users were constrained to <i>dis­crete speech, </i>ill-formedness occurred in as little as 2% of the input. Another unusual environment can be created by informing the users that the system cannot really understand natural language, thus biasing lan­guage use.</p><p>In addition to natural language interfaces, process­ing ill-formed input is critical to correcting language use. Prototype systems have already been investigated in the language-learning environment (Weischedel et al. 1978) and in the office automation environment of document preparation (Miller et al. 1981). Even in published text unintentional ungrammaticalities occur.</p><p>Most natural language understanding systems deal with a few types of ill-formedness. Out of our own work, and that of others, we have produced a frame­work for processing ill-formed input. This approach treats ill-formedness as <i>rule-based. </i>First, natural lan­guage interfaces should process all input as presumably well-formed until the <i>rules </i>of normal processing are violated. At that point, error handling procedures based on <i>meta-rules </i>relating ill-formed input to wellformed structures through the modification of the vio­lated normal rules should be employed. These meta­rules correspond to types of errors.<footnote anchor="3"/></p><footnote label="3">The purpose of these meta-rules is therefore quite distinct from that of Gawron (1982).</footnote><p>The rest of the paper argues for this rule-based approach. Section 2 characterizes both the types of ill-formed input, and the types of possible approaches to them, including our proposal. Section 3 gives ex­amples of meta-rules for processing ill-formed input. Section 4 describes how some heuristics developed by others fit within our paradigm. An implementation is sketched in Section 5. Section 6 discusses limitations of the proposal. Sections 7 and 8 present directions for future work and conclusions.</p></section><section number="2." title="Approaches to Ill-formedness"><p>This section introduces the problem of interpreting ill-formed input. First, we discuss the types of ill-formed input briefly. Then we consider the range of approaches that have been tried for allowing for such input.</p><p>Ill-formedness phenomena can be divided into two sets. The first defines what we call <i>absolute ill-formedness. </i>An utterance is absolutely ill-formed if the typical listener considers it ill-formed. The defini­tion unfortunately appeals to subjective evaluations; these are known to differ widely (Ross 1979). But it seems to include the majority of typical cases and exclude the majority of types of good English sen­tences.</p><p>The second set defines <i>relative ill-formedness. </i>This is ill-formedness with respect to the normal processing rules of the formal computing system including the natural language interface and the underlying applica­tion system. The set of ill-formed inputs for an inter­face can be defined as the union of these two sets for that interface.</p><p>The set of ill-formed input captured by these defi­nitions can also be seen through the four typical phas­es of interpretation in natural language interfaces: lexical, syntactic, semantic, and pragmatic processing. In lexical processing, absolute ill-formedness can come from misspelling, mistyping, and mispronunciation; relative ill-formedness can arise from unknown words. In syntactic processing, absolute ill-formedness is seen in faulty subject-verb agreement, word order errors, omitted words, run-on sentences, etc; relative ill-formedness is seen in grammatical combinations of words that exceed the interface's grammar.</p><p>Semantic processing can be defined as the interpre­tation of the input in isolation. Knowledge of the task domain can be applied, but the context of input with respect to previous interactions and the state of the underlying computing system are only considered in pragmatic processing.</p><p>Absolute ill-formedness in semantics includes omit­ting needed information and violating of selectional restrictions.    Absolute ill-formedness in pragmatics includes breaking the rules of conversation, as when answering a question with a question, having presup­positions of the speaker fail, and failing to make clear an anaphoric reference.<page local="3" global="163"/> Relative ill-formedness in both cases includes "overshoot", requesting capabili­ties or information not covered by the system in its current state, and parenthetical expressions incompre­hensible to the system.</p><subsection number="2.1." title="Four alternative approaches to ill-formedness"><p>There are at least five approaches one can take to ill-formedness. This section outlines the four alterna­tives to the approach we have formulated; our ap­proach is covered in Section 2.2. In describing the five approaches, we use the following informal nota­tion. SYSTEMjs] refers to a system designed to proc­ess a set of sentences s. WELL-FORMED is a set of well-formed utterances; ILL-FORMED is a set of ill-formed utterances. Naturally, an approach that covers the broadest range of linguistic behaviour should be preferred.</p><p>One alternative is to treat the processing of ill-formed and well-formed inputs identically, by ignoring constraints. That is, one designs SYSTEMfWELL-FORMED U ILL-FORMED]. Schank et al. (1980) and Waltz (1978) have taken this approach toward gram­matical constraints. CASPAR (Hayes and Carbonell 1981) exhibits this approach for grammatical con­straints as well. Since there is much redundancy in language, the practice of not using certain constraints will often work. However, this will fail on many ut­terances, since it ignores rules that not only constrain search but also eliminate unintended interpretations. One can see this by considering subject-verb agree­ment, a grammatical constraint that people sometimes violate and that is often left out of natural language systems. Though other constraints, such as semantic (selection) restrictions between a verb and its subject, often indicate the intended interpretation, it is easy to think of examples where subject-verb agreement is crucial to understanding. Comparing examples (1) and (2) below, subject-verb agreement is crucial to determining whether the company or assets were pur­chased.</p><p>(2) List the assets of the company that were pur­chased by XYZ Corp.</p><p>A second approach is to build systems for well-formed input and for ill-formed input together; that is, one designs SYSTEM[ILL-FORMED] merged with SYSTEM!WELL-FORMED]. Unlike the first approach, well-formedness constraints are employed on well-formed input. LUNAR (Woods et al. 1972), an early English interface to a question-answering system, and</p><p>SOPHIE (Burton and Brown 1977), an intelligent tu­toring system with an English interface, both used this approach. The problem with this approach is that it does not reflect the fact that constraints indicate pref­erences in interpretation. For instance, though exam­ple (3) below has two legitimate syntactic interpreta­tions, the one that violates our model of the world is rejected, causing us to reject the "garden path" inter­pretation.</p><p>(3) I saw the Statue of Liberty flying to New York.</p><p>As another example, the two pronouns in "He shot him" are normally considered to refer to different people; the alternative that the speaker meant "He shot himself" does not arise unless there are strong expectations ahead of time that that is the correct proposition.</p><p>A third approach is to build two systems, but to use SYSTEM[ILL-FORMED] only if SYSTEMfWELL-FORMED] finds no interpretation. A commercially available English interface to data bases (Harris 1977) has taken this approach. The EPISTLE project (Jensen and Heidorn 1983, Miller et al. 1981) employs this alternative for grammatical violations. DYPAR (Carbonell et al. 1983) has taken this approach in an interface to an expert system. Kaplan (1978) devel­oped a strategy to give more useful responses when a data base query yields a negative response, for exam­ple, when no entity satisfies the desired conditions. Chang (1978) created a heuristic for inferring missing joins in incomplete queries to relational data bases. The defect in this model is that there is no means of relating strategies for processing ill-formedness explic­itly to the strategies for processing well-formedness. We argue here that one can explicitly relate the two classes of strategies.</p><p>A fourth approach is to build only one system, SYSTEM[WELL-FORMED], and to employ a metric to measure how far a postulated interpretation is from satisfying all well-formedness constraints. Charniak (1981) has advocated this for grammatical processing; Wilks (1975) has made this the basis of semantic processing during the interpretation phase. Of course, the notion of weighing alternatives and using metrics has been used for phenomena other than ill-formed­ness, such as parsing (Robinson 1982) and speech understanding (Walker 1978, Woods et al. 1976). Clearly, ranking alternative interpretations is neces­sary. However, if one relies <i>solely </i>on a metric and SYSTEMfWELL-FORMED], then an account of the fact that the ill-formedness often has specific implications is still needed. In example (4), the selection restriction that "like" requires animate agents is violated; a rea­sonable inference is that the speaker somewhat per­sonifies the computer in question.</p><p>(4) My home computer doesn't like to run BASIC.</p><page local="4" global="164"/><p>Nor does a metric reflect the fact that there are clear patterns of error, such as those that have been report­ed in linguistic studies (Fromkin 1973) and in applica­tion studies (Thompson 1980, Eastman and McLean 1981).</p><table caption="Table I summarizes these four approaches.2.2.Our approach"></table><p>Based on previous work, both our own and that of others, we propose a framework employing meta-rules to relate the processing of ill-formed input to well-formedness rules. This framework may be stated as follows:</p><p>1. Process the input using S YSTEMfWELL-FORMED].</p><p>2. If no interpretation is found by SYSTEM [WELL-FORMED], apply a meta-rule to the well-formedness rules, based on a ranking of the alter­natives, in order to</p><p>a) diagnose the problem, that is, the rule that is violated and how it is violated, b) relax the rule,</p><p>c) add a "deviance note" to the interpretation re­cording the violation,</p><p>d) resume processing via the well-formedness rules, if possible.</p><p>3. Repeat step 2 as necessary.</p><p>Each meta-rule should correspond to a pattern of ill-formedness and should account for utterances corre­sponding to only that pattern. SYSTEMflLL-FORMED] is therefore implicit in the meta-rules.</p><p>This framework has advantages lacking in one or more of the other approaches. Well-formedness con­straints, whether syntactic, semantic, or pragmatic, are employed  to  eliminate  unintended interpretations.</p><p>Well-formed interpretations are always preferred. Ill-formedness processing is explicitly related to the well-formedness rules. Only the constraint that seems to be violated is relaxed; all other well-formedness constraints are still effective. Furthermore, the devi­ance notes record the aspect that deviates from well-formedness, thus allowing pragmatic inferences by later processing.</p><p>In the next two sections, we propose a handful of primitives for syntactic and semantic problems and also propose a formalism for writing meta-rules. As supporting evidence, we state meta-rules for a number of problems and describe approaches for several oth­ers. These phenomena include the following:</p><p>failed grammatical tests, word confusions, spelling errors, unknown words, restarted sentences, resumptive pronouns and noun phrases, contextual ellipses, selection restriction violation, metonymy, personification, and presupposition failure.</p></subsection></section><section number="3." title="Examples of Meta-rules"><p>To further argue for meta-rules as a uniform frame­work for processing ill-formed input, we describe a wide variety of meta-rules in this section and the next. We adopt the following notation for meta-rules in this paper:</p><p><b>Approach </b>1</p><p>Characterization: Do not encode well-formedness constraints.</p><p>Flaw: Well-formedness rules convey meanings by constraining interpretations.</p><p><b>Approach 2</b></p><p>Characterization: Design systems for well-formed input and ill-formed input together.</p><p>Flaw: This gives no preference of well-formed interpretations over ill-formed ones.</p><p><b>Approach </b>3</p><p>Characterization: Search for well-formed interpretations prior to considering any ill-formed ones.</p><p>Flaw: This does not explicitly relate handling ill-formedness to processing well-formedness.</p><p><b>Approach </b>4</p><p>Characterization: Use a metric to rank ill-formedness interpretations, and select the one that comes closest to satisfying all constraints.</p><p>Flaw: This does not state what the deviation is so that one may draw inferences from the ill-formedness, nor does it capture actual patterns of error.</p><p><b>Table </b>1. Four Rejected Approaches.</p><page local="5" global="165"/><doubt alpha="35.7" length="28" tooSmall="False" monospace="0.0">Cl C2 ... Cn -&gt; Al A2 ... Am</doubt><p>The left-hand side (LHS) diagnoses what the problem might be; the right-hand side (RHS) states how to relax the failed constraint. The Ci are conditions on the computational state of the system; all must be true if the meta-rule is to apply. The Ai are actions stating how to rewrite the violated constraint and resume processing; all will be executed if the rule applies. Many of the actions we suggest here can be viewed as rewriting a rule of the normative system, for example, a grammar rule or case frame. However, some are more appropriately viewed as changing the computa­tional state when blockage occurs; an example is cor­recting the spelling of a word. In Section 5 we will argue that it is best to implement all of the actions as modifications of a blocked alternative.</p><p>Naturally, in rewriting a rule, pattern-matching and substitution are fundamental. We adopt the following definition of patterns. A pattern is a LISP s­expression. Atoms preceded by a question mark are variables. Expressions preceded by a dollar sign are evaluated using the LISP rules; their values are treated as patterns.<footnote anchor="4"/> If a period appears before a pattern vari­able that is the last item in a list, that pattern variable matches the tail of a list. All other items in patterns are literals. The scope of a pattern variable is the whole meta-rule. The first time a variable is bound in a meta-rule, it retains the binding throughout the rule.</p><p>Potentially, there may be many places where relax­ation can occur. If a meta-rule applies to more than one configuration, it will be applied to each in turn, creating a list of possibilities for processing after re­covery is complete. Consequently, the meta-rules will refer to only one failed configuration at a time.</p><subsection number="3.1." title="Meta-rules related to syntax"><p>First, let us consider meta-rules dealing with the gram­mar. Many of our examples here are reformulations of our earlier work (Weischedel et al. 1978, Kwasny and Sondheimer 1979, Weischedel and Black 1980) within the uniform framework of meta-rules. All meta-rules pertaining to syntax should have a first condition which is (SYNTAX-FAILED?); this is true iff the parser is blocked. Since all rules in this section would con­tain that predicate, we will not include it in the exam­ples.</p><p>Many syntactic formalisms have a similar frame­work for expressing rules: these include context-free grammars, augmented phrase structure grammars (Heidorn 1975), Programmar (Winograd 1972), the linguistic string parser (Sager 1981), Lifer (Hendrix et al. 1978), and augmented transition networks (ATNs) (Woods 1970). In fact, all of these can be viewed as formally equivalent to ATNs, and we will describe our techniques in that framework.</p><p>Figure 1 gives several predicates that should be useful in the LHS of meta-rules. The LHS of the meta­rule is matched against the environment in which an ATN arc failed. The environment is called a configu­ration and includes the current ATN state, the arc, all ATN registers, and the remainder of the input string.</p><footnote label="4">The expression $expr could be implemented as (*EVAL* expr). The pattern variable ?atom could be implemented as (*VAR* atom).</footnote><table class="main" frame="box" rules="all" border="0" regular="False"><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>(IN-STATE? state)</p></td><td class="cell"><p>Did the configuration block in state?</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>(CAT? category)</p></td><td class="cell"><p>Is the current word in category?</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>(WRD? list)</p></td><td class="cell"><p>Is the current word a member of list?</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>(NEXTCAT? category)</p></td><td class="cell"><p>Is the word after the current one in category?</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>(NEXTWRD? list)</p></td><td class="cell"><p>Is the word after the current one in list?</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>(FAILED-TEST? pattern)</p></td><td class="cell"><p>Is the pattern a predicate expression in the test of the arc and did both the expression</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p></p></td><td class="cell"><p>and the test evaluate to false?</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>(FAILED-ARC? pattern)</p></td><td class="cell"><p>Does the failed arc match pattern?</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>(HOLDLIST-NOT-EMPTY?)</p></td><td class="cell"><p>Is the hold list empty?</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>(CONFUSION-WORD? x)</p></td><td class="cell"><p>Is x a word frequently confused with another? If so, the related word is returned.</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p></p></td><td class="cell"><p><b>Figure 1. </b>Useful Conditions for Syntactic Meta-rules.</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr></table><page local="6" global="166"/><p><b><i>State Arcs </i></b><b>(Figure 3.)</b></p><p>S/ (PUSH NP/ T ...</p><p>(SETR SURFACE-SUBJECT *) (* We think this is the subject) (TO S/NP)) (VIR NP T (SETR SURFACE-SUBJECT *) (* In relative clauses, this identifies a noun phrase from the hold list as surface subject) (TO S/NP))</p><p>S/NP (CAT VERB (SUBJECT-VERB-AGREE? (GETR SURFACE-SUBJECT) *) (* The predicate SUBJECT-VERB-AGREE? is true iff the number and person of the subject and verb are</p><doubt alpha="56.4" length="39" tooSmall="False" monospace="0.0">compatible) ... (SETR VERB *) (TO S/V))</doubt><p>S/V (JUMP S/POP (INTRANSITIVE-VERB? (GETR VERB))) (* Identifies a noun phrase from the hold list as the direct object in relative clauses) (AND (TRANSITIVE-VERB? (GETR VERB)) (NOT (REQUIRES-INDIRECT-OBJ? (GETR VERB)))))</p><doubt alpha="45.8" length="24" tooSmall="False" monospace="0.0">(CAT ADV T ... (TO S/V))</doubt><doubt alpha="58.2" length="67" tooSmall="False" monospace="0.0">(PUSH NP T (SETR OBJECT *)... (TO S/POP)) (VIR NP T (SETR OBJECT *)</doubt><doubt alpha="54.5" length="11" tooSmall="False" monospace="0.0">(TO S/POP))</doubt><doubt alpha="44.8" length="29" tooSmall="False" monospace="0.0">S/POP       (POP (BUILDQ ...)</doubt><p>NP/ (CAT PRO T ...</p><p>(SETR PRO *) (TO NP/POP)) (CAT DET T ...</p><doubt alpha="58.3" length="12" tooSmall="False" monospace="0.0">(SETR DET *)</doubt><p>(SETR NUMBER (GETNUMBER DET)) (TO NP/DET))</p><p>NP/DET    (CAT ADJ T ...</p><p>(* collecting adjectives before head noun) (TO NP/DET)) (CAT N T ...</p><doubt alpha="47.6" length="21" tooSmall="False" monospace="0.0">(SETR N *) (TO NP/N))</doubt><table class="main" frame="box" rules="all" border="0" regular="False"><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>(EMPTY-HOLD)</p></td><td class="cell"><p>defines the hold list to be empty.</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>(FAILED-CONSTRAINT pattern)</p></td><td class="cell"><p>adds the instantiation of the pattern to a list of violated constraints stored in the</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p></p></td><td class="cell"><p>configuration. The position of the parser within the input string will automatically be</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p></p></td><td class="cell"><p>recorded as well.</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>(SUBSTITUTE-IN-ARC pattern 1</p></td><td class="cell"><p></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>pattern2)</p></td><td class="cell"><p>changes the arc in the failed configuration by replacing all expressions matching</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p></p></td><td class="cell"><p>pattern 1 by pattern2.</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>(REPLACE-* x)</p></td><td class="cell"><p>makes x the current word in the blocked configuration.</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p></p></td><td class="cell"><p><b>Figure 2. </b>Useful Actions for Syntactic Meta-rules.</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr></table><page local="7" global="167"/><p>NP/N        (CAT N T ...</p><doubt alpha="66.7" length="90" tooSmall="False" monospace="0.0">(ADDR COMPOUND (GETR N)) (SETR N *) (* a possible nominal compound) (TO NP/N) (JUMP NP/POP</doubt><p>(DET&amp;NOUN-AGREE? (GETR NUMBER) (GETR N))</p><p>(* The predicate DET&amp;NOUN-AGREE? is true iff the determiner used is incompatible with the head noun) ... ) (PUSH RELCL/ T (SENDR TRACE (TRACE)) (* This sends a trace of a noun phrase to a relative clause) (SETR RELATIVE-CLAUSE *) (TO NP/POP))</p><doubt alpha="50.0" length="30" tooSmall="False" monospace="0.0">NP/POP    (POP (BUILDQ ...) T)</doubt><doubt alpha="55.4" length="56" tooSmall="False" monospace="0.0">RELCL/    (CAT RELPRO T (HOLD (GETR TRACE)) ... (TO S/))</doubt><p>(PUSH NP T (SETR SURFACE-SUBJECT *) (HOLD (GETR TRACE)) (* This allows for relative clauses where the subject is present) (TO S/NP))</p><p>(JUMP S/NP (SETR SURFACE-SUBJECT (GETR TRACE)) (* Allows for elided subjects in reduced relative clauses)) <b>Figure </b>3. A Simple ATN Graph.</p><p>An important action for the RHS is NEW-CONFIGURATION, which defines a new parser configuration, thus replacing the failed configuration that the meta-rule matched. It may take any number of arguments which set parts of the configuration. For example, SETR will define the new value of an ATN register. A list of useful arguments to NEW-CON­FIGURATION is given in Figure 2. Failed constraints fill the role of the deviance notes of Kwasny and Son­dheimer (1979). All parts of the failed configuration that are not explicitly changed in NEW-CONFIGURA­TION remain the same. Our implementation assumes that there is only one NEW-CONFIGURATION per meta-rule, though one could generalize this so that executing NEW-CONFIGURATION n times in a meta­rule gives n new configurations to replace the failed one. If a new configuration is generated, the parse can be resumed.</p><p>Figure 3 gives a trivial ATN which will be used for the sample meta-rules. The start state is S/. A list beginning with an asterisk in the actions of an arc is a comment.</p><subsubsection number="3.1.1." title="Simple grammatical tests"><p>Our earlier work showed how to relax tests that ap­pear on ATN arcs. In one study (Eastman and Mc­Lean 1981), subject-verb disagreement occurred in 2.3% of the English queries. Meta-rule (i) relaxes that agreement test. The new configurations here are the result of replacing the agreement test in each failed arc by the predicate T. Since a new configura­tion is generated, parsing is resumed using it. Though the substitution was trivial in this case, SUBSTITUTE-IN-ARC is a general pattern-matching and substitution facility. As an example, consider "A curious problem showing unusual conditions appear ..." A top-down, left-to-right parse using a grammar such as the one in Figure 3 would block at the word "appear". One of the blocked configurations would correspond to the agreement test failure in the arc leaving state S/NP; meta-rule (i) would apply, allowing the sentence to be parsed.</p><doubt alpha="61.6" length="73" tooSmall="False" monospace="0.0">(i)      (FAILED-TEST? (SUBJECT-VERB-AGREE? ?X ?Y)) -&gt; (NEW-CONFIGURATION</doubt><p>(FAILED-CONSTRAINT (SUBJECT-VERB-AGREE? ?X ?Y)) (SUBSTITUTE-IN-ARC (SUBJECT-VERB-AGREE? ?X ?Y) T))<page local="8" global="168"/></p><p>Another frequently occurring problem is omitting re­quired articles from count nouns. In the study by Eastman and McLean (1981) this occurred in 3.3% of all queries. In the grammar of Figure 3, blockage would occur at NP/N because of the test DET&amp;NOUN-AGREE? on an example such as "Print price of P27 over the last five years". Rule (ii) relaxes the test. When the parser starts on the new configura­tion, the modified test will be checked, verifying that no determiner is present. If none is, the message from FAILED-CONSTRAINT is available for error recovery.</p><p>The meta-rule approach allows for more sophisti­cated actions. Suppose that a linguistic study of utter­ances with missing determiners showed that a default assumption of definite reference is a good heuristic. In this case, one could simply add the action (SETR DET 'the) to the actions in NEW-CONFIGURATION.</p><p>One could argue that, in a data base environment, the grammar should simply treat omitted determiners as a normative construction. Even though determiners are frequently omitted in data base contexts, prefer­ring well-formed interpretations can eliminate some ambiguities in complex noun phrases such as "a ma­chine running programs". The determiner constraint suggests that "running programs" modifies the head noun "machine" rather than "machine" and "running" both modifying "programs".</p></subsubsection><subsubsection number="3.1.3." title="Confusion words"><p>A number of word pairs are frequently confused, such as homonyms and "good" for "well". Meta-rule (iii) allows for such errors, since REPLACE-* will modify the current word in the blocked configuration. MR-SETQ binds the value of its second argument to the pattern variable appearing as its first argument. Hence, "You performed good" would block at S/V, and the meta-rule would substitute "well" for "good".</p></subsubsection><subsubsection number="3.1.4." title="Resumptive pronouns"><p>Another kind of ill-formedness is resumptive pronouns and resumptive noun phrases. These occur in relative clauses where the entity referred to by the relative pronoun is improperly repeated in the relative clause as a pronoun or noun phrase. An example is "John's friend Mary married the man that she planned to mar­ry him", since there is no syntactic slot in the relative clause for the relative pronoun "that" to fill. A typi­cal ATN strategy for interpreting relative clauses is to put a place holder or trace on a "hold list"; the ATN processor prevents POPping from a level if the hold list is non-empty. That test prevents accepting clauses where traces are not used. Meta-rule (iv) provides for resumptive pronouns and resumptive noun phrases. One can imagine more complicated tests, since there are specific conditions (Kroch 1981) under which resumptive pronouns and resumptive noun phrases are more likely.</p><doubt alpha="63.6" length="66" tooSmall="False" monospace="0.0">(ii) (FAILED-TEST? (DET&amp;NOUN-AGREE? ?X ?Y)) --&gt; (NEW-CONFIGURATION</doubt><p>(SUBSTITUTE-IN-ARC (DET&amp;NOUN-AGREE? ?X ?Y) (NULL ?X)) (FAILED-CONSTRAINT (DETERMINER&amp;NOUN ?Y - MISSING DETERMINER))) (FAILED-CONSTRAINT (?X SUBSTITUTED FOR *))) --&gt; (NEW-CONFIGURATION (FAILED-CONSTRAINT (Resumptive Clause $?VALUE)) (EMPTY-HOLD)) --&gt; (PRINT-RESPONSE-PATTERN (I DO NOT UNDERSTAND YOUR USE OF THE VERB $(GETR VERB) /. WOULD YOU LIKE EXAMPLES OF WHAT I UNDERSTAND?)) (SELECTQ (READ)</p><doubt alpha="65.5" length="58" tooSmall="False" monospace="0.0">(iii) (MR-SETQ ?X (CONFUSION-WRD *)) —&gt; (NEW-CONFIGURATION</doubt><doubt alpha="57.1" length="14" tooSmall="False" monospace="0.0">(REPLACE-* ?X)</doubt><doubt alpha="61.8" length="76" tooSmall="False" monospace="0.0">(iv) (FAILED-ARC? (POP ?VALUE . ?Z)) (IN-STATE? S/POP) (HOLDLIST-NOT-EMPTY?)</doubt><doubt alpha="57.1" length="21" tooSmall="False" monospace="0.0">(v) (IN-STATE? S/POP)</doubt><doubt alpha="65.1" length="43" tooSmall="False" monospace="0.0">((YES Y) (PRINT-EXAMPLES (GETR VERB))) NIL)</doubt></subsubsection><subsubsection number="3.1.2." title="Omitted articles 3.1.2.   Omitted articles"><page local="9" global="169"/><p>(FAILED-SEMANTIC-TEST? pattern)      Is the pattern a predicate expression (on a constituent) and is the predicate false?</p><p>(MANDATORY-CASE-MISSING?) (TOO-MANY-FILLERS?) (SEMANTIC-CLASS? class) (MATRIX-TYPE? class) (VIEWABLE? x y z)</p><doubt alpha="66.7" length="12" tooSmall="False" monospace="0.0">(CASE? case)</doubt><p>Is a required case absent?</p><p>In trying to fill case, does it already have the maximum number assigned? Is class a semantic class predicate, for example, human?</p><p>Is the matrix to which we are trying to assign the current constituent in class?</p><p>Can entity x be viewed as a y in the context z? (This is a question for the pragmatic component.)</p><p>Is the constituent supposed to fill role case?</p><p><b>Figure </b>4. Predicates for Semantic Meta-rules.</p><p>(FAILED-CONSTRAINT pattern)       adds the instantiation of the pattern to a list of violated constraints stored in the configuration.</p><p>(SUBSTITUTE-IN-CASE patternl</p><p>pattern2)      replaces patternl by pattern2 everywhere in the constraint. (SUBSTITUTE-FOR-CASE case)        tries assigning the constituent as case.</p><p><b>Figure 5. </b>Actions for Semantic Meta-rules.</p></subsubsection><subsubsection number="3.1.5." title="Error messages"><p>Of course, the parser may not be able to recover at all due to either absolute or relative ill-formedness. Weischedel and Black (1980) presented a technique for associating error messages with states where the parser blocked. The only way to block in S/POP is if the verb complement expected for the main verb is not present. Meta-rule (v) could handle this simple case. Notice that this is a different class of meta-rule, for it does not resume computation. Naturally, such rules should be tried only after no other meta-rules are available. One could define different classes of meta­rules by appropriate declarations; alternatively, this class can be recognized easily, since none of the ac­tions resume processing.</p><p>This is not the only alternative in the face of failure to parse even with relaxation; Jensen and Heidorn (1983) present heuristics for what to pass to the se­mantic interpreter in this case, given bottom-up pars­ing.</p></subsubsection></subsection><subsection number="3.2." title="Meta-rules related to semantics"><p>In addition to these syntactic examples, semantic prob­lems can also be addressed within the formalism. If some semantic tests are included in the parser, say a certain arc test contains calls on the semantic compo­nent, specific semantic tests can be relaxed by the general mechanism we described for relaxing tests on ATN arcs.</p><p>Instead, suppose that semantic constraints are en­coded in a separate component. Semantic constraints may be expressed in several formalisms, such as se­mantic nets (Bobrow and Webber 1980a,b; Sondheimer et al. 1984), first-order logic, and production rules (for example, PROLOG, Warren et al. 1977). It is generally agreed that all are formally equivalent to first-order logic. For the purposes of this paper, we assume that the selection restrictions are encoded in first-order logic.</p><p>One of the most common designs for a semantic interpreter is based on selection restrictions and case frames (Bruce 1975). At least five kinds of con­straints may be violated:</p><p>1) what may fill a given case,</p><p>2) which cases are required for a complete constitu­ent,</p><p>3) which may have multiple fillers without conjunc­tion,</p><p>4) which are allowed for a given case frame, and</p><p>5) what order cases may appear in.</p><p>Figure 4 lists tests useful for diagnosing failures in such a semantic interpreter. Assume that any predi­cate on the semantic class of a constituent is encoded simply in LISP notation, for example, (HUMAN x) is true iff x is of class human. All meta-rules in this section can be assumed to include an initial test (SEMANTICS-FAILED?.)</p><p>For convenience, we have used the same names for some of the actions as in the syntactic cases (for ex­ample, FAILED-CONSTRAINT, NEW-CONFIGURA­TION, etc.). When implemented in a particular system, different names may be used, since the concept of configuration, blockage, etc., is usually different for the types of processing (for example, lexical, syntactic, semantic, and pragmatic). Figure 5 lists several ac­tions useful in semantic meta-rules.</p><page local="10" global="170"/><p>CONCRETE</p><p>INANIMATE</p><p>HUMAN ANIMAL <b>Figure 6. </b>A Fragment of a Semantic Hierarchy.</p><subsubsection number="3.2.1." title="Universal relaxation of semantic class"><p>Meta-rule (vi) is a very general rule. Assuming that semantic class tests are organized in a hierarchy, it states that the failed test is to be replaced by its par­ent in the hierarchy, yielding the next most general test.</p><p>An example of the use of meta-rule (vi) is "My car drinks gasoline". The restriction on the AGENT case could be the predicate ANIMATE. A fragment of a semantic hierarchy appears in Figure 6. In that, ANIMATE has a parent predicate of CONCRETE that would include cars. The failure of the initial sentence and the subsequent processing using the meta-rule would accept a sentence with the special deviance note identifying the semantic oddity.</p></subsubsection><subsubsection number="3.2.2." title="Personification"><p>In a way similar to our arguments against approach 4 in Section 2.1, we feel that general meta-rules such as (vi) will prove less valuable than specific rules. A particular test that can be relaxed is the requirement for a human; for instance, the verbs of saying and those of propositional attitude, such as "believe" and "think", normally have a restriction that their agent be human. Nevertheless, such a constraint is regularly violated through personification of pets, higher ani­mals, machines, etc.</p><p>Since personification is infrequent compared to the norm of descriptions designating humans, a case con­straint of "human" can trim the search space. Since personification conveys particular inferences (Lakoff and Johnson 1980), a relaxation rule that records the detected personification can trigger appropriate infer­ence processes. Figures of speech certainly are not absolutely ill-formed; we argue here that it is useful to treat them as relatively ill-formed.</p><p>Meta-rule (vii) is one simple relaxation for personi­fying animals. More specific ones may prove prefera­ble, if classes of personification are taxonomized.</p></subsubsection><subsubsection number="3.2.3." title="Metonymy"><p>There are at least seven classes of metonymy (Lakoff and Johnson 1980), including a part for the whole, the producer for the product, the object for its user, the controller for the controlled entity, the institution for the people responsible, the place for the institution, and the place for the event. This analysis suggests two kinds of strategies. A particular class of descrip­tions may occur in exactly the same linguistic environ­ments as their class of metonymous descriptions. For instance, institutions and people appear interchange­able as the logical subject of the verbs of saying and of propositional attitude. That can be encoded direct­ly in the case frames of those verbs.</p><p>However, many types of metonymy are conditioned on a highly specialized relationship. For instance, places can be used metonymously for events only if the speaker believes an event is identifiably associated with the location. For instance, compare the following examples:</p><p>Pearl Harbor caused us to enter the war.</p><p>*Fifth and Lombard caused us to reconsider graduated income taxes.</p><p>Therefore, a meta-rule such as (viii) seems appropriate to prefer the normal, but accept metonymous descrip­tions of events by places. In meta-rule (viii), we have assumed that there is a variable FILLER of the seman­tic interpreter that holds the constituent to be as­signed. Also, in the call to VIEWABLE?, CURRENT indicates that the pragmatic component should use its current context.</p></subsubsection><subsubsection number="3.2.4." title="Phrase ordering"><p>Failure in selectional restrictions can indicate other semantic errors. These include ordering problems, for example, "John killed with a gun Mary", and unex­pected prepositions, for example, "John killed Mary by a gun". The LHS of the appropriate meta-rules would begin with identification of selectional restriction fail­ures but would also include other tests. The RHS would change the assumed case. A rule for the first example is (ix). Here the assumption is that the or­dering problem will be first noted when "Mary" is tried as a time modifier. Using SUBSTITUTE-FOR-CASE postulates the constituent "Mary" to fill the object case and attempts to do so.</p><page local="11" global="171"/><p>--&gt; (NEW-CONFIGURATION</p><doubt alpha="63.2" length="57" tooSmall="False" monospace="0.0">(vi) (FAILED-SEMANTIC-TEST? (?Y ?Z)) (SEMANTIC-CLASS? ?Y)</doubt><doubt alpha="64.8" length="122" tooSmall="False" monospace="0.0">(FAILED-CONSTRAINT (?Y TOO RESTRICTIVE - USING PARENT $(PARENT-OF ?Y))) (SUBSTITUTE-IN-CASE (?Y ?Z) ($(PARENT-OF ?Y) ?Z)))</doubt><p>(vii) (FAILED-SEMANTIC-TEST? (HUMAN ?Z)) -&gt; (NEW-CONFIGURATION (FAILED-CONSTRAINT ((HUMAN ?Z) - ASSUMING PERSONIFICATION OF ANIMAL)) (SUBSTITUTE-IN-CASE (HUMAN ?Z)</p><p>(viii) (FAILED-SEMANTIC-TEST? (EVENT ?X)) (LOCATION FILLER) (VIEWABLE? FILLER 'EVENT 'CURRENT) -&gt; (NEW-CONFIGURATION</p><p>(SUBSTITUTE-IN-CASE (EVENT ?X) T) (FAILED-CONFIGURATION (METONYMY PLACE-FOR-EVENT FILLER))) (ix) (FAILED-SEMANTIC-TEST? (TIME ?Z)) (MATRIX-TYPE? 'CLAUSE) (CASE? 'TEMPORAL)</p><p>--&gt;  (NEW-CONFIGURATION  (FAILED-CONSTRAINT  (ORDERING PROBLEM - OBJECT CASE AS­SUMED))</p></subsubsection></subsection><subsection number="3.3." title="Generality of the approach"><p>Though we have experience in implementing our framework for ATN parsers only, we believe the framework to be applicable over a broad range of parsers. It assumes only that a "configuration" or "alternative" representing a blocked, partial interpre­tation can be stored, modified, and restarted. No as­sumption regarding the direction of processing (for example, left to right), the nature of search (for exam­ple, top-down vs. bottom-up), nor the class of problem (for example, lexical, syntactic, or semantic) is made. For instance, the design of an implementation for se­mantic meta-rules as in Section 3.2 is complete. The underlying semantic component is based on searching case frames breadth-first with both top-down and bottom-up characteristics. Except for the one meta­rule regarding incorrect phrase ordering in Section 3.2.4, the semantic meta-rules themselves are inde­pendent of whether proposing a phrase for a given case in a frame is based on syntactic considerations or other criteria (for example, Schank et al. 1980). Natu­rally, the primitive conditions and actions of a given set of rules will depend on a particular formalism. In the next section, we relate our framework to a variety of parsers and problems.</p></subsection></section><section number="4." title="Additional Supporting Evidence"><p>Many natural language interfaces have some heuristics for processing one or more classes of ill-formed input. Since an exhaustive analysis would be impossible here, we will review only a handful of techniques that have inspired us to develop the meta-rule framework. We describe each technique by showing how it could be phrased as a meta-rule within our paradigm.</p><p>The LADDER system (Hendrix et al. 1978) imple­ments three major techniques for processing ill-formed input. All fit within the framework we suggest. One deals with recovery from lexical processing. In this system, the developer of a question-answering system prepares only a dictionary of well-formed words. If a sentence contains a word that is not in the dictionary, the parser will fail. The system localizes the area of failure to the ATN state associated with the partial interpretation that has proceeded rightmost in the input and that is shallowest (in terms of incompleted ATN PUSH arcs). Candidates for the correct spelling are limited to the words that would permit the parser to proceed and that are close to the spelling that ap­pears. An equivalent meta-rule would check in the LHS that the parser failed. The RH S would compute a list of words expected next for each type of arc leav­ing that state, for example, the category members and literal words expected next. The next action would apply the Interlisp spelling correction algorithm to postulate a known word that was expected next. This word would replace the unrecognized one in the input and parsing would resume. A similar heuristic is run­ning in our current implementation, with the addition that, if the unrecognized word appears to have an inflected ending, spelling correction is performed on the possible root.</p><p>A second technique in LADDER deals with under­standing contextual ellipsis, if no parse for the input is found. This heuristic interprets "the fastest submarine" as "To what country does the fastest sub­marine belong", if it occurs after a query such as "To what country does each merchant ship in the North Atlantic belong". In Weischedel and Sondheimer (1982), we extended that heuristic to allow for turn-taking in dialogues and to allow expansions as well as substitutions, such as the elliptical form "Last month" following "Did you go to Chicago?<page local="12" global="172"/>"</p><p>A third technique in LADDER is the printing of error messages, in the same sense that meta-rule (v) above prints a message when all attempts have failed. We could phrase this heuristic as a meta-rule whose LHS would check that the parser has blocked. This meta-rule would be ordered strictly after the ones for spelling correction and contextual ellipsis. A state would be postulated as the locale of the problem by the same heuristic as for spelling correction. The RHS would print for each arc that leaves that state the cate­gory, constituent, or word that was expected by that arc.</p><p>Hayes and Mouradian (1980) emphasize recovery techniques for blocking during left-corner parsing (Aho and Ullman 1972). Their strategies are invoked only if the parser blocks. Two of them can be refor­mulated as meta-rules as follows. One meta-rule would check in its LHS that the parser was blocked and that a special parser variable (call it BLOCKED-PARSE) was empty. The RHS would save the blocked configuration in BLOCKED-PARSE, and start parsing as if the current word were the first word of the input. This would enable the system to ignore initial strings that could not be understood. A useful example of this is restarted inputs, such as "Copy all print all headers of messages". A second meta-rule is related. The LHS would check whether the parser was blocked and BLOCKED-PARSE had a configuration in it. Furthermore, the LHS would check to see that another parser variable (call it DONE-ONCE) was NIL. If so, the RHS would set DONE-ONCE to T. The RHS would then swap the current configuration with BLOCKED-PARSE and would try resuming the parse from the current word with that configuration. This heuristic is designed to ignore incomprehensible mate­rial in the middle of an input. For instance, it would enable skipping the parenthetical material in "List all messages, assuming there are any, from Brown".</p><p>In the area of pragmatics, solutions that could fit within our paradigm have been suggested for two classes of problems. One problem is the failure of presuppositions of an input. In the environment of an intelligent tutor for computer-assisted language in­struction, a technique suggested in Weischedel et al. (1978) could be formulated as a meta-rule as follows. The LHS would check whether processing was blocked due to a presupposition being false. Since that system would have a more complete knowledge of language than a beginning student of a foreign language, the system could treat the input as absolutely ill-formed. A sophisticated RHS could paraphrase the false pre­supposition for the student and indicate which word or syntactic construction was used inappropriately. Thus, the tutor could point out mistakes such as "Das Fraeu-lein ist Student", indicating that the student should look up the meaning of "Student" (which applies only to males).</p><p>Kaplan (1978) suggests an alternative heuristic for false extensional presuppositions in a data base envi­ronment. One can reformulate it as a meta-rule whose LHS would check that the query had requested a set as a response and that the set was empty. The RHS would compute queries corresponding to subsets that the original query presupposed would have a non­empty extension. The RHS would paraphrase the most general such query with an empty response set, report­ing to the user that the system knew of no such enti­ties.</p></section><section number="5." title="Implementation"><p>We implemented a grammatical meta-rule processor first for an ATN interpreter and more recently for an ATN compiler (Burton and Brown 1977). Our experi­ments have used RUS (Bobrow 1978), a broad-coverage grammar of English with calls to a semantic component to block anomalous interpretations pro­posed by the grammar.</p><p>Design and implementation of a meta-rule proc­essor for violation of semantic constraints is currently underway in two different semantic interpreters. In one, case constraints are expressed as sets of logical formulas; in the other, KL-ONE is used to encode case frames (Sondheimer et al. 1984).</p><p>Four design issues are considered in the following sections.</p><subsection number="5.1." title="Applying meta-rules"><p>The set of meta-rules dealing with the grammar or semantic system could be viewed formally as a func­tion f from a component's rules S to a new component's rules S'.</p><doubt alpha="33.3" length="9" tooSmall="False" monospace="0.0">f(S) = S'</doubt><p>S' is the transitive closure of applying every meta-rule pertaining to the system rules in every possible way. (Since it is the transitive closure, S is contained in S'). There are three alternatives. One is to compute S' and use it, rather than S, as the basis of processing, assuming that the transitive closure S' is a finite clo­sure. The second is to apply meta-rules only as need­ed, thus making S' a virtual system. The third alterna­tive is a combination of applying some meta-rules as needed and applying others in advance.</p><p>The first alternative is superficially similar to ap­proach 2 of Section 2.1, where ill-formedness process­ing is embedded in the normative system; however, S<footnote anchor="1"/>will maintain the preference for normal interpretations over ill-formed ones. We have rejected this alternative because of the combinatorial growth of rules needed for S<footnote anchor="1"/>. For instance, one can write meta-rules for handling relaxation of word categories and relaxation of predicates on ATN arcs.    Since both can occur throughout the grammar, they should not be expanded ahead of time. A similar argument is used to justify treating conjunction processing as a separate process rather than building it directly into the grammar (Woods 1973).<page local="13" global="173"/> Since the classes of ill-formedness can occur in combination, the number of relaxed rules in S' can be very large. Furthermore, since utterances where many, many combinations of errors occur should be rare, computing the transitive closure seems uncalled for.</p><p>The second alternative, generating a relaxed rule each time it is needed, is the one we implemented first in the context of an ATN interpreter. This alternative provides a kind of virtual system and avoids the in­creased memory necessary to hold S'.</p><p>The third alternative, applying some rules ahead of time and using others only as needed, offers the great­est flexibility and a variety of alternatives. We have implemented a version in which the underlying parser is the output of an ATN compiler. When the meta-rule processor applies a meta-rule at a given arc, the re­laxed version of the arc is compiled and saved.<footnote anchor="5"/> If the meta-rule is to be tried by the meta-rule processor at that arc again, the form of the relaxed arc need not be re-computed; it can simply be executed.</p><p>This third alternative also offers the potential of adapting the system to the idiosyncrasies of an individual's language and also the potential of extend­ing its own model of language. Obviously, this is an area for future research.</p><p>Alternatives two and three assume only that the processor applying well-formedness rules is able to store a "configuration" in a queue or agenda. No assumption about the type of processing (for example, bottom-up or top-down), nor the class of violated rule (for example, lexical, syntactic, semantic, or pragmat­ic) is necessary.</p></subsection><subsection number="5.2." title="What to store"><p>When a configuration blocks because of the well-formedness rules, should the blocked configuration be stored or the results of applying each relevant meta­rule? Both of the implementations in the ATN environ­ment save only the blocked configuration, namely, a blocked arc at the end of a path. The number of blocked configurations can be large. At present, there is insufficient evidence to determine whether a well-tuned set of meta-rules will yield a substantially larger (or smaller) number of relaxed configurations com­pared to the set of blocked configurations.</p><p>Some types of problems, for example, subject-verb agreement, may be so common, and some types of relaxation, for example, an unrecognized word, may be so diagnostically clear that the corresponding meta­rules should be applied immediately. In the case of subject-verb agreement, hand-compiling the meta-rule into the grammar may be appropriate (that is, writing an arc whose test is that subject-verb agreement failed and whose action places the new configuration on a queue that is tried only after all normal configurations have failed).</p><footnote label="5">The current implementation is limited somewhat; it saves the relaxed arc only if the RHS of the meta-rule modifies only the arc itself. Our misspelling meta-rule, for example, does not modify the arc at all, but rather the input string.</footnote></subsection><subsection number="5.3." title="Localizing the problem"><p>When processing ill-formed inputs, some means of ranking alternatives is appropriate, since the system must determine what is intended in the face of violat­ed constraints and possible error. Also, the number of relaxed configurations may be large, even with a set of well-tuned meta-rules designed to open the search space minimally.<footnote anchor="6"/> The ideal solution is that the ranking of alternatives should be based on syntactic, semantic, and pragmatic evidence, in addition to the diagnosis and recovery strategy.</p><p>The current implementation uses only some of those bases and employs a rather simple ranking. Since both grammatical constraints and selection re­strictions are employed while parsing with <b>RUS, </b>both syntactic and semantic evidence is used. Blocked con­figurations are ordered on the amount of input proc­essed; there is also a partial order on the meta-rules.</p><p>One of our students, Amir Razi, is designing an experiment to collect data on the performance of the system. The current system can be run in one of two modes: saving all blocked configurations or using only ones that proceeded rightmost in the input. One as­pect of the experiment is to determine the frequency with which the interpretations covering the most input in a left-to-right parse block at the true source of the problem. Some preliminary evidence (Weischedel and Black 1980) indicates that this heuristic frequently does indicate where the problem is, if the normative system is nearly deterministic, for example, because the grammar is a fairly constrained subset of English or because semantic criteria filter out parses that have no meaning in the application domain.</p><p>Our long-term goal is accurate determination of both the problem in an ill-formed utterance and what was intended. The current implementation represents the first step toward that by employing both syntactic and semantic evidence. We are investigating the use of pragmatic evidence for that purpose as well. In addition, we wish to explore techniques for examining both the left and right contexts of a blocked interpre­tation, for instance, by employing bottom-up process­ing.</p><footnote label="6">However, it is not clear whether the combinatorics alone for typical inputs will be a problem, given the rapid increase in proc­essor power/cost and the prospect of multi-processing.</footnote><page local="14" global="174"/><p>Hand-compilation of meta-rules as mentioned in Sec­tion 5.1 is just one way to pinpoint the configurations to which a meta-rule applies; another is providing an index from blocked configurations to the meta-rules that could apply. We have implemented a preproces­sor that builds an index from an ATN arc to the meta­rules that can apply to it. When loading the ATN grammar, our preprocessor localizes the syntactic meta-rules having IN-STATE?, FAILED-TEST?, and FAILED-ARC? in their LHS to the few arcs to which they could possibly apply. Clearly, if IN-STATE? is in the LHS, that meta-rule can apply to only the handful of arcs leaving one state. Since FAILED-ARC? and FAILED-TEST? require the arc to match a given pat­tern, meta-rules using these tests can be identified with the arcs satisfying those patterns.<footnote anchor="7"/> Such preproc­essing provides an index into the possible rules that apply to a blocked configuration, since the state and the arc will be part of the configuration. Furthermore, the pattern-matching operations in the LHS need not be repeated at run-time, since the preprocessor stores for each arc an altered form of the meta-rule (without the calls to the pattern matcher) and the bindings that pattern matching created.</p><p>Some meta-rules will not have any tests that local­ize their applicability; an example is the one for confu­sion words, which can appear almost anywhere. These are stored separately, and must be checked for any arc to which relaxation is to be tried.</p></subsection></section><section number="6." title="Limitations"><p>There are a number of points of caution. It should be clear that relaxation does not necessarily guarantee understanding. After all, relaxing any arc to (TST X T ...) will accept any word syntactically; yet that is no guarantee that the word will be understood. Relaxing constraints introduces additional potential for confu­sion.</p><p>What one classifies as "absolutely ill-formed" is clearly open to dispute, as Ross (1979) points out. Therefore, the system may classify something as ill-formed, ranking it behind other interpretations, even though the user views it as well-formed. We suspect that categorizing almost any particular constraint as normative could be the basis for argument. The crite­ria for deciding whether a constraint should be includ­ed in the normative system should include at least the following:</p><p>a) whether a native speaker would edit inputs that violate it,</p><p>b) whether violating the constraint can yield useful inferences,</p><footnote label="7">Of course, this preprocessing assumes that no patterns in the LHS contain a form Sexpr.</footnote><p>c) whether examples exist in which the constraint carries meaning,</p><p>d) whether the constraint, if classified as normative, trims the search space, and</p><p>e) whether a processing strategy for the constraint can be stated more easily as a modification of norma­tive processing, as in the case of conjunction (Woods 1973) or the case of contextual ellipsis in the data base environment ( and Sond­heimer, 1982).</p><p>Thus far we have considered only constraints that are associated with a single point in the processing, such as relaxing a single case frame or relaxing a single ATN arc. Obviously, this need not be the case if, for instance, word or phrase order is permuted. At pres­ent, we have no general way of dealing with such problems.</p></section><section number="7." title="Future Work"><p>The problems of processing ill-formed input require several substantial research efforts. One is collecting additional corpora to determine patterns of errors and their frequency of occurrence. This is particularly important for two reasons. First, the more detail un­covered on patterns of error, the tighter the meta-rules for relaxing constraints. In our experience, the effort to make relaxation procedures as constrained and ac­curate as warranted by the patterns of occurrence is highly worthwhile, not only in trimming the search space, but also in eliminating senseless interpretations. Second, the patterns of ill-formedness will depend on the user community and the modality of input. For instance, non-native speakers of a language make dif­ferent errors than native speakers. Typed input has a predominance of typographical/spelling errors; spo­ken input may have more restarted utterances.</p><p>As a correlate to the need for more corpora of ill-formed natural language, there is an obvious need to define highly specific heuristics (as meta-rules) to diagnose and recover from each type of ill-formedness. Some of the heuristics should involve clarification dialogue, another area for research.</p><p>There are many possible responses given a diag­nosed problem. Consider a simple problem: violation of selection restrictions. In German, the verb "fressen" presupposes that the one eating is an animal. To an input such as "Dieser Mann frisst oft",<footnote anchor="8"/> several recovery strategies could apply:</p><p>a) The selection restriction could be ignored.</p><p>b) The selection restriction could be generalized for future use.</p><p>c) The system could conclude that an error has occur­red, as in the aforementioned language learning environment.</p><p><b>"This man eats often."</b><page local="15" global="175"/></p><doubt alpha="58.3" length="48" tooSmall="False" monospace="0.0">5.4.   A meta-rule index5.4.   A meta-rule index</doubt><p>d) The system could engage in clarification dialog to determine whether the user intended to use that word.</p><p>e) The system could assume the user believes that the man referred to eats like an animal.</p><p>The conditions for selecting a strategy need to be studied. An explicit model of the user is needed for deciding the intent of the user and for appropriate recovery from ill-formedness.</p><p>Learning the idiosyncrasies of particular users and automatic extension of the system (based on detecting relatively ill-formed input) is very challenging. Some initial steps in this direction have been taken in Car­bonell (1979), but there is much to be done. A signif­icant aspect of the learning problem in this environ­ment is the substantial uncertainty about whether the system has the intended interpretation, and the effect on both the functional and time performance of the system as the abnormal is viewed as more normal (and the search space correspondingly grows).</p><p>For syntactic ill-formedness, pure bottom-up pars­ing is intuitively very appealing, since one has descrip­tions of what is present both to the left and the right of the problem(s). The EPISTLE project (Jensen and Heidorn 1983) is employing bottom-up parsing. The advantage of employing top-down strategies, including left-corner parsing strategies, is the strong expecta­tions available when a configuration blocks. Conse­quently, many relaxation strategies and systems in the literature (for example, Hendrix, et al. 1978; Kwasny and Sondheimer 1981; Weischedel and Sondheimer 1982) have been proposed and implemented in that framework. Use of bottom-up strategies offers inter­esting new classes of relaxation, such as rearranging constituents for ordering problems. It is not obvious how the combinatorics of bottom-up strategies will compare to those of top-down strategies. However, developing relaxation techniques for bottom-up proc­essing and extensive empirical studies comparing them to top-down are certainly needed.</p><p>One of the most critical problems is control. The need to relax the very rules that constrain the search for an interpretation is like opening Pandora's box. This affects not only the time required to understand an ill-formed input, but also ambiguity through the additional alternatives the system is prepared to ac­cept. There are several aspects to controlling this search. First, the well-formedness constraints should reflect strictly what is normative. Second, the relaxa­tion rules should be made as tight as warranted by patterns of ill-formedness in language use. Third, a partial order on the relaxations should be established. Fourth, not only syntactic constraints and selection restrictions should be used (as in our system) but also pragmatic information to suggest the most promising alternatives. We have begun research on how to use pragmatic knowledge in an information-seeking environment for this purpose; see Carberry (1983, 1984) and Ramshaw and Weischedel (1984). In the environ­ment of messages reporting events, Granger (1983) reports on using expectations based on stereotypical events for this purpose. Extensive empirical studies regarding effective control of the search space are needed.</p><p>The acid test for a framework, relaxation heuristics, and control strategies is not relaxing simple tests like subject-verb agreement or diagnosing obvious prob­lems like a word not in the dictionary. Rather the acid test is a wide spectrum of problems, including exam­ples like misspellings/typographical errors that result in a known word, because in this type of example, all of the local evidence can indicate that the incorrect word is perfectly correct. Trawick (1983) has initiat­ed work on such misspelling problems.</p></section><section number="8." title="Conclusions"><p>Ill-formed input cannot be ignored by natural language processing systems. This paper has suggested a uni­form framework for processing ill-formed input in the hope of providing a basis for standardizing work on ill-formedness.</p><p>Our framework has several advantages:</p><p>a) Well-formed interpretations are always preferred.</p><p>b) Ill-formedness processing is explicitly related to the well-formedness rules.</p><p>c) Only the constraint that seems to be violated is relaxed; all other well-formedness constraints are still effective for eliminating senseless interpreta­tions and trimming search.</p><p>d) Deviance notes record the aspect that deviates from well-formedness, thus allowing pragmatic inferences by later processing.</p><p>e) Though our approach is uniform, it permits encod­ing as much specific knowledge into the diagnosis and recovery procedure as one desires for highly specialized cases.</p><p>f) Though this paper has drawn most of its examples from ATN grammars and from case frame process­ing, as argued in Section 3.3., the framework is not dependent on a particular model of language proc­essing.</p><p>g) The framework should be applicable to lexical, syntactic, semantic, and pragmatic constraints.</p></section><section title="Acknowledgments"><p>The authors appreciate the helpful suggestions of Sudhir Advani, Robert J. Bobrow, Madeleine Bates, Sandra Carberry, Sheila Coyazo, Giorgio Ingargiola, Stan C. Kwasny, William Mann, William Mark, Geoff Pullum, Lance Ramshaw, Amir Razi, Richard L. Wex-elblat, and William A. Woods in discussions about the research.</p><p>Much of the coding for the implementation de­scribed was done by Amir Razi.</p><page local="16" global="176"/></section><references><p><b>Aho, A.V. and Ullman, J. 1972 <i>The Theory of Parsing, Translation, and Compiling, Vol. </i></b><b><i>I.</i></b><b><i> </i>Prentice-Hall, Englewood Cliffs, New Jersey.</b></p><p><b>Bates, M. 1976 Syntax in Automatic Speech Understanding. <i>American Journal of Computational Linguistics </i>Microfiche 45.</b></p><p><b>Bobrow, R.J. 1978 The RUS System. In Webber, B.L. and Bo-brow, R., eds., Research in Natural Language. BBN Report No. 3378. Bolt Beranek and Newman Inc., Cambridge, Massa­chusetts.</b></p><p><b>Bobrow, R.J. and Webber, B. 1980a PSI-KLONE: Parsing and Semantic Interpretation in the BBN Natural Language Under­standing System, <i>Proceedings of the 1980 Conference of the Canadian Society for Computational Studies of Intelligence. </i>CSCSI/SCEIO.</b></p><p><b>Bobrow, R.J. and Webber, B. 1980b Knowledge Representation for Syntactic/Semantic Processing. <i>Proceedings of the National Conference on Artificial Intelligence. </i>AAAI.</b></p><p><b>Bruce, B. 1975 Case Systems for Natural Language. <i>Artificial Intelligence </i>6(4): 327-360.</b></p><p><b>Burton, R.R. and Brown, J.S. 1977 Semantic Grammar: A Tech­nique for Constructing Natural Language Interfaces to Instruc­tional Systems. BBN Report No. 3587. Bolt Beranek and Newman Inc., Cambridge, Massachusetts.</b></p><p><b>Carberry, S. 1983 Tracking User Goals in an Information-Seeking Environment. <i>Proceedings of the National Conference on Artifi­cial Intelligence. </i>Washington, D.C.: 59-63.</b></p><p><b>Carberry, S. 1984 Understanding Pragmatically Ill-formed Input. <i>Proceedings of COLING 84.</i></b></p><p><b>Carbonell, J.G. 1979 Towards a Self-Extending Parser. <i>Proceed­ings of the 17th Annual Meeting of the Association for Computa­tional Linguistics. </i>La Jolla, California: 3-8.</b></p><p><b>Carbonell, J.G.; Boggs, W.M.; Mauldin, M.L.; and Anick, P.G. 1983 The XCALIBUR Project: A Natural Language Interface to Expert Systems. <i>Proceedings of the Eighth International Joint Conference on Artificial Intelligence. </i>Karlsruhe, West Germany: 653-656.</b></p><p><b>Chang, C.L.   1978   Finding Missing Joins for Incomplete Queries</b> <b>in Relational Data Bases.</b><b>    Research Report RJ2145. IBM</b></p><p><b>Research Laboratory, San Jose, California. Charniak, E.   1983   A Parser with Something for Everyone. In</b> <b>King, Margaret, ed.</b><b>, <i>Parsing Natural Languages. </i>Academic</b> <b>Ill-formed Input.</b><b>  <i>American Journal of Computational Linguistics</i></b> <b>7(4):</b><b> 257.</b></p><doubt alpha="65.2" length="89" tooSmall="False" monospace="0.0">Press, New York, New York. Eastman, CM. and McLean, D.S.   1981   On the Need for Parsing</doubt><p><b>Fineman, L. 1983 Questioning the Need for Parsing Ill-formed Inputs. <i>American Journal of Computational Linguistics </i>9(1): 22.</b></p><p><b>Fromkin, V.A., ed. 1973 <i>Speech Errors as Linguistic Evidence. </i>Janua Linguarum, Series maior 77. Mouton, The Hague.</b></p><p><b>Gawron, J.M.; King, J.; Lamping, J.; Loebner, E.; Paulson, E.A.; Pullum, G.K.; Sag, I.A.; and Wasow, T. 1982 The GPSG Linguistic System. <i>Proceedings of the 20th Annual Meeting of the Associaton for Computational Linguistics. </i>Cambridge, Massachu­setts: 74-81.</b></p><p><b>Granger, R.H.; Staros, C.J.; Taylor, G.B.; and Yoshii, R. 1983 Scruffy Text Understanding: Design and Implementation of the NOMAD System. <i>Proceedings of the Conference on Applied Natural Language Processing. </i>Santa Monica, California: 104­106.</b></p><p><b>Harris, L.R. 1977a ROBOT: A High Performance Natural Lan­guage Interface for Data Base Query. Technical Report TR 77-1. Department of Mathematics, Dartmouth College, Han­over, New Hampshire.</b></p><p><b>Harris, L.R. 1977b User Oriented Data Base Query with the ROBOT Natural Language Query System. <i>International Journal for Man-Machine Studies </i>9: 697-713.</b></p><p><b>Harris, L.R. 1978 The ROBOT System: Natural Language Pro­cessing Applied to Data Base Query.   <i>Proceedings 1978 Annual</i></b> <b><i>Conference, Association for Computing Machinery.</i></b><b><i> </i>Washington, DC: 165-172.</b></p><p><b>Hayes, P.J. and Carbonell, J.G. 1981 Multi-strategy Construction-Specific Parsing for Flexible Data Base Query and Update. <i>Proceedings of the Seventh International Joint Conference on Arti­ficial Intelligence. </i>Vancouver, BC: 432-439.</b></p><p><b>Hayes, P. and Mouradian, G. 1980 Flexible Parsing. <i>Proceedings of the 18th Annual Meeting of the Association for Computational Linguistics and Parasession on Topics on Interactive Discourse. </i>Philadelphia, Pennsylvania: 97-103.</b></p><p><b>Heidorn, G.E. 1975 Augmented Phrase Structure Grammars. <i>Proceedings of the Workshop: Theoretical Issues in Natural Lan­guage Processing. </i>Cambridge, Massachusetts: 1-5.</b></p><p><b>Hendrix, G.G.; Sacerdoti, E.E.; Sagalowicz, D.; and Slocum, J. 1978 Developing a Natural Language Interface to Complex Data. <i>ACM Transactions on Database Systems </i>3(2): 105-147.</b></p><p><b>Jensen, K.E. and Heidorn, G.E. 1983 The Fitted Parse: 100% Parsing Capability in a Syntactic Grammar of English. <i>Proceed­ings of the Conference on Applied Natural Language Processing. </i>Santa Monica, California: 93-98.</b></p><p><b>Kaplan, S.J. 1978 Indirect Responses to Loaded Questions. <i>Theo­retical Issues in Natural Language Processing 2. </i>University of Illinois at Urbana-Champaign.</b></p><p><b>Kroch, A.S. 1981 On the Role of Resumptive Pronouns in Amnes­tying Island Constraint Violations. <i>The Proceedings of the 17th Regional Meeting of the Chicago Linguistic Society.</i></b></p><p><b>Kwasny, S.C. and Sondheimer, N.K. 1981 Relaxation Techniques for Parsing Ill-formed Input. <i>American Journal of Computational Linguistics </i>7(2): 99-108.</b></p><p><b>Linde, C. and Labov, W. 1975 Spatial Network as a Site for the Study of Language and Thought. <i>Language </i>51(4): 924-938.</b></p><p><b>Malhotra, A. 1975 Design Criteria for a Knowledge-Based English Language System for Management: An Experimental Analysis. MAC TR 146. Project MAC, Massachusetts Institute of Tech­nology, Cambridge, Massachusetts.</b></p><p><b>Miller, L.A.; Heidorn, G.E.; and Jensen, K. 1981 Text-critiquing with the EPISTLE System: An Author's Aid to Better Syntax. <i>AFI PS Conference Proceedings, 1981 NCC. </i>AFIPS Press, Mont-vale, New Jersey: 649-655.</b></p><p><b>Ramshaw, L.A. and Weischedel, R.M. 1984 Problem Localization Strategies for Pragmatics in Natural Language Front Ends. <i>Proceedings of COLING 84.</i></b></p><p><b>Robinson, J.J. 1982 DIAGRAM: A Grammar for Dialogues. <i>Communications of the ACM </i>25(1): 27-46.</b></p><p><b>Ross, J.R. 1979 Where's English. In Fillmore, C.J.; Kempler, D.; and Wang, W.S-Y., eds., <i>Individual Differences in Language Ability and Language Behavior. </i>Academic Press, New York, New York: 127-163.</b></p><p><b>Sager, N. 1981 <i>Natural Language Information Processing: A Com­puter Grammar of English and its Applications. </i>Addison-Wesley, Reading, Massachusetts.</b></p><p><b>Schank, R.C.; Lebowitz, M.; and Birnbaum, L. 1980 An Integrat­ed Understander. <i>American Journal of Computational Linguistics </i>6(1): 13-30.</b></p><p><b>Sondheimer, N.K.; Weischedel, R.M.; and Bobrow, R.J. 1984 A Knowledge Representation for Semantic Interpretation. <i>Proceed­ings of COLING 84.</i></b></p><p><b>Thompson, B.H. 1980 Linguistic Analysis of Natural Language Communication with Computers. <i>Proceedings of the Eighth International Conference on Computational Linguistics. </i>Tokyo, Japan: 190-201.</b></p><p><b>Trawick, D.J. 1983 Robust Sentence Analysis and Habitability. Technical Report 5074:TR:83. Computer Science Department, California Institute of Technology, Pasadena, California.</b></p><p><b>Walker, D.E. 1978 <i>Understanding Spoken Language. </i>North-Holland, New York, New York.</b></p><p><b>Waltz, D.L. 1978 An English Language Question Answering System for a Large Relational Database. <i>Communications of the ACM </i>21(7): 526-539.</b></p><page local="17" global="177"/><p><b>Warren, D.H.D.; Pereira, L.M.; and Pereira, F. 1977 PROLOG -The Language and its Implementation Compared to LISP. <i>SIG-PLAN Notices </i>12(8): 109-115.</b></p><p><b>Weischedel, R.M. and Black, J. 1980 Responding Intelligently to Unparsable Inputs. <i>American Journal of Computational Lingustics </i>6(2): 97-109.</b></p><p><b>Weischedel, R.M. and Sondheimer, N.K. 1982 An Improved Heuristic for Ellipsis Processing. <i>Proceedings of the 20th Annual Meeting of the Association for Computational Linguistics. </i>Cam­bridge, Massachusetts: 85-88.</b></p><p><b>Weischedel, R.M.; Voge, W.M.; and James, M. 1978 An Artificial Intelligence Approach to Language Instruction. <i>Artificial Intelligence </i>10: 225-240.</b></p><p><b>Wilks, Y.A. 1975 A Preferential Pattern-Seeking Semantics for Natural Language Inference. <i>Artificial Intelligence </i>6: 53-74.</b></p><p><b>Wilks, Y. 1976 Natural Language Understanding Systems Within the AI Paradigm - A Survey and Some Comparisons. <i>American Journal of Computational Linguistics </i>Microfiche 40.</b></p><p><b>Winograd, T.    1972    <i>Understanding Natural Language. </i>Academic</b></p><p><b>Press, New York, New York. Woods, W.A.    1970   Transition Network Grammars for Natural</b> <b>Language Analysis.</b><b>    <i>Communications of the ACM </i>13(10):</b></p><p><b>591-606.</b></p><p><b>Woods, W.A. 1973 Progress in Natural Language Understanding -An Application to Lunar Geology. <i>AFIPS Conference Proceed­ings, 42. </i>AFIPS Press, Montvale, New Jersey: 441-450.</b></p><p><b>Woods, W.A.; Kaplan, R.M.; and Nash-Webber, B. 1972 The Lunar Sciences Natural Language Information System: Final Report. BBN Report No. 2378. Bolt Beranek and Newman Inc., Cambridge, Massachusetts.</b></p><p><b>Woods, W.A.; Bates, M.; Brown, G.; Cook, C; Klovstad, J.; Mak-houl, J.; Nash-Webber, B.; Schwartz, R.; Wolf, J.; and Zue, V. 1976 Speech Understanding Systems: Final Report. Volumes 1-5. Bolt Beranek and Newman Inc., Cambridge, Massachu­setts.</b></p></references></body></article>