<?xml version="1.0"?><!DOCTYPE article SYSTEM "/project/take/software/searchbench_offline_processing/paperxml_generator/aclextractor/src/python/../resource/dtd/paperxml.dtd"><article><header><firstpageheader><page local="1" global="402"/><title>Maintaining Consistency and Plausibility in Integrated Natural Language Understanding</title><author surname="Nishida" givenname="Toyoaki"><org  name="Kyoto University" country="Japan" city="Kyoto"/></author><author surname="Liu" givenname="Xuemin"><org  name="Kyoto University" country="Japan" city="Kyoto"/></author><author surname="Doshita" givenname="Shuji"><org  name="Kyoto University" country="Japan" city="Kyoto"/></author><author surname="Yamada" givenname="Atsushi"><org  name="Kyoto University" country="Japan" city="Kyoto"/></author></firstpageheader><frontmatter><p>Maintaining Consistency and Plausibility in 'Integrated Natural</p><p>Language Undeistanding</p><p><b>Toyoaki Nishida, Xuemin Liu, Shuji Doshita, and Atsushi Yamada</b></p><p><b>Department of information Science Kyoto University Sakyo-ku, Kyoto 606, Japan</b></p><p>phone: 81-75~751-?,lll ext. 5396 email: nishida%doshita.kuis.kyoto-u.junet%j apan@relay.cs.net</p></frontmatter><abstract>in tliis paper, we present an inference mecha­nism called the <b><i>integrated parsing engine </i></b>which provides a uniform abd active inference mech­anism for natural language understanding. It can (1) make plausible assumptions, (2) rea­son with multiple alternatives, (3) switch the search process to the maximally plausible alter­native, (4) detect contradiction and tame con-clutions which depend on inconsistent assump­tions, and (5) update plausibility factor of each belief based on new observations. We demon­strate that a natural language understanding system using the integrated parsing engine as a subsystem can pursue a guided search for most plausible interpretation by making use of syn­tax, semantics, and contextual information. </abstract></header><body><section number="1" title="Introduction"><p>Natural language understanding involves lots of hard issues such as various types of ambiguities, indeterminacies caused by ellipses or fragmental utterances, or ill-formedness. Being confronted with these difficulties, it does not seem reason able to seek for a method of <b><i>logically </i></b>deducing the speaker's intended meaning or plan from utterances. Instead, it is much more natural to characterize natural language understanding as an abd active process of exploring most plausi­ble interpretation which can explain given ut­terances.</p><p>In this paper, we present an abductive in­ference mechanism, called the <b><i>integrated pars-</i></b> <b><i>ing engine, </i></b>for natural language understanding. The integrated parsing engine is able to:</p><p>• make plausible assumptions at appropriate time</p><p>e reason with multiple alternatives based on different sets of assumptions</p><p>• switch the search process to the maximally plausible alternative</p><p>• detect contradiction resulting from incon­sistent assumptions and eliminate all con­flations which depends on these assump­tions</p><p>® update plausibility factor of each belief based on new observations.</p><p>Thus, the integrated parsing engine is general enough to carry out linguistic and nonlinguis tic inferences in a uniform manner, by drawing information from various sources: syntax, se­mantic, discourse, pragmatics, or real world.</p><p>In the remainder of this paper, we first de­scribe mechanisms for maintaining consistency and plausibility. We then show how these two mechanisms interact to guide the inference pro cess. Finally, we use an implemented exam­ple to demonstrate how the integrated parsing engine is used to interpret sentences by taking contextual factors into account.</p></section><section number="2" title="Maintaining Consistency"><p>The (/ME (Consistency Maintenance Engine) is a. component of the integrated parsing engine responsible for maintaining consistency among beliefs.<page local="2" global="403"/> Basic design principles of the CME is based on de Kleer's ATMS (Assumption-based Truth Maintenance Engine) [de 86]. The CM]'] maintains a set of alternative be­liefs, each of which, consists of a set of as­sumptions and their conclusions, as follows:</p><doubt alpha="52.2" length="46" tooSmall="False" monospace="0.0">alternative t{All, • • • ?A\mx}B\11 • • •tBimi</doubt><doubt alpha="44.2" length="52" tooSmall="False" monospace="0.0">alternativen      {A„:t, .•^•J^nra,}Bn1.1 • •- ,Bnm^</doubt><p><b><i>cnvir onment conclusions </i></b>An external problem solver is assumed to exist which makes assumption, adds conclusion, and detects contradietion.</p><p>The main task of CME is to maintain alterna­tive beliefs by removing all alternatives whose net of assumptions has turned out contradic­tory. Like ATMS, the CME takes advantage of the following monotonie property:</p><p>if a contradiction is derived from a set of assumptions <b><i>A, </i></b>then contradiction is also derived from any set of assump­tions <b><i>B </i></b>such that <b><i>B D A.</i></b></p><p>Thus, if contradiction is derived from a set of assumptions {0S.D}, alternative interpreta­tions depending on sets of assumptions such as <b><i>{B, </i></b><i>C, </i><b><i>D), </i></b><i>{A,</i><i> </i><i>B,</i><i> </i>£&gt;}, <b><i>{A, </i></b>B, <b><i>Ct D\, </i></b>... are re­moved. In addition, the CME keeps records of contradictory sets of assumptions to prevent any interpretation depending on them from be­ing considered in future.</p><p>Unlike ATMS whose control regime is bread-first, our CME uses a tree called the environ­ment tree, or the <i>E-tree </i>for short, to guide the search process. Each node of the E-tree rep­resents an environment, a set of assumptions. Each arc of the E tree represents that a lower node is derived from the upper node by mak­ing one more assumption. Thus in figure 1, <i>Eq </i>is the root node, and it represents an environ-mnet without any assumption. Nodes below &amp;o represent environments with one or more assumption added to its parent node's envi­ronment. Thus, <i>Ei </i>-~ <i>Eq </i>U <b>M</b><b>i}</b><b> </b><b>=</b><b> </b><i>En ~ Et </i>U {An} = {/ii, /In}, and so on.</p><p>We assume that a set of assumptions made at the same parent node are mutually exclusive. Although this is a rather strong assumption, it makes sense in natural language understand­ing .since many assumptions being made dur­ing the natural language understanding process are mutually exclusive. Even if this is not the case, any set of assumptions can be transformed into a set of mutually exclusive assumptions by adding appropriate conditions. Although this is a cumbersome solution, it does not often take place in natural language understanding and most importantly it saves the amount of com­putation.</p><doubt alpha="100.0" length="2" tooSmall="False" monospace="0.0">ho</doubt><doubt alpha="48.8" length="43" tooSmall="False" monospace="0.0">= {yli,^ii}-{Ai,Aini} ~{An,Anl} = {An,Anm,}</doubt><figure caption="Figure 1: The E-tree"></figure><doubt alpha="50.0" length="2" tooSmall="False" monospace="0.0">E0</doubt><doubt alpha="41.2" length="17" tooSmall="False" monospace="0.0">Pi^^/r^~~^-J,K/Pi</doubt><doubt alpha="43.5" length="23" tooSmall="False" monospace="0.0">Ex={Ai}E2= {A2}En= {An}</doubt><doubt alpha="44.4" length="45" tooSmall="False" monospace="0.0">= {Ai,An}   ={Ai,/iini} ={A„,Anl}  = {An,AnmJ</doubt><figure caption="Figure 2: The E-tree with Conditional Proba­bilities"></figure><p>Note that the CME alone cannot determine which way to go when there is more than one possibility of extending the set of beliefs. This information is provided by the PME, as de­scribed in the next section.</p></section><section number="3" title="Maintaining Plausibility"><p>The PME (Plausibility Maintenance Engine) maintains estimations of how plausible each en­vironment is. This information is given as con­ditional probabilities and it is kept as annota­tions to each arc of the E-tree. Thus, in figure 2, which is a slightly more precise version of fig­ure 1, <b><i>pi </i></b>stands for <b><i>P(Ei), pij </i></b>for <b><i>P(Ej\Ai), Pijk </i></b>for <b><i>P{Ek\Ai, Aj), </i></b><i>etc.</i></p><p>It follows from the property of conditional probability that if <b><i>i -fi j </i></b>and <b><i>E, </i></b>and <b><i>Ej </i></b>are immediate children<page local="3" global="404"/></p><doubt alpha="26.3" length="19" tooSmall="False" monospace="0.0">P(Ei\...Ej...) = 0,</doubt><doubt alpha="33.3" length="3" tooSmall="False" monospace="0.0">4B3</doubt><doubt alpha="50.0" length="2" tooSmall="False" monospace="0.0">E0</doubt><p><b>(a) initial B-tree</b> <b>Linguistic and Nonlinguistic Pioblem Solver</b> <b>(b) The E-tree after </b><i>-iEt </i><b>is observed.</b><b></b></p><doubt alpha="9.1" length="11" tooSmall="False" monospace="0.0">1/2^--"\l/2</doubt><p><b><i>E3</i></b><b><i> </i></b><b><i>E4</i></b><b><i> </i></b><b><i>Es</i></b><b><i></i></b>Figure 3: A Sample E-tree with Annotation</p><doubt alpha="62.0" length="50" tooSmall="False" monospace="0.0">of the same parent. Furthermore,P(Ei\..^Ej...)=,0,</doubt><p>if <b><i>Ej </i></b>is a parent node of .</p><p>Initial value of <b><i>p^s </i></b>are to be given from the external problem solver. The PME's role is to maintain estimation of plausibility by taking into account given observations. Currently we only take <b>-i </b>./■/', the event of environment <b><i>E </i></b>run­ning into contradiction, as an observation. We use a Bayes' law to modify <b><i>P(A) </i></b>into <b><i>P(A\-iE). </i></b>Thus, if <b><i>Ei </i></b>and <b><i>Ej </i></b>aie brothers, (l) is further simpli­fied to:</p><doubt alpha="62.5" length="8" tooSmall="False" monospace="0.0">P(EihE3)</doubt><doubt alpha="64.3" length="14" tooSmall="False" monospace="0.0">Pj^EAEj)•P(Ej)</doubt><doubt alpha="51.6" length="31" tooSmall="False" monospace="0.0">P(-,Ej)(1-PjEAPj)).P(Ej)l-P(Ej)</doubt><doubt alpha="0.0" length="3" tooSmall="False" monospace="0.0">(1)</doubt><doubt alpha="50.0" length="8" tooSmall="False" monospace="0.0">l-P(Ej)-</doubt><doubt alpha="0.0" length="3" tooSmall="False" monospace="0.0">(2)</doubt><p>For example, suppose it has turned out that environment <i>e4 </i>is in contradiction and hence <i>~&lt;Ei </i>is observed (figure 3(a)). The annotations to the E-tree are updated as in figure 3(b). Notice that the update of conditional proba­bility can be done based on local information.</p><p><b>Knowledge Base</b></p><p><b>Problem Solving Engine (PSE)</b></p><p><b>Working Memory</b></p><p><b>Associative Networks Previous Topic</b></p><p><b>The Integrated Parsing Engi</b></p><p><b>The Integrated Parsing Engine (CME)</b> <b>Plausibility Maintenance Engine (PME)</b></p><doubt alpha="66.7" length="6" tooSmall="False" monospace="0.0">"Ni Pi</doubt><p><b>E-true ft .</b></p><doubt alpha="80.0" length="5" tooSmall="True" monospace="0.0">Ik Hi</doubt><p>Figure 4:  The Structure of a Natural Lan­guage Understanding System with the lute grated Parsing Engine as a subsystem</p><p>4 Natural Language Un­derstanding System Us­ing the Integrated Pars­ing Engine as a Subsys­tem,</p><p>The integrated parsing engine consists of the CME and the PME. The architecture of a natu­ral language understanding system with the in­tegrated parsing engine as a subsystem is shown in figure 4.</p><p>The knowledge base contains various types of information for language comprehension, in eluding lexicon, morphology, syntax, semantics, discourse, pragmatics, commonsenses, and so on. The whole system is controled by the prob­lem solving engine (PSE). The PSE can access to the knowledge base and use the integrated parsing engine as an aid to seek for most plau­sible interpretation. Input texts are analyzed in a sentence-by-sentence manner. The discourse structure is maintained as a previous topic in the working memory.</p><p>When it scans a new sentence, the PSE first initialize the E-tree with only the root node. Then the PSE repeats the following cycle:</p><p>(step 1) choose a leaf node with the high est probability as a working enviri ornent (step 2) repeatedly derive conclusions from<page local="4" global="405"/></p><doubt alpha="50.0" length="2" tooSmall="False" monospace="0.0">E0</doubt><p><b>the library the xerox the meeting room room room</b></p><doubt alpha="52.6" length="19" tooSmall="False" monospace="0.0">icy Ikey 2    key 3</doubt><figure caption="Figure 5: Sample. Dialog Environment"></figure><p>believed propositions until either (a) the goal is achieved,  (b) contra diction is derived, or (c) no more conclusion is derived unless malting more assumption.</p><p>In case (a), the process halts.</p><p>In case (b), the process is passed to the PME, which modifies current es­timation of plausibility so that this fact is reflected, then an alternative of mexhuuui plausibility is chosen and is suggested to the CME.</p><p>In case (c), the process also is passed to the PME, which assigns plausibil ity to new nodes, and working envi­ronment is chosen again.</p><p>The integrated parsing engine has been writ­ten in Lisp. It is running with a small exmeri-mental grammar for Japanese. The next section shows how it works.</p></section><section number="5" title="An Example"><p>Suppose a dialog environment in which a pro lessor speaks to a clerk to borrow a key of some rooms (figure 5) and utters the following Japanese sentence:</p><doubt alpha="60.9" length="23" tooSmall="False" monospace="0.0">(3) KA SH I TBKUDA SA Ï</doubt><doubt alpha="54.1" length="85" tooSmall="False" monospace="0.0">(a/the) key Object&gt;        lend       could you...?"could you lend (me) (a/the) key?"</doubt><doubt alpha="27.3" length="33" tooSmall="False" monospace="0.0">1/3^-^^2/3 { @ weird-1} {@word-2}</doubt><figure caption="Figure 6: E tree after assumptions@word-land@ word-2are made"></figure><p>The referential meaning of this sentence is ambiguous if there is more than one key in a given situation. Suppose three keys are there: <i>keyl</i><i> </i>for a library room, <i>key2</i><i> </i>for a xerox room, and <i>keyS</i><i> </i>for a meeting room.</p><p>Although sentence (3) is ambiguous in nor­mal contexts, it becomes much less so if it fol­lows sentences like:</p><doubt alpha="63.9" length="72" tooSmall="False" monospace="0.0">(4) HO N WO KO PI I SHI TA I NO DE SU GA "I'd like to xerox some books."</doubt><p>Even if no previous sentence is spoken, sen tence (3) is acceptable in a situation where the speaker and the hearer mutually believe that the xerox room is accessed so often that "the key" is usually used to refer to <i>key</i><i>2, </i>the one for the xerox room.</p><p>Note that the omission of the patient case does not matter in usual situations, since there is a strong default that the filler of this case is the speaker.</p><p>Now let us show how sentence (3) is ana­lyzed in a context where sentence (4) was pre­viously uttered. The task of analyzing input starts <b>from </b>recognizing words. Lots of ambi­guities arise in this phase. For sentence (3), <b>*KA' </b>might be a single word lKA' (postposi­tion marking interrogative) or a part of a longer word 'KAGI' (key). Since longer match is con­sidered to be more plausible in general case in Japanese analysis, we assign larger number of probability to the latter possibility. Following this analysis, the PSE makes the assumptions to the integrated parsing engine:</p><p><i>&lt;§&gt;word~j </i><b>(take the sequence 'KA' as a word): probability 1/3.</b></p><p><i>®\nord-2 </i><b>(take the sequence 'KAGI' as a word): =&gt; probability 2/3.</b></p><p>Accordingly, the CME extends the initial E-tree as in figure 6. Since, the environment <b><i>E\</i></b><b><i> </i></b>has the highest plausibility, the CME chooses it for the next environment and control is returned to the PSE.</p><doubt alpha="33.3" length="3" tooSmall="False" monospace="0.0">*U5</doubt><page local="5" global="406"/><doubt alpha="50.0" length="8" tooSmall="False" monospace="0.0">key I —■</doubt><doubt alpha="60.0" length="5" tooSmall="False" monospace="0.0">key3-</doubt><p><i>the library ~ room</i> <i>xeroxing</i> <i>.</i><i>.the meeting</i>--<i>meeting</i></p><doubt alpha="100.0" length="4" tooSmall="False" monospace="0.0">room</doubt><figure caption="Figure Y: An Associative Network between Concepts"></figure><p>Now the PSE tries to derive further conclu­sion in the chosen environment. After having recognized that the part of speech of the word 'KAGI' is noon, the PSE tries to find out the referent of the noun and realizes that three am­biguities arise in this situation. Again, the PSE calls the CME to make assumptions. At the same time, the PSE is called for to assign esti­mated conditional probabilities to each assump­tion.</p><p>Currently, the system uses an associative net­work as shown in figure <b><i>7 </i></b>to determine plausi­bility. Nodes of this network represent either a concept or an instance, and arcs mean that the two concepts or instants at its both ends have a certain relation. Those items which have dense connections to previous subjects are considered to be plausible as a referent. In our example, since the node <b><i>xerox </i></b>is marked as the previous subject <b><i>key 2 </i></b>is considered most plausible, while <b><i>keyl </i></b>is less plausible and <b><i>key3 </i></b>much less. Thus, the following assumptions are made: <footnote anchor="1"/></p><p><i>©referen1-l </i><b>(consider 'KAGI' to refer to </b><i>key]): </i><b>=&gt; probabiliy 1/3.</b></p><p><b>@re/eren&lt;-2 (consider 'KAGI' to refer to </b><i>key2): </i><b>=&gt; probabiliy 1/2.</b></p><p><i>@referen1-3 </i><b>(consider 'KAGI' to refer to </b><i>key3): </i><b>=&gt; probabiliy 1/6.</b></p><p>In case no previous utterance is given, the PSE will consult information given as <b><i>a priori </i></b>measurements.</p><doubt alpha="65.5" length="87" tooSmall="False" monospace="0.0">The E-tree now becomes as in figure 8, and {@word.~2,  @referent~2},  which is the most</doubt><footnote label="1">Currently we use a very simple algorithm for assign­ing those value: when there are three alternatives, the densest connection receives the value (1/3), the second (1/2), and the third (1/6), regardless of how closely they are related to each other. We plan to develop a much more precise method in a near future.</footnote><p><b>1/3.</b></p><doubt alpha="100.0" length="2" tooSmall="False" monospace="0.0">Eo</doubt><doubt alpha="0.0" length="3" tooSmall="False" monospace="0.0">2/3</doubt><doubt alpha="47.4" length="19" tooSmall="False" monospace="0.0">{@word-l&gt; {@word-2}</doubt><doubt alpha="41.4" length="29" tooSmall="False" monospace="0.0">{@word-2, {@word-2, {©word-2,</doubt><doubt alpha="56.8" length="44" tooSmall="False" monospace="0.0">@referent-l}       @referent-2} @referent-3}</doubt><figure caption="Figure 8: E-tree after assumptions about the referent of 'KAGI* (key) are made"></figure><p><b>meaning--2 <i>meaniüg-3 mcan'mg-4</i></b></p><p><b><i>referent-2-ßrcferent-2 </i></b><i>X</i> <b>meaning--<i>1</i></b> <b>word-J <i>word-2</i></b> <b>WO KA SHI TE KU DA SAÏ</b> <b>Notice that all part of this network is not explored in actual processing.</b><b></b></p><doubt alpha="100.0" length="1" tooSmall="True" monospace="0.0">I</doubt><figure caption="Figure 9: Dependency of Beliefs"></figure><p>plausible environment at this point, is chosen as the next environment. The analysis is contin­ued this way until the semantic representation is obtained for the whole sentence. The inter pretation obtained this case is:</p><doubt alpha="57.6" length="66" tooSmall="False" monospace="0.0">event    —   asking-for actor    =   &lt;the speaker&gt; object   = key2</doubt><figure caption="Figure 9 shows the dependency structure of be­liefs related to this analysis."></figure><p>Notice that the efficiency of the analysis is significantly improved when strong expectation exists. For example, although character <b>'SHï' </b>in sentence (3) has many possible interpretations in Japanese, the system is not annoyed by those ambiguities, since this part of the sentence just goes as expected. The system may come to sus­pect it only when most of its expectation fails.</p><doubt alpha="0.0" length="5" tooSmall="False" monospace="0.0">4 8 6</doubt><p><b><i>object-ctise-1 objcct-casv-2 </i>object-case-</b> <b>referent-1</b><page local="6" global="407"/></p><doubt alpha="0.0" length="3" tooSmall="False" monospace="0.0">1/2</doubt><doubt alpha="47.4" length="19" tooSmall="False" monospace="0.0">{@word-l) {@word-2}</doubt><doubt alpha="41.4" length="29" tooSmall="False" monospace="0.0">{@word-2, {@word-2, {@word-2,</doubt><doubt alpha="57.8" length="45" tooSmall="False" monospace="0.0">@referent,-l}      ©referent-2} @refererit-3}</doubt><p>addition, the integrated parsign engine provides a concise and high level mechanism for abd ac­tive reasoning. We have carefully chosen a set of reasonably high-level functions necessary for abductive reasoning. This serves, to much sim­plifying natural language understanding system than otherwise.</p><p>Figure 10: E-tree after assumptions about the proposed interpretation based on {@word-2, ©referent-2} is rejected</p><p>Now suppose the above interpretation is re­jected for some reason, say by explicitly negated by the speaker. Then the system will eventu­ally produce an alternative interpretation tak­ing <i>keyl</i><i> </i>as a referent, by changing annotations to the E-tree as in figure 10.</p></section><section number="6" title="Related Work"><p>This paper was inspired by a number of works. A massively parallel parsing by Waltz and Pollack <b>I</b>WP85] has demonstrated the effect of integration through a uniform computa­tion mechanism (marker passing) in context-dependent comprehension of discourse. They have pointed out the importance of non-logical, associative relation between concepts. Char-niak has pointed out the abductive nature of language comprehension. Charniak's Wimp [Cha86] uses a marker passing mechanism as a basis of abductive inference engine for lan­guage comprehension. But it is not used alone; it is augmented by a logical process called path proof. In a parser used in Lytinen's MOP-TRANS [Lyt86], a mechanism is provided to allow close interaction between syntax and se­mantics, while keeping the modularity of the system. Another thing to note is that Lytinen's integrated parser makes use of strong semantic expectation to constrain the search.</p><p>The integrated parsing engine presented in this paper takes advantages of these preced­ing works. Unlike Waltz and Pollack, and like Charniak and Lytinen, our integrated parsing engine has a hybrid architecture for logical and non-logical inferences. What is novel with our integrated parsing engine is the method of inte­grating and maintaining logical and non-logical information obtained from various source. In</p></section><section number="7" title="Concluding Remarks"><p>We have presented an inference engine for inte­grated natural language understanding, based on a characterization of natural language un­derstanding as an abductive process. The essence of our approach is connecting con­sistency maintenance engine and plausibility maintenance engine closely enough to allow their dense interaction. Although we have shown rather "low level" issues, we believe the same idea is applicable to "higher level" prob­lems such as inferring speaker's intention and plan.</p></section><references><p>[Cha86] Eugine Charniak. A neat theory of marker passing. <b><i>ïn. Proceedings </i></b><i>AAA</i><i> </i><b><i>l-</i></b><i>86, </i>pages 584-588, 1986.</p><p>[de 86] Johan de Kleer. An assumption-based has. <b><i>Artificial Intelligence, </i></b>28:127­162, 1986.</p><p>[Lyt86] Steven Lytinen. Dynamically combin­ing syntax and semantics in natural language processing. In <b><i>Proceedings </i></b><b><i>AAAI</i></b><i>-86, </i>pages 574-578, 1986.</p><p>[WP85] D. Waltz and J. B. Pollack. Massively parallel parsing: a strongly interactive model of natural language interpreta­tion. <b><i>Cognitive Science, </i></b>9:51-74, 1985.</p><doubt alpha="33.3" length="6" tooSmall="False" monospace="0.0">4 LI 7</doubt></references></body></article>