<?xml version="1.0"?><!DOCTYPE article SYSTEM "/project/take/software/searchbench_offline_processing/paperxml_generator/aclextractor/src/python/../resource/dtd/paperxml.dtd"><article><header><firstpageheader><page local="1"/><title>Word Sense Disambiguation and Text Segmentation Based on Lexical Cohesion</title><author surname="Okumura" givenname="Manabu"><org  name="Japan Advanced Institute of Science and Technology" country="Japan" city="Nomi"/></author><author surname="Honda" givenname="Takeo"><org  name="Nara Institute of Science and Technology" country="Japan" city="Nara"/></author></firstpageheader><frontmatter><p>Word Sense Disambiguation and Text Segmentation Based on Lexical Cohesion</p><p>OKUMURA Manabu, HONDA Taken School of Information Science, Japan Advanced Institute of Science and Technology (Talsunokuchi, Ishikawa 923-12 Japan) c- in ai 1 : { ok u, hone 1 a} (d&gt; j aist. ac. j p</p></frontmatter><abstract>In this paper, we describe how word sense am­biguity can be resolved with the aid of lexical co­hesion. By checking lexical cohesion between the current word and lexical chains in the order of the salience, in tandem with generation of lexical chains, we realize incremental word sense disant biguation based on contextual information that lexical chains,reveal. Next, we describe how seg­ment boundaries of a text can be determined with the aid of lexical cohesion. We can measure the plausibility of each point in the text as a segment boundary by computing a degree of agreement of the start and end points of lexieal chains. </abstract></header><body><section number="1" title="Introduction"><p>A text is not a mere set of unrelated sentences. Rather, sentences in a text are about the same thing and connected to each other[10]. <i>Cohesion </i>and <i>coherence </i>are said to contribute to such con­nection of the sentences. While coherence is a semantic relationship and needs computationally expensive processing for identification, cohesion is a surface relationship among words in a text and more accessible than coherence. Cohesion is roughly classified into <i>reference<footnote anchor="1"/>, conjunction, </i>and <i>lexical cohesion<footnote anchor="2"/>.</i></p><p>Except conjunction that explicitly indicates the relationship between sentences, the other two classes are considered to be similar in that the re­lationship between sentences is indicated by two semantically same(or related) words. But lexical cohesion is far easier to identify than reference be­cause both words in lexical cohesion relation ap­pear in a text while one word in reference relation is a pronoun or elided and has less information to infer the other word in the relation automatically.</p><footnote label="1">Reference by pronouns and ellipsis in Halliday and Hasan's classification^] are included here.</footnote><footnote label="2">Reference by full NPs, substitution and lexical cohe­sion in Halliday and Hasan's classification a.re included here.</footnote><p>Based on this observation, we use lexical cohe­sion as a linguistic device for discourse analysis. We call a sequence of words which are in lexical cohesion relation with each other a <i>lexical chain </i>like [10]. Lexical chains fend to indicate portions of a text that form a semantic unit. And so vari­ous lexical chains tend to appear in a text corre­sponding to the change of the topic. Therefore,</p><p>I . lexical chains provide a local context to aid in the resolution of word sense ambiguity;</p><p>2. lexical chains provide a clue for the determi­nation of segment boundaries of the text[10],</p><p>in this paper, we first describe how word sense ambiguity can be resolved with the aid of lexical cohesion. During the process of generating lex­ical chains incrementally, they are recorded in a register in the order of the salience. The <i>salience </i>of lexical chains is based on. their recency and length. Since the more salient lexical chain rep resents the nearby local context, by checking lexi­cal cohesion between the current word and lexical chains in the order of the salience, in tandem with generation of lexical chains, we realize incremen­tal word sense disambiguation based on contex ­tual information that; lexical chains reveal.</p><p>Next, we describe how segment boundaries of a text can be determined with the aid of lexical cohesion. Since the start and end points of lexical chains in the text tend to indicate the start and end points of the segment, we can measure the plausibility of each point in the text as a segment boundary by computing a degree of agreement of the start and end points of lexical chains.</p><doubt alpha="0.0" length="4" tooSmall="False" monospace="0.0">75.5</doubt><page local="2"/><p>Morris and IIirst[10] pointed out the above two importance of lexical cohesion for discourse analysis and presented a way of computing lexical chains by using Roget's International Thesaurus[15]. However, in spite of their mention to the importance, they did not present the way of word sense disambiguation based on lexical co­hesion and they only showed the correspondences between lexical chains and segment boundaries by their intuitive analysis.</p><p>McRoy's work[8] can be considered as the one that uses the information of lexical cohesion for word sense disambiguation, but her method docs not take into account the necessity to arrange lexical chains dynamically. Moreover, her word sense disambiguation method based on lexical co­hesion is not evaluated fully.</p><p>In section two we outline what lexical cohe­sion is. In section three we explain the way of incremental generation of lexical chains in tan­dem with word sense disambiguation and describe the result of the evaluation of our disambiguation method. In section four we explain the measure of the plausibility of segment boundaries and de­scribe the result of the evaluation of our measure.</p></section><section number="2" title="Lexical Cohesion"><p>Consider the following example, which is the English translation of the fragment of one of Japanese texts that we use for the experiment later.</p><p>In the universe that continues expand­ing, a number of stars have appeared and disappeared again and again. And about ten billion years after the birth of the universe, in the same way as the other stars, a primitive galaxy was formed with the primitive sun as the center.</p><p>Words {universe, star, universe, star, galaxy, sun} seem to be semantically same or related to each other and they are included in the same cat­egory in Roget's International Thesaurus. Like Morris and Hirst, we compute such sequences of related words(lcxical chains) by using a thesaurus as the knowledge base to take into account not only the repetition of the same word but the use of superordinates, subordinates, and synonyms.</p><p>We use a Japanese thesaurus 'Bunrui-goihyo'[l]. Bunrui-goihyo has a similar organi­zation to Roget's: it consists of 798 categories and has a hierarchical structure above this level. For each word, a list of category numbers which corresponds to its multiple word senses is given. We count a sequence of words which are included in the same category as a lexical chain. It might be clear that this task is computationally trivial. Note that we regard only a sequence of words in the same category as a lexical chain, rather than using the complete Morris and Hirst's framework with five types of thesaural relations.</p><p>The word sense of a word can be determined in its context. For example, in the context {universe, star, universe, star, galaxy, sun}, the word 'earth' has a 'planet' sense, not a 'ground' one. As clear from this example, lexical chains can be used as a contextual aid to resolve word sense ambiguity[10]. In the generation process of lexical chains, by choosing the lexical chain that the current word is added to, its word sense is determined. Thus, we regard word sense dis­ambiguation as selecting the most likely category number of the thesaurus, as similar to [16].</p><p>Earlier we proposed incremental disambigua­tion method that uses intrasentential informa­tion, such as selectional restrictions and case frames[12]. In the next section, we describe incre­mental disambiguation method that uses lexical chains as intersentential(contextual) information.</p></section><section number="3" title="Generation of Lexical Chains"><p>In the last section, we showed that lexical chains can play a role of local context. However, multi­ple lexical chains might cooccur in portions of a text and they might vary in their plausibility as local context. For this reason, for lexical chains to function truly as local context, it is necessary to arrange them in the order of the salience that indicates the degree of the plausibility. We base the salience on the following two factors: the re­cency and the length. The more recently updated chains are considered to be the more activated context in the neighborhood and are given more salience. The longer chains are considered to be more about the topic in the neighborhood and are given more salience.</p><p>By checking lexical cohesion between the cur­rent word and lexical chains in the order of the salience, the lexical chain that is selected to add the current word determines its word sense and plays a role of local context.</p><p>Based on this idea, incremental generation of lexical chains realises incremental word sense dis ­ambiguation using contextual information that lexical chains reveal. During the generation of lexical chains, their salience is also in­crementally updated.<page local="3"/> We think incremental disambiguation]<footnote anchor="0"/>^ is a better strategy, because a combinatorial explosion of the number of to tal ambiguities might occur if ambiguity is not resolved as early as possible during the analyt­ical process. Moreover, incremental word sense disambiguation is indispensable during the gen­eration of lexical chains if lexical chains are used for incremental analysis, because the word sense ambiguity might cause many undesirable lexical chains and they might degrade the performance of the analysis (in this case, the disambiguation itself).</p><doubt alpha="0.0" length="3" tooSmall="False" monospace="0.0">756</doubt><subsection number="3.1" title="The Algorithm"><p>First of all, a Japanese text is automatically seg­mented into a sequence of words by the morpho­logical analysis]!!]. From the result of the mor­phological analysis, candidate words are selected to include in lexical chains. We consider only nouns, verbs, and adjectives, with some excep­tions such as nouns in adverbial use and verbs in postpositional use.</p><p>Next lexical chains are formed. Lexical cohe­sion among candidate words inside a sentence is first checked by using the thesaurus. Here the word sense of the current word might be deter­mined. This preference for lexical cohesion inside a sentence over the intersenfential one reflects our observation that the former might be tighter.</p><p>After the analysis inside a sentence, candidate words are tried to be added to one of the lexi­cal chains that are recorded in the register in the order of the above salience. The first chain that the current word has the lexical cohesion relation is selected. The salience of the selected lexical chain gets higher and then the arrangement in the register is updated.</p><p>Here not only the word sense ambiguity of the current word is resolved but the word sense of the ambiguous words in the selected lexical chain can also be determined. Because the lexical chain gets higher salience, other word senses of the ambigu­ous words in the lexical chain which correspond to other lexical chains can be rejected. There fore, lexical chains can be used not only as prior context but also later context for word sense dis­ambiguation.</p><p>If a candidate word can not be added to the existing lexical chain, new lexical chains for each word sense are recorded in the register.</p><p>As clear from the algorithm, rather than the truly incremental method where the register of lexical chains is updated word by word in a sen­tence, we adopt, the incremental method where updates are performed at the end of each sentence because we regard intrasentential information as more important.</p><p>The process of word sense disambiguation us­ing lexical chains is illustrated in Figure 1. The most salient lexical chain is located at the top in the register. In the initial state the word <i>Wl </i>re­mains ambiguous. When the current unambigu­ous word <i>W2 </i>is added, the chain <i>b </i>is selected(top left). The chain <i>b </i>becomes the most salient (top-right). Here the word sense ambiguity of the word <i>WI </i>in the chain &amp; is resolved(bottom-left). If the word to be added is ambiguous( <i>W3), </i>the word sense corresponding to the more salient lexical chain(/7J.'//) is selected(bottom-right).</p></subsection><subsection number="3.2" title="The Evaluation"><p>We apply the algorithm to five texts. Table I shows the system's performance.</p><p>The 'correctness' of the disambiguation is judged by one of the authors. The system's per­formance is computed as the quotient, of the num­ber of correctly disambiguated words by the num­ber of ambiguous words minus the number of wrongly segmented words(morphological analysis errors)'<footnote anchor="1"/>.</p><p>Words that remain ambiguous arc those that do not form any lexical chains with other words. Kxeept by the errors in the morphological analy­sis, most of the errors in the disambiguation are caused by being dragged into the wrong context.</p><p>The average performance is 63.4 %. We think the system's performance is promising for the fol­lowing reasons:</p><p>I . Lexical cohesion is not the only knowledge source for word sense disambiguation and proves to be useful at least as a source sup­plementary to our earlier framework that used case frames[I2].</p><p>2. In fact, higher performance is reported in [16], that uses broader context acquired by</p><p>''The accuracy ot the morphological analysis will be im­proved by adding new word entries or the like.</p><doubt alpha="0.0" length="3" tooSmall="False" monospace="0.0">757</doubt><page local="4"/><doubt alpha="30.0" length="10" tooSmall="False" monospace="0.0">{W2,[ID2])</doubt><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">D</doubt><doubt alpha="57.1" length="14" tooSmall="False" monospace="0.0">hain bW1[ID11]</doubt><doubt alpha="60.0" length="25" tooSmall="False" monospace="0.0">fSiaincW1[ID12]^)^haind ^</doubt><doubt alpha="42.9" length="56" tooSmall="False" monospace="0.0">est of ambiguous words(W1,[ID11,ID12])   (...,[.......])</doubt><doubt alpha="60.9" length="23" tooSmall="False" monospace="0.0">^hain bW2[ID2] W^IDIlj)</doubt><doubt alpha="44.8" length="58" tooSmall="False" monospace="0.0">List of anibiguous^fords(W1,[ID11,ID12])   (...,[.......])</doubt><doubt alpha="44.4" length="27" tooSmall="False" monospace="0.0">(diain bW2[ID2] W1[ID11] ^)</doubt><doubt alpha="66.7" length="3" tooSmall="False" monospace="0.0">^ha</doubt><doubt alpha="71.4" length="7" tooSmall="False" monospace="0.0">^hain d</doubt><p>List: of airbiguous words</p><p><u>v (-[.......)) ........</u></p><doubt alpha="29.4" length="17" tooSmall="False" monospace="0.0">(W3,(ID31,ID32])/</doubt><doubt alpha="0.0" length="1" tooSmall="False" monospace="0.0">/</doubt><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">I</doubt><doubt alpha="44.4" length="27" tooSmall="False" monospace="0.0">J^hain bW2[ID2] W1[ID11] ~\</doubt><doubt alpha="55.6" length="9" tooSmall="False" monospace="0.0">^hain d ^</doubt><p>e
.st of anibitruous words <b>(...[.......« ........</b></p><figure caption="Figure 1: The process of word sense disambiguation"></figure><table caption="Table 1: The performance for the disambiguation" class="main" frame="box" rules="all" border="1" regular="False"><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p>number of</p></td><td class="cell"><p>number of</p></td><td class="cell"><p>number of</p></td><td class="cell"><p>number of</p></td><td class="cell"><p>system's</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>text</p></td><td class="cell"><p>number of</p></td><td class="cell"><p>candidate</p></td><td class="cell"><p>ambiguous</p></td><td class="cell"><p>words that</p></td><td class="cell"><p>correctly</p></td><td class="cell"><p>performance</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p></p></td><td class="cell"><p>sentences</p></td><td class="cell"><p>words</p></td><td class="cell"><p>words</p></td><td class="cell"><p>remain</p></td><td class="cell"><p>disambiguated</p></td><td class="cell"><p>(%)</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p>ambiguous</p></td><td class="cell"><p>words</p></td><td class="cell"><p></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>No.l</p></td><td class="cell"><p>41</p></td><td class="cell"><p>481</p></td><td class="cell"><p>166</p></td><td class="cell"><p>7</p></td><td class="cell"><p>126</p></td><td class="cell"><p>87.5</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>No.2</p></td><td class="cell"><p>26</p></td><td class="cell"><p>197</p></td><td class="cell"><p>71</p></td><td class="cell"><p>13</p></td><td class="cell"><p>32</p></td><td class="cell"><p>51.6</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>No.3</p></td><td class="cell"><p>24</p></td><td class="cell"><p>212</p></td><td class="cell"><p>57</p></td><td class="cell"><p>12</p></td><td class="cell"><p>34</p></td><td class="cell"><p>64.2</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>No.4</p></td><td class="cell"><p>38</p></td><td class="cell"><p>433</p></td><td class="cell"><p>123</p></td><td class="cell"><p>19</p></td><td class="cell"><p>71</p></td><td class="cell"><p>60.1</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>No.5</p></td><td class="cell"><p>24</p></td><td class="cell"><p>163</p></td><td class="cell"><p>82</p></td><td class="cell"><p>11</p></td><td class="cell"><p>42</p></td><td class="cell"><p>53.8</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr></table><page local="5"/><p>training on large corpora, but our method can attain such tolerable level of performance without any training.</p><p>However, our salience of lexical chains is, of course, rather naive and must be refined by us­ing other kinds of information, such as Japanese topical marker 'wa'.</p><p>chains I text</p><doubt alpha="25.0" length="4" tooSmall="False" monospace="0.0">I1 2</doubt><doubt alpha="26.5" length="34" tooSmall="False" monospace="0.0">start-endI123456789012345678901234</doubt><doubt alpha="5.7" length="35" tooSmall="False" monospace="0.0">( i - 24)I ************************</doubt><doubt alpha="4.8" length="21" tooSmall="False" monospace="0.0">( 4 - 13)I **********</doubt><doubt alpha="7.1" length="14" tooSmall="False" monospace="0.0">(14 - 16)I ***</doubt><doubt alpha="7.1" length="14" tooSmall="False" monospace="0.0">( 8 -   9)I **</doubt><doubt alpha="6.2" length="16" tooSmall="False" monospace="0.0">(14 - 18)I *****</doubt></subsection></section><section number="4" title="Text Segmentation by Lexi­cal Chains"><p>The second importance of lexical chains is that they provide a clue for the: determination of seg­ment boundaries. Certain spans of sentences in a text form semantic units and are usually called segments. It is crucial to identify the segment boundaries as a first step to construct the struc­ture of a text [2].</p><subsection number="4.1" title="The Measure for Segment Bound­aries"><p>When a portion of a text forms a semantic unit, there is a tendency for related words to be used. Therefore, if lexical chains can be found, they will tend to indicate the segment boundaries of the text. When a lexical chain ends, there is a tendency for a segment to end. If a new chain begins, this might be an indication thai, a new segment has begunflO]. Taking into account this correspondence of lexical chain boundaries to seg­ment boundaries, we measure the plausibility of each point in the text as a segment; boundary: for each point between sentences <i>n </i>and <i>n </i>f I (where n ranges from 1 to the number of sentences in the text minus 1), compute the sum of the number of lexical chains that end at the sentence <i>n </i>and the number of lexical chains that begin at the sentence n + 1. We call this naive measure of a degree of agreement, of the start and end points of lexical chains <i>vi(n,n + </i>1) <i>boundary strength </i>like [14]. The points in the text are selected in the order of boundary strength as candidates of seg­ment boundaries.</p><p>Consider for example the live lexical chains in the imaginary text that consists of 24 sentences in Figure 2. In this text, the boundary strength can be computed as follows: w(3,4) = l,u&gt;(7,8) = l,w(9,J0) = l,w(13,14) ^ 3,....</p><figure caption="Figure 2: Lexical chains in the text4.2   The Evaluation"></figure><p>We try to segment the texts in section 3.2 and apply the above measure to the lexical chains that were formed. We pick out three toxts(No.3,4,.r&gt;), which are from the exam ques­tions of the Japanese language that ask us to par­tition the texts into a given number of segments. The system's performance is judged by the com­parison with segment boundaries marked as an attached model answer. Two more t,ext,s(No.6,7) from the questions are also tried to be segmented.</p><p>Here we do not take into account the informa tion of paragraph boundaries, such as the inden­tation, at, all in the following reasons:</p><p>• Because our texts are from the exam ques­tions, many of them have no marks of para­graph boundaries ;</p><p>• In case of Japanese, it is pointed out that paragraph and segment boundaries do not always coincide with each other[J3].</p><p>'table 2 shows the performance in case where the system generates the given number of segment boundaries<footnote anchor="4"/> in the order of the strength. From Table 2, we can compute tire system's marks as an examinee in the test that consists of these five questions. 'Fable 3 shows the performance in case where segment boundaries are generated down to half of the maximum strength. The metrics that we use for the evaluation are as follows: <i>Recall </i>is the quotient of the number of correctly identified boundaries by the total number of correct bound­aries. <i>Precision </i>is the quotient of the number of correctly identified boundaries by the number of generated boundaries.</p><p>We think the poor result for the text No.5 might be caused by the difficulty of the text itself because it is written by one of the most difficult writers in Japan, KOBAYASIII Hideo.<page local="6"/> Table 2 shows that our system gets 8(l+3+3+l)/15(l+6+l+4+3)= 53 % in the test. From Table 3, the average recall and pre­cision rates are 0.52 and 0.25 respectively. Of course these results are unsatisfactory, but we think this measure for segment boundaries is promising and useful as a preliminary one.</p><footnote label="4">The number of boundaries to be given is the number of segments given in the question nanus J .</footnote><table caption="Table 3: The performance for the segmenta­tion^)"></table><p>Since lexical chains are considered to be dif­ferent in their degree of contribution to segment boundaries, we are now refining the measure by taking into account their importance. We base the importance of lexical chains on the following two factors:</p><p>1. The lexical chains that include more words with topical marker 'wa' get more impor­tance.</p><p>2. The longer lexical chains tend to represent a semantic unit and get more importance.</p><p>The start and end points of the more impor­tant lexical chains can get the more boundary strength. This refinement of the measure is in the process and yields a certain extent of improve­ment of the system's performance.</p><p>Moreover, this evaluation method is not nec­essarily adequate since partitioning into a larger number of smaller segments might be possible and be necessary for the given texts. And so we will have to consider the evaluation method that the agreement with human subjects is tested in future. However, since human subjects clo not al­ways agree with each other on segmentation[6, 4, 14], our evaluation method using the texts in the questions with model answers is considered to be a good simplification.</p><p>Several other methods to text segmentation have been proposed. Kozima[7] and Youmans[17] proposed statistical measures(they are named LCP and VMP respectively), which indicate the plausibility of text points as a segment bound­ary. Their hills or valleys tend to indicate seg­ment boundaries. However, they only showed the correlation between their measures and segment boundaries by their intuitive analysis of few sam­ple texts, and so we cannot compare our system's and their performance precisely.</p><p>Hearst[5] independently proposes a similar measure for text segmentation and evaluates the performance of her method with precision and re­call rates. However, her segmentation method depends heavily on the information of paragraph boundaries and always partitions a text at the points of paragraph boundaries.</p></subsection></section><section number="5" title="Conclusion"><p>We showed that lexical cohesion can be used as a knowledge source for word sense disambiguation and text segmentation. We think our method is promising, although only partially successful re­sults can be obtained in the experiments so far. Here we reported some preliminary positive re­sults and made some suggestions for how to im­prove the method in future. The improvement of the method is now under way.</p><p>In addition, because computation of lexical chains depends completely on the thesaurus used, we think the comparison among the results by different thesauri would be insightful and are now planning. It it also necessary to incorporate other textual information, such as clue words, which can be computationally accessible to improve the performance.</p><doubt alpha="0.0" length="3" tooSmall="False" monospace="0.0">760</doubt><table caption="Table 2: The performance for the segmentation^)" class="main" frame="box" rules="all" border="1" regular="False"><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>text</p></td><td class="cell"><p>given number of</p></td><td class="cell"><p>number of</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p></p></td><td class="cell"><p>boundaries</p></td><td class="cell"><p>correct boundaries</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>No.3</p></td><td class="cell"><p>1</p></td><td class="cell"><p>1</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>No.4</p></td><td class="cell"><p>6</p></td><td class="cell"><p>3</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>No.5</p></td><td class="cell"><p>1</p></td><td class="cell"><p>0</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>No.6</p></td><td class="cell"><p>4</p></td><td class="cell"><p>3</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>No.7</p></td><td class="cell"><p>3</p></td><td class="cell"><p>1</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr></table><table caption="Table 3: The performance for the segmentation^)" class="main" frame="box" rules="all" border="1" regular="False"><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p></p></td><td class="cell"><p>number of</p></td><td class="cell"><p>number of</p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>text</p></td><td class="cell"><p>generated</p></td><td class="cell"><p>correct</p></td><td class="cell"><p>rec.</p></td><td class="cell"><p>prec.</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p></p></td><td class="cell"><p>boundaries</p></td><td class="cell"><p>boundaries</p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>No.3</p></td><td class="cell"><p>3</p></td><td class="cell"><p>1</p></td><td class="cell"><p>1</p></td><td class="cell"><p>0.33</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>No.4</p></td><td class="cell"><p>10</p></td><td class="cell"><p>3</p></td><td class="cell"><p>0.5</p></td><td class="cell"><p>0.30</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>No.5</p></td><td class="cell"><p>3</p></td><td class="cell"><p>0</p></td><td class="cell"><p>0</p></td><td class="cell"><p>0</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>No.6</p></td><td class="cell"><p>7</p></td><td class="cell"><p>3</p></td><td class="cell"><p>0.75</p></td><td class="cell"><p>0.43</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>No.7</p></td><td class="cell"><p>5</p></td><td class="cell"><p>1</p></td><td class="cell"><p>0.33</p></td><td class="cell"><p>0.20</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr></table><page local="7"/></section><references><p>[I] <i>Bunrui-Goihyo. </i>Shuei Shuppan., 1964. in Japanese.</p><p>[2] B.J. Gros/, and CL. Sidner. Attention, intentions, and the structure of discourse. <i>Computational Linguistics, </i>12(3):175-204, 1986.</p><p>[3] H. A. K. Halliday and R. Hassan. <i>Cohesion in English. </i>Longman, 1976.</p><p>[4] M.A. Hearst. Texttiling: A quantitative ap­proach to discourse segmentation, 'techni­cal Report 93/24, University of California, Berkeley, 1993.</p><p>[5] M.A. Hearst, Multi-paragraph segmenta­tion of expository texts. Technical Report 94/790, University of California, Berkeley, 1994.</p><p>[6] J. Hirschberg and B. Grosz. lntonational fea­tures of local and global discourse structure. In <i>Proc. of the Uarpa Workshop on Speech and Natural Language, </i>pages 441 446, 1992.</p><p>[7] II. Kozima. Text segmentation based on sim­ilarity between words. In <i>Proc. of the 31st Annual Meeting of the Association for Com­putational Linguistics, </i>pages 286 -288, 1993.</p><p>[8] S.W. McRoy. Using multiple knowledge sources for word sense discrimination. <i>Com­putational Linguistics, </i>J8( 1 ): 1 30, 1992.</p><p>[9] C.S. Mellish. <i>Computer Interpretation of Natural Language Descriptions. </i>Ellis Hor-wood, 1985.</p><p>[10] J. Morris and G. Hirst. Lexical cohesion computed by thesaural relations as an indi­cator of the structure of text. <i>Computational Linguistics, </i>17(1):21--48, 1991.</p><p>[II] Nagao Lab., Kyoto University. <i>Japanese Morphological Analysts System JUMAN Manual Version 1.0, </i>1.993. in Japanese.</p><p>[12] M. Okumura and II. Tanaka. Towards in­cremental disambiguation with a general­ized discrimination network. In <i>['roc. of the 8th National Conference on Artificial Intel­ligence, </i>pages 990 995, 1990.</p><p>[13] '1'. Ookuma. Gengo tan'i toshite no bun-shou. <i>Nihongo gaku, </i>ll(4):20-25, 1992. in Japanese.</p><p>[14] R.J. Passonneau. Intention-based segmenta­tion: Human reliability and correlation with linguistic cues. In <i>Proc. of the 31st An­nual Meeting of the Association for Compu­tational linguistics, </i>pages 148-155, 1993.</p><p>[15] P. Roget. <i>Roget's International Thesaurus, Fourth Edition. </i>Harper and Row Publishers Inc., 1977.</p><p>[16] D. Yarowsky. Word-sense disambiguation using statistical models of roget's categories trained on large corpora. In <i>Proc. of the Utth International Conference on Computational Linguistics, </i>pages 454-460, 1992.</p><p>[17] G. Youmans. A new tool for discourse anal­ysis: The vocabulary-management profile. <i>Language, </i>67:763-789, 1991.</p><doubt alpha="0.0" length="3" tooSmall="False" monospace="0.0">761</doubt></references></body></article>