<?xml version="1.0"?><!DOCTYPE article SYSTEM "/project/take/software/searchbench_offline_processing/paperxml_generator/aclextractor/src/python/../resource/dtd/paperxml.dtd"><article><header><firstpageheader><page local="1" global="833"/><title>Prediction of Maximal Projection for Semantic Role Labeling</title><pubinfo>Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008),pages 833-840 Manchester, August 2008</pubinfo><author surname="Sun" givenname="Weiwei"><org  name="Peking University" country="China" city="Beijing"/></author><author surname="Sui" givenname="Zhifang"><org  name="Peking University" country="China" city="Beijing"/></author><author surname="Wang" givenname="Haifeng"><org  name="Peking University" country="China" city="Beijing"/></author></firstpageheader><frontmatter><p><b>Prediction of Maximal Projection for Semantic Role Labeling</b></p><p><b>Weiwei Sun,* Zhifang Sui Haifeng Wang</b></p><p>Institute of Computational Linguistics Toshiba (China) R&amp;D Center</p><p>Peking University 501, Tower W2, Oriental Plaza</p><p>Beijing, 100871, China Beijing, 100738, China</p><p>{ws,   szf}@pku.edu.cn wanghaifeng@rdc.toshiba.com.cn</p></frontmatter><abstract>In Semantic Role Labeling (SRL), argu­ments are usually limited in a syntax sub­tree. It is reasonable to label arguments lo­cally in such a sub-tree rather than a whole tree. Lo identify active region of argu­ments, this paper models Maximal Pro­jection (MP), which is a concept in D-structure from the projection principle of the Principle and Parameters theory. Lhis paper makes a new definition of MP in S-structure and proposes two methods to pre­dict it: the anchor group approach and the single anchor approach. Lhe anchor group approach achieves an accuracy of 87.75% and the single anchor approach achieves 83.63%. Experimental results also indicate that the prediction of MP improves seman­tic role labeling. </abstract></header><body><section number="1" title="Introduction"><p>Semantic Role Labeling (SRL) has gained the in­terest of many researchers in the last few years. SRL consists of recognizing arguments involved by predicates of a given sentence and labeling their semantic types. As a well defined task of shallow semantic parsing, SRL has a variety of applications in many kinds of NLP tasks.</p><p>A variety of approaches has been proposed for the different characteristics of SRL. More re­cent approaches have involved calibrating features (Gildea and lurafsky, 2002; Xue and Palmer, 2004;</p><p>This work was partial completed while this author was at Toshiba (China) R&amp;D Center.</p><p>© 2008. Licensed under the <i>Creative Commons Attribution-Noncommercial-Share Alike 3.0 Unported </i>li­cense (http://creativecommons.Org/licenses/by-nc-sa/3.0/). Some rights reserved.</p><p>Pradhan et al., 2005), analyzing the complex input - syntax trees (Moschitti, 2004; Liu and Sarkar, 2007), exploiting the complicated output - the predicate-structure (Toutanova et al., 2005), as well as capturing paradigmatic relations between predicates (Gordon and Swanson, 2007).</p><p>In prior SRL methods, role candidates are ex­tracted from a whole syntax tree. Though sev­eral pruning algorithms have been raised (Xue and Palmer, 2004), the policies are all in global style. In this paper, a statistical analysis of Penn Prop-Bank indicates that arguments are limited in a local syntax sub-tree rather than a whole one. Prior SRL methods do not take such locality into account and seek roles in a wider area. The neglect of local­ity of arguments may cause labeling errors such as constituents outside active region of arguments may be falsely recognized as roles.</p><p>This paper uses insights from generative lin­guistics to guide the solution of locality of argu­ments. In particular, Maximal Projection (MP) which dominates<footnote anchor="1"/> active region of arguments ac­cording to the projection principle of principle and parameters. Two methods, the anchor group ap­proach and the single anchor approach, are pro­posed to find the active sub-tree which is rooted by MP and covers all roles. The solutions put forward in this paper borrow ideas from NP-movement principle in generative linguistics and are in statis­tical flavor. The anchor group approach achieves an accuracy of 87.75%, and the single anchor ap­proach achieves 83.63%. Though the accuracy is lower, the single anchor approach fits SRL better.</p><footnote label="1">Dominate is an concept in X-bar theory are modeled. As­suming a and /? are two nodes in a syntax tree: a dominates f3 means a is ancestor of /?.</footnote><page local="2" global="834"/><doubt alpha="36.8" length="19" tooSmall="False" monospace="0.0">JJ       NN-4 NNS-2</doubt><p>Most of the stock selling pressure came from wall street professionals , including computer program traders</p><doubt alpha="85.7" length="7" tooSmall="False" monospace="0.0">-guided</doubt><figure caption="Figure 1: A sentence from WSJ test corpus of CoNLL-2005 shared task"></figure></section><section number="2" title="Maximal Projection and Its Government of Arguments"></section><section number="2" title=".1   Maximal Projection"><p>Principle and parameters theory is a framework of generative grammar. X-bar theory, as a module of principle and parameters, restricts context-free phrase structure rules as follows:</p><p>1. a phrase always contains a head of the same type, i.e. NPs Ns, VPs Vs, PPs Ps, etc.</p><doubt alpha="50.0" length="26" tooSmall="False" monospace="0.0">2. XP(X") -»■ specifier X'</doubt></section><section number="3." title="X'^X complement(s)"><p>These structural properties are conventionally rep­resented as shown in figure 2.</p><p>specifier X'</p><p>X complement(s)</p><figure caption="Figure 2: X-bar structure"></figure><p>X is the head of the phrase XP. X' and XP(X") are called projections of X. The head is also called the zero projection. X-bar structure is integrated with the properties of lexical items via the <b>Projec­tion Principle </b>of principle and parameters. This principle is summed up as the properties of lexi­cal information project onto the syntax of the sen­tence. For instance:</p><p>• Sue likes Picasso</p><doubt alpha="66.7" length="12" tooSmall="False" monospace="0.0">• *Sue likes</doubt><p>The subcategorization frame of the lexical item <i>like </i>[_,NP] ensures that the verb is followed by an NP and the second sentence is of ungrammatical form.</p><p><b>Maximal Projection </b>(MP) is the constituent which is projected to the highest level of an X-bar structure from lexical entities and is therefore the top node XP of the X-bar structure.</p><p>Take figure 1 for instance, <i>S </i>is the MP of the predicate <i>come. </i>Though the syntax tree is not in D-structure (deep structure), the S-structure (surface structure) headed by <i>come </i>is similar to its genuine D-structure. In a latter part of this section, a spe­cific definition of MP in S-structure will be given for application.</p><subsection number="2.2" title="MP Limits Active Region of Arguments"><p>MP holds all lexical properties of heads. In partic­ular, the MP of a predicate holds predicate struc­ture information and the constituents out of its do­main cannot occupy argument positions, ^-theory and government are two modules of principle and parameters. They both suggest that the possi­ble positions of semantic roles are in the sub-tree rooted by MP.</p><page local="3" global="835"/><p>Concerning assignment of semantic roles to constituents, ^-theory suggests that semantic roles are assigned by predicates to their sisters (Chom­sky, 1986). Furthermore, in a X-bar theory, com­plements are assigned semantic roles by the pred­icate and specifiers get roles from the V. In both situations the process of roles assignment is in sis­terhood condition and limited in the sub-structure which is dominated by the MP. Only constituents under MP can get semantic roles. <b>The Case As­signment Principle </b>also points out: Case is as­signed under government (Chomsky, 1981). Take figure 1 for instance, only <i>NP-1 </i>and <i>PP-2 </i>can get semantic roles of the head <i>come.</i></p><p>From generative linguists' point, MP limits sub­tree of arguments. Therefore, finding the MP is equivalent to finding the active region of predicate structure.</p></subsection><subsection number="2.3" title="Definition of MP in S-structure"><p>Though a clear enough definition of MP in D-structure has been previously illustrated, it is still necessary to define a specific one in S-structure for application, especially for automatic parsing which are not exactly correct. This paper de­fines <b>MP in S-structure </b>(hereinafter denote MP for short) as following: for every predicate <i>p </i>in the syntax tree T, there exists one and only one MP <i>mp</i><i> </i>s.t.</p></subsection></section><section number="1." title="mp  dominates all arguments of p;"><p><i>2. </i>all descendent nodes of mp don't satisfy the former condition.</p><p>Due to its different characteristics from argu­ments, adjunct-like arguments are excluded from the set of arguments in generative grammar and many other linguistic theories. For this reason, this paper does not take them into account.</p><p>For gold syntax tree, there exists a one-to-one mapping between arguments and nodes of syn­tax trees, whereas automatic syntactic parsing con­tains no such mapping. This paper do not take arguments which cannot get corresponding con­stituents into account to reduce the influence of au­tomatic parsing error.</p><p>Take the sentence of figure 1 to illustrate our definition of MP: <i>S </i>is MP of <i>come </i>since <i>NP-1 </i>and <i>PP-2 </i>are arguments of it. There is no node map­ping to the argument <i>Wall Street professionals </i>in the parsing tree. Instead of covering argument's fragments, we simply take it PP-4 as MP.</p><subsection number="2.4" title="Using MP Information in SRL"><p>The boundaries of a predicate structure are two word positions of the sentence. It is difficult to model these two words. On the contrary, MP, as one ancestor of predicate, has a clear-cut meaning and is ideal for modeling. In this paper, the pol­icy to predict MP rather than two word positions is carried out to deal with locality of arguments.</p><p>Automatic prediction of MP can be viewed as a preprocessing especially a pruning preprocessing for SRL. Given a sentence and its parsing, SRL systems can take seeking the active sub-tree rooted by MP as the first step. Then SRL systems can work on the shrunk syntax tree, and follow-up la­beling processes can be in a various form. Most of previous SRL methods still work without spe­cial processing. Take figure 1 for example: when labeling <i>include, </i>as the MP is <i>PP-4, </i>just <i>NP-7 </i>will be extracted as argument candidate.</p></subsection></section><section number="3" title="Analysis of Locality of Arguments"><p>Principle and parameters suggests that MP bounds arguments. Additionally, a statistical analysis shows that possible positions of arguments are lim­ited in a narrow region of syntax tree. An opposite experiment also shows that MP information is use­ful for SRL.</p><subsection number="3.1" title="Data and Baseline System"><p>In this paper, CoNLL-2005 SRL shared task data (Carreras and Marquez, 2005) is used as cor­pus. The data consists of the Wall Street Jour­nal (WSJ) part of the Penn TreeBank with infor­mation on predicate argument structures extracted from the PropBank corpus. In addition, the test set of the shared task includes three sections of the Brown corpus. Statistical analysis is based on sec­tion 02-21 of WSJ. Experiments are conducted on WSJ and Brown corpus. As defined by the shared task, section 02-21 of PropBank are used for train­ing models while section 23 and Brown corpus are used for test. In terms of syntax information, we use Charniak parser for POS tagging and full pars­ing.</p><p>A majority of prior SRL approaches formulate the SRL propblem as a multi-class classification propblem. Generally speaking, these SRL ap­proaches use a two-stage architecture: i) argument identification; ii) argument classification, to solve the task as a derivation of Gildea and Jurafsky's pioneer work (Gildea and Jurafsky, 2002). UIUC<page local="4" global="836"/></p><p>Semantic Role Labeler <footnote anchor="2"/> (UIUC SRLer) is a state-of-the-art SRL system that based on the champion system of CoNLL-2005 shared task (Carreras and Marquez, 2005). It is utilized as a baseline system in this paper. The system participated in CoNLL-2005 is based on several syntactic parsing results. However, experiments of this paper just use the best parsing result from Charniak parser. Param­eters for training SRL models are the same as de­scribed in (Koomen, 2005).</p></subsection><subsection number="3.2" title="Active Region of Arguments"><p>According to a statistical analysis, the average depth from a target predicate to the root of a syntax tree is 5.03, and the average depth from a predicate to MP is just 3.12. This means about 40% of an­cestors of a predicate do not dominate arguments directly. In addition, the quantity of leaves in syn­tax tree is another measure to analyze the domain. On average, a syntax tree covers 28.51 leaves, and MP dominates only 18.19. Roughly speaking, only about 60% of words are valid for semantic roles. Statistics of corpora leads to the following conclu­sion: arguments which are assigned semantic roles are in a local region of a whole syntax tree.</p></subsection><subsection number="3.3" title="Typical Errors Caused by Neglect of Locality of Arguments"><p>The neglect of the locality of arguments in prior SRL methods shows that it may cause errors. Some constituents outside active region of argu­ments may be falsely labeled as roles especially for those being arguments of other predicates. A sta­tistical analysis shows 20.62% of falsely labeled arguments are constituents out of MP domain in labeling results of UIUC SRLer. Take figure 1 for instance, UIUC SRLer makes a mistake when la­beling <i>NP-1 </i>which is Argl of the predicate <i>come </i>for the target <i>include; </i>it labels ArgO to <i>NP. </i>In fact, the active region of <i>include </i>is the sub-tree rooted by <i>PP-4.</i><i> </i>Since <i>NP-1 </i>is an argument of another predicate, some static properties of <i>NP-1 </i>make it confusing as an argument.</p><footnote label="2">http://12r.cs . uiuc.edu/ cogcomp/srl-demo.php</footnote></subsection><subsection number="3.4" title="SRL under Gold MP"><p>If MP has been found before labeling semantic roles, the set of role candidates will be shrunk, and the capability to identify semantic roles may be improved. An opposite experiment verifies this idea. In the first experiment, UIUC SRLer is re­trained as a baseline. For comparison, during the second experiment, syntax sub-trees dominated by gold MP are used as syntactic information. Both training and test data are preprocessed with gold MP information. That is to say we use pruned data for training, and test is conducted on pruned syntax sub-trees.</p><p>Table 1 and 2 show that except for Arg4, all ar­guments get improved labeling performance, espe­cially ArgO. Since arguments except for ArgO are realized as objects on the heel of predicate in most case, the information of MP is not so useful for them as ArgO. The experiment suggests that high performance prediction of MP can improve SRL.</p></subsection></section><section number="4" title="Prediction of MP"><p>Conforming to government and ^-theory, MP is not too difficult to predict in D-structure. Unfor­tunately, sentences being looked at are in their sur­face form and region of arguments has been ex­panded. Simple rules alone are not adequate for finding MP owing to a variety of movement be­tween D-structure and S-structure. This paper de­signs two data driven algorithms based on move­ment principles for prediction of MP.</p><subsection number="4.1" title="NP-movement and Prediction of MP"><subsubsection number="4.1.1" title="NP-movement in Principle and Parameters"><p>The relationship between D-structure and S-structure is movement:   S-structure equals Dstructure plus movement.<page local="5" global="837"/> NP-movement prin­ciple in principle and parameters indicates that noun phrases only move from A-positions (argu­ment position) which have been assigned roles to A-positions which have not, leaving an NP-trace. On account of ^-theory and government, A-positions are nodes m-commanded<footnote anchor="3"/> by predicates in D-structure. In NP-movement, arguments move to positions which are C-commanded <footnote anchor="4"/> by target predicate and m-commanded by other predicates. Broadly speaking, A-positions are C-commanded by predicates after NP-movement. The key of the well-known pruning algorithm raised in (Xue and Palmer, 2004) is extracting sisters of ancestors as role candidates. Those candidate nodes are all C-commanders of a predicate. NP-movement can give an explanation why the algorithm works.</p><table caption="Table 1: SRL performance of UIUC SRLer" class="main" frame="box" rules="all" border="1" regular="False"><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p></p></td><td class="cell"><p>Precision</p></td><td class="cell"><p>Recall</p></td><td class="cell"><p>F/3=i</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>ArgO</p></td><td class="cell"><p>86.28%</p></td><td class="cell"><p>87.01%</p></td><td class="cell"><p>86.64</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Argl</p></td><td class="cell"><p>79.37%</p></td><td class="cell"><p>75.06%</p></td><td class="cell"><p>77.15</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Arg2</p></td><td class="cell"><p>69.48%</p></td><td class="cell"><p>62.97%</p></td><td class="cell"><p>66.07</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Arg3</p></td><td class="cell"><p>69.01%</p></td><td class="cell"><p>56.65%</p></td><td class="cell"><p>62.22</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Arg4</p></td><td class="cell"><p>72.64%</p></td><td class="cell"><p>75.49%</p></td><td class="cell"><p>74.04</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr></table><table caption="Table 2: SRL performance of UIUC SRLer using information of gold MP" class="main" frame="box" rules="all" border="1" regular="False"><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p></p></td><td class="cell"><p>Precision</p></td><td class="cell"><p>Recall</p></td><td class="cell"><p>F/3<b>=l</b></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>ArgO</p></td><td class="cell"><p>91.84%</p></td><td class="cell"><p>89.98%</p></td><td class="cell"><p>90.90</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Argl</p></td><td class="cell"><p>81.73%</p></td><td class="cell"><p>75.93%</p></td><td class="cell"><p>78.72</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Arg2</p></td><td class="cell"><p>69.86%</p></td><td class="cell"><p>63.06%</p></td><td class="cell"><p>66.29</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Arg3</p></td><td class="cell"><p>71.13%</p></td><td class="cell"><p>58.38%</p></td><td class="cell"><p>64.13</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Arg4</p></td><td class="cell"><p>73.08%</p></td><td class="cell"><p>74.51%</p></td><td class="cell"><p>73.79</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr></table></subsubsection><subsubsection number="4.1.2" title="Definition of Argument Anchor"><p>To capture the characteristics of A-positions, we make definition of A-anchor as following. For ev­ery predicate <i>p </i>in the syntax tree T, denote <i>A </i>the set of C-commanders of <i>p:</i></p><p>• a <b>left-A-anchor </b>satisfies:</p></subsubsection><subsubsection number="4.1.3" title="Anchor Model for Prediction of MP"><p>Parents of A-anchors and first branching ances­tor of the predicate can cover 96.25% of MP and the number of those ancestors is 2.78 times of the number of MP. The number of all ancestors is 6.65 times. The data suggests that taking only these kinds of ancestors as MP candidates can shrink the candidate set with a relatively small loss.</p><footnote label="3">M-command is an concept in X-bar syntax. Assuming a and /? are two nodes in a syntax tree: a m-commands f3 means a C-commands /? and the MP of a dominates /?</footnote><footnote label="4">C-command is an concept in X-bar theory. Assuming a and f3 are two nodes in a syntax tree: a C-commands /? means every parent of a is ancestor of /?.</footnote></subsubsection></subsection></section><section number="1." title="left-A-anchor belongs to A;"><p><i>2. </i>left-A-anchor is a noun phrase (includ­ing NNS, NNP, etc.) or simple declara­tive clause (S); 3. left-A-anchor is on the left hand of <i>p.</i></p><p>• a <b>right-A-anchor </b>satisfies:</p></section><section number="1." title="right-A-anchor belongs to A;"><p><i>2. </i>right-A-anchor is a noun phrase (includ­ing NNS, NNP, etc.); 3. right-A-anchor is on the right hand of <i>p.</i></p><p>Take figure 1 for example, <i>NP-1, NP-4 </i>and <i>NP-6 </i>are left-A-anchors of <i>include, </i>and no right-A-anchor. There is a close link between A-position and the A-anchor that we defined, since A-anchors occupy A-positions.</p><subsection number="4.2" title="Anchor Group Approach"><p>MP is one ancestor of a predicate. An natural ap­proach to predict MP is searching the set of all ancestors. This idea encounters the difficulty that there are too many ancestors. In order to reduce the noise brought by non-anchors' parents, the an­chor group approach prunes away useless ances­tors which are neither parents of A-anchors nor first branching node upon predicate from MP can­didate set. Then the algorithm scores all candidates and chooses the MP in argmax flavor. Formally, we denote the set of MP candidates <i>C </i>and the score function <i>S{.).</i></p><p><i>rap = </i>arg maxcec <i>S{mp\c)</i></p><p>Probability function is chosen as score func­tion in this paper. In estimating of the probability <i>P(MP\C),</i><i> </i>log-linear model is used. This model is often called maximum entropy model in research of NLP Let the set {1,-1} denotes whether a con­stituent is MP and $(c, {-1,1}) <i>e Rs </i>denotes a feature map from a constituent and the possible class to the vector space <i>W.</i><i> </i>Formally, the model of our system is defined as:</p><p><i>nip = </i>arg maxc€C <i>e&lt;^(J^c^'^(cfl)^&gt;</i>The algorithm is also described in pseudo code as following.</p><p>Ancestor Algorithm:</p><p>1: collect parents of anchors and the first branching ancestor, denote them set <i>C 2: </i>for every c G <i>C </i>3:   calculate <i>P{mp</i><i> </i><i>\</i><i> </i><i>c)</i><i> </i>4: return c that gets the maximal <i>P{mp\</i><i> </i><i>c)</i></p><subsubsection number="4.2.1" title="Features"><p>We use some features to represent various as­pects of the syntactic structure as well as lexical information. The features are listed as follows:</p><p><b>Path </b>The path features are similar to the path feature which is designed by (Gildea and Jurafsky, 2002).A path is a sequential collection of phrase tags. There are two kinds of path features here: one is from target predicate through to the candidate; the other is from the candidate to the root of the syntax tree. For <i>include </i>in the sentence of figure 1, the first kind of path of <i>PP-2 </i>is <i>VBG+PP+NP+PP </i>and the second is <i>PP+VP+S.</i></p><page local="6" global="838"/><p><b>C-commander Thread </b>As well as path features, C-commander threads are other features which reflect aspects of the syntactic structures. C-commander thread features are sequential contain­ers of constituents which C-command the target predicate. We design three kinds of C-commander threads: 1) down thread collects C-commanders from the anchor to the target predicate; 2) up thread collects C-commanders from the anchor to the left/right most C-commander; 3) full thread collects all C-commanders in the left/right direc­tion from the target predicate. Direction is depen­dent on the type of the anchor - left or right anchor.</p><p>Considering the grammatical characteristics of phrase, we make an equivalence between such phrase types:</p><doubt alpha="60.0" length="20" tooSmall="False" monospace="0.0">• JJ, JJR, JJS, ADJP</doubt><p><b>seek right most left</b>-A<b>-anchor</b> <b>predict action</b> <b>return first branching node</b> <b>return parent of right most left</b>-A<b>-anchor</b></p><doubt alpha="100.0" length="4" tooSmall="False" monospace="0.0">down</doubt><figure caption="Figure 3: Flow diagram of the single anchor ap­proach"></figure><doubt alpha="57.6" length="33" tooSmall="False" monospace="0.0">• NN, NNP, NNS, NNPS, NAC, NX, NP</doubt><p>Besides the equivalent constituents, we discard these types of phrases:</p><doubt alpha="58.3" length="24" tooSmall="False" monospace="0.0">• MD, RB, RBS, RBR, ADVP</doubt><p>For <i>include </i>in figure 1, the up thread of <i>NP-4 </i>is <i>VBG+,+NP+NP; </i>the down thread is <i>NP+IN+VBD+NP; </i>the full thread is <i>VBG+, +NP+NP+IN+ VBD+NP.</i></p><p>The phrase type of candidate is an important fea­ture for prediction <b>Candidate </b>of MP. We also select the rank num­ber of the current candidate and the number of all candidates as features. For the former example, the two features for <i>PP-2 </i>are 2 and <i>3, </i>since NP-4 is the second left-A-anchor and there are three A-anchors of <i>include.</i></p><p><b>Anchor </b>Features of anchor include the head word of the anchor, the boundary words and their POS, and the number of the words in the anchor. Those features are clues of judgment of whether the anchor's position is an A-position.</p><p><b>Forward predicate </b>For the former example, the forward predicate of <i>NP-4 </i>is <i>come. </i>The features include the predicate itself, the Levin class and the SCF of the predicate.</p><p><b>predicate </b>Features of predicate include lemma, Levin class, POS and SCF of the predicate.</p><p><b>Formal Subject </b>An anchor may be formal sub­ject. Take <i>It is easy to say the specialist is not do­ing his job </i>for example, the formal subject will be recognized as anchor of <i>do. </i>We use a heuristic rule to extract this feature: if the first NP C-commander of the anchor is "it" and the left word of predicate is "to", the value of this feature is 1; otherwise 0.</p><p><b>The Maximal Length of C-commanders </b>Con­stituent which consists of many words may be a barrier between the predicate and an A-position. For the former example, if the target predicate is <i>include, </i>this feature of <i>NP-1 </i>is 2, since the largest constituent <i>NP-4 </i>is made up of two words.</p></subsubsection></subsection></section><section number="4" title=".3 Single Anchor Approach"><p>Among all A-anchors, the right most left-A-anchor such as <i>NP-6 </i>of <i>include </i>in figure 1 is the most im­portant one for MP prediction. The parent of this kind of left-A-anchor is the MP of the predicate, obtaining a high probability of 84.59%. The single anchor approach is designed based on right most left-A-anchor. The key of this approach is an ac­tion prediction that when right most left-A-anchor is found, the algorithm predicts next action to re­turn which node of syntax tree as MP. There is a label set of three types for learning - <i>here, up, down. </i>After action is predicted, several simple rules are executed as post process of this predic­tion: i) if there is no left-A-anchor, return the root of the whole syntax tree as MP; ii)if the predicted label is <i>here, </i>return the parent of right most left-A-anchor; iii) if the predicted label is <i>down, </i>return<page local="7" global="839"/></p><table class="main" frame="box" rules="all" border="1" regular="False"><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p><b>i</b></p></td><td class="cell"><p></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p><b>return root</b></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p><b><i>i</i></b></p><p><b>up</b></p></td><td class="cell"><p></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr></table><p>Prediction Accuracy</p><table caption="Table 3: Accuracy of the anchor group ap­proach"></table><p>Table 5: SRL performance of UIUC SRLer us­ing information of predicted MP; the anchor group approach; WSJ test corpus</p><table caption="Table 4: Accuracy of the single anchor ap­proach"></table><p>Table 6: SRL performance of UIUC SRLer us­ing information of predicted MP; the single an­chor approach; WSJ test corpus the first branching node upon the predicate; iv) if the predicted label is <i>up,</i><i> </i>return the root. The ac­tion prediction also uses maximum entropy model. Figure 3 is the flow diagram of the single anchor approach. Features for this approach are similar to the former method. Features of the verb which is between the anchor and the predicate are added, including the verb itself and the Levin class of that verb.</p></section><section number="5" title="Experiments and Results"><p>Experiment data and toolkit have been illustrated in section 3. Maxent<footnote anchor="5"/>, a maximum entropy model­ing toolkit, is used as a classifier in the experiments of MP prediction.</p><subsection number="5.1" title="Experiments of Prediction of MP"><p>The results are reported for both the anchor group approach and the single anchor approach. Table 3 summaries the accuracy results of MP prediction for the anchor group approach; table 4 summaries results of both action prediction and MP prediction for the single anchor approach. Both the anchor group approach and the single anchor approach have better prediction performance in Brown test set, though the models are trained on WSJ cor­pus. These results illustrate that anchor approaches which are based on suitable linguistic theories have robust performance and overcome limitations of training corpus.</p><footnote label="5">http://homepages.inf.ed.ac.uk/s0450736/maxent_toolkit.h</footnote></subsection><subsection number="5.2" title="Experiments of SRL Using MP Prediction"><p>Like the experiments in the end of section 3, we perform similar experiments under predicted MP. Both training and test corpus make use of predicted MP information. It is an empirical tactic that pre­dicted information of maximal projection, instead of gold information, is chosen for a training set. Experiments suggest predicted information is bet­ter. Table 5 is SRL performance using the anchor group approach to predict MP; Table 6 is SRL per­formance using the single anchor approach.</p><p>Compared with table 1 on page 4, table 5 and table 6 both indicate the predicted MP can help to label semantic roles. However, there is an interest­ing phenomenon. Even though the anchor group approach achieves a higher performance of MP, the single anchor approach is more helpful to SRL. 18.56% of falsely labeled arguments are out of MP domain using the single anchor approach to predict MP, compared to 20.62% of the baseline system.</p><p>In order to test robustness of the contribution of MP prediction to SRL, another opposite exper­iment is performed using the test set from Brown corpus. Table 7 is the SRL performance of UIUC SRLer on Brown test set. Table 8 is the corre­sponding performance using MP information pre­dicted by the single anchor approach. Comparison between table 7 and table 8 indicates the approach of MP prediction proposed in this paper adapts to other genres of corpora.</p><p>Capability of labeling ArgO gets significant im­provement. Subject selection rule, a part of the-<page local="8" global="840"/></p><table caption="Table 3: Accuracy of the anchor group approach" class="main" frame="box" rules="all" border="1" regular="False"><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Corpus</p></td><td class="cell"><p>Action</p></td><td class="cell"><p>MP</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>WSJ</p></td><td class="cell"><p>-</p></td><td class="cell"><p>87.75%</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Brown</p></td><td class="cell"><p>-</p></td><td class="cell"><p>88.84%</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr></table><table caption="Table 4: Accuracy of the single anchor approach" class="main" frame="box" rules="all" border="1" regular="False"><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Corpus</p></td><td class="cell"><p>Action</p></td><td class="cell"><p>MP</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>WSJ</p></td><td class="cell"><p>88.45%</p></td><td class="cell"><p>83.63%</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Brown</p></td><td class="cell"><p>90.10%</p></td><td class="cell"><p>85.70%</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr></table><table class="main" frame="box" rules="all" border="1" regular="False"><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p></p></td><td class="cell"><p>Precision</p></td><td class="cell"><p>Recall</p></td><td class="cell"><p>F/3<b>=l</b></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>ArgO</p></td><td class="cell"><p>86.23%</p></td><td class="cell"><p>87.90%</p></td><td class="cell"><p>87.06</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Argl</p></td><td class="cell"><p>80.21%</p></td><td class="cell"><p>74.79%</p></td><td class="cell"><p>77.41</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Arg2</p></td><td class="cell"><p>70.09%</p></td><td class="cell"><p>62.70%</p></td><td class="cell"><p>66.19</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Arg3</p></td><td class="cell"><p>71.74%</p></td><td class="cell"><p>57.23%</p></td><td class="cell"><p>63.67</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Arg4</p></td><td class="cell"><p>74.76%</p></td><td class="cell"><p>75.49%</p></td><td class="cell"><p>75.12</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr></table><table class="main" frame="box" rules="all" border="1" regular="False"><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p></p></td><td class="cell"><p>Precision</p></td><td class="cell"><p>Recall</p></td><td class="cell"><p>F/3<b>=l</b></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>ArgO</p></td><td class="cell"><p>87.03%</p></td><td class="cell"><p>87.59%</p></td><td class="cell"><p>87.31</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Argl</p></td><td class="cell"><p>80.24%</p></td><td class="cell"><p>74.77%</p></td><td class="cell"><p>77.41</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Arg2</p></td><td class="cell"><p>70.35%</p></td><td class="cell"><p>63.06%</p></td><td class="cell"><p>66.51</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Arg3</p></td><td class="cell"><p>71.43%</p></td><td class="cell"><p>57.80%</p></td><td class="cell"><p>63.90</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Arg4</p></td><td class="cell"><p>73.33%</p></td><td class="cell"><p>75.49%</p></td><td class="cell"><p>74.40</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr></table><p>Table 8: SRL performance of UIUC SRLer us­ing information of predicted MP; the single an­chor approach; Brown test corpus matic hierarchy theory, states that the argument that the highest role (i.e. proto-agent, ArgO in PropBank) is the subject. This means that ArgO is usually realized as a constituent preceding a predi­cate and has a long distance from the predicate. As a solution of finding active region of arguments, MP prediction is helpful to shrink the searching range of arguments preceding the predicate. From this point, we give a rough explanation why exper­iment results for ArgO are better.</p></subsection></section><section number="6" title="Conclusion"><p>Inspired by the locality phenomenon that argu­ments are usually limited in a syntax sub-tree, this paper proposed to label semantic roles locally in the active region arguments dominated by maximal projection, which is a concept in D-structure from the projection principle of the principle and param­eters theory. Statistical analysis showed that MP information was helpful to avoid errors in SRL, such as falsely recognizing constituents outside ac­tive region as arguments. To adapt the projection concept to label semantic roles, this paper defined MP in S-structure and proposed two methods to predict MP, namely the anchor group approach and the single anchor approach. Both approaches were based on NP-movement principle of principle and parameters. Experimental results indicated that our MP prediction methods improved SRL.</p><p><b>Acknowlegements</b></p><p>The work is supported by the National Natu­ral Science Foundation of China under Grants No. 60503071, 863 the National High Technol­ogy Research and Development Program of China under Grants No.2006AA01Z144, 973 Natural Basic Research Program of China under Grants NO.2004CB318102.</p><table caption="Table 7: SRL performance of UIUC SRLer; Brown test corpus" class="main" frame="box" rules="all" border="1" regular="False"><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p></p></td><td class="cell"><p>Precision</p></td><td class="cell"><p>Recall</p></td><td class="cell"><p><i>F </i><i>13=1</i></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>ArgO</p></td><td class="cell"><p>82.88%</p></td><td class="cell"><p>85.51%</p></td><td class="cell"><p>84.17</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Argl</p></td><td class="cell"><p>66.30%</p></td><td class="cell"><p>63.17%</p></td><td class="cell"><p>64.70</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Arg2</p></td><td class="cell"><p>50.00%</p></td><td class="cell"><p>45.58%</p></td><td class="cell"><p>47.69</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Arg3</p></td><td class="cell"><p>0.00%</p></td><td class="cell"><p>0.00%</p></td><td class="cell"><p>0.00</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Arg4</p></td><td class="cell"><p>60.00%</p></td><td class="cell"><p>20.00%</p></td><td class="cell"><p>30.00</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr></table><table class="main" frame="box" rules="all" border="1" regular="False"><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p></p></td><td class="cell"><p>Precision</p></td><td class="cell"><p>Recall</p></td><td class="cell"><p>F/3<b>=l</b></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>ArgO</p></td><td class="cell"><p>83.85%</p></td><td class="cell"><p>86.22%</p></td><td class="cell"><p>85.02</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Argl</p></td><td class="cell"><p>66.67%</p></td><td class="cell"><p>63.02%</p></td><td class="cell"><p>64.79</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Arg2</p></td><td class="cell"><p>50.38%</p></td><td class="cell"><p>44.90%</p></td><td class="cell"><p>47.48</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Arg3</p></td><td class="cell"><p>0.00%</p></td><td class="cell"><p>0.00%</p></td><td class="cell"><p>0.00</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Arg4</p></td><td class="cell"><p>60.00%</p></td><td class="cell"><p>20.00%</p></td><td class="cell"><p>30.00</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr></table></section><references><p>Carreras, Xavier and Llufs Marquez. 2005. Introduc­tion to the CoNLL-2005 shared task: semantic role labeling. In <i>Proceedings of Conference on Natural Language Learning.</i></p><p>Chomsky, Noam. 1981. <i>Lectures on Government and Binding. </i>Foris Publications, Dordrecht.</p><p>Chomsky, Noam. 1986. <i>Barriers. </i>MIT Press, Barriers.</p><p>Gildea, Daniel and Daniel Jurafsky. 2002. Automatic Labeling of Semantic Roles. <i>Computional Linguis­tics, </i>28(3):245-288.</p><p>Gordon, Andrew and Reid Swanson. 2007. Generaliz­ing Semantic Role Annotations Across Syntactically Similar Verbs. In <i>Proceedings of Conference on As­sociation for Computational Linguistics.</i></p><p>Koomen, Peter, Vasina Punyakanok, Dan Roth and Wen-tau Yih. 2005. Generalized Inference with Multiple Semantic Role Labeling Systems. In <i>Pro­ceedings of Conference on Natural Language Learn­ing.</i></p><p>Liu, Yudong and Anoop Sarkar. 2004. Experimen­tal Evaluation of LTAG-Based Features for Semantic Role Labeling. In <i>Proceedings of Joint Conference on Empirical Methods in Natural Language Process­ing and Computational Natural Language Learning.</i></p><p>Mocshitti, Alessandro. 2004. A Study on Convolu­tion Kernels for Shallow Semantic Parsing. In <i>Pro­ceedings of Conference on Association for Compu­tational Linguistics.</i></p><p>Pradhan, Sameer, Kadri Hacioglu, Valerie Krugler, Wayne Ward, James Martin and Daniel Jurafsky. 2005. Support Vector Learning for Semantic Argu­ment Classification. In <i>Proceedings of Conference on Association for Computational Linguistics.</i></p><p>Toutanova, Kristina, Aria Haghighi and Christopher Manning. 2005. Joint Learning Improves Seman­tic Role Labeling. In <i>Proceedings of Conference on Association for Computational Linguistics.</i></p><p>Xue, Nianwen and Martha Palmer. 2004. Calibrating Features for Semantic Role Labeling. In <i>Proceed­ings of Empirical Methods in Natural Language Pro­cessing.</i></p></references></body></article>