<?xml version="1.0"?><!DOCTYPE article SYSTEM "/project/take/software/searchbench_offline_processing/paperxml_generator/aclextractor/src/python/../resource/dtd/paperxml.dtd"><article><header><firstpageheader><page local="1"/><title>Cross-Serial Dependencies Are Not Hard to Process</title><author surname="Vogel" givenname="Carl"><org  name="University of Stuttgart" country="Germany" city="Stuttgart"/></author><author surname="Hahn" givenname="Ulrike"><org  name="University of Edinburgh" country="United Kingdom" city="Edinburgh"/></author><author surname="Branigan" givenname="Holly"><org  name="University of Edinburgh" country="United Kingdom" city="Edinburgh"/></author></firstpageheader><frontmatter><p><b>Cross-Serial Dependencies Are Not Hard to Process</b></p><p><b>Carl Vogel Ulrike Hahn Holly Branigan</b></p><p>Institute for Computational Department of Experimental Centre for Cognitive Science</p><p>Linguistics University of Stuttgart Azenbergstr. 12 D-70174 Stuttgart Germany</p><p>Psychology University of Oxford South Parks Road Oxford 0X1 3UD England</p><p>University of Edinburgh 2 Buccleuch Place Edinburgh EH8 9LW Scotland</p><p>{vogel,holly ,ueh}&lt;3cogsci .ed.ac.uk</p></frontmatter><abstract>Cross-serial dependencies in Dutch and Swiss-German are the only known extra-context free natural language syntactic phenomena. Psycholinguistic evidence suggests cross-serial orderings tend to be easier to process than nested construc­tions. We argue that the expressivity re­quirements of the corresponding formal languages do not actually entail that pro­cessing reduplication languages require the worst-case time complexity for lan­guages of the same expressive class. We distinguish between context-free repre-sentability and context-free processing. We show that for any language with up to context free expressive power, <i>pro­cessing </i>cross-serial dependencies can be accommodated without affecting parsing complexity. This is related to other work on reduplication phenomena in formal models of computation. </abstract></header><body><section number="1" title="Introduction"><p>The cross-serial dependencies in Dutch and Swiss-German are the only known constituent-level syn­tactic phenomena which make natural languages not representable in context free languages (Gaz-dar, 1985; Gazdar and Pullum, 1985). Psycholin­guistic study of the cross-serial dependencies re­veals that the cross-serial orderings tend to be preferred over nested constructions (Bach et al., 198C).<footnote anchor="1"/> Bach et al. argue from this that the push­down stack cannot be the universal basis of the human parsing mechanism (since the pushdown automaton is essentially a context free recognition device which cannot represent cross-serial depen­dencies). Stabler (1994), on the other hand, con­siders the findings of Bach et al. (198G) as evi­dence for finite human sentence processing capac­ity. In this paper, we distinguish between context-free rcpresentability and context-free processing.</p><p>tested constructions are a quintcssentially con­text free phenomenon.</p><p>We show that for any language with up to con­text free expressive power, <i>processing </i>cross-serial dependencies can be accommodated without af­fecting parsing complexity. While this does essen­tially imbue the language with indexed expressiv­ity, it does so while allowing us to retain context, free (or even regular) parsing complexity. Essen­tially, it is possible to carve out a cross-section of the expressivity hierarchy with the desired pro­cessing complexity. The result is based un the sim­ple observation that the cross-serial dependencies are idealized by the string duplication language (whereas the nested dependencies are idealized by the palindrome language;), and that it is trivial to provide a context-free (or regular) language parse for half of the string, followed by a test, of equal­ity for the remaining half of the string. This is consistent, with findings that cross-serial depen­dencies are not hard to process, but qualifies the interpretation that Bach et al. give to their re­sults and the implications on the human parsing mechanism. In particular, this suggests that, with an additional operation the pushdown stack can be adequate for processing human languages. It also suggests an explanation for the finding that, Dutch cross-serial dependencies are easier to pro­cess than German nested dependencies. We out­line further consequences of our proposal in terms of patterns of disfiuencics that are likely to occur in languages that admit cross-serial dependencies and propose a strategy for empirical investigation.</p></section><section number="2" title="Preliminaries"><p>To calibrate our discussion, we quickly review the salient terminology from formal language; theory and the current understanding of the import, for natural languages.</p><subsection number="2.1" title="Terminology"><p>Let <i>Ci </i>denot e; the hierarchy of languages gener­ated by the corresponding hierarchy of grammars (according to the usual hierarchy (Hopcroft and Ullman, 1979)). Tims, <i>£0 </i>denotes the class of lan­guages generated by type 0 grammars. They are characterized by unrestricted grammar production rules.<page local="2"/> £1 is the class of languages generated by context sensitive grammars—the sole restric­tion on production rules in this type of grammar is that the right hand side (RHS) of each rule is at least as long as the left hand side (LHS). £1.5 de­notes the class of languages generated by indexed grammars. Gazdar (1985) provides the most per­spicuous notation for the restricted forms that production rules may take in such grammars:<footnote anchor="2"/></p><doubt alpha="0.0" length="3" tooSmall="False" monospace="0.0">157</doubt><doubt alpha="13.3" length="15" tooSmall="False" monospace="0.0">1.A[...] W[...}</doubt><doubt alpha="17.6" length="17" tooSmall="False" monospace="0.0">2.A[...]^B[i,...}</doubt><doubt alpha="16.7" length="18" tooSmall="False" monospace="0.0">3.A[i,...]—&gt;W[...]</doubt><p>Indexed grammars incorporate a notion of stack­ing; rules of the form in (2) describe push opera­tions, and those of the form in (3) involve pops. Rules of the form (1) are copy operations. The elipses indicate that the remainder of the stack is passed on from the LHS to each nonterminal (and only the nonterminals) on the RHS. <i>C2 </i>is the class of context free languages generated by gram­mars whose productions are restricted such that the LHS of each is a single nonterminal symbol, and each RHS is a sequence of terminals and non­terminals. Finally, the regular languages, £3 are those produced by regular grammars, character­ized by rules that have a single nonterminal sym­bol on the LHS and on the RHS, either a terminal symbol or a terminal and a single nonterminal.</p><p>These classes of languages can be arranged into a hierarchy based on proper containment rela­tions among them: £3 <b>c </b>£2 <b>c </b>£1.5 C £1 C £0 (£0 is the least restrictive, the most expres­sive). Aho (1968) shows the existence of lan­guages that are a proper subset of the indexed languages and a proper superset of the context free. Joshi et al. (1989) conjecture that there is actually a convergence in expressive power among the 'mildly context sensitive' (MCS) lan­guages, but other work points out exceptions (Sav-itch, 1989; Vogel and Erjavec, 1994). Since the reduplication languages (Savitch, 1989) are cen­tral to the point of this paper we define them— the languages homomorphic to the set of strings <i>{ww\w £ {a, b}*}. </i>The string duplication lan­guages are not context free, although they are closely related to the string reversal languages <i>({wwR\w e {a, b}*}, </i>where the <i>R </i>indicates the re­versal operator) which are context free. The two languages induce different dependency relation­ships which is best described as <i>nesting </i>in the con­text free case and <i>cross-serial </i>in the indexed case:</p><doubt alpha="0.0" length="3" tooSmall="False" monospace="0.0">+—+</doubt><footnote label="2">The bracketed material indicates a stack of in­dices; W denotes a sequence of elements of terminals and nonterminals; A,B denote nonterminals.</footnote><p>An important property of the each of the lan­guage classes is that it is closed under both in­tersection with regular languages (e.g., the inter­section of a context free language and a regular language is no more expressive than a context free language) and homomorphism (e.g., an or­der preserving map of each symbol in a language to a single element (possibly a string) of a context free language implies that the first language is also context free). It is convenient to refer to languages with homomorphisms to <i>{wwR\w £ {a, b}*} </i>and <i>{ww\w </i>6 <i>{a, b}*} </i>as <i>wwR </i>and <i>ww, </i>respectively.</p><p>Corresponding to expressivity class and the as­sociated model of computation is the complex­ity of recognition for each class. Table 1 gives an informal ranking of the language classes with their corresponding worst case recognition com­plexity on the standard model of computation. Thus, given a context free grammar for <i>wwR </i>and a string of length n, then in the worst case it will take an amount of time proportional to the cube of the length of the string to determine whether the string is in <i>wwR </i>(and identify its structure). While the expressivity hierarchy is useful for dif­ferentiating classes of languages in precise terms like worst-case recognition complexity, it is easy to use the hierarchy incorrectly. For instance, it is not valid to conclude that because a lan­guage is in a particular language class all subsets of that language are also included that language class (e.g. <i>wwR </i>is a proper subset of <i>w, </i>yet <i>w </i>e£3 <i>wwR </i>g£2). Also, in most cases the structural de­scriptions that underlie strings of a language are of more interest than the string sets themselves. For this reason it is useful to distinguish <i>weak </i>and <i>strong </i>containment of a grammar in a language class: e.g., a grammar is <i>weakly </i>context free if its stringset is context free; a grammar- is <i>strongly </i>context free if its treeset is also context free.</p></subsection><subsection number="2.2" title="Applicability to Natural Language"><p>Pullum and Gazdar (1982) survey the arguments up to the time they wrote for the non-corltext-freeness of natural language. The most interesting were those that considered idealizations of linguis­tic phenomena in terms of the string duplicating language, <i>ww. </i>In each case they found the ar­gument flawed: the phenomena in question did not yield languages whose stringsets were homo­morphic to the duplication language. Bresnan et al. (1982) argue that Dutch is not strongly con­text free. Shieber (1985) provides a stringset ar­gument about a dialect of Swiss-German, which has a class of verb phrases with cross-serial depen­dencies (through case marking) between NPs and their Vs, which establishes even the weak-non-context-freeness of natural language because of homomorphism to <i>ww. </i>Manaster-Ramer (1987) re-analyzes an argument considered by Pullum and Gazdar (1982) about Dutch and produces a corrected stringset argument that Dutch licences <b><i>anbnc" </i></b>constructions, which are MCS.<page local="3"/> No known syntactic phenomenon requires greater than in­dexed language expressivity.</p><doubt alpha="0.0" length="3" tooSmall="False" monospace="0.0">158</doubt><p>The point of this paper is to emphasize that al­though a particular Swiss-German dialect renders natural language syntax non-context free, it does not entail that natural languages, including the ones that license cross-serial dependencies, incur the worst case recognition complexity costs for in­dexed languages. In fact, we argue in the next section that <i>ww </i>is fairly straightforward to pro­cess. Essentially, we consider languages <i>xx </i>homo-morphic to <i>ww, </i>where <i>x </i>can be either £3 or £2, and argue that the recognition for <i>xx </i>is no worse than worst case recognition for £3 if <i>x </i>e£3 and no worse than the worst case for £2 if <i>x </i>€£2, even though <i>xx </i>is itself indexed.</p></subsection></section><section number="3" title="Cross-Serial Dependencies Are Not Hard to Process"><p>It is always possible to compile less restrictive grammar formalisms into more restrictive covering formalisms, allowing different constituent analy­ses and potential stringset overgeneration. Meta-grammatical techniques give an alternative that preserve coverage, but use special purpose pro­cessing. We suggest a parsing method for lan­guages that rely on <i>ww </i>which does not cost a greater complexity fee than the worst case for parsing context free grammars. The method is metagrammatical and therefore akin to propos­als put forward previously for handling coordina­tion (Dahl and McCord, 1983) with logic gram­mars and TAGs (Shieber, 1995) or for extraposi­tion (Milward, 1994). The method is constrained enough not to augment overall processing com­plexity, implying that <i>ww </i>does not require the worst case recognition complexity for its charac­teristic class, the MCS languages.</p><subsection number="3.1" title="Why not?"><p>Trivially, the string duplication languages can be recognized with time complexity proportional to the length of the string — if the string is of even length, and its first half is identical to the sec­ond half, then this can be established in just lin­ear time. Though trivial in the sense of being about mere recognition, this is nonetheless inter­esting. In particular, under the reasonable hy­pothesis that humans are not in general reverse-wired<footnote anchor="3"/> it is easier to process serial orders than their reverse. In this trivial recognition model we could take the serial ordering as primitive, but to use the same model as a recognizer for the <i>con­text free </i>string reversal languages would require an additional step of reversing the second half of the string before checking equivalence, which means the recognition complexity is <i>nlogn. </i>Thus, for trivial recognition the string duplication lan­guages aie easier to process than the string rever­sal languages. This is a concrete illustration that not every language costs the worst case recogni­tion complexity for its expressivity class.</p><p>However, in the case of natural languages, pars­ing is of greater interest than mere recognition. A generalization of the recognizer method can be used inside a parsing approach as well. Suppose some <i>i </i>such that <i>i </i>&gt; 2; suppose we want a rec­ognizer for {tuu)|to 6 {a,ft}*} where <i>w € Ci, </i>then we can use a parser that is no worse than cubic (if <i>i </i>= 2) and which can be linear (if <i>i </i>= 3) to determine if <i>w</i><i> </i><i>€d.</i><i> </i>Thus, if we parse exactly half of the string using a processor designed for languages in <i>Ci, </i>and then ascertain whether the remaining half is identical, then we remain in the same processing complexity class, since the iden­tity check occurs after the parse and only requires linear time, but we also have structural informa­tion about the sentence as a whole.<page local="4"/> We know the structure of the first half of the string, and the sec­ond half of the string but not the structure of the second half (the grammar for <i>w </i>could be ambigu­ous), although we can assume that the second <i>w </i>was licensed by exactly the same tree structure as the first. This method also preserves a relative dif­ference between par sing <i>ww </i>and <i>wwR, </i>at least for £3. Since <i>wwR </i>can be represented directly within £2 it can be argued that we should not be required to use the metagrammatical method of parsing it, just to keep symmetry with the duplication lan­guages. Interestingly, if <i>w </i>is in £2 and we use the metagrammatical parsing method, then <i>wwR </i>also requires more processing time than <i>ww </i>for the same reason as the trivial case. Suppose instead that we allow <i>wwR </i>to be parsed without; using the metagrammatical method. In that case <i>ww </i>is relatively even easier to process since it costs <i>\w\a</i>to parse with the metagrammatical approach but <i>wwR </i>will cost (2|w|)<footnote anchor="3"/> in the direct approach. It might be claimed that just as we argue <i>ww </i>not to require the worst case complexity for its language-class (£1.5), neither need <i>wwR </i>for £2; but, the reversal language is a canonical example of a lan­guage that makes maximal use of the stack in the PDA. In any case, the metagrammatical method for parsing <i>ww </i>costs no more than just parsing strings in the characteristic language class of <i>w.</i></p><footnote label="3">While there actually is structural reverse wiring, psychological effects, like child learning of the dis­tinction between left and right hands on themselves and on a person facing them, suggest that there is a difference in processing time required between recog­nizing a copy and an inverse copy. Another example comes from the recognition of rotated objects. There is a robust effect for which given a reference object and a rotated object-in-question it takes time linear in the amount of rotation to recognize the objects as copies. Mirror-image objects are isomorphic, yet it takes strictly more time to recognize reflected copies than to recognize nonreflected copies (Cooper, 1975).</footnote><doubt alpha="0.0" length="3" tooSmall="False" monospace="0.0">159</doubt><table caption="Table 1: Models of Grammar and Computation" class="main" frame="box" rules="all" border="1" regular="False"><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Hierarchy Level</p></td><td class="cell"><p>Language Type</p></td><td class="cell"><p>Model of Computation</p></td><td class="cell"><p>Complexity</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>0</p></td><td class="cell"><p>unrestricted phrase structure grammar (= r.e.)</p></td><td class="cell"><p>Turing Machine (TM)</p></td><td class="cell"><p>undecidable</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>1</p></td><td class="cell"><p>context sensitive (C recursive)</p></td><td class="cell"><p>Linear bounded Automata <i>CUBA)</i></p></td><td class="cell"><p>PSPACE</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>1.5</p></td><td class="cell"><p>indexed</p></td><td class="cell"><p>Nested    Stack Automata (NSA)</p></td><td class="cell"><p>NP-Complete</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>1.75</p></td><td class="cell"><p>mildly context sensitive</p></td><td class="cell"><p>Embeded Pushdown Automata (EPDA)</p></td><td class="cell"><p>n<footnote anchor="7"/></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>2</p></td><td class="cell"><p>context free</p></td><td class="cell"><p>Pushdown Automata (PDA)</p></td><td class="cell"><p></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>3</p></td><td class="cell"><p>regular</p></td><td class="cell"><p>Finite State Machines (FSM)</p></td><td class="cell"><p>linear</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr></table><p>If this were the complete st ory then we could only recognize languages homomorphic to the du­plication languages. Clearly even the Zürich di­alect of Swiss-German allows other constructions, all of which we can assume are context free (Pul­lum and Gazdar, 1982). Essentially we want to be able to write arbitrary £3 or £2 grammars and also be able to parse the string duplication lan­guage for whichever <i>Ci </i>we choose. The language defined by such a union is no longer <i>Ci, </i>but will not contain arbitrary £1.5 strings, and if <i>i = </i>3 then the union will not even contain arbitrary con­text free strings. However, the situation is more involved than the basic approach since there needs to be a way to indicate where the metagrammat­ical approach is to be invoked. Add a single fea­ture to the grammar int erpreted by the processor as 'expect a copy'.<footnote anchor="4"/></p><doubt alpha="38.5" length="13" tooSmall="False" monospace="0.0">1.A —&gt; WB[c]Y</doubt><p>We allow context free productions of the form shown in (1), where <i>A </i>and <i>B </i>are nonterminals and <i>W, Y </i>are (possibly empty) sequences of terminals and nonterminals, <i>B </i>possibly occurring among the nonterminals of <i>Y.</i><i> </i>For an ambiguous CFG, there is no guarantee that multiple instances of a nonterminal will rewrite to through the same sequence of productions to yield the same string.</p><footnote label="4">Once we admit 'interpretability by the processor' we in principle have TM power. However we make quite restricted use of such interpretation. The rule format makes clear that it is less expressive than in­dexed grammars when interpreted directly.</footnote><p>There are any number of ways that this basic notation can be used in a metagrammatical ap­proach. In the first instance, we take c to be a signal to the processor to generate an expectation for a duplicate of the terminal sequence that the nonterminal it is attached to gets rewritten to, and that this expectation must be satisfied by the next nonterminal of the same name and in the same local domain.<footnote anchor="5"/> This approach will require that the sequence of terminals rewritten from the first <i>B </i>in (1) will be duplicated by the terminal sequence rewritten from the first instance of <i>B </i>(if any) that occurs in <i>Y. </i>The restriction will not hold of subsequent instances of the nonterminal marked for copying in the same local domain nor at different levels in the analysis. A stronger in­terpretation could require an expectation for the same constituent analysis of the nonterminal as well. Since we do not allow the feature to stack, the string-based method does not yield the full expressive power of indexed languages. The point is just that it's possible to keep a CF (or regular) grammar, and supplement; the processor with a string-duplication operator which can be invoked at the subsentence level. This is sufficient to yield languages that more closely resemble the Zürich dialect in having other constructions besides the duplication construction, yet remaining efficiently processablc.<footnote anchor="6"/></p><p>We have implemented the interpreter in a chart parser that can be used in cither top-down or bottom-up fashion. Edges in the chart are marked with a category (some nonterminal or preter­minal symbol from the grammar), constituents, substring span and expectations (along with a unique identifier for each edge). This is modi­fied to include a list, of constraints, which for the present purposes is presumed to be just duplica­tion checks. An edge with no expectations is in­active (saturated) and one with expectations is active. In the completer step, when active edges combine with adjacent, inactive edges whose cate­gory satisfies the current expectation of the ac­tive, the usual process of creating a new edge with one less expectation is augmented with an­other: if the current expectation has an associ­ated copy feature, then the new edge is marked with a constraint interpreted by the parser as in­dicated above — the nonterminal symbol and the string spanned by the inactive edge are noted so that the next inactive edge of the same category (if one is expect ed) will have; to span an identi­cal string.<page local="5"/> Constraints of this form are not passed on after satisfied once, and arc not, passed out of the local domain. Wit hin the same set of restric­tions the implemented constraint could have been 'expect a reversed copy'. This would require com-putafing the string's reverse before annotating the constraint list.</p><footnote label="5">We take a local domain, in tree terms, as a node and the set of nodes that it immediately dominates.</footnote><footnote label="6">To get closer still to the Zürich dialect, we require that the duplication operator be applied at the level of preterminals, with complementation, to get the pair­ings of case-marked NPs and Vs.</footnote><doubt alpha="0.0" length="3" tooSmall="False" monospace="0.0">160</doubt></subsection></section><section number="4" title="Discussion"><p>The context free languages have already been studied from the perspective; of minimal addition to incorporate copy languages. Savitch (1989) docs exactly that by presenting the model of com­putation required for the class of languages de­fined by augment ing the CFLs with reduplication: a Reduplication PDA (RPDA). An RPDA is just a PDA which has a special type of symbol that, can be put onto the stack to make the machine! treat the part, of the! stack above it as if it, were; a queue. Essentially, this obtains the reversal be­havior needed of a staek to process copy languages as well as reversals. Multiple; instances of the special symbol can be; placed on the stack. Sav­itch presents a characterization of the languages in terms of stringsets and the requisite compu­tational structures. The; family that we charac­terized above in terms of grammars are properly a subset, of the languages recognized by RPDA, a restriction of RPDA languages which Savitch (1989) terms <i>simple RPDA languages. </i>The model of computation here is an RPDA in which oidy one special symbol is allowe;d on the stack at any one; time. We; have not, proven the equiva­lence we conjecture between our metagrammatical method and the <i>reduplication context-free gram­mars </i>(RCFGs) that, Savitch introduces as genera­tive of simple RPDA languages. Savitch's (1989) grammars are; stated in terms of rule schemata (a finite set) that generate potentially infinite sets of rewrite rules. This is the tradeoff between doing things metagrammatically and directly.</p><p>Joshi and Rainbow (.Toshi, 1990; Rainbow and Joshi, 1994) have also considered the performance data associated with processing crossed vs. nested dependencies and present an alternative com­putation model, the <i>bottom-up embedded PDA </i>(BEPDA), designed for a variant, of tree-adjoining grammar (it uses a stack of stacks and a mon; complex operation for emptying the stack). Rain­bow and Joshi (1994) use; the processing model to demonstrate that, if can account for the difference between crosse;d and nested dependencies in terms of the amount of time; associated objects spend in the pushdown store of the BEPDA using a mildly context free; language; model that captures depen­dencies direct ly, rather than metagrammatically.<footnote anchor="7"/></p><footnote label="7">,Toshi (1990) gives a similar analysis for EDPAs.</footnote><p>Essentially, their analysis concludes the same: when judging string isomorphisms, it is easier to make the judgment of identically ordered pairs than it is to reversely ordered pairs. Thus, the cross-serial dependencies needn't cost the; worst, case complexity for parsing indexed or mildly con­text sensitive languages. Parsing <i>ww </i>languages requires, at worst, the worst case complexity of parsing <i>w </i>in whichever language class <i>w </i>is re­stricted to. Shieber (1985) pointed out without proof that, the nonCF data associated Zürich di­alect is linearly parsablc; our task has been to clarify how this follows from the language theory.</p><subsection number="4.1" title="A Caveat"><p>For efficient processing of <i>ww </i>to entail correspond­ing complexity for natural languages that license cross-serial dependencies hinges crucially on there being efficiently computable homomorphisms be­tween the natural language; and the string dupli­cation languages. This is an open que;stion. How­ever, give;n that, empirical work that compares pro­cessing of crossed and nested dependencies and concludes that the cross-serial dependencies are preferred to nested ones (Bach et, al., 198G), and give;n our argument, that, cross-serial dependencies are in theory easier to process, we feel it reason­able; to entertain the; assumption that something such exists. This does not require us to assume that people; actually use context-free! grammars and compute: homomorphisms in order to under­stand natural languages, just, that, the computa­tional model should be; at least, approximately as efficient, as pc;ople.</p></subsection><subsection number="4.2" title="Implications"><p>Our nietagrammatical approach to dealing with cross serial dependencies involves the; assumption of an operation for testing string duplication. We hinted earlier that we feel there to be sullicie;nf reason to believe that copy-checking is a basic; cog­nitive function, and although we don't, suppose that, people have built, in production systems and processors isomorphic to our chart parser and base: language, we: do think that this copy-checking is invoked in the processing of crossed dependencies. Our approach to accounting for the processing complexity that the string duplication languages should t ake does make empirical predictions and these can be tested. For instance, if it; is the case that such a rnee;hanism exists, then patterns of string-copy elisfluency should occur with different frequency in languages that license cross-serial de­pendencies than in those that, do not. A string-copy elisfluency is just one: that involves a repe:at of part; of the sentence: uttered so far:</p><p>1. <i>We went to the to the store to buy some flour.</i></p><p>The idea is that speakers of languages with <i>ww </i>homomorphisms have a different, pattern of in­voking copy-checking than those who speak languages that do not admit cross serial dependen­cies.<page local="6"/> These differences should be manifest in speech corpora like those that are currently being accumulated (Anderson et al., 1992; Miller, 1995), but which need augmentation by a corpus derived from copy-language dialects. Verifying this would, for example, establish whether the copied strings need to be constituents, and this has a bearing on whether processing models designed for incremen­tal interpretation (Milward, 1992) are the best de­scriptors of human performance. We do not offer arguments that our metagrammatical approach is the best description of human processing of cross-serial dependencies, just that it is another theo­retical justification for the difference in process­ing nested dependencies and efficient processing of crossed dependencies.</p><doubt alpha="0.0" length="3" tooSmall="False" monospace="0.0">161</doubt></subsection></section><section title="Acknowledgements"><p>Vogel is grateful to the SFB 340 for funding his stay Stuttgart; Hahn acknowledges the sup­port of ESRC grant No. R004293341442; Brani-gan, EPSRC research studentship No. 92315069. All would like to thank Catherine Collin, Tomaz Erjavec, Tsutomu Fujinami, Merce Prat, Fred Popowich, Mark Steedman, and the anonymous reviewers.</p></section><references><p>Alfred V. Alio. 1968. Indexed grammars—an exten-tion to context-free grammars. <i>Journal of the As­sociation for Computing Machinery, </i>15(4):647-671.</p><p>Anne H. Anderson, Miles Bader, Ellen Gurman Bard, Elizabeth H. Boyle, Gwyneth M. Doherty, Simon C. Garrod, Stephen D. Isard, Jacqueline C. Kowtko, Jan M. McAllister, Jim Miller, Catherine F. Sotillo, Henry S. Thompson, and Regina Weinert. 1992. The HCRC Map Task corpus. <i>Language and Speech, </i>34(4):351-366.</p><p>Emrnon Bach, Colin Brown, and William Marslen-Wilson. 1986. Cross and nested dependencies in german and dutch: A psycholinguistic study. <i>Lan­guage and Cognitive Processes, </i>l(4):249-262.</p><p>Joan Bresnan, Ron Kaplan, Stanley Peters, and Annie Zaenen. 1982. Cross-serial dependencies in dutch. <i>Linguistic Inquiry, </i>13(4):613-35.</p><p>Lynn Cooper. 1975. Mental rotation of random two-dimensional shapes. <i>Cognitive Psychology, </i>7:23-43.</p><p>Veronica Dahl and Michael McCord. 1983. Treating coordination in logic grammars. <i>American Journal of Computational Linguistics, </i>9(2):69-91.</p><footnote label="8">Note that thé English 'respectively' constructions require a special intonational behavior in the sing­song litany-voice that is required for a speaker to make an extended 'respectively' construction inter­prétable, thus arguments for specifically metagram­matical treatment do exist (where intonational facts are considered evidence for a signal to the processor to do something unusual).</footnote><p>Gerald Gazdar and Geoffrey Pullum. 1985. Compu­tationally relevant properties of natural languages and their grammars. Technical Report CSLI-85-24, Stanford: Center for the Study of Language and Information.</p><p>Gerald Gazdar. 1985. Applicability of indexed gram­mars to natural language. Technical Report CSLI-85-34, Stanford: Center for the Study of Language and Information.</p><p>John E. Hopcroft and Jeffrey D. Ullman. 1979. <i>Intro­duction to Automata Theory, Languages, and Com­putation. </i>Addison-Wesley Publishing Co., Reading MA.</p><p>Aravind K. Joshi, K. Vijay-Shanker, and David Weir. 1989. The convergence of mildly context-sensitive grammar formalisms. Technical Report MS-CIS-89-14; LINC LAB 144, Department of Computer and Information Science University of Pennsylva­nia, Philadelphia, PA.</p><p>Aravind Joshi. 1990. Processing crossed and nested dependencies: An automaton perspective on the psycholinguistic results. <i>Language and Cognitive Processes, </i>5(l):l-27.</p><p>Alexis Manaster-Ramer. 1987. Dutch as a formal language. <i>Linguistics and Philosophy, </i>10(2):221-46.</p><p>Jim Miller. 1995. Focus in the languages of europe. To appear in G. Bernini (ed.) <i>Pragmatic organiza­tion in the languages </i>(Volume I. of <i>Typology of the languages of Europe). </i>Mouton-de Gruyter.</p><p>David Milward. 1992. Dynamics, dependency gram­mar and incremental interpretation. In <i>COL-ING92, </i>pages 1095-9.</p><p>David Milward. 1994. Dynamic dependency gram­mar. <i>Linguistics and Philosophy, </i>17:561-605.</p><p>Geoffrey Pullum and Gerald Gazdar. 1982. Natural languages and context-free languages. <i>Linguistics and Philosophy, </i>4:471-504.</p><p>Owen Rambow and Aravind Joshi. 1994. A process­ing model for free word order languages. In L. Fra-zier C. Clifton, Jr. and K. Rayner, editors, <i>Perspec­tives on Sentence Processing. </i>Lawrence Erlbaum.</p><p>Walter Savitch. 1989. A formal model for context-free languages augmented with reduplication. <i>Compu­tational Linguistics, </i>15(4):250-61.</p><p>Stuart Shieber. 1985. Evidence against the context-freeness of natural language. <i>Linguistics and Phi­losophy, </i>8(3).-333-43.</p><p>Stuart Shieber. 1995. What is wrong with tags. In­vited talk at the <i>Seventh Conference of the Euro­pean Chapter of the Association for Computational Linguistics. </i>Belfield, Dublin, Ireland.</p><p>Edward P. Stabler. 1994. The finite connectivity of linguistic structure. In Lyn Frazier Charles Clifton, Jr. and Keith Rayner, editors, <i>Perspectives on Sen­tence Processing. </i>Hillsdale, NJ: Lawrence Erlbaum.</p><p>Carl Vogel and Tomaz Erjavec. 1994. Restricted dis­continuous phrase structure grammar and its rami­fications. In Carlos Martin-Vide, editor, <i>Current Is­sues in Mathematical Linguistics. </i>The Netherlands: Elsevier Science Publishers.</p><doubt alpha="0.0" length="3" tooSmall="False" monospace="0.0">162</doubt></references></body></article>