<?xml version="1.0"?><!DOCTYPE article SYSTEM "/project/take/software/searchbench_offline_processing/paperxml_generator/aclextractor/src/python/../resource/dtd/paperxml.dtd"><article><header><firstpageheader><page local="1"/><title>The Combinatory Morphemic Lexicon</title><pubinfo>© 2GG2 Association for Computational Linguistics</pubinfo><author surname="Bozsahin" givenname="Cem"><org  name="Middle East Technical University" country="Turkey" city="Ankara"/></author></firstpageheader><frontmatter><p><b>The Combinatory Morphemic Lexicon</b></p><p>Cem Bozsahin*</p><p>Middle East Technical University</p></frontmatter><abstract><i>Grammars that expect words from the lexicon may be at odds with the transparent projection of syntactic and semantic scope relations of smaller units. We propose a morphosyntactic framework based on Combinatory Categorial Grammar that provides flexible constituency, flexible category consistency, and lexical projection of morphosyntactic properties and attachment to grammar in order to establish a morphemic grammar-lexicon. These mechanisms provide enough expressive power in the lexicon to formulate semantically transparent specifications without the necessity to confine structure forming to words and phrases. For instance, bound morphemes as lexical items can have phrasal scope or word scope, independent oftheir attachment characteristics but consistent with their semantics. The controls can be attuned in the lexicon to language-particular properties. The result is a transparent interface of inflectional morphology, syntax, and semantics. We present a computational system and show the application of the framework to English and Turkish.</i> </abstract></header><body><section number="1." title="Introduction"><p>The study presented in this article is concerned with the integrated representation and processing of inflectional morphology, syntax, and semantics in a unified grammar ar­chitecture. An important issue in such integration is mismatches in morphological, syntactic, and semantic bracketings. The problem was first noted in derivational mor­phology. Williams (1981) provided examples from English; the semantic bracketings in (1a-2a) are in conflict with the morphological bracketings in (1b-2b).</p><p>(l) a.</p><doubt alpha="75.0" length="4" tooSmall="False" monospace="0.0">-ity</doubt><p>b.</p><doubt alpha="100.0" length="5" tooSmall="False" monospace="0.0">hydro</doubt><p><i>hydro electric</i> <i>electric -ity</i> (2) a.</p><doubt alpha="75.0" length="4" tooSmall="False" monospace="0.0">-ing</doubt><doubt alpha="85.7" length="7" tooSmall="False" monospace="0.0">b.Gödel</doubt><p><i>Godel number</i> <i>number -ing</i></p><p>If the problem were confined to derivational morphology, we could avoid it by making derivational morphology part of the lexicon that does not interact with gram­mar. But this is not the case. Mismatches in morphosyntactic and semantic bracketing</p><p>* Computer Engineering and Cognitive Science, Middle East Technical University, 06531 Ankara, Turkey. E-mail: bozsahin@metu.edu.tr.</p><page local="2" global="146"/><p>also abound. This article addresses such problems and their resolution in a computa­tional system.<footnote anchor="1"/></p><p>Müller (1999, page 401) exemplifies the scope problem in German prefixes. (3a) is in conflict with the bracketing required for the semantics of the conjunct (3b).</p><p>(3) a. <i>Wenn </i>[ <i>Ihr Lust </i>]     <i>und </i>[ <i>noch nichts   anderes vor- ]habt,</i> if      you pleasure and yet   nothing else intend <i>können wir sie    ja vom      Flughafen abholen</i> can     we them PARTICLE from.the airport   pick up 'If you feel like it and have nothing else planned, we can pick them up at the airport.' b. <i>Ihr Lust habt UND noch nichts anderes vorhabt</i></p><p>Similar problems can be observed in Turkish inflectional suffixes. In the coordi­nation of tensed clauses, the tense attaches to the verb of the rightmost conjunct (4a) but applies to all conjuncts (4b). Delayed affixation appears to apply to all nominal inflections (4c-e).</p><p>(4) a. <i>Zorunlu     deprem      sigortasi   </i>[ <i>yururluge girmis </i>] <i>ama</i> mandatory earthquake insurance effect       enter-ASP but [ <i>tam anlamiyla uygulanamamis ]-ti</i> exactly apply-NEG-ASP-TENSE 'Mandatory earthquake insurance had gone into effect, but it had not been enforced properly.' b. <i>yururluge girmis-ti ama tam anlamiyla uygulanamamis-ti</i></p><p>c. <i>Adam-in   </i>[<i>arabave   ev]-i </i>man-GEN car    and house-POSS 'the man's house and car' d. <i>Araba-yi  </i>[<i>adamve   cocuk]-lar-a goster-di-m</i></p><p>Car-ACC man and child-PLU-DAT show-TENSE-PERS1 '(I) showed the car to the men and the children.' e. <i>Araba-yi sen-in     </i>[ <i>dost ve   tanidik]-lar-in-a goster-di-m</i></p><p>Car-ACC you-GEN friend and acq.-PLU-POSS-DAT showed '(I) showed the car to the your friends and acquaintances.'</p><p>1 Our use of the term <b>morphosyntax </b>needs some clarification. Some authors, (e.g., Jackendoff 1997), take it to mean the syntax of words, in contrast to the syntax of phrases. By morphosyntax we mean those aspects of morphology and syntax that collectively contribute to grammatical meaning composition. This is more in line with the inflectional-morphology-is-syntax view. In this respect, we will not address problems related to derivational morphology; its semantics is notoriously noncompositional and does not interact with grammatical meaning. Moreover, without a semantically powerful lexicon such as Pustejovsky's (1991), even the most productive fragment of derivational morphology is hard to deal with (Sehitoglu and Bozsahin 1999).</p><page local="3" global="147"/><p>Phrasal scope of inflection can be seen in subordination and relativization as well. In (5a), the entire nominalized clause marked with the accusative case is the object of <i>want. </i>In (5b), the relative participle applies to the relative clause, which lacks an object. The object's case is governed by the subordinate verb, whose case requirements might differ from that of the matrix verb (5c). As we show later in this section, the coindexing mechanisms in word-based unification accounts of unbounded extraction face a conflict between the local and the nonlocal behavior of the relativized noun, mainly due to applying the relative participle <i>-dig-i </i>to the verbal stem <i>ver </i>rather than the entire relative clause. A lexical entry for <i>-dig-i </i>would resolve the conflict and capture the fact that it applies to nonsubjects uniformly.</p><doubt alpha="60.4" length="53" tooSmall="False" monospace="0.0">(5) a.Can[Ayse'ninkitab-i      oku-ma-si ]-ni iste-di</doubt><p>C.NOM A.-GEN book-ACC read-INF-AGR-ACC want-TENSE 'Can wanted Ayse to read the book.' lit. 'Can wanted Ayse's-reading-the-book.'</p><doubt alpha="63.3" length="60" tooSmall="False" monospace="0.0">b.Ben[Mehmet'in cocug-a/*-u ver]-dig-i      kitab-i oku-du-m</doubt><p>I.NOM M-GEN child-DAT/*ACC give-REL.OP book-ACC read-TENSE-PERS1 'I read the book that Mehmet gave to the child.'</p><doubt alpha="66.5" length="182" tooSmall="False" monospace="0.0">c.Ben[Mehmet'in kitab-i       ver]-dig-i      cocug-u/*-a gor-dii-mI.NOM M-GEN     book-ACC give-REL.OP child-ACC/*DAT see-TENSE-PERS1 'I saw the child to whom Mehmet gave the book.'</doubt><p>The morphological/phrasal scope conflict of affixes is not particular to morpho­logically rich languages. Semantic composition of affixes in morphologically simpler languages poses problems with word (narrow) scope of inflections. For instance, <i>fake trucks </i>needs the semantics (plu(faketruck)), which corresponds to the surface brack­eting [ <i>fake truck]-s, </i>because it denotes the nonempty nonsingleton sets of things that are not trucks but fake trucks (Carpenter 1997). <i>Four trucks, </i>on the other hand, has the semantics (four(plu truck)), which corresponds to <i>four </i>[ <i>truck]-s, </i>because it denotes the subset of nonempty nonsingleton sets of trucks with four members.</p><p>The status of inflectional morphology among theories of grammar is far from settled, but, starting with Chomsky (1970), there seems to be an agreement that deriva­tional morphology is internal to the lexicon. Lexical Functional Grammar (LFG) (Bresnan 1995) and earlier Government and Binding (GB) proposals e.g. (Anderson 1982) consider inflectional morphology to be part of syntax, but it has been del­egated to the lexicon in Head-Driven Phase Structure Grammar (HPSG) (Pollard and Sag 1994, page 35) and in the Minimalist Program (Chomsky 1995, page 195). The representational status of the morpheme is even less clear. Parallel develop­ments in computational studies of HPSG propose lexical rules to model inflectional morphology (Carpenter and Penn 1994). Computational models of LFG (Tomita 1988) and GB (Johnson 1988; Fong 1991), on the other hand, have been noncommittal re­garding inflectional morphology. Finally, morphosyntactic aspects have always been a concern in Categorial Grammar (CG) (e.g., Bach 1983; Carpenter 1992; Dowty 1979; Heylen 1997; Hoeksema 1985; Karttunen 1989; Moortgat 1988b; Whitelock 1988), but the issues of constraining the morphosyntactic derivations and re­solving the apparent mismatches have been relatively untouched in computational studies.</p><p>We briefly look at Phrase Structure Grammars (PSGs), HPSG, and Multimodal CGs (MCGs) to see how word-based alternatives for morphosyntax would deal with the issues raised so far.<page local="4" global="148"/> For convenience, we call a grammar that expects words from the lexicon a <b>lexemic </b>grammar and a grammar that expects morphemes a <b>morphemic </b>grammar. A lexemic PSG provides a lexical interface for inflected words (X0s) such that a regular grammar subcomponent handles lexical insertion at X0.<footnote anchor="2"/> In (4d), the right conjunct <i>cocuk-lar-a </i>is analyzed as <i>N0 </i><i>— </i>cocuk-PLU-DAT (or <i>N0 </i><i>— </i><i>N</i><i>y </i>-DAT, <i>N</i><i>0i </i><i>— </i><i>N0» </i>-PLU, N0" <i>— </i>Stem, as a regular grammar). Assuming a syncategorematic coordination schema, that is, <i>X </i><i>— </i><i>X and X, </i>the N0 in the left and right conjuncts of this example would not be of the same type. Revising the coordination schema such that only the root features coordinate would not be a solution either. In (4e), the relation of possession that is marked on the right conjunct must be carried over to the left conjunct as well. What is required for these examples is that the <i>syntac­tic </i>constituent <i>X </i>in the schema be analyzed as <i>X</i>-PLU(-POSS)-DAT, after <i>N</i>0 <i>and N</i>0 coordination.</p><p>What we need then is not a lexemic but a morphemic organization in which brack­eting of free and bound morphemes is regulated in syntax. The lexicon, of course, must now supply the ingredients of a morphosyntactic calculus. This leads to a the­ory in which semantic composition parallels morphosyntactic combination by virtue of bound morphemes' being able to pick their domains just like words (above <i>X</i>0, if needed). A comparison of English and Turkish in this regard is noteworthy. The English relative pronouns <i>that/whom </i>and the Turkish relative participle <i>-dig-i </i>would have exactly the same semantics when the latter is granted a representational status in the lexicon (see Section 6).</p><p>Furthermore, rule-based PSGs project a rigid notion of surface constituency. Steed-man (2000) argued, however, that syntactic processes such as identical element dele­tion under coordination call for flexible constituency, such as SO (subject-object) in the SVO &amp; SO gapping pattern of English and SV (subject-verb) constituency in the OSV &amp; SV pattern of Turkish. Nontraditional constituents are also needed in specifying semantically transparent constituency of words, affixes, clitics, and phrases.</p><p>Constraint-based PSGs such as HPSG appeal to coindexation and feature passing via unification, rather than movement, to deal with such processes. HPSG also makes the commitment that inflectional morphology is internal to the lexicon, handled either by lexical rules (Pollard and Sag 1994) or by lexical inheritance (Miller and Sag 1997). We look at (5c) to highlight a problem with the stem-and-inflections view. As words en­ter syntax fully inflected, the sign of the verb <i>ver-dig-i </i>in the relative clause (5c) would be as in (6a), in which the SUBCAT list of the verb stem is, as specified in the lexi­cal entry for ver, unsaturated. The participle adds coindexation in MOD<i>\-</i><i> </i><i>•</i><i> </i><i>•</i><i> </i>|INDEX. The HPSG analysis of this example would be as in Figure 1. Although passing the agreement features of the head separately (Sehitoglu 1996) solves the case problem alluded to in (5c), however, structure sharing of the <i>NP dat </i>with the SLASH, INDEX, and CONTENT features of <i>ver-dig-i </i>is needed for semantics (GIVEE), but this conflicts with the head features of the topmost <i>NPacc </i>in the tree. The relative participle as a lexical entry (e.g., (6b)) would resolve the problem with subcategorization because its SUBCAT list is empty (like the relative pronoun <i>that </i>in English), hence there would be no indirect dependence of the nonlocal SLASH feature and the local SUBCAT feature via semantics (CONTENT). Such morphemic alternatives are not considered in HPSG, however, and require a significant revision in the theory. Furthermore, HPSG's lexical</p><p>2 But see Creider, Hankamer, and Wood (1995), which argues that the morphotactics of human languages is not regular but linear context free.</p><page local="5" global="149"/><p>assignment for trace introduces phonologically null elements into the lexicon, which, as we show later, is not necessary.</p><doubt alpha="42.1" length="19" tooSmall="False" monospace="0.0">(6) a. ver-dig-i :=</doubt><doubt alpha="100.0" length="5" tooSmall="False" monospace="0.0">LOCAL</doubt><doubt alpha="100.0" length="3" tooSmall="False" monospace="0.0">CAT</doubt><doubt alpha="100.0" length="7" tooSmall="False" monospace="0.0">CONTENT</doubt><p>[PERSON <i>thirdl\ </i>HEAD      A        [NUMBER   <i>sing \ </i>CASE <i>dat</i></p><p>SUBCAT <b><i>&lt;SM </i></b>NP[gen], 0 NP[acc]ji] NP[dat]&gt; MOD | MODSYN | LOCAL | CONT | INDEX 0 . 'RELN give" GIVER go</p><p>GIVEE GIFT</p><doubt alpha="0.0" length="1" tooSmall="False" monospace="0.0">0</doubt><doubt alpha="63.3" length="30" tooSmall="False" monospace="0.0">NONLOCAL | TO-BIND | SLASH {□}</doubt><doubt alpha="41.7" length="12" tooSmall="False" monospace="0.0">b. -dig-i :=</doubt><p>CONTENT    <i>npro </i>[INDEX □] NONLOCAL | INHER | SLASH {□}</p><doubt alpha="65.3" length="75" tooSmall="False" monospace="0.0">MCGs (Hepple 1990a; Morrill 1994; Moortgat and Oehrle 1994) allow different</doubt><p>modes of combination in the grammar. In addition to binary modes such as wrapping and commutative operations, unary modalities provide finer control over the cate­gories. Heylen (1997, 1999) uses unary modalities as a way of regulating morphosyn-tactic features such as case, number, and person for economy in lexical assignments. For instance, <i>Frau </i>has the category □casenfem□sgn3pndeclN, which underspecifies it for case and declension. Underspecification is dealt with in the grammar using inclusion postulates (e.g., (7)). The interaction of different modalities is regulated by distribution postulates.</p><p>(7)    Dcaser <i>h X      </i>Dcaser h <i>X</i></p><p>Dnomr <i>h X </i>naccr <i>h X</i></p><p>Lexical assignments to inflected words carry unary modalities: <i>boys </i>has the type □plN, in contrast to ^sgN for boy. Although such regulation of inflectional features successfully mediates, for example, subject-verb agreement or NP-internal case agree­ment (as in German), it is essentially word-based, because type assignments are to inflected forms; morphemes do not carry types. This reliance on word types neces­sitates a lexical rule-based approach to some morphosyntactic processes that create indefinitely long words, such as ki-relativization in Turkish (see Section 6.5). But lexical rules for such processes risk nontermination (Sehitoglu and Bozsahin 1999). Our main point of departure from MCG accounts is the morphemic versus lexemic nature of the lexicon: The morphosyntactic and attachment modalities originate from the lexicon; they are not properties of the grammar (we elaborate more on this later). This paves the way to the morphemic lexicon by licensing type assignments to units smaller than words.</p><p>Besides problems with lexical rules, the automata-theoretic power of MCGs is problematic: Unrestricted use of structural modalities and postulates leads to Tur­ing completeness (Carpenter 1999). Indeed, one of the identifiable fragments of Mul-</p><p>HEAD    <i>noun </i>[acc or datj<page local="6" global="150"/></p><doubt alpha="61.1" length="18" tooSmall="False" monospace="0.0">LOCAL | [SUBCAT &lt;&gt;</doubt><p>timodal languages that is computationally tractable is Combinatory Categorial lan­guages (Kruijff and Baldridge 2000), which we adopt as the basis for the framework presented here. We propose a morphosyntactic Combinatory Categorial Grammar (CCG) in which the grammar and the morphemic lexicon refer to morphosyntactic types rather than syntactic types. We first introduce the syntactic CCG in Section 2. Morphosyntactic CCG is described in Section 3. In Section 4, we look at the compu­tational aspects of the framework. We then show its realization for some aspects of English (Section 5) and Turkish (Section 6).</p><page local="7" global="151"/></section><section title="2. Syntactic Types"><p>CG is a theory of grammar in which the form-meaning relation is conceived as a transparent correspondence between the surface-syntactic and semantic combinatorics (Jacobson 1996). A CCG sign can be represented as a triplet <i>n — a: </i>where <i>n </i>is the prosodic element, <i>a</i><i> </i>is its syntactic type, and <i>\i</i><i> </i>its semantic type. For instance, the lexical assignment for <i>read </i>is (8).<footnote anchor="3"/></p><doubt alpha="57.5" length="40" tooSmall="False" monospace="0.0">(8) read :=read — (S\NP)/NP:Ax.Ay.readxy</doubt></section><section title="Definition (Syntactic Types)"><p><i>• </i>The set of basic syntactic categories: <i>As </i><i>= </i><i>{N,NP,S,S-t,S+t}</i></p><p><i>• </i>The set of complex syntactic categories: <i>Bs</i></p></section><section title="— A s CB s"></section><section title="— If X eB s  and Y eB s , then X\Y and X/Y eB s"><p>The classical Ajdukiewicz/Bar-Hillel (AB) CG is weakly equivalent to Context-Free Grammars (Bar-Hillel, Gaifman, and Shamir 1960). It has function application rules, defined originally in a nondirectional fashion. The directional variants and their associated semantics are as follows:</p><doubt alpha="58.9" length="90" tooSmall="False" monospace="0.0">(9) Forward Application (&gt;):4X/Y:fY:a   ==X:fa Backward Application (&lt;):   Y:aX\Y:f=&gt; X:fa</doubt><doubt alpha="51.9" length="79" tooSmall="False" monospace="0.0">CCG (Steedman 1985, 1987, 1988; Szabolcsi 1983, 1987) is an extended version of</doubt><p>AB that includes function composition (10), substitution, and type raising (11). These extensions make CCGs mildly context sensitive.</p><doubt alpha="58.0" length="112" tooSmall="False" monospace="0.0">(10) Forward Composition(&gt;B):X/Y:fY/Z:g==   X/Z: Xx.f (gx)Backward Composition(&lt;B):Y\Z:gX\Y:f==   X\Z: Xx.f (gx)</doubt><doubt alpha="50.9" length="112" tooSmall="False" monospace="0.0">(11) Forward Type Raising(&gt;T):5X:a   =   T/(T\X): Xf f[a] Backward Type Raising(&lt;T):X:a   ==   T\(T/X): Xf .f[a]</doubt><p>Type raising is an order-preserving operation. For instance, Lambek's (1958) cat­egory <i>S/(S\NP) </i>is a positional encoding of the grammatical subject as a function</p><p>3 We take <i>n </i>to be the surface string for simplicity. We use the "result-first" convention for CG. For instance, transitive verbs of English are written as <i>(S\NP)/NP, </i>which translates to <i>(NP\S)/NP </i>in the "result-on-top" convention.</p><p>4 We omit the prosodic element for ease of exposition. For instance, the complete definition of forward application is s1 <i>— X/Y: </i><i>fs</i><i>2 </i><i>— Y: a </i>=&gt; s1 <i>• s</i><i>2 </i><i>— X: fa, </i>where • is prosodic combination and <i>fa </i>is the application of <i>f </i>to a. The <i>• </i>will play a crucial role in the lexicalization of attachment later on.</p><p>5 The lambda term <i>f [a] </i>denotes internal one-step ^-reduction of <i>f </i>on a. In parsing, we achieve the same effect by partial execution (Pereira and Shieber 1987). <i>\f.f [a] </i>is encoded as (a"F)"F in Prolog, where " is lambda abstraction. We opted for the explicit <i>f[a] </i>notation mainly for ease of exposition (cf. the semantics of raising verbs, relative participles, etc. in Section 6). Moreover, as Pereira and Shieber noted, (a"F)"F is not a lambda term in the strict sense because a is not a variable.</p><page local="8" global="152"/><p>looking for a VP (= <i>S\NP) </i>to the right to become S. The reversal of directionality such as topicalization (e.g., <i>This book, I recommend) </i>requires another schema. The reversal is with respect to the position of the verb, which we shall call <b>contraposition </b>and formulate as in (12).<footnote anchor="6"/> (&lt;XP) is leftward extraction of a right constituent, and (&gt;XP) is rightward extraction of a left constituent, both of which are marked constructions. Directionally insensitive types such as <i>T\(T\X) </i>cause the collapse of directionality in surface grammar (Moortgat 1988a).</p><doubt alpha="55.7" length="61" tooSmall="False" monospace="0.0">(12) Leftward Contraposition (&lt;XP):   X:a==S+t/(S/X):Xf f [a]</doubt><doubt alpha="41.2" length="17" tooSmall="False" monospace="0.0">/(S+t/X):Xf f [a]</doubt><doubt alpha="55.6" length="63" tooSmall="False" monospace="0.0">Rightward Contraposition (&gt;XP):   X:a   =   S_t\(S\X): Xf f [a]</doubt><doubt alpha="45.0" length="20" tooSmall="False" monospace="0.0">S-t\(S_t\X):Xf f [a]</doubt><p>The semantics of contraposition depends on discourse properties as well. We leave this issue aside by (2) noting that it is related to type raising in changing the function-argument relation and (2) categorizing the sentence as S+t (topicalized) or S_t (detopi-calized), which are not discourse equivalent to <i>S. </i>Syntactic characterization as such also helps a discourse component do its work on syntactic derivations.</p><p>CCG's notion of interpretation is represented in the Predicate-Argument Structure (PAS). Its organization is crucial for our purposes, since the bracketing in the PAS is the arbitrator for reconciling the bracketings in morphology and syntax via proper lexical type assignments. It is the sole level of representation in CCG (Steedman 1996, page 89).<footnote anchor="7"/> It is the level at which the conditions on objects of interpretation, such as binding and control, are formulated. For instance, Steedman (1996) defines c-command and binding conditions A, B, and C over the PAS. The PAS also reflects the obliqueness order of the arguments:</p><p><i>Predicate </i><i>...</i><i> Tertiary-Term Secondary-Term Primary-Term</i></p><p>Assuming left associativity for juxtaposition, this representation yields the brack­eting in (13) for the PAS. Having the primary argument as the outermost term is motivated by the observations on binding asymmetries between subjects and comple­ments in many languages (e.g., <i>^Himself saw John, *heself).</i></p><doubt alpha="0.0" length="4" tooSmall="False" monospace="0.0">(13)</doubt><doubt alpha="61.9" length="21" tooSmall="False" monospace="0.0">Predicate   • • •ierm</doubt></section><section title="3. Morphosyntactic Types"><p>A syntactic type such as <i>N </i>does not discriminate morphosyntactically. A finer dis­tinction can be made as singular nouns, plural nouns, case-marked nouns, etc. For</p><p>6 In fact, topicalization of nonperipheral arguments <i>(This book, I would give to Mary) </i>requires that (12) be finitely schematized over valencies, such as S, <i>S/NP, S/PP </i>(Steedman 1985).</p><p>7 We will not elaborate on the theoretical consequences of having this level of representation; see, for instance, Dowty (1991) and Steedman (1996).</p><page local="9" global="153"/><doubt alpha="62.5" length="8" tooSmall="False" monospace="0.0">free (f)</doubt><doubt alpha="60.0" length="10" tooSmall="False" monospace="0.0">n-case (c)</doubt><doubt alpha="60.0" length="20" tooSmall="False" monospace="0.0">n-comp (m) n-poss(o)</doubt><doubt alpha="55.6" length="9" tooSmall="False" monospace="0.0">n-num (n)</doubt><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">i</doubt><doubt alpha="60.0" length="10" tooSmall="False" monospace="0.0">n-base (b)</doubt><p>s-person (s)</p><doubt alpha="50.0" length="36" tooSmall="False" monospace="0.0">s-njodal (m) s-^nse (t) s-a^l (a)(g)</doubt><doubt alpha="55.6" length="9" tooSmall="False" monospace="0.0">s-imp (i)</doubt><doubt alpha="60.0" length="10" tooSmall="False" monospace="0.0">s-pass (p)</doubt><doubt alpha="60.0" length="10" tooSmall="False" monospace="0.0">s-caus (u)</doubt><doubt alpha="63.6" length="11" tooSmall="False" monospace="0.0">s-recip (c)</doubt><doubt alpha="62.5" length="24" tooSmall="False" monospace="0.0">n-relbase (l) n-root (r)</doubt><doubt alpha="57.1" length="21" tooSmall="False" monospace="0.0">n-num (n) s-tense (t)</doubt><doubt alpha="57.1" length="21" tooSmall="False" monospace="0.0">n-base (b) s-base (v)</doubt><doubt alpha="33.3" length="3" tooSmall="False" monospace="0.0">(a)</doubt></section><section title="Figure 2"><p>The lattice of diacritics for (a) Turkish and (b) English.</p><doubt alpha="33.3" length="3" tooSmall="False" monospace="0.0">(b)</doubt><p>instance, the set of number-marked nouns can be represented as M N, where M is a morphosyntactic modality ("equals") and <i>n </i>is a diacritic (for number). <i>Books </i>is of type M <i>N, </i>but <i>book </i>is not. The type for <i>books </i>can be obtained morphosyntactically by as­signing <i>-s </i>(-PLU) the functor type Mn\ Mn, where <i>b </i>stands for base. A syntactic type such as <i>N\N </i>overgenerates.</p><p>Another modality, <i>&lt; </i>("up to and equals"), allows wider domains in morphosyntactic typing. For instance, <i>&lt; N </i>represents the set of nouns marked on number or any other diacritic that is lower than number in a partial order (e.g., Figure 2). The inflectional paradigm of a language can be represented as a partial ordering us­ing the modalities.<footnote anchor="8"/> For instance, if the paradigm is Base-Number-Case, we have <i>v( &lt; N) C v( &lt; N) C v( &lt; N), </i>where <i>v(t) </i>is the valuation function from the mor­phosyntactic type <i>t </i>to the set of strings that have the type <i>t</i>. The M modality is more strict than <i>&lt; </i>to provide finer control; although <i>v( &lt; N) C v( &lt; N), v( M N) C v( M N), </i>because a noun can be number marked but not case marked or vice versa. Also, <i>v( M N) C v( &lt; N) </i>for any diacritic <i>i </i>since, for instance, the set of nouns marked up to and including case includes case-marked, number-marked, and unmarked nouns.</p><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">n</doubt><doubt alpha="60.0" length="5" tooSmall="False" monospace="0.0">b n c</doubt><doubt alpha="57.1" length="7" tooSmall="False" monospace="0.0">n c n c</doubt><doubt alpha="100.0" length="2" tooSmall="False" monospace="0.0">ii</doubt><p>The lattice consistency condition is imposed on the set of diacritics to ensure category unity.<footnote anchor="9"/> In other words, the syntactic type <i>X </i>can be viewed as an abbreviation for the morphosyntactic type <i>&lt; X </i>where <i>T </i>is the universal upper bound. It is the</p><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">T</doubt><p>8 See Heylen (1997) on use of unary modalities for a similar purpose in lexemic MCG.</p><p>9 In a lattice <i>L, x &lt; y </i>(morphosyntactically, <i>x &lt; y) </i>is equivalent to the consistency properties <i>x A y = x </i>and <i>x </i>V <i>y = y. </i>We use the join operator for this check, thus it suffices to have a join semilattice.</p><page local="10" global="154"/><p>most underspecified category of <i>X </i>which subsumes all morphosyntactically decorated versions of X. Figure 2 shows the lattice for English and Turkish.</p></section><section title="Definition (Morphosyntactic Types)"><p><i>•</i><i> </i><i>V</i><i> </i><i>=</i><i> </i>finite set of diacritics</p><doubt alpha="58.6" length="29" tooSmall="False" monospace="0.0">•Join semilatticeL= (V, &lt;, =)</doubt><p><i>• </i>The set of basic morphosyntactic types: <i>Ams.</i></p><p><i>— &lt; X </i><i>eAms </i>and M <i>X </i><i>eAms </i>if <i>i </i><i>eV</i><i> </i>and <i>X </i><i>eA</i><i> </i>(see definition of syntactic types for As)</p><p>— (M corresponds to lattice condition =)</p><p>— (&lt; corresponds to lattice condition &lt;)</p><p><i>• </i>The set of complex morphosyntactic types: <i>Bms</i></p></section><section title="— If X eB ms  and Y eB ms , then X\Y and X/Y eB ms"><p>For instance, the infinitive marker <i>-ma </i>in (14a) can be lexically specified to look for untensed VPs—functions onto <i>&lt; S</i>—to yield a complex noun base (14b), which, as a consequence of nominalization (result type <i>N), </i>receives case to become an argument of the matrix verb. The adjective in <i>fake trucks </i>can be restricted to modify unmarked Ns to get the bracketing <i>[fake truck]-s </i>(14c).</p><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">a</doubt><p>(14) a. <i>Mehmet [[kitab-i oku]-ma]-yi istiyor </i>M.NOM book-ACC read-INF-ACC wants 'Mehmet wants to read the book.'</p><doubt alpha="38.5" length="39" tooSmall="False" monospace="0.0">b. -INF :=ma — &lt; N\( &lt; S\ &lt; NPmm):Xf .f</doubt><doubt alpha="100.0" length="2" tooSmall="False" monospace="0.0">bb</doubt><doubt alpha="52.9" length="34" tooSmall="False" monospace="0.0">c. fake :=fake — &lt; N/ &lt; N:Xx.fakex</doubt><p>Different attachment characteristics of words, affixes, and clitics must be factored into the prosodic domain as a counterpart of refining the morphosyntactic description. In Montague Grammar, every syntactic rule is associated with a certain mode of at­tachment, and this tradition is followed in MCG; attachment types are related with the slash (e.g., <i>/</i><i>w </i>for wrapping), which is a grammatical modality.<footnote anchor="10"/> In the present frame­work, however, attachment is projected from the lexicon to the grammar as a prosodic property of the lexical items.<footnote anchor="11"/> The grammar is unimodal in the sense that <i>/ </i>and <i>\ </i>simply indicate the function-argument distinction in adjacent prosodic elements. The lexical projection of attachment further complements the notion of morphemic lexicon so that bound morphemes are no longer parasitic on words but have an independent</p><p>10 See Dowty (1996) and Steedman (1996) for a discussion of bringing nonconcatenative combination into grammar.</p><p>11 There is a precedent of associating attachment characteristics with the prosodic element rather than the slash in CG (Hoeksema and Janda 1988). In Hoeksema and Janda's notation, arguments can be constrained on phonological properties and attachment. For instance, the English article <i>a </i>has its <i>NP/N </i>category spelled out as &lt;/CX/N,NP,Pref&gt;, indicating a consonantal first segment for the noun argument and concatenation to the left.</p><page local="11" global="155"/><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">a</doubt><table caption="Table l"></table><p>Attachment properties of some Turkish morphemes.</p><doubt alpha="75.0" length="4" tooSmall="False" monospace="0.0">sb b</doubt><doubt alpha="53.6" length="28" tooSmall="False" monospace="0.0">uzun (long) :=ouzun-&lt; N/ &lt; N</doubt><doubt alpha="56.4" length="39" tooSmall="False" monospace="0.0">oku (read) :=ooku-&lt; S\ &lt; NPnom\ &lt; NPacc</doubt><doubt alpha="60.0" length="15" tooSmall="False" monospace="0.0">-EMPH :=ode-X\X</doubt><doubt alpha="42.1" length="19" tooSmall="False" monospace="0.0">-LOC :=ode-&lt; N\ &lt; N</doubt><p><i>uzun yol </i>long road 'long road' <i>adam    kitab-i oku-du</i></p><p>man book-ACC read-TENSE 'the man read the book.' <i>Ben-de    kalem var</i></p><doubt alpha="63.2" length="19" tooSmall="False" monospace="0.0">Ben    de yaz-ar-im</doubt><doubt alpha="62.8" length="43" tooSmall="False" monospace="0.0">I       too write-TENSE-PERS 'I write too.'</doubt><doubt alpha="61.8" length="34" tooSmall="False" monospace="0.0">I-LOC    pen exist'I have a pen.'_</doubt><p>representational status of their own. We write <i>o</i><i>s </i>to denote the attachment modality <i>i </i>(affixation, syntactic concatenation, cliticization) of the prosodic element s.</p><p>Table 1 shows some lexical assignments for Turkish (e.g., the sign <i>o</i><i>s </i><i>— </i><i>X\Y: ^ </i>characterizes a suffix). The morphosyntactic calculus of CCG is defined with the ad­dition of morphosyntactic types and attachment modalities as follows (similarly, for other combinatory rules):</p><doubt alpha="62.1" length="29" tooSmall="False" monospace="0.0">(15) Forward Application (&gt;):</doubt><p><i>O </i><i>si </i><i>- </i><i>X/ </i>si Y:f</p><doubt alpha="40.0" length="10" tooSmall="False" monospace="0.0">Os?-??Y: a</doubt><doubt alpha="100.0" length="2" tooSmall="False" monospace="0.0">kk</doubt><doubt alpha="44.4" length="9" tooSmall="False" monospace="0.0">O(si•s?)-</doubt><p>if <i>a?</i>□iai in lattice L, for:</p><doubt alpha="60.0" length="5" tooSmall="False" monospace="0.0">X: fa</doubt><doubt alpha="40.9" length="44" tooSmall="False" monospace="0.0">□i,□?g{m,&lt; },ai,a?EDin L,i, j, k E{a, s, c},</doubt><p>Forward Composition (&gt;B):</p><doubt alpha="60.9" length="23" tooSmall="False" monospace="0.0">Osi-X/siY:fOs?-s?Y/Z: g</doubt><doubt alpha="43.8" length="16" tooSmall="False" monospace="0.0">■ X/Z: Xx.f (gx)</doubt><doubt alpha="50.0" length="12" tooSmall="False" monospace="0.0">kko(si • s?)</doubt><p>if <i>a?^iai </i>in lattice L, for:</p><doubt alpha="40.9" length="44" tooSmall="False" monospace="0.0">□i,□?e{M,&lt; },ai,a?EDinL, i, j, k E{a, s, c},</doubt><p>The main functor's argument specification (□1 of □! <i>Y </i>in (15)) determines the lattice condition in derivations.<footnote anchor="12"/> Hence the morphosyntactic decoration in lexical as­signments propagates its lattice condition to grammar as in a2^1a1 (cf. Heylen [1997], in which the grammar rule imposes a <i>fixed </i>partial order, e.g., <i>X/Y </i>combines with <i>Z </i>if</p><doubt alpha="0.0" length="1" tooSmall="False" monospace="0.0">&gt;</doubt><p>12 This coincides with Steedman's (1991b) observation that directionality of the main functor's slash is also a property of the same argument. The main functor is the one whose result type determines the overall result type (i.e., <i>X/Y </i>in (15)).</p><page local="12" global="156"/><p>Z &lt; Y). This is another prerequisite that must be fulfilled for the morphemic lexicon to project the lexical specification of scope.</p><p>The grammar is not fixed on the attachment modality either (unlike a lexemic grammar, which is fixed on combination of words). Hence another requirement is the propagation of attachment to grammar. This is facilitated by the lexical types o <i>s </i><i>— </i><i>a:</i></p><doubt alpha="33.3" length="9" tooSmall="False" monospace="0.0">i     j k</doubt><p>where <i>m </i>is an attachment type. The attachment calculus <i>o o\-a o </i>in (15), which reads "attachment types <i>i </i>and <i>j </i>yield type k," relates attachment to prosodic combination in the grammar.<footnote anchor="13"/> It can be attuned to language-particular properties.</p><p>We can specify some prosodic properties of the attachment calculus for Turkish as follows (x indicates stress on the prosodic element x): syntactic concatenation   <i>x </i>• <i>y   </i>= <i>xy </i>affixation   <i>x »y   </i>= <i>xy </i>cliticization   <i>x »y   </i>= xy</p></section><section title="4. Morpheme-Based Parsing"><p>To contrast lexemic and morphemic processing, consider the Turkish example in (16a). We show some stages of the derivation to highlight prosodic combination (•) as well. Every item in the top row is a lexical entry. Allomorphs, such as that of tense, have the same category in the lexicon (16b). Vowel harmony, voicing, and other phonological restrictions are handled as constraints on the prosodic element. Constraint checking can be switched off during parsing to obtain purely morphosyntactic derivations.</p><doubt alpha="48.0" length="25" tooSmall="False" monospace="0.0">(16)     a.Can Ayse C.NOM</doubt><p>nin kitab -GEN(agr) book oku read</p><doubt alpha="75.0" length="4" tooSmall="False" monospace="0.0">-ACC</doubt><doubt alpha="45.8" length="24" tooSmall="False" monospace="0.0">3NPgm\&lt; N &lt; N &lt; Nacc\&lt; N</doubt><p>masi -SUB1G</p><doubt alpha="100.0" length="2" tooSmall="False" monospace="0.0">ni</doubt><p>iste want <i>iste a di—</i> <i>as</i><i> </i><i>a</i><i> </i><i>s</i><i> </i><i>a</i><i> </i><i>a</i><i> </i><i>t</i><i> </i><i>f</i><i> </i><i>t</i><i> </i><i>f</i><i> </i><i>f</i> <i>s</i><i> </i><i>as</i><i> </i><i>a</i><i> </i><i>s</i><i> </i><i>a</i><i> </i><i>as</i><i> </i><i>a</i></p><doubt alpha="100.0" length="2" tooSmall="False" monospace="0.0">di</doubt><doubt alpha="83.3" length="6" tooSmall="False" monospace="0.0">-TENSE</doubt><doubt alpha="48.1" length="79" tooSmall="False" monospace="0.0">3S\f&lt;NPnom &lt;N\f&lt;NPgen &lt;N\&lt;NTV( &lt; S\&lt; NP) \fNPacc      \( &lt;S\fNPnom) \( &lt; S\fNP)</doubt><doubt alpha="100.0" length="4" tooSmall="False" monospace="0.0">asvf</doubt><doubt alpha="48.4" length="31" tooSmall="False" monospace="0.0">kitab • i • oku — &lt; S \ &lt; NPnom</doubt><doubt alpha="66.7" length="3" tooSmall="False" monospace="0.0">t f</doubt><doubt alpha="63.6" length="11" tooSmall="False" monospace="0.0">a S\ &lt;NPnom</doubt><doubt alpha="55.6" length="9" tooSmall="False" monospace="0.0">\ &lt; NPacc</doubt><doubt alpha="33.3" length="15" tooSmall="False" monospace="0.0">a  s       a of</doubt><doubt alpha="51.4" length="35" tooSmall="False" monospace="0.0">(kitab • i • oku) • masi—&lt; N\ &lt;NPgm</doubt><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">N</doubt><doubt alpha="47.0" length="83" tooSmall="False" monospace="0.0">((ayse • nin) • (kitab • i • oku) • masi) • ni— (&lt; S\ &lt;NPnom)/(&lt; S\ &lt;NPnom \&lt;NPacc)</doubt><doubt alpha="48.4" length="64" tooSmall="False" monospace="0.0">can • (ayse • nin • kitab • i • oku • masi • ni) • (iste • di) —</doubt><p>: want(read book ayse)can 'Can wanted Ay§e to read the book.'</p><doubt alpha="45.6" length="68" tooSmall="False" monospace="0.0">b. -TENSE :=◦ di\di\du\dü\ti\ti\tu\tü - (&lt; S\ &lt;NP)\( &lt; S\ &lt;NP): Xf.f</doubt><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">S</doubt><p>13 Clearly, much more needs to be done to incorporate intonation into the system. The motive for attachment types is to provide the representational ingredients on behalf of the morphemic lexicon. As one reviewer noted, CCG formulation of the syntax-phonology interface moved from autonomous prosodic types (Steedman 1991a) to syntax-directed prosodic features (Steedman 2000b). The present proposal for attachment modality is computationally compatible with both accounts: Combinatory prosody can match prosodic types with morphosyntactic types. Prosodic features are associated with the basic categories of a syntactic type in the latter formulation, hence they become part of the featural inference that goes along with the matching of categories in the application of combinatory rules.</p><page local="13" global="157"/><p>The lexicalization of attachment modality helps to determine the prosodic domain of postconditions. For instance, for Turkish, vowel harmony does not apply over word boundaries, which can be enforced by applying it when the modality is o and <i>o</i>, but not o. Voicing applies to o and o, but not to <i>o</i>.</p><p>The basic categories <i>N, NP, S, </i>S+t, and S_t carry agreement features of fixed arity (e.g., tense and person for <i>S, </i>S+t, and S_t, and case, number, person, and gender for <i>N </i>and NP). Positional encoding of such information as in Pulman (1996) allows efficient term unification for the propagation of these features.<footnote anchor="14"/> Term unification also handles the matching of complex categories in the CCG schema. For instance, <i>n\ </i><i>A/(</i>S2<i>B\ </i>S3 <i>C)</i> combines with ^<i>B\ </i>^5<i>C </i>via (&gt;) for B, <i>C </i><i>eA</i><i>s</i><i>,if</i><i> f</i><i>h</i><i> </i>^2 «2, /?3 ^3 «3 (□/ <i>&amp;{</i><i>&lt;</i>, X}). Apart from the matching of syntactic types and agreement, uniication does no linguistic work in this framework, in contrast to structure-sharing in HPSG and slash passing in Unification CG (Calder, Klein, and Zeevat 1988).</p><p>CCG is worst-case polynomially parsable (Vijay-Shanker and Weir 1993). This re­sult depends on the inite schematization of type raising and bounded composition. Assuming a maximum valence of four in the lexicon (Steedman 2000a), composition (B") is bounded by <i>n &lt; </i>3. The refinement of the type raising schema (11) for finite schematization is shown in (17).</p><doubt alpha="56.6" length="136" tooSmall="False" monospace="0.0">(17) a. Revised Forward Type Raising(&gt;T):NP:a   ==   T/(T\NP):Xf .f [a]b. Revised Backward Type Raising(&lt;T):NP:a   ==   T\(T/NP):Xf f[a]</doubt><p><i>T e{S,S\NP, S\NP\NP, </i><i>S</i><i>\</i><i>NP</i><i>\</i><i>Np</i><i>\</i><i>np</i><i>}</i><i>.</i></p><p>The inite schematization of type raising suggests that it can be delegated to the lexicon, for example, by a lexical rule that value-raises all functions onto <i>NP </i>to their type-raised variety, such as <i>NP/N </i>to <i>(S/(S\NP))/N. </i>But this move presupposes the presence of such functions in the lexicon, that is, a language with determiners. To be transparent with respect to the lexicon, we make type raising and other unary schema (contraposition) available in the grammar. Since both are inite schemas in the revised formulation, the complexity result of Vijay-Shanker and Weir still holds. Checking the lattice condition as in (15) incurs a constant factor with a inite lattice.</p><p>Type raising and composition cause the so-called spurious-ambiguity problem (Wittenburg 1987): Multiple analyses of semantically equivalent derivations are pos­sible in parsing. This is shown to be desirable from the perspective of prosody; for example, different bracketings are needed to match intonational phrasing with syn­tactic structure (Steedman 1991). From the parsing perspective, the redundancy of analyses can be controlled by <i>(1) </i>grammar rewriting (Wittenburg 1987), <i>(2) </i>checking the chart for PAS equivalence (Karttunen 1989; Komagata 1997), <i>(3) </i>making the proces­sor parsimonious on using long-distance compositions (Pareschi and Steedman 1987), or <i>(4) </i>parsing into normal forms (Eisner 1996; Hepple 1990b; Hepple and Morrill 1989; Konig 1989; Morrill 1999). We adopt Eisner's method, which eliminates chains of com­positions in 0(1) time via tags in the grammar, before derivations are licensed. There is a switch that can be turned off during parsing to obtain all surface bracketings.</p><p>14 Mediating agreement via unification, type subsumption, or set-valued indeterminacy has important consequences on underspecification, the domain of agreement, and the notion of "like categories" in coordination (see Johnson and Bayer 1995; Dalrymple and Kaplan 2000; Wechsler and Zlatic 2000). Rather than providing an elaborate agreement system, we note that Pulman's techniques provide the mechanism for implementing agreement as atomic uniication, subsumption hierarchies represented as lattices, or set-valued features. The categorial ingredient of phrase-internal agreement can be provided by endotypic functors when necessary (see Sections 5 and 6).</p><page local="14" global="158"/><p>There is also a switch for checking the PAS equivalence, with the warning that the equivalence of two lambda expressions is undecidable.</p><p>The parser is an adaptation of the Cocke-Kasami-Younger (CKY) algorithm (Aho and Ullman 1972, page 315), modified to handle unary rules as well: In the <i>kth </i>iteration of the CKY algorithm to build constituents of length k, the unary rules apply to the CKY table entries T[<i>a(,ai+k</i>],<i>i = </i>0,1,<i>n </i><i>- </i>k; that is, k-length results of binary rules are input to potential unary constituents of length <i>k. </i>In practice, this allows, for instance, a nominalized clause to be type-raised after it is derived as a category of type N. The remaining combinatory schema is already in Chomsky Normal Form, as required by CKY. The finite schematization of CCG rules and constant costs incurred by the normal form and lattice checking provide a straightforward extension of CKY-style context-free parsing for CCG. Komagata (1997) claims that the average complexity of CCG parsing is 0(n<footnote anchor="3"/>) even without the finite schematization of type raising (based on the parsing of 22 sentences consisting of around 20 words, with a lexicon of 200 entries and no derivation of semantics in the grammar; a morphological analyzer provided five analyses per second to the parser). Statistical techniques developed for lexicalized grammars (e.g., Collins 1997), readily apply to CCG to improve the average parsing performance in large-scale practical applications (Hockenmaier, Bierner, and Baldridge 2000). Both Collins and Hockenmeier, Bierner, and Baldridge used section 02-21 of the Wall Street Journal Corpus of Penn Treebank for training, which contains 40,886 words (70,151 lexical entries). A recent initiative (Oflazer, et al. 2001) aims to provide such a resource of around one million words for Turkish. It encodes in the Treebank surface-syntactic relations and the morphological breakdown of words. The latter is invaluable for training morphemic grammars and lexicons.</p><p>In morpheme-based parsing, lattice conditions help eliminate the permutation problem in endotypic categories. Such categories are typical of inflectional morphemes. For instance, assume that three morphemes m1, <i>m2, </i>and <i>m3 </i>have endotypic categories (say <i>N\N), </i>that they can appear only in this order, and that they are all optional. The because <i>k</i>0 <i>&lt; k</i>3</p><doubt alpha="66.4" length="143" tooSmall="False" monospace="0.0">categorization ofmias&lt; N\ &lt; Nsuch thatk[&lt;Kifor all i, andk'-_1&lt;Kjforj =1,2,3 allows omissions (18a-b) but rules out the permutations (18c-d).15</doubt><doubt alpha="44.4" length="18" tooSmall="False" monospace="0.0">(18) a.stem m1m2m3</doubt><doubt alpha="57.1" length="14" tooSmall="False" monospace="0.0">k0K1kiK2k2K3k3</doubt><doubt alpha="41.2" length="17" tooSmall="False" monospace="0.0">&lt;N&lt;N\&lt;N&lt;N\&lt;N&lt;N\&lt;N</doubt><doubt alpha="57.9" length="19" tooSmall="False" monospace="0.0">k1 &lt;&lt;Nbecausek0&lt; k1</doubt><doubt alpha="0.0" length="2" tooSmall="False" monospace="0.0">-&lt;</doubt><doubt alpha="66.7" length="15" tooSmall="False" monospace="0.0">&lt;Nbecausek1&lt; k2</doubt><doubt alpha="50.0" length="2" tooSmall="False" monospace="0.0">k3</doubt><doubt alpha="66.7" length="15" tooSmall="False" monospace="0.0">&lt;Nbecausek2&lt; k3</doubt><doubt alpha="50.0" length="2" tooSmall="False" monospace="0.0">m3</doubt><doubt alpha="83.3" length="6" tooSmall="False" monospace="0.0">b.stem</doubt><doubt alpha="33.3" length="3" tooSmall="False" monospace="0.0">&lt; N</doubt><p>15 Three asterisks in the line indicate that the derivation is not licensed.</p><page local="15" global="159"/><doubt alpha="50.0" length="2" tooSmall="False" monospace="0.0">m3</doubt><doubt alpha="71.4" length="7" tooSmall="False" monospace="0.0">c.*stem</doubt><doubt alpha="50.0" length="2" tooSmall="False" monospace="0.0">m2</doubt><doubt alpha="100.0" length="2" tooSmall="False" monospace="0.0">mi</doubt><doubt alpha="61.5" length="13" tooSmall="False" monospace="0.0">becausek0&lt; «2</doubt><doubt alpha="0.0" length="1" tooSmall="False" monospace="0.0">&lt;</doubt><doubt alpha="16.7" length="6" tooSmall="False" monospace="0.0">«2&lt; «i</doubt><doubt alpha="47.6" length="21" tooSmall="False" monospace="0.0">becausek1&lt;«[ &lt; k2&lt;k'2</doubt><doubt alpha="71.4" length="7" tooSmall="False" monospace="0.0">d.*stem</doubt><p>because k0 <i>&lt; k1</i> because k1 <i>&lt; k3</i></p><doubt alpha="0.0" length="5" tooSmall="False" monospace="0.0">«3&lt;«2</doubt><doubt alpha="50.0" length="18" tooSmall="False" monospace="0.0">becausek2&lt;k'&lt;«3&lt;«3</doubt><p>The lattice and its consistency condition on derivability offer varying degrees of flexibility. A lattice with only <i>T </i>and the relation <i>&lt; </i>would undo all the effects of parameterization; it would be equivalent to a syntactic grammar in which every basic category <i>X </i>stands for <i>&lt; X.</i><i> </i>To enforce a completely lexemic syntax, a lattice with <i>T </i>and <i>free </i>would define all functional categories as functions over free forms.</p><p>Morphological processing seems inevitable for languages like Turkish, and mor­phological and lexical ambiguity such as that shown in (19) must be passed on to syntax irrespective of how inflectional morphology is processed (isolated from or in­tegrated with syntax). For the verbal paradigm, Jurafsky and Martin (2000) reports Oflazer's estimation that inflectional suffixes alone create around 40,000 word forms per root. In the nominal paradigm, iterative processes such as fo'-relativization (Sec­tion 6.5) can create millions of word forms per nominal root (Hankamer 1989).</p><doubt alpha="58.8" length="17" tooSmall="False" monospace="0.0">(19) a.kazma-lari</doubt><p>pickaxe-POSS3p their pickaxe</p><p>b. <i>kazma-lar-i </i>pickaxe-PLU-POSS3p their pickaxes c. <i>kazma-lar-i</i> pickaxe-PLU-POSS3s 'his/her pickaxes'</p><p>d. <i>kaz-ma-lari </i>dig-SUB-AGR their digging</p><p>The questions that need to be answered related to processing are <i>(1) </i>What should a (super)linear fragment of processing for morphology deliver to (morpho)syntax? and <i>(2) </i>Is the syntax lexemic or morphemic? The problems with lexemic syntax, which stem from mismatches with semantics, were highlighted in the introduction. In other<page local="16" global="160"/></p><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">t</doubt><doubt alpha="81.8" length="11" tooSmall="True" monospace="0.0">kaz-SUB-AGR</doubt><doubt alpha="80.0" length="5" tooSmall="False" monospace="0.0">PF-LF</doubt><doubt alpha="100.0" length="5" tooSmall="True" monospace="0.0">pairs</doubt><p>(a) Lexemic syntax and lexicon (b) Morphemic syntax and split lexicon</p><doubt alpha="62.5" length="8" tooSmall="False" monospace="0.0">-ma -lar</doubt><doubt alpha="80.0" length="5" tooSmall="False" monospace="0.0">-lari</doubt><p>Phonological Form (PF) Logical Form (LF)</p><p>(c) Morphemic syntax and lexicon</p></section><section title="Figure 3"><p>The processing of <i>kazmalari </i>in three different architectures (see Example (19) for glosses).</p><p>affix lexicon</p><p>words, a lexemic grammar (e.g., Figure 3a) is computationally nontransparent when interpretation is a component of an NLP system.</p><p>Regarding the first question, let us consider two architectures from the perspective of the lexicon for the purpose of morphology, morphemic syntax, and semantics inter­face. The architecture in Figure 3b incorporates the current proposal as an interpretive front end to a morphological analyzer such as Oflazer's (1994), which delivers the anal­yses of words as a stream of morphemes out of which the bound morphemes have to be matched with their semantics from the affix lexicon to be interpretable in grammar. The advantage of this model is its efficiency; morphological parsing of words is—in principle—linear context free; hence, finite-state techniques and their computational advantages readily apply. But the uninterpretable surface forms of bound morphemes must match with those of the affix lexicon, and this is not necessarily a one-to-one mapping because of multiple lexical assignments for capturing syntactic-semantic dis­tinctions (e.g., dative case as a direct object, indirect object, or adjunct marker or <i>-i </i>as a possessive and/or compound marker). Surface form-semantics pairing is not a trivial task, particularly in the case of lexically composite affixes, which require se­mantic composition as well as tokenization. The matching process needs to be aware of all the syntactic contexts in which certain affix sequences act as a unit, for exam­ple, relative participles and agreement markers <i>(-dig-i </i>relative participle as -OP-POSS or -OP-AGR), possessive and compound markers, etc., for Turkish. The factorization of syntactic issues into a morphological analyzer would also make the separate mor­phological component nonmodular or expand its number of states to factor in these concerns (e.g., treating the -OP-POSS sequence as a state different from -OP followed<page local="17" global="161"/></p><table class="main" frame="box" rules="all" border="1" regular="True"><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>root lexicon</p></td><td class="cell"><p>kazma kaz</p></td><td class="cell"><p>morphological parsing</p></td><td class="cell"><p>kazma-POSS3p</p><p>kazma-PLU-POSSSp kazma-PLU-POSS3s</p></td><td class="cell"><p>syntax and interpretation</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr></table><table class="main" frame="box" rules="all" border="1" regular="False"><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p>morpheme-</p><p>semantics</p><p>matching</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>root lexicon</p></td><td class="cell"><p>kazma kaz</p></td><td class="cell"><p>morphological parsing</p></td><td class="cell"><p>kazma-POSS3p</p><p>kazma-PLU-POSS3p kazma-PLU-POSS3s</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr></table><table class="main" frame="box" rules="all" border="1" regular="False"><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>root and affix lexicon</p></td><td class="cell"><p></p></td><td class="cell"><p>syntax and interpretation</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>kaz</p><p>kazma</p><p>-lar</p><p>-lari -ma</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr></table></section><section title="Table 2"><p>Parsing performance.</p><p><i>Note: </i>CPU times are for a Sun UltraSparc-4 running SICStus Prolog; lexical items include stems and inflec­tional affixes.</p><p>by -POSS, in which -POSS is not interpreted with the semantics of possession but that of agreement marking). Not knowing how many of the syntactic distinctions are han­dled by the morphological analyzer, a subsequent interpreter may need to reconsult the grammar if scoping problems arise.</p><p>The architecture in Figure 3c describes the current implementation of the pro­posal. Bound morphemes are fed to the parser along with their interpretation. This model is preferred over that presented in Figure 3b for its simplicity in design and extendibility.<footnote anchor="16"/> The price is lesser efficiency due to context-free processing of inflec­tional morphology. By one estimate (Oflazer, Gocmen, and Bozsahin 1994), Turkish has 59 inflectional morphemes out of a total of 166 bound morphemes, and Oflazer (personal communication) notes that the average number of bound morphemes per word in unrestricted corpora is around 2.8, including derivational affixes. In a news corpus of 850,000 words, the average number of inflections per word is less than two (Oflazer et al. 2001). This is tolerable for sentences of moderate length in terms of the extra burden it puts on the context-free parser. Table 2 shows the results of our tests with a Prolog implementation of the system on different kinds of constructions. The test cases included 10 lexical items on average, with an average parsing time of 0.32 seconds per sentence. A relatively long sentence (12 words, 21 morphemes) took 2.9 seconds to parse. The longest sentence (20 words, 37 morphemes) took 40 seconds. The lexicon for the experiment included 700 entries; 139 were free morphemes and 561 were bound morphemes compiled out of 105 allomorphic representations (includ­ing all the ambiguous interpretations of bound morphemes and the results of lexical rules). For a rough comparison with an existing NLP system with no disambiguation</p><p>16 The morphological analyzer would be in no better position to handle morpheme-semantics pairing if the architecture in Figure 3b were implemented with an integrated lexicon of roots and affixes. For instance, -POSS would still require distinct states because of the difference in the semantics of possession and agreement marking coming from the lexicon.</p><table class="main" frame="box" rules="all" border="0" regular="False"><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p><b>Average number</b></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p><b>Sample text</b></p></td><td class="cell"><p><b>Number of items</b></p></td><td class="cell"><p><b>of parses/grammatical</b></p></td><td class="cell"><p><b>Average CPU time</b></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p><b>type</b></p></td><td class="cell"><p></p></td><td class="cell"><p><b>in text</b></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p><b>input</b></p></td><td class="cell"><p><b>per test (milliseconds)</b></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p><b>Normal</b></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p><b>Normal</b></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p><b>PAS</b></p></td><td class="cell"><p><b>form</b></p></td><td class="cell"><p></p></td><td class="cell"><p><b>PAS</b></p></td><td class="cell"><p><b>form</b></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p></p></td><td class="cell"><p><b>tests</b></p></td><td class="cell"><p><b>words</b></p></td><td class="cell"><p><b>morphs</b></p></td><td class="cell"><p><b>check</b></p></td><td class="cell"><p><b>parse</b></p></td><td class="cell"><p><b>Unrestr.</b></p></td><td class="cell"><p><b>check</b></p></td><td class="cell"><p><b>parse</b></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Word order and</p></td><td class="cell"><p>SS</p></td><td class="cell"><p>216</p></td><td class="cell"><p>SS4</p></td><td class="cell"><p>1.26</p></td><td class="cell"><p>S.6S</p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p>S0</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>case</p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Subordination</p></td><td class="cell"><p>14</p></td><td class="cell"><p>70</p></td><td class="cell"><p>1S7</p></td><td class="cell"><p>S.00</p></td><td class="cell"><p>S.09</p></td><td class="cell"><p>267</p></td><td class="cell"><p>270</p></td><td class="cell"><p>1S0</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Relativization</p></td><td class="cell"><p>2S</p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p>2.04</p></td><td class="cell"><p>2.S2</p></td><td class="cell"><p>796</p></td><td class="cell"><p></p></td><td class="cell"><p>266</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Control verbs</p></td><td class="cell"><p>SS</p></td><td class="cell"><p>147</p></td><td class="cell"><p>291</p></td><td class="cell"><p>1.42</p></td><td class="cell"><p>S.S4</p></td><td class="cell"><p>166</p></td><td class="cell"><p>16S</p></td><td class="cell"><p></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Possessives and</p></td><td class="cell"><p>26</p></td><td class="cell"><p>109</p></td><td class="cell"><p>200</p></td><td class="cell"><p>1.2S</p></td><td class="cell"><p>2.47</p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p>9S</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>compounds</p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>Adjuncts</p></td><td class="cell"><p>14</p></td><td class="cell"><p>S7</p></td><td class="cell"><p>100</p></td><td class="cell"><p>1.12</p></td><td class="cell"><p>4.S7</p></td><td class="cell"><p>S9</p></td><td class="cell"><p>SS</p></td><td class="cell"><p>72</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p><i>-ki </i>relatives</p></td><td class="cell"><p>24</p></td><td class="cell"><p>66</p></td><td class="cell"><p>179</p></td><td class="cell"><p>1.07</p></td><td class="cell"><p>1.S4</p></td><td class="cell"><p>S6</p></td><td class="cell"><p>S6</p></td><td class="cell"><p>SS</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr></table><page local="18" global="162"/><doubt alpha="0.0" length="1" tooSmall="False" monospace="0.0">&lt;</doubt><p>aids, GungordU and Oflazer (1995) reported average parsing times of around 10 sec­onds per sentence for a lexicon of 24,000 free morphemes, and their morphological analyzer delivered around two analyses per second to a lexemic grammar. Oflazer's later (1996) morphological analyzer contained an abstract morphotactic component of around 50 states for inflections, which resulted in compilation to 30,000 states and 100,000 transitions when the morphophonemic rules were added to the system.</p><p>In conclusion, we note that the current proposal for a morphemic lexicon and grammar is compatible with both a separate morphological component (Figure 3b) and syntax-integrated inflectional morphology (Figure 3c). The architecture in Figure 3b may in fact be more suitable for inflecting languages (e.g., Russian) in which the surface forms of bound morphemes are difficult to isolate (e.g., <i>meste, </i>locative singular of <i>mesto) </i>but can be delivered as a sequence of morpheme labels by a morphological analyzer (e.g. mesto-SING-LOC) to be matched with the lexical type assignments to -SING and -LOC for grammatical interpretation.</p><p>It might be argued that in computational models of the type in Figure 3b, the lattice is not necessary, because the morphological analyzer embodies the tactical component. But not only tactical problems (cf. Example (18) and its discussion) but also transparent scoping in syntax and semantics is regulated by the use of lattice in type assignments, and that is our main concern. We show examples of such cases in the remainder of the article. Thus the nonredundant role of the lattice decouples the morphemic grammar-lexicon from the kind of morphological analysis performed in the back end.</p></section><section title="5. Case Study: The English Plural"><p>In this section, we present a morphosyntactic treatment of the English plural mor­pheme. The lattice for English is shown in Figure 2b. We follow Carpenter (1997) in categorizing numerical modifiers and intersective adjectives as plural noun modifiers: <i>four boys </i>is interpreted as four(plu boy) and <i>green boxes </i>as green(plu box). This bracketing reflects the "set of sets" interpretation of the plural noun; four(plu boy) denotes the set of nonempty nonsingleton sets of boys with four members. The type assignments in (20) correctly interpret the interaction of the plural and these modifiers (cf. 21a-b). The endotypic category of the plural also allows phrase-internal number agreement for languages that require it; the agreement can be regulated over the category <i>N </i>before the specifier is applied to the noun group to obtain NP.</p><doubt alpha="60.0" length="20" tooSmall="False" monospace="0.0">(20) -PLU four green</doubt><doubt alpha="43.5" length="23" tooSmall="False" monospace="0.0">:=os-&lt; N\ &lt; N: Xx.plu x</doubt><doubt alpha="60.0" length="5" tooSmall="False" monospace="0.0">s n n</doubt><doubt alpha="62.5" length="24" tooSmall="False" monospace="0.0">:=ofour-&lt; N/MN: Ax.fourx</doubt><doubt alpha="57.1" length="28" tooSmall="False" monospace="0.0">:=ogreen-&lt; N/ &lt; N: Xx.greenx</doubt><doubt alpha="36.0" length="25" tooSmall="False" monospace="0.0">(21) a.four        boy -s</doubt><doubt alpha="71.4" length="7" tooSmall="False" monospace="0.0">nnb n b</doubt><doubt alpha="24.0" length="25" tooSmall="False" monospace="0.0">&lt; N/MN    &lt; N    &lt; N\ &lt; N</doubt><doubt alpha="63.6" length="11" tooSmall="False" monospace="0.0">&lt; N:plu boy</doubt><doubt alpha="0.0" length="2" tooSmall="False" monospace="0.0">-&gt;</doubt><doubt alpha="61.1" length="18" tooSmall="False" monospace="0.0">&lt; N: four(plu boy)</doubt><page local="19" global="163"/><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">n</doubt><doubt alpha="0.0" length="2" tooSmall="False" monospace="0.0">-&lt;</doubt><doubt alpha="0.0" length="2" tooSmall="False" monospace="0.0">-&gt;</doubt><p>b. <i>four boy -s</i></p><doubt alpha="0.0" length="7" tooSmall="False" monospace="0.0">-***-&gt;-</doubt><doubt alpha="57.8" length="45" tooSmall="False" monospace="0.0">&lt; N:four boy&lt;N\&lt;Nbecause n-base = n-num -***-</doubt><doubt alpha="57.9" length="19" tooSmall="False" monospace="0.0">&lt; N:* plu(four boy)</doubt><p>Carpenter (1997) points out that nonintersective adjectives (e.g, <i>toy, fake, alleged)are </i>unlike numerical modifiers and intersective adjectives in that their semantics requires phrasal (wide) scope for -PLU, corresponding to the "set of things" interpretation of the plural noun. Thus, <i>toy guns </i>is interpreted as plu(toygun) because the plural outscopes the modification. It denotes a nonempty nonsingleton set of things that are not really guns but toy guns. *toy(plu gun) would interpret plu over guns. The situation is precisely the opposite of (21); we need the second derivational pattern to go through and the first one to fail. The following category for nonintersective adjectives derives the wide scope for -PLU but not the narrow scope:</p><doubt alpha="60.0" length="5" tooSmall="False" monospace="0.0">s b b</doubt><doubt alpha="41.7" length="36" tooSmall="False" monospace="0.0">(22)   toy   :=otoy-&lt; N/&lt; N: Xx.toyx</doubt><doubt alpha="47.1" length="17" tooSmall="False" monospace="0.0">(23) a.toy gun -s</doubt><doubt alpha="0.0" length="4" tooSmall="False" monospace="0.0">- -&lt;</doubt><doubt alpha="45.0" length="20" tooSmall="False" monospace="0.0">&lt;N/&lt;N&lt;N:plu gun-***-</doubt><p><i>&lt; N </i>: *toy(plu gun) because n-num n-base</p><doubt alpha="44.4" length="18" tooSmall="False" monospace="0.0">b.toy       gun -s</doubt><doubt alpha="20.8" length="24" tooSmall="False" monospace="0.0">&lt;N/&lt;N    &lt; N    &lt; N\ &lt; N</doubt><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">b</doubt><doubt alpha="63.6" length="11" tooSmall="False" monospace="0.0">&lt; N:toy gun</doubt><doubt alpha="62.5" length="16" tooSmall="False" monospace="0.0">&lt; N:plu(toy gun)</doubt><p>Carpenter (1997) avoided rebracketing because of the plural through lexical type assignments to plural nouns and a phonologically null lexical entry to obtain differ­ent semantic effects of the plural. In our formulation, there is no lexical entry for inflected forms and no phonologically null type assignment to account for the dis­tinction in different types of plural modification; there is only one (phonologically realized) category for -PLU.<footnote anchor="17"/> The modifiers differ only in the kind and degree of mor-phosyntactic control. Strict control (Xl)on <i>four </i>disallows <i>four boy, </i>and flexible control (&lt;)on<i>green </i>also handles<i>greenbox. Fourgreenboxes </i>is interpreted as four(green(plu box)),</p><p>17 This is not to say that there is only one model-theoretic interpretation of plu. "Sets of sets" and "set of individuals" valuations of plu can be carried over the PAS.</p><page local="20" global="164"/><p>not as *four(plu(green box)), and <i>four toy guns </i>is interpreted as four(plu(toy gun)), not as *plu(four(toygun)). These derivations preserve the domain of the modifiers and the plural without rebracketing.</p></section><section title="6. Case Study: Turkish Morphosyntax"><p>There have been several computational studies to model morphology-syntax inter­action in Turkish. These unification-based approaches represent varying degrees of integration. Gungordu and Oflazer (1995) isolates morphology from syntax by having separate modules (a finite-state transducer for the former, and an LFG component for the latter), that is, the syntax is lexemic. The morphological component is expected to handle all aspects of morphology, including inflections and derivations. In Sehitoglu and Bozsahin (1999), lexical rules implement inflectional morphology, and derivations are assumed to take place in the lexicon. Hoffman's (1995) categorial analysis of Turk­ish is also lexemic; all lexical entries are fully inflected. Interpretive components of these systems face the aforementioned difficulties because of their commitment to lex­emic syntax. Inflectional morphology is incorporated into syntax in another categorial approach (Bozsahin and Gocmen 1995), but morphotactic constraints are modeled with nonmonotonic unification, such as nonexistence checks for features and overrides. The system cannot make finer distinctions in morphosyntactic types either. The result is an overgenerating and nontransparent integration of morphology and syntax because of the possibility of rebracketing and the unresolved representational basis of the lexicon.</p><p>In this section, we outline the application of the proposed framework to Turkish. We analyze a large fragment of the language, without any claims for a comprehensive grammar. The phenomena modeled here exhibit particular morphosyntactic problems described in the preceding sections. We assume the binding theory in Steedman (1996), which is predicated over the PAS. In each section, we provide a brief empirical observa­tion about the phenomenon, propose lexical type assignments, exemplify derivations of the parser, and briefly discuss the constraints imposed by morphosyntactic types. Because of space considerations, we sometimes use abbreviated forms in derivations such as the genitive affix's <i>(N/(N\N))\N </i>category for <i>(&lt; N/( $ Npn\ $ Npn))\ &lt; Npn, </i>but the parser operates on full morphosyntactic representations.</p></section><section title="6.1 Case Marking and Word Order"><p>Turkish is regarded as a free constituent order language; all permutations of the predicate and its arguments are grammatical in main clauses, being subject to con­straints on discourse and semantic properties such as definiteness and referentiality of the argument and topic-focus distinctions. The mapping of surface functions to grammatical relations is mediated by case marking. Word order variation has lesser functionality in embedded clauses because embedded arguments are less accessible to surface discourse functions like topic and focus. Embedded clauses are verb fi­nal.</p><p><b>6.1.1 Lexical Types. </b>We start with the lexical type assignments for the verbs. We use the abbreviations in (24a) when no confusion arises about the arguments' case or mor­phosyntactic type. Verb-final orders are regarded as basic, which suggests the category <i>S\NP\NP </i>for transitive verbs. But Janeway (1990) argued that such underspecification for verb-peripheral languages causes undesirable ambiguity. Grammatical relations of the arguments are determined not by directionality but by case in such languages.<page local="21" global="165"/> The category <i>S\NP„om\NPacc </i>resolves the ambiguity (24b-c).</p><doubt alpha="37.5" length="16" tooSmall="False" monospace="0.0">(24) a.IV = S\NP</doubt><doubt alpha="57.1" length="28" tooSmall="False" monospace="0.0">TV = S\NP\NP DV = S\NP\NP\NP</doubt><doubt alpha="66.7" length="6" tooSmall="False" monospace="0.0">sv f f</doubt><doubt alpha="58.2" length="55" tooSmall="False" monospace="0.0">b. sev (like) :=osev-&lt; S\ &lt; NP„om\ &lt; NPacc:Xx.Xy.likexy</doubt><doubt alpha="55.6" length="9" tooSmall="False" monospace="0.0">s v f f f</doubt><doubt alpha="60.6" length="66" tooSmall="False" monospace="0.0">c. ver (give) :=over-&lt; S\ &lt; NP„om\&lt; NPdat\&lt; NPacc:Xx.Xy.Xz.giveyxz</doubt><doubt alpha="37.5" length="8" tooSmall="False" monospace="0.0">v    f f</doubt><doubt alpha="52.5" length="40" tooSmall="False" monospace="0.0">d.&lt; S\ &lt; NP„om\ &lt; NPacc: Xx.Xy.likexy =&gt;</doubt><doubt alpha="65.6" length="32" tooSmall="False" monospace="0.0">&lt;S\&lt;NPtJcf\&lt; NP„om: Xy.Xx.likexy</doubt><doubt alpha="60.0" length="5" tooSmall="False" monospace="0.0">a c o</doubt><doubt alpha="53.1" length="49" tooSmall="False" monospace="0.0">e. -ACC :=oi\i\u\ii\yi\yi\yu\yu-&lt; Nacc\&lt; N: Xf .f</doubt><doubt alpha="28.6" length="14" tooSmall="False" monospace="0.0">a a        a o</doubt><doubt alpha="50.0" length="48" tooSmall="False" monospace="0.0">f. -LOC :=ode\da\te\ta-( &lt;S/&lt;S)\ &lt; N: Xx.Xf.atfx</doubt><p>Gapping behavior seems to indicate that Turkish is verb final, not just SOV. SO and OS syntactic types must be distinguished to account for SO &amp; SOV, OS &amp; OSV, *SO &amp; OSV and *OS &amp; SOV. The OS &amp; OSV pattern requires the lexical category <i>S\NPacc\NP„om </i>for the verb (Bozsahin 2000b). SOV and OSV base orders can be cap­tured uniquely in the lexicon in set-CCG notation as <i>S\{NPacc,NP„om</i><i>}</i><i>. </i>Set-CCG is strongly equivalent to CCG (Baldridge 1999). We distinguish SOV and OSV lexically, however, because OSV requires referential objects (25a-b). OSV is generated from SOV by a lexical rule (24d). This is genuine lexical ambiguity, because the two related entries differ in semantics (referentiality).</p><doubt alpha="51.6" length="31" tooSmall="False" monospace="0.0">(25) a.Kitab-i      adam oku-du</doubt><p>Book-ACC man.NOM read-TENSE 'The man read the book.'</p><p>b. <i>*Kitapadam oku-du </i>Book man.NOM read-TENSE</p><p>Regarding the relationship between case and the specifiers, it is questionable whether Turkish has a discernible syntactic category for determiners. There is no lex­ical functor that takes an <i>N </i>and yields an NP. The only article, the indefinite <i>bir ('a'), </i>makes a distinction in discourse properties (26). Specifying case as a determiner (e.g., <i>NP\N) </i>does not alleviate the problem, either. Ignoring the problem of case stacking for a moment, zero marking of the surface subject and the indefinite object takes us back to where we started.</p><doubt alpha="66.7" length="48" tooSmall="False" monospace="0.0">(26)Cocuk yesil     bir elma/elma/elma-yi ye-mis</doubt><p>child.NOM   green   an apple/apple/apple-ACC eat-TENSE 'The child ate a green apple.' (indefinite but referential apple) 'The child ate green apple.' (indefinite and nonreferential apple) 'The child ate the green apple.' (definite and referential apple)<page local="22" global="166"/></p><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">S</doubt><doubt alpha="0.0" length="2" tooSmall="False" monospace="0.0">-&gt;</doubt><p>Making the nouns lexically ambiguous <i>(N </i>or NP) would also require that all func­tions onto nouns be ambiguous (N\N and <i>NP\NP </i>for inflections, <i>N/N </i>and <i>NP/NP </i>for adjectives, etc.). Redundancy of this kind in the lexicon is not desirable, since it is in­troduced purely for formal reasons with no distinction in meaning. We accommodate these concerns by positing a special case of type raising for Turkish (27). Similarly, contraposition turns <i>N</i>s into functors looking for <i>NP</i>s.</p><doubt alpha="59.7" length="62" tooSmall="False" monospace="0.0">(27) Type Raising for Turkish:Nagr:a==T/(T\ &lt; NPagr): Xf f [a]</doubt><doubt alpha="53.7" length="54" tooSmall="False" monospace="0.0">=T\(T/&lt; NPagr):Xf f [a] Te{S,S\NP,S\NP\NP, S\NP\NP\NP}</doubt><p>The noun that is type raised can be a syntactically derived noun (28). SO (and OS) constituency required for gapping is provided by &gt;T and &gt;B.</p><p>(28) <i>Mehmet kuquk ye§il kitab-i,        gocukda        yeni gelen dergi-yi oku-du </i>M.NOM      little green book-ACC child-COORD new come mag.-ACC read-TENSE</p><p><i>Nacc Nnomi Nacc S\NPmom\\NPacc </i>-&gt;T</p><doubt alpha="50.0" length="6" tooSmall="False" monospace="0.0">(S\NP)</doubt><doubt alpha="61.5" length="13" tooSmall="False" monospace="0.0">/(S\NP\NPacc)</doubt><doubt alpha="33.3" length="6" tooSmall="True" monospace="0.0">-&gt;B-&gt;B</doubt><p><i>S/(S\NP</i><i>nom</i><i>\NP</i><i>acc</i><i>)S/(S\NP</i><i>nom</i><i>\NP</i><i>acc)</i> 'Mehmet read the little green book, and the child, the newly arrived magazine'.</p><doubt alpha="0.0" length="2" tooSmall="False" monospace="0.0">-&amp;</doubt><doubt alpha="75.0" length="4" tooSmall="False" monospace="0.0">acc)</doubt><p>Our lexical type assignment to case morphemes (24e-f) departs from other CCG analyses of case (e.g., Steedman 1985, 1991a, Bozsahin 1998). These studies correlate morphological case with type raising of arguments, in the case of Bozsahin (1998), via a value-raised category assignment to case morphemes. Evidence from NP-internal case agreement and case stacking (Kracht 1999) challenges the type-raising approach. Agreement phenomena require that case (which can be marked on articles, adjectives, and nouns) be regulated as an agreement feature within the category N <i>before </i>the case-marked argument looks for the verb via type raising. Kracht observes that, in case stacking, there may be other morphemes between two case morphemes. Thus, treating the two cases as composite affixes for the purpose of type raising is not feasible. If the first case type-raises the noun to say, <i>T/(T\NP), </i>the second case would require a category, <i>(T/(T\NP))\(T/(T\NP)); </i>that is, it is endotypic. Hence, an endotypic category for case (like other inflections in the paradigm) subsumes the type-raising analysis of case provided that type raising is available in the grammar, not necessarily anchored to case.</p><p>We analyze case as an endotypic functor of type <i>N\N </i>(24e)—hence allow for phrase-internal agreement for languages that require it and provide type raising in grammar as in (27). Abandoning the type-raising analysis of case does not necessitate taking liberties in the directionality of the categories, such as the use of nondirectional slash (|) in multiset-CCG (Hoffman 1995). Contraposition and type raising in grammar can account for free word order and gapping facts with fully directional syntactic types (Bozsahin 2000a).<page local="23" global="167"/></p><doubt alpha="100.0" length="5" tooSmall="False" monospace="0.0">Nnoom</doubt><doubt alpha="33.3" length="3" tooSmall="True" monospace="0.0">-&gt;T</doubt><doubt alpha="63.6" length="11" tooSmall="False" monospace="0.0">S/(S\NPnom)</doubt><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">n</doubt><doubt alpha="0.0" length="1" tooSmall="False" monospace="0.0">&gt;</doubt><doubt alpha="75.0" length="4" tooSmall="False" monospace="0.0">-ACC</doubt><doubt alpha="55.6" length="9" tooSmall="False" monospace="0.0">\ &lt; NPacc</doubt><doubt alpha="0.0" length="1" tooSmall="False" monospace="0.0">&lt;</doubt><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">b</doubt><doubt alpha="33.3" length="3" tooSmall="True" monospace="0.0">-&gt;T</doubt><p><b>6.1.2 Derivations. </b>The wide scope of case is captured by treating its argument type as non-case-marked N ( <i>&lt; N) </i>and the type of noun modifiers as functions onto non-casemarked nouns of a particular domain, for example, <i>&lt; N </i>for nonintersective adjectives and <i>&lt; N </i>for intersective adjectives (29a). The same strategy in type assignments to other nominal inflections allows them to outscope nominal modification, for exam­ple, (29b).</p><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">o</doubt><p>(29) a.</p><p><i>Mehmet </i>M.NOM [[ <i>oyuncak araba </i>] toy car</p><doubt alpha="80.0" length="5" tooSmall="False" monospace="0.0">&lt;Nnom</doubt><doubt alpha="85.7" length="7" tooSmall="False" monospace="0.0">:mehmet</doubt><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">f</doubt><doubt alpha="50.0" length="14" tooSmall="False" monospace="0.0">S/(S\ &lt; NPnom)</doubt><doubt alpha="60.0" length="15" tooSmall="False" monospace="0.0">: Xf .f[mehmet]</doubt><doubt alpha="60.0" length="10" tooSmall="False" monospace="0.0">-lar] -PLU</doubt><doubt alpha="50.0" length="2" tooSmall="False" monospace="0.0">-i</doubt><p><i>sev-er </i>like-TENSE <i>:</i><i> Xx.Xy. </i>like <i>xy</i> <i>&lt; N</i><i>acc:</i><i> </i>plu(toycar) <i>S:</i><i> </i>like(plu(toycar))mehmet 'Mehmet likes toy cars.'</p><doubt alpha="33.3" length="3" tooSmall="True" monospace="0.0">-&lt;B</doubt><doubt alpha="35.6" length="45" tooSmall="False" monospace="0.0">&lt;\N/&lt;N&lt;\N   &lt;N\ &lt;\N     &lt; Nacc\&lt;N&lt; S\ &lt; NPnom</doubt><doubt alpha="62.5" length="24" tooSmall="False" monospace="0.0">:Xx.toyx: car   :Xx.plux</doubt><doubt alpha="60.0" length="5" tooSmall="False" monospace="0.0">:Xf.f</doubt><doubt alpha="63.6" length="11" tooSmall="False" monospace="0.0">&lt; N:toy car</doubt><doubt alpha="52.6" length="19" tooSmall="False" monospace="0.0">&lt; N:plu(toy car) -&lt;</doubt><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">c</doubt><doubt alpha="56.1" length="41" tooSmall="False" monospace="0.0">(S\NP)/(S\NP\ &lt; NPacc):Xg.g[plu(toy car)]</doubt><doubt alpha="61.1" length="36" tooSmall="False" monospace="0.0">&lt; S\ &lt; NPnom: Xy.like(plu(toy car))y</doubt><p>b. <i>Adam-in   </i>[<i>kucukkirmizi araba ]-si</i></p><p>Man-GEN little red car-POSS 'the man's little red car' = poss(little(red car))man</p><p>A word-based alternative for reconciling the semantic (wide) scope of inflections and their morphological (narrow) attachment to stems runs into difficulties even if we assume that morphemes carry type assignments—and hence have representational status—but that they always combine with stems first. We use syntactic types to show the problem. If -PLU and -ACC in (29a) combine with the stem first, only the narrow-scope reading of the plural and case is possible (30a). Plu(toycar) is not derivable with word-based modification.<page local="24" global="168"/> The morphosyntactic categories, however, are transparent to the scope of nominal modification (cf. (29a) and (30b)).</p><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">n</doubt><doubt alpha="0.0" length="1" tooSmall="False" monospace="0.0">&gt;</doubt><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">S</doubt><doubt alpha="0.0" length="2" tooSmall="False" monospace="0.0">-&lt;</doubt><doubt alpha="0.0" length="1" tooSmall="False" monospace="0.0">&lt;</doubt><doubt alpha="0.0" length="2" tooSmall="False" monospace="0.0">-&gt;</doubt><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">c</doubt><doubt alpha="50.0" length="34" tooSmall="False" monospace="0.0">(30) a.oyuncak [[araba]   -lar] -i</doubt><doubt alpha="50.0" length="24" tooSmall="False" monospace="0.0">toy     car    -PLU -ACC</doubt><p><i>N/NNN\NN</i><i>acc</i><i>\N</i> : Ax.toy <i>x   </i>: car   : <i>Ax. </i>plu <i>x </i>: <i>Af.f</i> <i>N:</i><i> </i>plu car</p><p><i>Nacc: plu car</i></p><doubt alpha="62.5" length="16" tooSmall="False" monospace="0.0">N: *toy(plu car)</doubt><doubt alpha="51.8" length="56" tooSmall="False" monospace="0.0">b.     [yeçil[araba]-lar]-igreen      car      -PLU -ACC</doubt><doubt alpha="35.0" length="20" tooSmall="False" monospace="0.0">n n b n b        c o</doubt><doubt alpha="46.9" length="64" tooSmall="False" monospace="0.0">&lt; N/ &lt; N &lt; N &lt; N\ &lt; N &lt; Nacc\&lt; N:Ax.greenx: car    :Ax.plux:Af.f</doubt><doubt alpha="63.6" length="11" tooSmall="False" monospace="0.0">&lt; N:plu car</doubt><doubt alpha="66.7" length="18" tooSmall="False" monospace="0.0">&lt; N:green(plu car)</doubt><p><i>&lt; N</i><i>acc: </i>green(plucar)</p><p>Surface case annotations on categories enable the grammar to capture the correct PAS in all permutations of S, O, and V while maintaining the discourse-relevant dis­tinctions (31). Verb-final subordinate clauses are enforced by the directionality of the subordination morphemes in the lexicon.</p><doubt alpha="21.1" length="19" tooSmall="False" monospace="0.0">(31) a.       S O V</doubt><doubt alpha="59.2" length="49" tooSmall="False" monospace="0.0">S/(S\ &lt; NPnom)(S\NP)/(S\NP\ &lt; NPacc)S\NPnom\NPacc</doubt><doubt alpha="75.0" length="4" tooSmall="False" monospace="0.0">S\NP</doubt><doubt alpha="100.0" length="3" tooSmall="False" monospace="0.0">nom</doubt><doubt alpha="30.8" length="13" tooSmall="False" monospace="0.0">b.      O S V</doubt><doubt alpha="28.6" length="7" tooSmall="True" monospace="0.0">-&gt;T-&gt;T-</doubt><doubt alpha="59.2" length="49" tooSmall="False" monospace="0.0">S/(S\ &lt; NPacc)(S\NP)/(S\NP\ &lt; NPnom)S\NPacc\NPnom</doubt><doubt alpha="85.7" length="7" tooSmall="False" monospace="0.0">S\NPacc</doubt><doubt alpha="50.0" length="2" tooSmall="True" monospace="0.0">&gt;T</doubt><page local="25" global="169"/><doubt alpha="0.0" length="2" tooSmall="False" monospace="0.0">-&lt;</doubt><doubt alpha="33.3" length="3" tooSmall="False" monospace="0.0">-&lt;B</doubt><doubt alpha="75.0" length="4" tooSmall="False" monospace="0.0">S\NP</doubt><doubt alpha="100.0" length="3" tooSmall="False" monospace="0.0">nom</doubt><doubt alpha="50.0" length="8" tooSmall="False" monospace="0.0">c. O V S</doubt><doubt alpha="30.0" length="10" tooSmall="False" monospace="0.0">-&gt;T - -&gt;XP</doubt><doubt alpha="60.0" length="50" tooSmall="False" monospace="0.0">(S\NP)/(S\NP\ &lt; NPacc)S\NPnom\NPaccS-t\(S\NPnom)-&gt;</doubt><doubt alpha="66.7" length="3" tooSmall="False" monospace="0.0">S-t</doubt><doubt alpha="50.0" length="8" tooSmall="False" monospace="0.0">d. S V O</doubt><doubt alpha="61.7" length="47" tooSmall="False" monospace="0.0">(S\NP)/(S\NP\ &lt; NPnom)S\NPacc\NPnomS-t\(S\NPaœ)</doubt><doubt alpha="60.0" length="10" tooSmall="False" monospace="0.0">S\NPacc?-&lt;</doubt><doubt alpha="50.0" length="8" tooSmall="False" monospace="0.0">e. V S O</doubt><doubt alpha="36.4" length="11" tooSmall="False" monospace="0.0">- -&gt;XP -&gt;XP</doubt><p><i>S\NP</i><i>nom</i><i>\NP</i><i>acc </i><i>S</i>-t<i>\(S\NP</i><i>nom</i><i>)S</i>-t<i>\(S</i>-t<i>\NP</i><i>acc)</i> <i>S\NP</i><i>acc</i><i>\NP</i><i>nom </i><i>S</i>-t<i>\(S\NP</i><i>acc</i><i>)S</i>-t<i>\(S</i>-t<i>\NP</i><i>nom)</i> <i>S</i>-t<i>\NP </i><i>nom</i></p><doubt alpha="61.5" length="13" tooSmall="False" monospace="0.0">S-t\NPacc&lt;B-&lt;</doubt><doubt alpha="50.0" length="8" tooSmall="False" monospace="0.0">f. V O S</doubt></section><section title="6.2 Subordination"><p>Subordinate clauses can be classified as unmarked clauses (32a), infinitival clauses (32b), verbal nouns (32c), and nominalizations (32d). The latter two types require a genitive embedded subject, which agrees with the subordinate verb.</p><doubt alpha="55.6" length="45" tooSmall="False" monospace="0.0">(32) a.Mehmet[cocuk        ev-e git-ti]san-di</doubt><p>M.NOM child.NOM house-DAT go-TENSE assume-TENSE 'Mehmet assumed that the child went home.' child.NOM girl-DAT pen-ACC give-SUB1i -ACC forget-TENSE 'The child forgot to give the pen to the girl.' c. [ <i>Cocug-un  araba-da uyu-ma-si] Mehmet'i kiz-dir-di</i> child-GEN car-LOC sleep-SUB1g-POSS M-ACC anger-CAUS-TENSE 'Child's sleeping in the car made Mehmet angry.' d.<page local="26" global="170"/> <i>Deniz    </i>[ <i>gocug-un   uyu-dug-u </i>] <i>-na inan-m-iyor</i></p><doubt alpha="60.4" length="48" tooSmall="False" monospace="0.0">b.Cocuk[kiz-a      kalem-i    ver-me]-yi unut-tu</doubt><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">a</doubt><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">o</doubt><p>D.NOM child-GEN sleep-SUB2g-POSS -DAT believe-NEG-TENSE 'Deniz does not believe the child's sleeping.' (33) a. <i>Denizi   [kendisi-nini uyu-ma-dig-i]-ni soyle-di</i></p><p>D.NOM self-GEN   sleep-NEG-SUB2g-POSS-ACC2 say-TENSE 'Denizi said that hei did not sleep.' b. <i>*kendisii </i>[ <i>Deniz'ini uyu-ma-dig-i ]-ni soyle-di</i></p><p>c. <i>Deniz</i><i>i   </i><i>adam-i</i><i>j     </i>[ <i>kendi</i><i>i/j </i><i>arkada§-i-nin gor-dug-u ]-ne inan-iyor </i>D.NOM man-ACC self      friend-POSS see-SUB2g-POSS-DAT2 believe-TENSE 'Denizi believes that his/ friend saw the manj.'</p><p>d. <i>Denizi   adam-a</i><i>j     </i>[ <i>kendi</i><i>i/tj</i><i>kitab-i-ni oku-dug-u ]-nu soyle-di </i>D.NOM man-DAT self       book-POSS-ACC2 read-SUB2g-POSS-ACC2 say-TENSE 'Denizi told the manj that he read <i>his</i><i>i/„j </i>book.'</p><p><b>6.2.1 Lexical Types. </b>The asymmetries in (33) show that the obliqueness order in bind­ing relations is preserved in subordination. This suggests the following bracketing, in which the embedded clause's position in the PAS of the matrix predicate is determined by its grammatical function.</p><p><i>Matrix-Pred </i><i>...</i><i> Matrix-Argument</i><i>...</i><i> Embedded-Clause </i><i>...</i><i> Matrix-Argument</i></p><doubt alpha="53.1" length="32" tooSmall="False" monospace="0.0">(34)   -SUBli (-ma) (infinitive)</doubt><p>-SUBlg (-masi) (verbal noun) (nominalization)</p><doubt alpha="57.1" length="14" tooSmall="False" monospace="0.0">-SUB2g (-digi)</doubt><doubt alpha="100.0" length="2" tooSmall="False" monospace="0.0">ma</doubt><doubt alpha="100.0" length="4" tooSmall="False" monospace="0.0">masi</doubt><doubt alpha="50.0" length="20" tooSmall="False" monospace="0.0">&lt;N\( &lt;S\ &lt;NPnom):Xff</doubt><doubt alpha="51.7" length="29" tooSmall="False" monospace="0.0">°N\ &lt; NPagr\(&lt;S\ &lt; NPnom):Xff</doubt><doubt alpha="57.9" length="38" tooSmall="False" monospace="0.0">&lt;Ncase=obl\ &lt;NPagr\(&lt;S\ &lt; NPnom):Xf .f</doubt><p>The wide scope of case markers on subordinate clauses implies that the subor­dinate markers themselves must have phrasal scope as well. Since case is a nominal inflection, the category of a subordinate marker must be a function onto N. Its ar­gument is <i>IV </i>for infinitives and <i>NP</i><i>ag</i><i>r\IV </i>for others, which require genitive subjects (34). This yields two families of functors for subordination. The verb-final characteris­tics of the embedded clauses is ensured by the backward-looking main functor of the subordinate marker.</p><p>For morphosyntactic modality, the resulting nominalized predicate can receive only case, hence it has <i>&lt; N </i>control. Verbal nouns refer to actions, and nominaliza-tions refer to facts. Subordinate markers for the former are tenseless. A subordinate marker replaces the tense of the subordinate verb in nominalizations, yielding <i>&lt; S</i> control on the verb.<page local="27" global="171"/> For subject raising, the result may undergo any nominal inflection (&lt;N).</p><doubt alpha="0.0" length="1" tooSmall="False" monospace="0.0">&gt;</doubt><doubt alpha="75.0" length="4" tooSmall="False" monospace="0.0">-ACC</doubt><doubt alpha="66.7" length="3" tooSmall="False" monospace="0.0">t f</doubt><doubt alpha="33.3" length="3" tooSmall="False" monospace="0.0">&lt; N</doubt><doubt alpha="0.0" length="1" tooSmall="False" monospace="0.0">&lt;</doubt><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">b</doubt><doubt alpha="60.0" length="5" tooSmall="False" monospace="0.0">:Xf.f</doubt><doubt alpha="50.0" length="2" tooSmall="True" monospace="0.0">&gt;T</doubt><p>Word order variation within the subordinate clause is constrained by the subject on the left and the verb on the right. This constraint is achieved by categorizing the embedded subjects as <i>NPagr </i>and having a result category of <i>N </i>for all subordinate mark­ers. If there were any contraposed element <i>NP </i>in the embedded clause, the category of the clause would be <i>S\NP, </i>and the clause could not combine with the contraposed category such as <i>S-t\(S\NP) </i>on the right because the extraction category combines with a subordinate marker first, which is onto N, not <i>S\NP, </i>hence composition (&lt;B) could not take place.</p><p><b>6.2.2 Derivations. </b>Example (35a) is the derivation of subject raising (we use N as an abbreviation for a type-raised <i>N </i>when space is limited). We use Steedman's (1996) ana function to denote the binding of the embedded subject. Infinitive -SUBli has phrasal scope in this example; the <i>DV </i>must be reduced to an <i>IV </i>before the infinitive can apply. Hence the subordination of intransitive clauses is only a special case in which the morphological scope of the infinitive works without rebracketing. Subject raising and coindexation with the matrix subject are made explicit in the raising cate­gory of <i>unut. </i>The systematic relationship between the raised and nonraised category of such verbs can be captured by a lexical rule, for example, TV: Ax.Ay.forget <i>xy == </i>TV: <i>Xf </i>.Xy.forgetf [ana y])y.</p><p>(35b-c) contrast subject and nonsubject nominalizations. The difference is cap­<i>o</i> tured with the case distinction of the result type (<i>&lt; N) </i>for -SUBlg and -SUB2g. These examples also show the possibility of affix composition in the lexicon. For instance, we write <i>-masi </i>in (35b), which marks subordination and agreement together, instead of <i>-ma-si. </i>Otherwise, <i>-ma </i>(SUBlg) would have to look to the right as a functor to enforce agreement, and the verb-final property of subordination could not be as­sured.</p><doubt alpha="60.7" length="61" tooSmall="False" monospace="0.0">(35)   a.Cocuk        kiz-a kalem-ichild.NOM girl-DAT pen-ACC</doubt><doubt alpha="100.0" length="7" tooSmall="False" monospace="0.0">vergive</doubt><doubt alpha="100.0" length="2" tooSmall="False" monospace="0.0">DV</doubt><p>: <i>X</i><i>f .f </i>[child] : Xg.g[girl] : <i>X</i><i>h.h</i><i>[pen]      </i>: <i>X</i><i>x</i><i>.X</i><i>y</i><i>.X</i><i>z.</i></p><doubt alpha="100.0" length="7" tooSmall="False" monospace="0.0">giveyxz</doubt><doubt alpha="60.0" length="5" tooSmall="False" monospace="0.0">v f f</doubt><p><i>&lt; </i><i>S\ </i><i>&lt; </i><i>NPnom\ </i><i>&lt; </i><i>NPdat</i></p><doubt alpha="100.0" length="2" tooSmall="False" monospace="0.0">vf</doubt><doubt alpha="50.0" length="12" tooSmall="False" monospace="0.0">&lt; S\ &lt; NPnom</doubt><p><i>-me </i>-SUBli : <i>X</i><i>f .f </i>: <i>X</i><i>f </i><i>.X</i><i>x.</i></p><doubt alpha="66.7" length="3" tooSmall="False" monospace="0.0">-yi</doubt><doubt alpha="53.8" length="26" tooSmall="False" monospace="0.0">&lt;N\(&lt;S\fNPnom)  &lt;lNacc \&lt;N</doubt><doubt alpha="85.7" length="7" tooSmall="False" monospace="0.0">unut-tu</doubt><doubt alpha="100.0" length="6" tooSmall="False" monospace="0.0">forgot</doubt><doubt alpha="100.0" length="2" tooSmall="False" monospace="0.0">TV</doubt><doubt alpha="66.7" length="6" tooSmall="False" monospace="0.0">&lt; Nacc</doubt><doubt alpha="50.0" length="22" tooSmall="False" monospace="0.0">(S\NP)/(S\NP\ &lt; NPacc)</doubt><doubt alpha="100.0" length="6" tooSmall="False" monospace="0.0">forget</doubt><doubt alpha="60.0" length="10" tooSmall="False" monospace="0.0">(f[anax])x</doubt><p><i>&lt; S: </i>forget(give girl pen(ana child))child 'The child forgot to give the pen to the girl.'<page local="28" global="172"/></p><doubt alpha="33.3" length="3" tooSmall="True" monospace="0.0">&gt; i</doubt><doubt alpha="66.7" length="3" tooSmall="False" monospace="0.0">l7l</doubt><doubt alpha="0.0" length="2" tooSmall="False" monospace="0.0">-&gt;</doubt><p>b. <i>Cocug-un  uyu     -masi    Mehmet'i kizdir-di </i>child-GEN sleep   -SUBlg    M-ACC anger-TENSE <i>N\NP</i><i>agr</i><i> </i>&lt; <i>IV &gt;</i></p><doubt alpha="62.5" length="32" tooSmall="False" monospace="0.0">NPag^IV   N\NPagr\IV   IV/TV TV&lt;</doubt><doubt alpha="50.0" length="2" tooSmall="False" monospace="0.0">N&lt;</doubt><doubt alpha="66.7" length="6" tooSmall="False" monospace="0.0">s/iv&gt;t</doubt><p><i>S: </i>anger(sleep child)mehmet 'The child 's sleeping angered Mehmet. ' c. <i>*Cocug-un uyu-dugu     Mehmet'i kizdir-di</i> sleep-SUB2g</p></section><section title="6.3 The Morphosyntax of Control"><p>The control verb 's controlled argument is marked by the infinitive -ma, and the re­sulting nominalized embedded clause can undergo nominal inflections (36a-b). The infinitive <i>-ma </i>has the lexical type in (34). A potential conflict between an object con­trol verb 's subcategorization and PAS is resolved by case decoration: <i>zorla </i>' force ' and <i>tavsiye et </i>'recommend' differ in their case requirements and what is controlled (36b-c). <i>tavsiye et</i>'s infinitive complement is accusative, whereas <i>zorla</i>'s is dative.</p><doubt alpha="55.6" length="45" tooSmall="False" monospace="0.0">(36) a.Cocuk[kitab-i     oku-ma ]-ya gali§-ti</doubt><p>child.NOM book-ACC read-SUBli-DAT try-TENSE ' The childi tried [to_i read the book]. '</p><p>b. <i>Mehmet cocug-u     [kitab-i     oku-ma]-ya zorla-di </i>M.NOM child-ACC book-ACC read-SUBli-DAT force-TENSE 'Mehmety forced the childi [to_^ read the book]. '</p><p>c. <i>Mehmet cocug-a/*-u         [kitab-i      oku-ma]-yi/*-ya tavsiye et-ti </i>M.NOM child-DAT/ACC book-ACC read-SUBli-ACC/DAT recommend-TENSE ' Mehmet recommended the childi [to_i read the book]. ' <b>6.</b><b>3.1 Lexical Types. </b>Subject control verbs (e.g., <i>calis </i>'try'; <i>soz ver </i>'promise ') and object control verbs (e.g., <i>zorla; tavsiye et) </i>have the control property indicated in their PAS (37). The nonraising variety of these verbs is obtained via a lexical rule.</p><doubt alpha="52.1" length="48" tooSmall="False" monospace="0.0">(37)   calis :=◦ calis - TV: Xq.Xz.try(q[anaz])z</doubt><doubt alpha="64.2" length="53" tooSmall="False" monospace="0.0">soz ver :=◦ soz ver — DV: Xq.Xz.Xw.promisez(q[anaw])w</doubt><doubt alpha="63.8" length="47" tooSmall="False" monospace="0.0">zorla :=◦◦zorla — DV: Xz.Xq.Xw.force(q[anaz])zw</doubt><p>tavsiye et := <i>◦ tavsiye et — DV: XzXqXw.</i>recommend(q[ana <i>z])zw</i> <b>6.</b><page local="29" global="173"/><b>3.2 Derivations. </b>The types in (37), coupled with the raising category of the infinitive, yield the derivations in (38). These examples compose the infinitive complement before a case can be applied on the nominalized predicate. This is possible because of the phrasal scope of <i>-ma </i>and the case markers. (38b) shows that although there may be two accusative-marked NPs, the arguments of the infinitive complement are identifiable; the <i>IV </i>scope of <i>-ma </i>implies that any (di)transitive subordinate verb must find its nonsubject arguments before the matrix verb gets its arguments. This type assignment strategy handles word order variations inside the infinitive complement and the matrix clause transparently.</p><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">N</doubt><doubt alpha="0.0" length="2" tooSmall="False" monospace="0.0">-&lt;</doubt><doubt alpha="0.0" length="2" tooSmall="False" monospace="0.0">-&gt;</doubt><doubt alpha="33.3" length="3" tooSmall="True" monospace="0.0">-&gt;T</doubt><doubt alpha="61.5" length="96" tooSmall="False" monospace="0.0">(38) a.Cocuk       kitab-i    oku -ma    -ya calis-tichild.NOM book-ACC read -SUB -DAT try-TENSE</doubt><doubt alpha="25.0" length="12" tooSmall="True" monospace="0.0">-&gt;T-&gt;T----&lt;B</doubt><doubt alpha="51.4" length="37" tooSmall="False" monospace="0.0">S/IV       IV/TV    TV N\IVNdat\NTV-&gt;</doubt><doubt alpha="100.0" length="2" tooSmall="False" monospace="0.0">IV</doubt><doubt alpha="100.0" length="4" tooSmall="False" monospace="0.0">Ndat</doubt><doubt alpha="80.0" length="5" tooSmall="False" monospace="0.0">IV/TV</doubt><doubt alpha="66.7" length="3" tooSmall="False" monospace="0.0">IV?</doubt><p><i>S: </i>try(read book(ana child))child The child tried to read the book.</p><doubt alpha="57.9" length="57" tooSmall="False" monospace="0.0">b.Mehmet   cocug-u     kitab-i    oku -ma    -ya zorla-di</doubt><p>M.NOM child-ACC book-ACC read -SUB -DAT force-TENSE</p><doubt alpha="26.7" length="15" tooSmall="True" monospace="0.0">-&gt;T-&gt;T-&gt;T----&lt;B</doubt><doubt alpha="50.0" length="46" tooSmall="False" monospace="0.0">S/IV     IV/TV      IV/TV    TV N\IVNdat\NDV-&gt;</doubt><doubt alpha="80.0" length="5" tooSmall="False" monospace="0.0">TV/DV</doubt><doubt alpha="66.7" length="3" tooSmall="False" monospace="0.0">TV&gt;</doubt><p><i>S: </i>force(read book(ana child))child mehmet Mehmet forced the child to read the book.</p><page local="30" global="174"/><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">f</doubt></section><section title="6.4 Relativization"><p>There are two strategies for forming relative clauses: the subject participle strategy (SP) and the nonsubject participle strategy (OP). SP is realized by the affixes <i>-(y)An, -(y)AcAk, </i>and <i>-mis, </i>and OP by <i>-dIk- </i>and <i>-(y)AcAk-. </i>OP triggers agreement similar to that of possessive constructions between the subject and the predicate of the relative clause (39b).</p><p>(39) a. <i>kitab-i oku-yan adam </i>book-ACC read-SP man ' the man that read/reads the book' b. <i>adam-in oku-dug-u kitap</i></p><p>man-GEN(AGR) read-OP-POSS(AGR) book the book that the man read <b>6.</b><b>4.1 Lexical Types. </b>The categories in (40) make explicit the unbounded nature of relativization; type raising and composition can combine an indefinitely large sequence of constituents onto <i>S\NP.</i></p><doubt alpha="25.0" length="8" tooSmall="False" monospace="0.0">(40) -SP</doubt><doubt alpha="71.4" length="7" tooSmall="False" monospace="0.0">-OP.AGR</doubt><p>(argument) (adjunct)</p><doubt alpha="41.4" length="58" tooSmall="False" monospace="0.0">◦ yan - (NT/ &lt; N)\( &lt; S\ &lt; NPmm): XP.Xx.XQ.and(Q[x])(P[x])</doubt><doubt alpha="46.0" length="63" tooSmall="False" monospace="0.0">◦ digi - (NT/ &lt; N)\( &lt; S\ &lt; NPmse=ott):XP.Xx.XQ.and(Q[x])(P[x])</doubt><doubt alpha="49.0" length="49" tooSmall="False" monospace="0.0">◦ digi -(NT/&lt;N)\&lt;S: XP.Xx.XQ.and(Q[x])(at(P[x])x)</doubt><p>We present a formulation of relativization without any use of empty categories, traces, or movement. We follow the Montagovian treatment of relative clauses as noun restrictors of the semantic type AP.AQ.and(Q[x])(P[xj), where <i>P </i>is the semantics of the relative clause and <i>Q </i>is the semantics of the predicate taking the relativized noun <i>(x) </i>as the argument. Montagovian analysis assumes a generalized quantifier (GQ) category for the determiner; that is, <i>NP </i>is the functor and <i>VP </i>is the argument. The determiner takes the relativized noun (and its semantically type-raised category) as an argument as well. In a language with determiners, the functor category of the overall <i>NP </i>can be made explicit by lexically value-raising the determiner with GQ semantics from, for example, <i>NP/N </i>to <i>(S/(S\NP))/N = (S/VP)/N. </i>To achieve the same effect in a language that lacks determiners, we make <i>NP </i>the functor by lexically value-raising the relative participle from <i>(N/N)\(S\NP) </i>to (NT<i>/N)\(S\NP), </i>in which <i>N</i><i>/N</i><i> </i>denotes a value-raised noun, since N is a type-raised category. The category of the relative participle unfolds to <i>((S/(S\NP))/N)\(S\NP) </i>and <i>(((S\NP)/(S\NP\NP))/N)\(S\NP).</i></p><p>Relativization is strictly head final in Turkish. This implies that all relative par­ticiples are backward-looking functors that differ only in case requirements (cf. En­glish relatives, which require different directionality, e.g., <i>(N\N)/(S\NP) </i>for subjects and <i>(N\N)/(S/NP) </i>for nonsubjects). For morphosyntactic modality, the head noun has flexible control (<i>&lt; </i>N), because any further grammatical marking on the head must be shared (41).<page local="31" global="175"/></p><doubt alpha="0.0" length="2" tooSmall="False" monospace="0.0">-&lt;</doubt><doubt alpha="0.0" length="2" tooSmall="False" monospace="0.0">-&gt;</doubt><p>(41) <i>Adam-in   gor-dug-u     gocuk-lar uyu-du </i>man-GEN see-OP-POSS child-PLU sleep-TENSE 'The children that the man saw slept.' = and(sleep(plu child))(see(plu child)man) =*and(sleep(plu child))(see child man)</p><p>Morphologically, the agreement marker -POSS in OP strategy is a function over the -OP morpheme, but syntactically, the -OP morpheme triggers the agreement in the relative clause. Hence -OP-POSS can be treated as a lexically composite affix and glossed as -OP.AGR. This also ensures the verb-final property of the relativized clause by not positing a rightward-looking functor for -OP. As for attachment modality, rel­ative participles are bound morphemes that are affixed to the predicate.</p><p><b>6.4.2 Derivations. </b>(42a-d) show example derivations for subject, object, indirect object, and adjunct relativization. All nonsubject arguments are handled by a single -OP type (42b-c). Relativizing the specifier of an argument uses the same strategy as the argu­ment. This phenomenon calls for another well-regulated lexical assignment schema, for example, <i>(N/N)\(N\N)\IV </i>for the relativized specifier of the subject. (42e) is an ex­ample of relativizing the subject's specifier. Configurationality within the noun group is maintained by backward directionality of the categories.</p><doubt alpha="62.9" length="35" tooSmall="False" monospace="0.0">(42) a.kitab-i oku -yan adam uyu-du</doubt><doubt alpha="66.7" length="39" tooSmall="False" monospace="0.0">book-ACC       read -SP man sleep-TENSE</doubt><doubt alpha="56.5" length="23" tooSmall="False" monospace="0.0">IV/TV TV (N/N)\IV N IV&lt;</doubt><doubt alpha="64.1" length="64" tooSmall="False" monospace="0.0">:Xf.f[book]:Ax.Ay.readxy:XP.Xx.XQ.and(Q[x])(P[x]) :man:Xx.sleepx</doubt><p><i>IV: </i>Xy.read book<i>y</i></p><doubt alpha="66.7" length="30" tooSmall="False" monospace="0.0">N/N:Xx.XQ.and(Q[x])(readbookx)</doubt><doubt alpha="61.0" length="41" tooSmall="False" monospace="0.0">N^=S/(S\NP):XQ.and(Q[man])(read book man)</doubt><p><i>S: </i>and (sleep man)(read book man) 'The man who read the book slept.' man-GEN read -OP.AGR child sleep-TENSE <i>NP</i><i>agrK</i><i> </i><i>TV  (N /N)\IV</i><i>agr    </i><i>NIV</i><i> </i><i>&lt;</i></p><doubt alpha="55.8" length="43" tooSmall="False" monospace="0.0">b.adam-in   gor      -dugu     cocuk uyu-du</doubt><doubt alpha="100.0" length="5" tooSmall="False" monospace="0.0">IVagr</doubt><doubt alpha="40.0" length="5" tooSmall="False" monospace="0.0">N^/N&lt;</doubt><doubt alpha="50.0" length="8" tooSmall="False" monospace="0.0">N^=S/IV&lt;</doubt><p><i>S: </i>and (sleep child)(see child man) 'The child whom the man saw slept.' child-GEN book-ACC give -OP.<page local="32" global="176"/>AGR man sleep-TENSE <i>S:</i><i> </i>and (sleep man)(give man book child) 'The man to whom the child gave the book slept.' child-GEN sleep -OP.AGR car break-TENSE</p><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">S</doubt><doubt alpha="0.0" length="2" tooSmall="False" monospace="0.0">-&lt;</doubt><doubt alpha="0.0" length="2" tooSmall="False" monospace="0.0">-&gt;</doubt><doubt alpha="66.7" length="3" tooSmall="False" monospace="0.0">TV&gt;</doubt><doubt alpha="100.0" length="5" tooSmall="False" monospace="0.0">IVagr</doubt><doubt alpha="53.6" length="56" tooSmall="False" monospace="0.0">c.cocug-un     kitab-i    ver      -digi     adam uyu-du</doubt><doubt alpha="61.8" length="34" tooSmall="False" monospace="0.0">NPagr*TV/DV     DV (N/N)\IVagrNIV&lt;</doubt><doubt alpha="50.0" length="4" tooSmall="False" monospace="0.0">N/N&lt;</doubt><doubt alpha="57.1" length="7" tooSmall="False" monospace="0.0">N=S\IV?</doubt><doubt alpha="64.3" length="42" tooSmall="False" monospace="0.0">d.cocug-un   uyu    -dugu   araba bozul-du</doubt><doubt alpha="59.1" length="22" tooSmall="False" monospace="0.0">NPagr*IV (N/N)\SNIV&lt;-&lt;</doubt><doubt alpha="57.1" length="7" tooSmall="False" monospace="0.0">N=S/IV?</doubt><p><i>S: </i>and (break car)(at (sleep child) car) 'The car that the child slept in broke.'</p><doubt alpha="60.7" length="84" tooSmall="False" monospace="0.0">e.cocug    -u     uyu -yan adam kiz-dichild -POSS sleep          -SP man anger-TENSE</doubt><doubt alpha="51.7" length="29" tooSmall="False" monospace="0.0">NN\N\NIV(N^/N)\(N\N)\IV N IV&lt;</doubt><doubt alpha="37.5" length="16" tooSmall="False" monospace="0.0">N\N&lt;(N/N)\(N\N)&lt;</doubt><doubt alpha="57.1" length="7" tooSmall="False" monospace="0.0">N=S/IV&gt;</doubt><p><i>S: </i>and(sleep(poss child man))(anger man) 'The man whose child slept got angry.'</p><p>As these examples indicate, -SP and -OP do not range over the verb stem in semantic scope; they cover the entire relative clause. The wide scope of -SP and -OP resolves the inconsistency pointed out in the introduction (5b-c), which was mainly due to coindexation in unification accounts and the lexemic nature of the lexicon. Isolating the relative participle inflections in a morphological component undermines the transparency of derivations. Note also that -OP is categorially transparent to the arity of the verb; a <i>DV </i>must be reduced to an <i>IV </i>before -OP applies to the verb complex (42c). This is possible only when -OP has phrasal scope.</p><page local="33" global="177"/></section><section title="6.5 Ki -relativization"><p>Ki-relativization is a morphosyntactic process that can generate indefinitely long words of relative pronouns and relative adjectives. <i>-ki </i>can be attached to case-marked nouns whose case relation is one of possession, time, or place (i.e., the genitive and the locative). Its effect is to create a nominal stem on which all inflections can start again (43a-b). It produces relative pronouns (43c) and relative adjectives (43d) with the locative and relative pronouns with the genitive.</p><doubt alpha="55.6" length="18" tooSmall="False" monospace="0.0">(43) a.araba-da-ki</doubt><p>car-LOC-REL the one in the car b. <i>cocug-un ev-i-nde-ki-ler-in-ki</i> child-GEN house-POSS-LOC2-REL-PLU-GEN-REL lit. The one that belongs to the ones that are in the child s house</p><doubt alpha="65.0" length="40" tooSmall="False" monospace="0.0">c.Ben     ev-de-ki-ni hic kullan-ma-di-m</doubt><p>I.NOM house-LOC-REL-ACC2 never use-NEG-TENSE-PERS.1s</p><p>I never used the one at home.</p><p>d. <i>ev-de-ki hediye </i>house-LOC-REL present the present<i>i, </i>the one<i>i </i>at home <b>6.</b><b>5.1 Lexical Types.</b></p><doubt alpha="50.0" length="64" tooSmall="False" monospace="0.0">(44) a.    -PROki      :=0ki-MN\MNloc:Xx.Xf.and(at PROx)(f[PRO])</doubt><p>(locative) (genitive)</p><doubt alpha="52.6" length="57" tooSmall="False" monospace="0.0">b. -ADJki      :=0ki-(MN/&lt;N)\MNte:Xx.Xy.Xf.and(atxy)f[y])</doubt><doubt alpha="58.9" length="56" tooSmall="False" monospace="0.0">c. -PROki      :=0ki-MN\Ngen:Xx.Xf.and(poss PROx)f[PRO])</doubt><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">l</doubt><doubt alpha="60.3" length="63" tooSmall="False" monospace="0.0">d. sabahki      :=osabahki-MN/&lt; N: Xx.Xf.and(at morningx)(f[x])</doubt><doubt alpha="39.5" length="43" tooSmall="False" monospace="0.0">e. ki (that)     :=0ki-(N1\&lt;N)/(&lt;S\&lt; NPnom)</doubt><doubt alpha="52.0" length="25" tooSmall="False" monospace="0.0">:XP.Xx.XQ.and(Q[x])(P[x])</doubt><p><i>N</i><i>g en </i>is a shorthand for the <i>N/(N\N) </i>category of a type-raised genitive. In (43c), pronominal <i>one </i>(PRO) cannot be bound to <i>ev </i>(44a). Adjectival interpretation (43d) associates the relative adjective with the relativized noun (44b). For morphosyntac-tic modality, <i>ki</i>-marked nouns behave like possessive-marked nouns in case marking, which requires strict control over the possessive (<i>MM</i><i> </i>N). This presents a dilemma: Mor­phologically, <i>-ki </i>creates a nominal stem that can undergo all nominal inflections again, but, as (45a) indicates, the stem does not take the CASE (ACC, DAT, etc.) that is com­mon to nouns unmarked on the possessive. Thus CASE2 in (45c) must refer to another diacritic (n-relbase, or M) to eliminate (45b). This diacritic controls the result category of <i>-ki. </i>The value-raised varieties of (44a-c) are assigned a type similar to the type of relative participles. Inherently temporal nouns such as <i>sabah </i>('morning') can take <i>-ki</i> without the locative. They can be lexicalized without overgeneration with the help of the morphosyntactic modality M (44d).<page local="34" global="178"/></p><doubt alpha="0.0" length="1" tooSmall="False" monospace="0.0">&gt;</doubt><doubt alpha="0.0" length="2" tooSmall="False" monospace="0.0">-&lt;</doubt><doubt alpha="33.3" length="3" tooSmall="False" monospace="0.0">&lt; N</doubt><doubt alpha="0.0" length="1" tooSmall="False" monospace="0.0">&lt;</doubt><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">b</doubt><doubt alpha="42.9" length="21" tooSmall="False" monospace="0.0">(45)   a.*ev-de-ki-yi</doubt><p>house-LOC-REL-ACC c. <i>ev-de-ki-ni</i> house-LOC-REL-ACC2 house-ACC2 house-ACC <b>6.</b><b>5.2 Derivations, </b><i>-ki </i>ranges over the case-marked noun, which, as (46a-b) indicate, can be lexical or phrasal. In a lexemic analysis, the entire ki-marked noun would have to be rebracketed before the adjective <i>kuguk </i>can apply to its right scope (which is ev, not <i>gocuk).</i></p><doubt alpha="62.5" length="8" tooSmall="False" monospace="0.0">b.*ev-ni</doubt><doubt alpha="66.7" length="6" tooSmall="False" monospace="0.0">d.ev-i</doubt><doubt alpha="30.4" length="23" tooSmall="False" monospace="0.0">(46) a.ev       -de -ki</doubt><doubt alpha="65.0" length="20" tooSmall="False" monospace="0.0">house    -LOC -PROki</doubt><doubt alpha="42.9" length="7" tooSmall="False" monospace="0.0">&lt;N&lt;N\&lt;N</doubt><doubt alpha="80.0" length="5" tooSmall="False" monospace="0.0">&lt;Nloç</doubt><p>M <i>N\</i><i> </i>M <i>N</i><i>loç</i></p><p><i>M </i><i>N: Xf </i>.and(at PROhouse)(f [PRO]) 'the one that is in the house'</p><doubt alpha="48.6" length="35" tooSmall="False" monospace="0.0">b.kucuk      ev       -de -ki cocuk</doubt><p>little   house   -LOC -ADJki child</p><doubt alpha="53.8" length="26" tooSmall="False" monospace="0.0">&lt;N/&lt;N&lt;N&lt;N\&lt;N(MN/&lt;N\MNloC&lt;N</doubt><doubt alpha="66.7" length="6" tooSmall="False" monospace="0.0">&lt; Nloç</doubt><doubt alpha="60.0" length="5" tooSmall="False" monospace="0.0">MN/&lt;N</doubt><p>Ml <i>N: A</i>/.and(at(little house)child)(/[child\)</p><p>'the child<i>i,</i><i> </i>the one<i>i</i><i> </i>at the little house'</p><p>There is another <i>ki </i>in Turkish that forms nonrestrictive relative clauses as post-modifiers. It is a Persian borrowing and follows the Indo-European pattern of relative clause formation (47). It can be distinguished from the bound morpheme <i>-ki </i>lexically. Its attachment characteristic is also different than that of <i>-ki </i>(44e).</p><doubt alpha="61.9" length="21" tooSmall="False" monospace="0.0">(47)Adamki   hep uyur</doubt><p>man that always sleep-TENSE 'the man, who always sleeps'<page local="35" global="179"/></p><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">o</doubt></section><section title="6.6 Possessive Constructions and Syntactic Compounds"><p>The grammatical marking of possession is realized through the genitive case on the possessor <i>(N</i><i>gen)</i><i> </i>and the possessive marker on the possessee <i>(N</i><i>poss). </i><i>N</i><i>gen </i>and <i>N</i><i>poss </i>must agree in person and number (48a), and the resulting noun group is configurational. Possessives can be nested (48c).</p><doubt alpha="65.5" length="58" tooSmall="False" monospace="0.0">(48) a.ev-in kapi-sib.* ev-inkapi/*ev-inkapi-lar(door-PLU)</doubt><p>house-GEN3 door-POSS3s 'the door of the house'</p><p>c. <i>ben-im arkadas-im-in ev-i-nin kapi-lar-i </i>I-GEN1 friend-POSS1s-GEN3 house-POSS3s door-PLU-POSS3s 'my friend's house's doors'</p><p>d. <i>ben-im arkada§-im-ini dost-u-nun</i><i>j</i><i> </i><i>kendisi</i><i>^/j</i><i> </i>I-GEN1 friend-POSS1s-GEN3 buddy-POSS3s-GEN3 self 'my friend's buddy himself'</p><p>e. <i>Her çalisan-in bazi hak-lar-i vardir </i>every worker-GEN3 some right-PLU-POSS3s exists <i>y </i><i>x</i><i>3</i><i>y ((worker (x) </i><i>A </i><i>right(y)) </i><i>— </i>has(x,y)) but not <i>3</i><i>yf</i><i>i</i><i>x(right(y) </i><i>A </i><i>(worker(x) </i><i>— </i><i>has(x,</i>y))) <b>6.</b><b>6.1 Lexical Types for Possessives. </b>Type assignments for the genitive and the pos­sessive can be schematized over person <i>(p) </i>and number <i>(n) </i>features, as in (49).</p><doubt alpha="59.3" length="86" tooSmall="False" monospace="0.0">(49) -GENpn:=Os-(&lt;N/(MNpn\ MNpn))\&lt;Npn:Xx.Xy.possyx-POSSpn:=Os-(MNpn\MNpn)\&lt; Npn:Xf .f</doubt><p>The possessive marker's result category is a functor because it enforces agreement with the type raised specifier.<footnote anchor="18"/> (48d-e) indicate that the genitive marker is a type raiser; the possessor scopes over the possessee. For morphosyntactic modality, the genitive marker can be attached to nouns that are inflected up to and including a possessive marker ( <i>&lt; </i>N). Moreover, nesting in possessives implies that the specifier may be a genitive. Hence, the stem's category must be <i>&lt; N.</i><i></i></p><p>But there is a finer control over the possessee argument's category, because it <i>must </i>be inflected with the possessive marker to signify relation of possession (cf. (48a-b)). Semantically, the possessive must outscope nominal modification. For instance, (50a) has the PAS as indicated, hence both markers must range over a noun group, not just</p><p>18 An "inert" category such as <i>N </i>may be motivated by the prodrop phenomenon, in which the specifier may be dropped under pragmatically conditioned circumstances. But this analysis disregards the point that binding relations (hence semantics) still require the coindexation of the specifier with some overt referent, which can be inferred from the discourse. Such an interface phenomenon seems to be better suited for handling by interactions in the components of a multidimensional grammar, rather than as a purely syntactic phenomenon.</p><page local="36" global="180"/><doubt alpha="0.0" length="1" tooSmall="False" monospace="0.0">&gt;</doubt><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">N</doubt><doubt alpha="0.0" length="1" tooSmall="False" monospace="0.0">&lt;</doubt><p>the stem. Binding relations require an organization of the type (poss possee possessor) (50b-c).</p><p>(50) a. <i>yasliadam-in kugilkkiz-i</i> old man-GEN3 little daughter-POSS3s 'old man's little daughter' =   poss(littledaughter)(old man) b. <i>adam</i><i>i</i><i>-in kendi</i><i>i</i><i>-si</i> man-GEN self-POSS 'the man himself' c. <i>*kendii adami-i</i> <b>6.</b><b>6.2 Derivation of Possessive Constructions. </b>Example (51) shows the wide scope of the genitive (51a) and nested genitives (51b).</p><doubt alpha="51.1" length="45" tooSmall="False" monospace="0.0">(51) a.yasli    adam -in kilgilk       kiz -i</doubt><doubt alpha="65.9" length="41" tooSmall="False" monospace="0.0">old     man -GEN little    daughter -POSS</doubt><doubt alpha="40.0" length="35" tooSmall="False" monospace="0.0">&lt;N/&lt;N&lt;N&lt;N/(&amp;N\MN)\&lt;N&lt;N/&lt;N&lt;N&amp;N\&amp;N\&lt;N</doubt><doubt alpha="66.7" length="3" tooSmall="False" monospace="0.0">b b</doubt><doubt alpha="28.6" length="7" tooSmall="False" monospace="0.0">&lt; N &lt; N</doubt><p><i>&lt; N: </i>poss(littledaughter)(old man) 'old man's little daughter'</p><doubt alpha="42.2" length="109" tooSmall="False" monospace="0.0">b.ben      -im      arkadas   -im        -in        ev -iI      -GEN     friend -POSS     -GEN    house -POSS</doubt><doubt alpha="54.8" length="31" tooSmall="False" monospace="0.0">NN/(N\N)\NNN\N\NN/(N\N)\NNN\N\N</doubt><doubt alpha="53.8" length="13" tooSmall="False" monospace="0.0">N/(N\N)N\NN\N</doubt><doubt alpha="37.5" length="8" tooSmall="False" monospace="0.0">N/(N\N)&lt;</doubt><p>N: poss house(possfriend i) 'my friend's house' <b>6.</b><page local="37" global="181"/><b>6.3 Lexical Types for Compounds. </b>Syntactic compounds exhibit syntactic patterns similar to possessive constructions, but they signify semantic relations of a different kind. In what follows, we use the function comp to signify that the arguments in the PAS form a compound but say nothing about the range of productivity of this function. The lexical semantics of the arguments and a qualia structure (Pustejovsky 1991) may indicate the function's range of applicability. Lexical type assignments for compound markers are as in (52).</p><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">b</doubt><doubt alpha="48.6" length="37" tooSmall="False" monospace="0.0">(52) -COMP :=0s-MN\&lt;N\MN:\x.\y.compxy</doubt><doubt alpha="22.7" length="22" tooSmall="False" monospace="0.0">a m       m        n b</doubt><doubt alpha="63.8" length="47" tooSmall="False" monospace="0.0">-COMP2 :=os-MN\MN\&lt;N\MN: Ax.Ay.Az.comp(compxy)z</doubt><p>(nested comp)</p><p>Syntactic compounds are formed by means of compound markers that are attached to the head of the compound. For morphosyntactic modality, nonreferentiality of the head implies no inflection (M N) or modification (53a-b). The left component can be a noun group (53c) in which there is ambiguity in the scope of modification. This is regulated by typing, for example, the intersective adjectives ambiguous as noun modifiers ( <i>&lt;</i><i> </i><i>N/</i><i> &lt; N) </i>and compound modifiers (M <i>N/</i><i> </i>M N).<footnote anchor="19"/> The overall compound may be inflected only for case (see, e.g., (53d) and (53e)).</p><doubt alpha="100.0" length="4" tooSmall="False" monospace="0.0">nnmm</doubt><p>(53) <i>a.otobus bilet-i b.*otobus ye§il bilet-i</i> bus   ticket-COMP green 'bus ticket'</p><p><i>c. </i><i>yesil   otobus bilet-i </i>green bus ticket-COMP green(comp ticket bus)</p><p>or comp(ticket(green bus))</p><p><i>d. otobus bilet-i-ni </i>e.*otobus <i>bilet-i-si</i> ticket-COMP-ACC2 ticket-COMP-POSS</p><p>Compound markers serve the dual function of compounding and agreement in possessive constructions; double marking of the possessive is suppressed (cf. 54a-b). The -COMP2 type assignment in (52) handles nested compounds.</p><doubt alpha="66.7" length="57" tooSmall="False" monospace="0.0">(54)a.banka-nin faiz     oran-i b.*banka-ninfaizoran-i-si</doubt><p>bank-GEN interest rate-COMP.POSS rate-COMP-POSS 'interest rate of the bank'</p><p>We claim that plural compounds are lexically composite functions in a similar vein. This claim has some empirical support from the lexicalization of <i>-leri </i>as a third person plural possessive marker; see (55b-c). It follows that <i>-leri </i>has the lexi-</p><p>19 I am grateful to the anonymous reviewer who proposed this alternative.</p><page local="38" global="182"/><doubt alpha="0.0" length="1" tooSmall="False" monospace="0.0">&gt;</doubt><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">N</doubt><doubt alpha="0.0" length="1" tooSmall="False" monospace="0.0">&lt;</doubt><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">b</doubt><p>cal types of -COMP and -COMP2 with plural and possessive composition: <i>Xx.Xy</i>.plu (comp xy).</p><p>(55)   <i>a.otobus bilet-leri b.onlar-in ev-leri</i> bus   ticket-COMP.PLU they-GEN3 house-POSS3p</p><p>'bus tickets' 'their house' c<i>.onlar-in ev-ler-i</i> they-GEN3 house-PLU-POSS3s 'their houses' <b>6.</b><b>6.4 Derivation of Compounds. </b>(56) exemplifies derivations with the type assign­ments in (52). (56a-b) show that both the narrow and the wide scope of the modifier can be accounted for. (56c-d) show that the compound marker interacts with the pos­sessive. Hence, it must carry both poss and comp in possessive constructions involving compounds. (56e-f) are examples of nested compounds. (56f-g) show the effect of strict control (   <i>N) </i>over the compound's head.</p><doubt alpha="58.1" length="31" tooSmall="False" monospace="0.0">(56) a.yesil    otobüs bilet -i</doubt><p>green     bus ticket -COMP</p><doubt alpha="63.6" length="11" tooSmall="False" monospace="0.0">n n b bmn b</doubt><doubt alpha="55.6" length="18" tooSmall="False" monospace="0.0">&lt;N/&lt;N&lt;N&lt;NMN\&lt;N\MMN</doubt><doubt alpha="57.1" length="7" tooSmall="False" monospace="0.0">&lt;NMN\&lt;N</doubt><p><i>m </i><i>N: </i>comp ticket(green bus) b.     <i>yesil    otobüs bilet -i</i></p><doubt alpha="41.7" length="12" tooSmall="False" monospace="0.0">m       mbmn</doubt><doubt alpha="57.1" length="14" tooSmall="False" monospace="0.0">MN/MN&lt;NMN\&lt;N-&lt;</doubt><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">m</doubt><p>1N: green (comp ticket bus)</p><doubt alpha="48.6" length="37" tooSmall="False" monospace="0.0">c.banka     -nin       faiz   oran -i</doubt><p>bank   -GEN   interest rate -COMP.POSS <i>N:</i><i> </i>poss(comp rate interest)bank 'interest rate of the bank' bank    -GEN    interest rate -COMP.<page local="39" global="183"/>POSS.PLU</p><doubt alpha="57.9" length="19" tooSmall="False" monospace="0.0">NN/(N\N)\NNNN\N\N\N</doubt><doubt alpha="50.0" length="12" tooSmall="False" monospace="0.0">N/(N\N)N\N\N</doubt><doubt alpha="66.7" length="3" tooSmall="False" monospace="0.0">N\N</doubt><doubt alpha="0.0" length="1" tooSmall="False" monospace="0.0">&gt;</doubt><doubt alpha="0.0" length="1" tooSmall="False" monospace="0.0">&lt;</doubt><doubt alpha="66.7" length="3" tooSmall="False" monospace="0.0">N\N</doubt><doubt alpha="52.5" length="40" tooSmall="False" monospace="0.0">d.banka     -nin       faiz   oran -lari</doubt><doubt alpha="47.2" length="36" tooSmall="False" monospace="0.0">NN/(N\N)\NNNN\N\N\N N/(N\N)  * N\N\N</doubt><p><i>N: </i>poss(plu(comp rate interest))bank 'interest rates of the bank'</p><doubt alpha="64.9" length="77" tooSmall="False" monospace="0.0">e.kredi kart     -i      faiz   oran -icredit card -COMP interest rate -COMP2</doubt><doubt alpha="64.7" length="17" tooSmall="False" monospace="0.0">NNN\N\NNN N\N\N\N</doubt><doubt alpha="50.0" length="10" tooSmall="False" monospace="0.0">N\N&lt;N\N\N&lt;</doubt><doubt alpha="75.0" length="4" tooSmall="False" monospace="0.0">NN\N</doubt><p>N: comp(comp rate interest)(comp card credit) credit card interest rate f.<i>kredi kart-i yillik   faiz oran-i g.*kredi kart-ifaiz yillik oran-i</i> 'credit card annual interest rate'</p><doubt alpha="100.0" length="6" tooSmall="False" monospace="0.0">annual</doubt></section><section title="7. Conclusion"><p>Theoretical and computational commitment to word-based grammar—and to regard inflectional morphology as a word-internal process—puts artificial limits on specify­ing the syntactic and semantic domains of all meaning-bearing elements and on the transparent projection of scope from the lexicon. Designating words as minimal units of the lexicon is too constraining for many languages. This traditional notion is also challenged in current linguistic theorizing (e.g., Jackendoff 1997 and Keenan and Sta­bler 1997). Marslen-Wilson (1999) argues on psycholinguistic grounds that the lexicon must be morphemic even for morphologically simpler languages such as English.</p><p>We have argued in this article that the key to the integration of inflectional mor­phology and syntax is granting representational status to morphemes, which, in a computational system, requires certain precautions. What we propose is enriching the expressive power of the combinatory morphemic lexicon to factor in morphosyntactic types and attachment modalities. Coupled with flexible constituency in the grammar and directionality information coming from the lexicon, these extensions provide the grammar with the information it requires to compute the transparent semantics of morphosyntactic phenomena. This flexibility causes neither inefficiency in parsing nor uncontrolled expressivity. The extensions do not affect the polynomial worst-case com­plexity results, and category unity is preserved by lattice consistency. The result is a morphemic grammar-lexicon with computationally desirable features such as mod­ularity and transparency. The system is available at ftp://ftp.lcsl.metu.edu.tr/pub/ tools/msccg.</p><page local="40" global="184"/></section><section title="Acknowledgments"><p>I am very grateful to four anonymous <i>CL </i>reviewers for extensive commentary and suggestions. Thanks to Wolf Konig and Stefan Muller for the data; and to Jason Baldridge, Gann Bierner, Aysenur Birturk, Ruken Cakici, Nissim Francez, Stasinos Konstantopoulos, Markus Kracht, Geert-Jan Kruijff, Alan Libert, Mark Steedman, limit Turan and Deniz Zeyrek for comments, advice, and criticism.</p></section><references><p>Aho, Alfred V. and Jeffrey D. Ullman. 1972.</p><p><i>The Theory of Parsing, Translation, and Compiling. </i>Vol. 1. Prentice-Hall.</p><doubt alpha="64.7" length="34" tooSmall="False" monospace="0.0">Anderson, Stephen R. 1982. Where s</doubt><p>morphology? <i>Linguistic Inquiry, </i>13(4):571-612.</p><p>Bach, Emmon. 1983. On the relationship between word-grammar and phrase-grammar. <i>Natural Language and Linguistic Theory, </i>1:65-89.</p><p>Baldridge, Jason. 1999. Strong equivalence of CCG and Set-CCG. Unpublished manuscript, University of Edinburgh.</p><p>Bar-Hillel, Yehoshua, Chaim Gaifman, and Eliyahu Shamir. 1960. On categorial and phrase structure grammars. <i>Bulletin of the Research Council of Israel, </i>9F:1-16.</p><doubt alpha="66.7" length="33" tooSmall="False" monospace="0.0">Bozsahin, Cem. 1998. Deriving the</doubt><p>Predicate-Argument structure for a free word order language. In <i>Proceedings of COLING-ACL 1998, </i>Montreal, pages 167-173.</p><p>Bozsahin, Cem. 2000a. Directionality and the lexicon: Evidence from gapping. Unpublished manuscript, Middle East Technical University, Ankara.</p><p>Bozsahin, Cem. 2000b. Gapping and word order in Turkish. In <i>Proceedings of the 10th International Conference on Turkish Linguistics, </i>Istanbul.</p><p>Bozsahin, Cem and Elvan Gocmen. 1995. A categorial framework for composition in multiple linguistic domains. In <i>Proceedings of the 4th International Conference on Cognitive Science ofNLP, </i>Dublin.</p><p>Bresnan, Joan. 1995. Lexical-Functional syntax. Course notes. Seventh European Summer School in Logic, Language, and Information, Barcelona.</p><p>Calder, Jonathan, Ewan Klein, and Henk Zeevat. 1988. Unification categorial grammar. In <i>Proceedings of the 12th International Conference on Computational Linguistics, </i>Budapest, pages 83-86.</p><p>Carpenter, Bob. 1992. Categorial Grammar, lexical rules, and the English predicative. In R. Levine, editor, <i>Formal Grammar: Theory and Application. </i>Oxford University</p><p>Press, pages 168-242.</p><p>Carpenter, Bob. 1997. <i>Type-Logical Semantics.</i></p><doubt alpha="63.9" length="36" tooSmall="False" monospace="0.0">MIT Press. Carpenter, Bob. 1999. The</doubt><p>Turing-completeness of Multimodal Categorial Grammars. In <i>Papers Presented to Johan van Benthem in Honor of his 50th</i> <i>Birthday.</i><i> </i>ESSLLI, Utrecht.</p><p>Carpenter, Bob and Gerald Penn. 1994. <i>The Attribute Logic Engine User's Guide, Version 2.0. </i>Carnegie Mellon University.</p><p>Chomsky, Noam. 1970. Remarks on nominalization. In R. Jacobs and P. Rosenbaum, editors, <i>Readings in English Transformational Grammar. </i>Ginn, Waltham,</p><p>MA, pages 184-221.</p><p>Chomsky, Noam. 1995. <i>The Minimalist</i></p><p><i>Program. </i>MIT Press. Collins, Michael. 1997. Three generative, lexicalised models for statistical parsing.</p><p>In <i>Proceedings of the 35th Annual Meeting of</i> <i>the ACL.</i><i></i></p><p>Creider, Chet, Jorge Hankamer, and Derick Wood. 1995. Preset two-head automata and natural language morphology. <i>International Journal of Computer</i> <i>Mathematics, </i>58:1-18.</p><p>Dalrymple, Mary and Ronald M. Kaplan. 2000. Feature indeterminacy and feature resolution. <i>Language, </i>76:759-798.</p><p>Dowty, David. 1979. <i>Word Meaning and Montague Grammar. </i>Kluwer, Dordrecht.</p><p>Dowty, David. 1991. Toward a minimalist theory of syntactic structure. In <i>Tilburg Conference on Discontinuous Constituency,</i></p><p>January 1989.</p><p>Dowty, David. 1996. Non-constituent coordination, wrapping, and Multimodal Categorial Grammars. In <i>International Congress ofLogic, Methodology, and Philosophy, </i>Florence, August.</p><p>Eisner, Jason. 1996. Efficient normal-form parsing for Combinatory Categorial Grammar. In <i>Proceedings ofthe 34th Annual Meeting ofthe ACL, </i>pages 79-86.</p><p>Fong, Sandiway. 1991. <i>Computational Properties ofPrinciple-Based Grammatical Theories. </i>Ph.D. dissertation, MIT.</p><p>Gungordu, Zelal and Kemal Oflazer. 1995. Parsing Turkish using the Lexical-Functional Grammar formalism.</p><p><i>Machine Translation, </i>10:293-319.</p><p>Hankamer, Jorge. 1989. Morphological parsing and the lexicon. In W. Marslen-Wilson, editor, <i>Lexical Representation and Process. </i>MIT Press.</p><page local="41" global="185"/><p>Hepple, Mark. 1990a. <i>The Grammar and Processing of Order and Dependency: A Categorial Approach. </i>Ph.D. dissertation, University of Edinburgh.</p><p>Hepple, Mark. 1990b. Normal form theorem proving for the Lambek Calculus. In <i>Proceedings of COLING 1990.</i></p><p>Hepple, Mark and Glyn Morrill. 1989. Parsing and derivational equivalence. In <i>Proceedings of the 4th EACL, </i>Manchester.</p><p>Heylen, Dirk. 1997. Underspecification in Type-Logical Grammars. In <i>Logical Aspects of Computational Linguistics (LACL), </i>Nancy.</p><p>Heylen, Dirk. 1999. <i>Types and Sorts: Resource Logic for Feature Checking. </i>Ph.D. dissertation, Utrecht University.</p><p>Hockenmaier, Julia, Gann Bierner, and Jason Baldridge. 2000. Providing robustness for a CCG system. Unpublished manuscript, University of Edinburgh.</p><p>Hoeksema, Jack. 1985. <i>Categorial Morphology. </i>Garland, New York.</p><p>Hoeksema, Jack and Richard D. Janda. 1988. Implications of process-morphology for Categorial Grammar. In Richard T. Oehrle, Emmon Bach, and Deirdre Wheeler, editors, <i>Categorial Grammars and Natural Language Structures. </i>D. Reidel, Dordrecht, pages 199-247.</p><p>Hoffman, Beryl. 1995. <i>The Computational Analysis of the Syntax and Interpretation of "Free" Word Order in Turkish. </i>Ph.D. dissertation, University of Pennsylvania.</p><p>Jackendoff, Ray. 1997. <i>The Architecture of the Language Faculty. </i>MIT Press.</p><p>Jacobson, Pauline. 1996. The syntax/ semantics interface in Categorial Grammar. In Shalom Lappin, editor, <i>The Handbook of Contemporary Semantic Theory.</i></p><p>Blackwell, 89-116.</p><p>Janeway, Roger. 1990. Unacceptable ambiguity in Categorial Grammar. In <i>Proceedings of the Ninth West Coast Conference on Formal Linguistics, </i>pages 305-316.</p><p>Johnson, Mark. 1988. Deductive parsing with multiple levels of representation. In <i>Proceedings of the 26th Annual Meeting of the</i> <i>ACL, </i>pages 241-248.</p><p>Johnson, Mark and Sam Bayer. 1995. Features and agreement in Lambek Categorial Grammar. In <i>Proceedings ofthe 1995 ESSLLI Formal Grammar Workshop,</i> pages 123-137.</p><p>Jurafsky, Daniel and James H. Martin. 2000. <i>Speech and Language Processing. </i>Prentice-Hall.</p><p>Karttunen, Lauri. 1989. Radical lexicalism. In Mark Baltin and Anthony Kroch, editors, <i>Alternative Conceptions of Phrase Structure. </i>University of Chicago Press,</p><p>pages 43-65. Keenan, Edward L. and Edward Stabler.</p><p>1997. Bare grammar. Course notes, Ninth</p><p>European Summer School on Logic,</p><p>Language, and Information,</p><p>Aix-en-Provence. Komagata, Nobo. 1997. Efficient parsing for</p><p>CCGs with generalized type-raised categories. In <i>Proceedings of the Fifth Int.</i></p><p><i>Workshop on Parsing Technologies, </i>pages 135-146.</p><p>Konig, Esther. 1989. Parsing as natural deduction. In <i>Proceedings of the 27th Annual</i> <i>Meeting ofthe ACL, </i>pages 272-279.</p><p>Kracht, Markus. 1999. Referent systems, argument structure, and syntax. ESSLLI</p><p>lecture notes, Utrecht. Kruijff, Geert-Jan M. and Jason M.</p><p>Baldridge. 2000. Relating categorial type logics and CCG through simulation.</p><p>Unpublished manuscript, University of</p><p>Edinburgh. Lambek, Joachim. 1958. The mathematics of sentence structure. <i>American Mathematical</i> <i>Monthly, </i>65:154-170.</p><p>Marslen-Wilson, William. 1999. Abstractness and combination: The morphemic lexicon. In Simon Garrod and Martin J. Pickering, editors, <i>Language Processing. </i>Psychology Press, East Sussex, UK, pages 101-119.</p><p>Miller, Philip H. and Ivan A. Sag. 1997.</p><p>French clitic movement without clitics or movement. <i>Natural Language and Linguistic</i> <i>Theory, </i>15:573-639.</p><p>Moortgat, Michael. 1988a. <i>Categorial Investigations: Logical and Linguistic Aspects ofthe Lambek Calculus. </i>Foris, Dordrecht.</p><p>Moortgat, Michael. 1988b. Mixed composition and discontinuous dependencies. In Richard T. Oehrle, Emmon Bach, and Deirdre Wheeler, editors, <i>Categorial Grammars and Natural Language Structures. </i>D. Reidel, Dordrecht, pages 319-348.</p><p>Moortgat, Michael and Richard T. Oehrle.</p><p>1994. Adjacency, dependency and order.</p><p>In <i>Proceedings ofthe Ninth Amsterdam</i></p><p><i>Colloquium. </i>Morrill, Glyn V. 1994. <i>Type Logical Grammar:</i></p><p><i>Categorial Logic ofSigns. </i>Kluwer.</p><doubt alpha="64.7" length="34" tooSmall="False" monospace="0.0">Morrill, Glyn V. 1999. Geometry of</doubt><p>lexico-syntactic interaction. In <i>Proceedings</i> <i>ofthe Ninth EACL, </i>Bergen.</p><p>Müller, Stefan. 1999. <i>Deutsche Syntax deklarativ. Head-Driven Phrase Structure Grammar für das Deutsche. </i>Linguistische Arbeiten 394. Max Niemeyer Verlag, Tübingen.</p><p>Oflazer, Kemal. 1994. Two-level description of Turkish morphology. <i>Literary and Linguistic Computing, </i>9(2).</p><page local="42" global="186"/><p>Oflazer, Kemal. 1996. Error-tolerant finite-state recognition with applications to morphological analysis and spelling correction. <i>Computational Linguistics,</i> 22:73-89.</p><p>Oflazer, Kemal, Elvan Gocmen, and Cem Bozsahin. 1994. An outline of Turkish morphology. Report to NATO Science</p><p>Division SfS III (TU-LANGUAGE),</p><p>Brussels.</p><p>Oflazer, Kemal, Bilge Say, Dilek Zeynep Hakkani-Tur, and Gokhan Tiir. 2001. Building a Turkish treebank. In Anne Abeille, editor, <i>Building and Exploiting Syntactically-Annotated Corpora. </i>Kluwer.</p><p>Pareschi, Remo and Mark Steedman. 1987. A lazy way to chart-parse with Categorial Grammars. In <i>Proceedings ofthe 25th Annual Meeting ofthe ACL, </i>pages 81-88.</p><p>Pereira, Fernando C. N. and Stuart M. Shieber. 1987. <i>Prolog and Natural-Language Analysis. </i>CSLI, Stanford, CA.</p><p>Pollard, Carl and Ivan A. Sag. 1994.</p><p><i>Head-Driven Phrase Structure Grammar.</i></p><p>University of Chicago Press. Pulman, Stephen G. 1996. Unification encodings of grammatical notations.</p><p><i>Computational Linguistics, </i>22:295-327. Pustejovsky, James. 1991. The generative lexicon. <i>Computational Linguistics,</i> 17(4):409-441.</p><p>Sehitoglu, Onur. 1996. A sign-based phrase structure grammar for Turkish. Master's thesis, Middle East Technical University.</p><p>Sehitoglu, Onur and Cem Bozsahin. 1999. Lexical rules and lexical organization: Productivity in the lexicon. In Evelyne Viegas, editor, <i>Breadth and Depth of Semantic Lexicons. </i>Kluwer, pages 39-57.</p><p>Steedman, Mark. 1985. Dependency and coordination in the grammar of Dutch and English. <i>Language, </i>61(3):523-568.</p><p>Steedman, Mark. 1987. Combinatory grammars and parasitic gaps. <i>Natural Language and Linguistic Theory, </i>5:403-439.</p><p>Steedman, Mark. 1988. Combinators and grammars. In Richard T. Oehrle, Emmon Bach, and Deirdre Wheeler, editors, <i>Categorial Grammars and Natural Language Structures. </i>D. Reidel, Dordrecht, pages 417-442.</p><p>Steedman, Mark. 1991a. Structure and intonation. <i>Language, </i>67:260-298.</p><p>Steedman, Mark. 1991b. Type raising and directionality in Combinatory Grammar. In <i>Proceedings of the 29th Annual Meeting of</i> <i>the ACL, </i>pages 71-78.</p><p>Steedman, Mark. 1996. <i>Surface Structure and</i></p><p><i>Interpretation. </i>MIT Press. Steedman, Mark. 2000a. <i>The Syntactic</i></p><p><i>Process. </i>MIT Press. Steedman, Mark. 2000b. Information structure and the syntax-phonology grammar. Unpublished manuscript,</p><doubt alpha="63.5" length="85" tooSmall="False" monospace="0.0">interface.Linguistic Inquiry,31(4): 649-689. Szabolcsi, Anna. 1983. ECP in categorial</doubt><p>Max-Planck Institute. Szabolcsi, Anna. 1987. Bound variables in syntax: Are there any? In <i>Proceedings ofthe</i></p><p><i>6th Amsterdam Colloquium, </i>pages 331-350. Tomita, Masaru. 1988. <i>The Generalized LR</i> <i>Parser/Compiler.</i><i> </i>Technical report, Center for Machine Translation, Carnegie Mellon</p><p>University.</p><p>Vijay-Shanker, K. and David J. Weir. 1993.</p><p>Parsing some constrained grammar formalisms. <i>Computational Linguistics,</i> 19:591-636.</p><p>Wechsler, Stephen and Larisa Zlatic. 2000. A theory of agreement and its application to Serbo-Croatian. <i>Language, </i>76:799-832.</p><p>Whitelock, Pete J. 1988. A feature-based categorial morpho-syntax for Japanese. In Uwe Reyle and C. Rohrer, editors, <i>Natural Language Parsing and Linguistic Theories</i>.D.</p><p>Reidel, pages 230-261.</p><p>Williams, Edwin. 1981. On the notions "lexically related" and "head of a word." combinators. In <i>Proceedings of the 25th Annual Meeting ofthe ACL, </i>pages 73-79.</p><doubt alpha="60.3" length="68" tooSmall="False" monospace="0.0">Linguistic Inquiry,12(2):245-274. Wittenburg, Kent. 1987. Predictive</doubt></references></body></article>