<?xml version="1.0"?><!DOCTYPE article SYSTEM "/project/take/software/searchbench_offline_processing/paperxml_generator/aclextractor/src/python/../resource/dtd/paperxml.dtd"><article><header><firstpageheader><page local="1" global="371"/><title>The application of two-level morphology to non-concatenative German morphology</title><author surname="Trost" givenname="Harald"><org  name="FRG Email:"/></author></firstpageheader><frontmatter><p><b>The application of two-level morphology to non-concatenative German morphology</b></p><p><b>Harald</b><b> </b><b>Trost</b></p><p>Deutsches Forschungszentrum für Künstliche Intelligenz (DFKI)l Stuhlsatzenhausweg 3, D-6600 Saarbrücken 11, FRG Email: htrost@sbuvax.campus.uni-sb.de</p></frontmatter><abstract>In this paper<footnote anchor="2"/> we describe a hybrid system for morphological analysis and synthesis. We call it hybrid because it consists of two separate parts interacting with each other in a well-defined way. The treatment of morphonology and non-concatenative morphology is based on the two-level approach originally proposed by Koskenniemi (1983). For the concatenative part of morphosyntax (i.e. affixation) we make use of a grammar based on feature-unification. Both parts rely on the same morph lexicon. Combinations of two-level morphology with feature-based morphosyntactic grammars have already been proposed by several authors (cf. Bear 1988a, Carson 1988, Görz &amp; Paulus 1988, Schiller &amp; Steffens 1990) to overcome the shortcomings of the continuation-classes originally proposed by Koskenniemi (1983) and Karttunen (1983) for the description of morphosyntax. But up to now no linguistically satisfying solution has been proposed for the treatment of non-concatenative morphology in such a framework. In this paper we describe an extension to the model which will allow for the description of such phenomena. Namely we propose to restrict the applicability of two-level rules by providing them with filters in the form of feature structures. We demonstrate how a well-known problem of German morphology, so-called "Umlautung", can be described in our approach in a linguistically motivated and efficient way. <b><footnote anchor="1"/>  Work on this project has begun while I was working for the Austrian Research Institute for Artificial Intelligence in Vienna, Austria</b> <b>^ I want to thank my collègues Greg Dogil, Wolfgang Heinz, Tibor Kiss and Günter Neumann for fruitful discussions and helpful comments on earlier versions of this paper.</b> </abstract></header><body><section title="Introduction"><p>Conventional morphosyntactic grammars do not allow to describe non-concatenative parts of morphology declaratively. Two-level morphology on the other hand can deal with some of these phenomena like vowel or consonant change, but there is no sound way to transmit information to the morphosyntactic grammar. This leads to quite unnatural solutions like the use of diacritics for the representation of morphosyntactic phenomena.</p><p>While German morphology is mainly based on concatenation, some non-concatenative phenomena do exist. The most prominent example is so-called "Umlautung". Umlautung means that in the surface form the original stem vowel is replaced by another vowel in a systematic manner. Possible transformations are a=&gt;ä, au=&gt;äu, o=&gt;ö, u=&gt;ü, and - in some cases - e=&gt;i.</p><p>Umlautung in German realizes quite different morphological features. With nouns it can mark the plural either by itself (e.g. Mutter -&gt; Mütter) or in combination with an explicit plural-ending (e.g. Mann =&gt; Männer), depending on the inflection class. With adjectives it is used to mark comparative forms (groß =&gt; größer =&gt; am größten), again in combination with an ending, for verbs following strong conjugation it marks the subjunctive II and 2nd and 3rd person singular of the indicative present tense. Umlautung also occurs in derivation in combination with a number of derivational particles, e.g. <i>-lich </i>(klagen =&gt; kläglich). In contrast to its use in inflection, umlautung provides for no extra morphosyntactic information in derivational forms. At last, it appears in compounding in combination with some "Fugenelement" (joining element) (e.g. <i>Männerchor - </i>male chorus).</p><doubt alpha="0.0" length="1" tooSmall="False" monospace="0.0">1</doubt><page local="2" global="372"/><p>There are two common ways to cope with umlautung in conventional morphological components for German. One is to treat all forms created by umlautung as suppletions, i.e. these forms are explicitly entered into the lexicon. This is linguistically inadequate, because it obscures the phonological similarity of the two forms. From a more practical point of view it has the drawback that in a few cases, e.g. forming of the diminutive with the derivational <i>-chen,</i><i> </i>umlautung is still productive, and cannot therefore be lexicalized.</p><p>The other solution is a special function replacing (and interpreting) or generating the umlaut in all stems which are marked for umlautung required by the morphosyntactic context (cf. Trost &amp; Dorffner 1987). This makes umlautung a special case neglecting its status as a regular means of morphosyntactic marking.</p><p>Solutions within the two-level approach have also been proposed. They rely on the idea to represent stem vowels which exhibit umlautung with special characters (diacritics) (e.g. <i>A)</i><i> </i>at the lexical level. These characters are then realized as either the regular vowel (e.g. <i>a)</i><i> </i>or the corresponding umlaut (e.g. <i>a)</i><i> </i>at the surface level. The idea behind is that these stem vowels are lexically somewhat underspecified. To trigger the appropriate substitution, Görz &amp; Paulus (1988) use a separate data structure to control for each word form which of the two possible rules is applied to create the surface structure. Schiller &amp; Steffens (1989) use still another diacritic symbol for this task. Flexional endings triggering umlautung start with the diacritic $ (realized as 0 at the surface level). The context to the right of the substitution of all umlaut rules requires the occurence of that $. Therefore the umlaut rule would fail if no such affix follows the stem. As a consequence, the null morph must be explicitly represented by $ in lexical strings where morphosyntactic information is expressed by umlautung only (e.g. <i>Mutter =&gt; Mutter).</i></p><p>Although both solutions certainly do work, at least for flexional morphology, they provide no clean and general solution for the integration of umlautung in the framework of two-level morphology. The use of a separate data structure is contrary to the intuition that umlautung is a regular phenomenon of German morphology, the treatment of which should require no extra mechanism. And the use of the diacritic $ places a burden on morphonology which clearly belongs to morphosyntax.</p><p>The handling of non-concatenative morphological phenomena within the two-level approach imposes two new requirements:</p><p>• Information about the application of a rule needs to be transferred to the morpho­syntactic grammar.</p><p>• It must be possible to restrict the application of two-level rules to certain classes of rnorphs.</p><p>Accordingly, we propose an approach where umlautung requires no extra mechanism at all and where no diacritics are (mis)used to describe morphosyntactic features. The basic idea is to provide two-level rules with a filter in form of a feature structure which controls its applicability. This feature structure has to be unified with the feature structure of the morph found in the lexicon to which the rule applies. In case of failure the two-level rule may not be applied. If unification succeeds information is transferred that way from the two-level part to the associated morphosyntactic grammar. This is crucial for the treatment of umlautung because, as mentioned above, its application conveys morphosyntactic meaning.</p><p>In the following we will describe the parts of our system in some detail and explain how umlautung can be handled using that framework. (Basic knowledge of the two-level approach and feature-unification is presupposed.) We will also argue that extending the two-level rules with filters facilitates the description of certain morphonological phenomena as well.</p><p>Hie Two<b>-Level</b><b> </b><b>Part</b></p><p>Our implementation of the two-level part is similar to the one proposed by Bear (1988a, b), i.e. rules are interpreted directly and not compiled into automata. Rules consist of a left context, a right context and a substitution. Left and right contexts are regular expressions over pairs of lexical and surface symbols. A substitution consists of exactly one such pair. Rules may be optional or obligatory (i.e. in contrast to Bear there are no disallowed rules). By definition, all default pairs are regarded as optional rules with empty contexts.</p><p>The pair of strings (lexical and surface) is processed from left to right. If more than one optional rule is applicable at a time this shows an ambiguity, i.e. there are as many continuations as there are different substitutions.<page local="3" global="373"/> Obligatory rules supercede all optional ones (thereby pruning the tree of continuations). If more than one obligatory rule is applicable at the same time (enforcing different substitutions) the whole mapping must be discarded. The same is true if no rule applies at all.</p><doubt alpha="0.0" length="1" tooSmall="False" monospace="0.0">2</doubt><p>The major difference from other two-level approaches is the possibility to provide the rules with a filter. A filter is an arbitrary feature structure. A rule may only be applied if the filter unifies with the feature-structure of the actual morph, i.e. the morph to which the substitution applies. Filters are used to restrict the application of a rule to certain classes of morphs. This is in contrast to the original view of Koskenniemi that morphonological rules are to be applied over the whole lexicon regardless of morphosyntactic considerations. This is certainly true of post-lexical rules. But there is evidence that it is not even true for all morphonological rules. Take e.g. the verb <i>senden </i>(to send), which can form two different past tenses <i>send-e.-te </i>and <i>sand-te, </i>the former being regular weak conjugation, the latter a strong stem with weak inflection ending. The epenthesis of schwa (or <i>e </i>in orthography) depends on the morphological class of the stem (weak or strong). Or take the adjective <i>dunkel, </i>where the nominalization <i>im Dunk-e-ln </i>(in the dark) is different from the attributive use <i>den dunkl-e.-n Mantel </i>(the dark coat) (cf. Gigerich 1987). Here nominalization requires schwa epenthesis in the stem, not at the morph boundary like the adjective.</p><p>If we want to use two-level rules for the description of non-concatenative morphology, such filters are necessary anyway. Because, as mentioned above, we do need some means to convey information from the two-level part to the morphosyntactic grammar. In the case of umlautung we suppose that it is triggered by the concatenation of a stem which is lexically marked for umlaut (by the occurrence of a diacritical character <i>A, </i><i>O,</i><i> </i><i>U</i><i> </i>or <i>E)</i><i> </i>with an affix allowing for umlautung (i.e. carrying the feature [umlautung: +]). Therefore the filter for all rules concerning umlautung basically contains the feature-value pair which marks affixes [umlautung: +/-].</p><p>Umlautung must only be performed if the stem allows for umlautung and that feature has the value +. Accordingly, all two-level rules substituting a vowel by its umlaut have the filter [umlautung +]. Corresponding rules are needed which keep the original vowel in the surface form. They have the filter [umlautung: -J. All the above-mentioned rules are obligatory, and exactly one of them applies to every occurence of a stem marked for umlaut (see figure 1).</p><p>Rule:  Rule Status:</p><p>Rule Filter:</p><doubt alpha="64.8" length="71" tooSmall="False" monospace="0.0">head [cat: stem] agr [umlautung: -] head [cat: stem] agr [umlautung: +]</doubt><p>Fig. 1 : Two-level rules for lexical <i>a</i></p><p>To select exactly one rule (in order to prevent a deadlock between two obligatory rules) the feature <i>umlautung </i>must always have a value. But, as stated above, stems are not lexically marked for this feature. The marking is effected by the morphosyntactic grammar, which unifies the agreement features of affix and stem, thereby transferring the feature <i>umlautung </i>to the feature structure of the stem (see figure 5).</p><p>What is important to note here, is that the two-level rules themselves contain no information about the morphosyntactic interpretation of umlautung. This is only specified in the grammar rules. Therefore the same two-level rules can be used for all the different places, where umlautung occurs. We will now shortly describe the morphosyntactic part of our system.</p><p>Feature-Based Morphosyntax</p><p>Concatenative morphology is described in grammar rules following X-bar theory. A head-driven approach is adopted. The basic structure consists of a head, usually some sort of affix, and one or more complements, one of which must be some type of stem. We will not go into any detail concerning the exact format of the grammar rules here, because it is irrelevant for the treatment of umlaut. For the purpose of this paper it suffices to give just one example.</p><p>We will describe the overall structure of the grammar using the noun <i>Mann</i><i> </i>(man) as an example. Nouns are constructed from stem, number marker, and case marker. The number marker forms the head and subcategorizes for a stem and a case marker. The relevant syntactic information is collected in the agreement feature which is passed upwards from the daugthers.</p><doubt alpha="0.0" length="1" tooSmall="False" monospace="0.0">3</doubt><table class="main" frame="box" rules="all" border="1" regular="False"><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>(A—&gt;a) obligatory</p></td><td class="cell"><p>syn</p></td><td class="cell"><p>loc</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p>{A-»a) obligatory</p></td><td class="cell"><p>syn</p></td><td class="cell"><p>loc</p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr></table><page local="4" global="374"/><doubt alpha="100.0" length="3" tooSmall="False" monospace="0.0">syn</doubt><doubt alpha="100.0" length="3" tooSmall="False" monospace="0.0">loc</doubt><p>Figure 2 shows (a simplified version of) the number markers for [flex-class: er]. We can see that the plural marker triggers umlautung, while the singular marker does not. Both subcategorize for a stem and a case marker.</p></section><section title="morph: end*"><doubt alpha="63.2" length="19" tooSmall="False" monospace="0.0">"bar[max:-lLmin: +J</doubt></section><section title="cat: noun-stem]"><doubt alpha="62.5" length="8" tooSmall="False" monospace="0.0">"num: sg</doubt></section><section title="person: 3"><doubt alpha="100.0" length="4" tooSmall="False" monospace="0.0">head</doubt><doubt alpha="75.0" length="4" tooSmall="False" monospace="0.0">agr"</doubt></section><section title="umlautung:"></section><section title="subcat"></section><section title="first"><doubt alpha="100.0" length="4" tooSmall="False" monospace="0.0">rest</doubt><doubt alpha="54.5" length="11" tooSmall="False" monospace="0.0">bar [max:+]</doubt></section><section title="headf st . em  1 Lilex -class: erJ"></section><section title="Lagr'"></section><section title="syn loc"><doubt alpha="0.0" length="2" tooSmall="False" monospace="0.0">■©</doubt><p><b>noun-flex.</b></p><doubt alpha="100.0" length="4" tooSmall="False" monospace="0.0">heai</doubt><doubt alpha="75.0" length="4" tooSmall="False" monospace="0.0">agr'</doubt><doubt alpha="0.0" length="1" tooSmall="False" monospace="0.0">©</doubt><doubt alpha="58.3" length="12" tooSmall="False" monospace="0.0">.rest: *end*</doubt><doubt alpha="66.7" length="36" tooSmall="False" monospace="0.0">"morph: [firstt:#er. JrLrest: *end*J</doubt><doubt alpha="50.0" length="2" tooSmall="False" monospace="0.0">&lt;D</doubt><doubt alpha="0.0" length="7" tooSmall="False" monospace="0.0">.....;]</doubt></section><section title="cat: noun-stem] num: pi person: 3 _umlautung: +_"></section><section title="cat: stem flex-class: er-"><doubt alpha="100.0" length="3" tooSmall="False" monospace="0.0">agr</doubt><doubt alpha="50.0" length="2" tooSmall="False" monospace="0.0">(D</doubt><doubt alpha="66.7" length="21" tooSmall="False" monospace="0.0">icat:. 1t. noun-tlexJ</doubt><doubt alpha="75.0" length="4" tooSmall="False" monospace="0.0">agr®</doubt><p>Fig.2: Number markers for nouns with plural -er (unmarked singular and plural #er)</p><p>Figure 3 shows the lexical entry for the stem <i>Mann,</i><i> </i>which may take an umlaut (its stem vowel is <i>A).</i><i> </i>The number marker takes that stem as a complement. The agreement featuresare shared between head and complement. As one result, the <i>ic</i><i>&amp;tuvQumlautung </i>is transferred from the number marker to the stem. It is now locally available to trigger the correct umlaut rule.</p><doubt alpha="100.0" length="5" tooSmall="False" monospace="0.0">morph</doubt><p>first: mAnn .rest: *end* bar head agr cat: stem root: mannflex-class: er genus: n</p><doubt alpha="58.3" length="12" tooSmall="False" monospace="0.0">lax:+1lin:+J</doubt><p>subcat: *end* Fig.3: Lexical entry for Mann (man)</p><p>Figure 4 shows two different case markers for the unmarked case and for dative plural. After combining with a stem, the number marker may now take a case marker as its second complement.</p><doubt alpha="80.0" length="5" tooSmall="False" monospace="0.0">rest:</doubt><doubt alpha="100.0" length="3" tooSmall="False" monospace="0.0">bar</doubt><doubt alpha="75.0" length="4" tooSmall="False" monospace="0.0">agr©</doubt><doubt alpha="66.7" length="12" tooSmall="False" monospace="0.0">morph: *end*</doubt><doubt alpha="47.6" length="21" tooSmall="False" monospace="0.0">casus:{1,3,4}.num: sg</doubt><doubt alpha="41.7" length="24" tooSmall="False" monospace="0.0">casus: {1, 2, 4} num: pi</doubt><p>Fig.4: Case morphs for the unmarked case and for dative plural</p><p>The grammar fragment sketched in this section must be viewed with care. German inflectional endings often combine different morpho­syntactic information, e.g. with nouns ease and number information is sometimes expressed by a single morph. Explaining the unmarked case as a combination of stem with nullmorph is also somewhat problematic. A more relistic grammar would probably collapse parts of the tree into a single structure, i.e. by using case&amp;number markers, which subcategorize only for a stem. This would, for example, eliminate the problem which is posed to the parser by allowing for the occurence of more than one null morphs in one position (as is the case for 1st, 3rd and 4th singular where both number and case marker are realized by the null morph). Nevertheless, with regard to the handling of umlautung in our approach these problems are not relevant.</p></section><section title="The  Integration  into  the  Grammar"><p>We will now show how the two parts of our system work together. Take e.g. the dative plural of <i>Mann</i><i> </i>(man), <i>Männern.</i><i> </i>For generation, the grammar part constructs the lexical string <i>$mAnn#er#n$ </i>(# marks a morph boundary and $ a word boundary), which is given to the two-level part. The relevant lexical information for the purpose of umlautung is the stem vowel <i>A </i>in <i>mAnn </i>, and the feature [umlautung: +] in <i>iter. </i>As described in the last chapter, by structure sharing this information</p><doubt alpha="0.0" length="1" tooSmall="False" monospace="0.0">4</doubt><doubt alpha="54.5" length="11" tooSmall="False" monospace="0.0">first: #n"1</doubt><doubt alpha="66.7" length="6" tooSmall="False" monospace="0.0">*end*J</doubt><doubt alpha="37.5" length="8" tooSmall="False" monospace="0.0">[max: +]</doubt><doubt alpha="66.7" length="30" tooSmall="False" monospace="0.0">Lmin: +J head [cat: noun-flex]</doubt><doubt alpha="50.0" length="10" tooSmall="False" monospace="0.0">[casus: 3]</doubt><doubt alpha="66.7" length="9" tooSmall="False" monospace="0.0">Lnum: pi]</doubt><doubt alpha="42.9" length="7" tooSmall="False" monospace="0.0">[max: +</doubt><doubt alpha="63.3" length="30" tooSmall="False" monospace="0.0">Lmin: + head [cat: noun- flex]</doubt><doubt alpha="75.0" length="4" tooSmall="False" monospace="0.0">cat:</doubt></section><section title='"bar[ m Lm'><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">m</doubt><doubt alpha="100.0" length="1" tooSmall="False" monospace="0.0">m</doubt><table class="main" frame="box" rules="all" border="1" regular="False"><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p><b>first</b></p></td><td class="cell"><p><b>syn</b></p></td><td class="cell"><p><b>loc</b></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p><b>t</b></p></td><td class="cell"><p></p></td><td class="cell"><p></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"><p><b>-rest:</b></p></td><td class="cell"><p><b>*end*</b></p></td><td class="cell"></td></tr><tr class="row"><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td><td class="cell"></td></tr></table><page local="5" global="375"/><doubt alpha="100.0" length="3" tooSmall="False" monospace="0.0">syn</doubt><doubt alpha="100.0" length="3" tooSmall="False" monospace="0.0">loc</doubt><doubt alpha="100.0" length="4" tooSmall="False" monospace="0.0">head</doubt><doubt alpha="75.0" length="4" tooSmall="False" monospace="0.0">agr"</doubt><doubt alpha="100.0" length="3" tooSmall="False" monospace="0.0">agr</doubt><doubt alpha="100.0" length="3" tooSmall="False" monospace="0.0">bar</doubt><p>has already been enriched by the generation process providing <i>mAnn</i><i> </i>with the feature [umlautung: +]. When reaching the stem vowel <i>A </i>the rules try to unify their filters with the feature structure of <i>mAnn. </i>Only the umlaut rule succeeds, generating the correct surface form <i>$männern$.</i></p><p>Now one can also see why the (incorrect) form <i>Manner</i><i> </i>will not be accepted by the parser. The filter of the obligatory rule <i>A =&gt; a </i>would add the feature [umlautung -] to the feature structure of <i>mAnn.</i><i> </i>This inhibits the unification with the feature structure of <i>#er.</i></p><doubt alpha="0.0" length="1" tooSmall="False" monospace="0.0">0</doubt><doubt alpha="52.1" length="48" tooSmall="False" monospace="0.0">cat: noun person: 3. max: +] .min: - J casus [ ]</doubt><doubt alpha="56.2" length="32" tooSmall="False" monospace="0.0">gender [ ] num [ ] umlautung [ ]</doubt><doubt alpha="0.0" length="2" tooSmall="False" monospace="0.0">•0</doubt><doubt alpha="63.5" length="63" tooSmall="False" monospace="0.0">cat: noun] person: 3] max:-1jriin: -J gender[ ]num[]umlautung [</doubt><doubt alpha="56.5" length="46" tooSmall="False" monospace="0.0">head [cat: stem] bar [max: +] agr ©[gender [ ]</doubt><doubt alpha="60.0" length="10" tooSmall="False" monospace="0.0">&lt;rj"numC ]</doubt><p>umlautung [ ]</p><figure caption="Figure 5: Basic tree structure created by the grammar for nouns"></figure><p>The analysis starts with the surface form <i>$männern$. </i>Because no morph has been recognized yet, both <i>mAnn</i><i> </i>and <i>mann</i><i> </i>are derivable (because of the default rule <i>ä</i><i> </i><i>~&gt;</i><i> </i><i>a).</i><i> </i>At that point the morph <i>mAnn </i>is found in the lexicon. The filter is successfully unified with the lexical entry, transferring the information that umlautung has taken place. Now a morph boundary must be created. A <i>0 </i>is inserted in the surface form which is mapped to # in the lexical form. (There still remains the other possibility to look for a longer morph. This hypothesis can only be discarded when the end of the surface form is reached without finding a corresponding morph in the lexicon.) Next <i>iter </i>is found in the lexicon. Since that ending allows for umlautung (having the feature [umlautung: +]), the morphosyntactic grammar can combine it with the stem which has already been unified with the rule filter. Next the morph <i>Un</i><i> </i>is recognized, completing the analysis path.</p><p>At this point we want to remind you of the fact that the morphosyntactic grammar uses the notion of null morphs for all unmarked forms (e.g. in our example the singular marker). Null morphs are not included in the lexical string though. They operate solely at the level of the morphosyntactic grammar. Take e.g. the generation of <i>Mann </i>(nom sg). Although only <i>$mAnn$ </i>is created as lexical string, the null morph has enriched the associated feature structure with [umlautung: -] enforcing the generation of the surface string <i>Mann</i><i> </i>by blocking the umlaut rule.</p><p>Analysis works in a similar way. When <i>$mann$ </i>is input as surface string, it is mapped to the lexical string unchanged. It is now associated to the morph <i>mann</i><i> </i>the feature structure of which has been unified with the rule filter providing it with [umlautung: ]. To create a legal word form it must now be</p><doubt alpha="0.0" length="1" tooSmall="False" monospace="0.0">5</doubt><doubt alpha="64.3" length="42" tooSmall="False" monospace="0.0">head [cat: noun-flex] syn loc bar [max: +]</doubt><doubt alpha="41.7" length="12" tooSmall="False" monospace="0.0">[casus [ ] "</doubt><doubt alpha="33.3" length="9" tooSmall="False" monospace="0.0">num [ ] _</doubt><p>cat: noun<page local="6" global="376"/></p><doubt alpha="56.5" length="23" tooSmall="False" monospace="0.0">person: 3 max: -1min:+J</doubt><p>combined with number and case markers. These can only be null morphs and their agreement features must unify, which leads to the correct interpretation.</p><p>Another example from derivation shall demonstrate how well the marking of stem vowels and the feature <i>umlautung</i><i> </i>work together to define the occurence of umlautung. The verb <i>klagen </i>(to mourn) shows no umlaut in any of its forms. The same is true for the nominalization <i>Klage. </i>But the derived adjective <i>kläglich </i>surprisingly exhibits an umlaut. A closer look shows that this behaviour conforms to what our system predicts. The morph <i>klAg </i>is marked as a stem which may take umlaut. Since all endings of weak verb conjugation are marked with [umlautung -] no umlautung takes place for any of the verb forms. The same is true for the noun plural ending <i>itn. </i>But <i>Mich</i><i> </i>comes with the feature [umlautung +] triggering the umlaut-rule to produce the surface form <i>kläglich.</i></p><p>Unfortunately in derivation and composition there are exceptions to the rule. Contrary to our expectations we find the adjective <i>handlich </i>derived from the noun <i>Hand.</i><i> </i>Since the plural form of the noun is <i>Hände</i><i> </i>the morph must clearly be stored as <i>hAnd </i>in the lexicon which would yield <i>händlich </i>which is incorrect. There are two solutions to this problem. One can take the stance that in such cases derivation is no longer transparent and that these words should be entered into the lexicon as a whole.</p><p>The other solution would be to introduce exception markers with such morphs which block the application of the umlaut rule (say [flex-uml-poss: -] for flexion and [deriv-uml-poss: -] for derivation). Instead of the single feature <i>umlautung </i>for all affixes we then need to mark flexional endings and derivational endings with the features <i>flexional-umlautung </i>and <i>derivational-umlautung </i>respectively. The rule filters become more complex too. Umlaut rules are equipped with the following filter: <b>I </b>[flexional-umlaut: + ] [flex-uml-poss: +] I I [derivational-umlaut: +] [deriv-uml-poss: +] { the corresponding no-umlaut rules get an according one. All morphs not explicitly marked will behave like before, i.e. take umlautung in both cases.</p><p>Conclusion</p><p>We have shown a hybrid system for morphological analysis and synthesis, based on two-level morphology and unification-based grammar rules. By providing two-level rules with a filter in the form of a feature structure the application of these rules can be controlled by the morphosyntactic grammar in a consistent way. The filters are also used to transfer morphosyntactic information from the two-level part to the grammar. This allows the description of non-concatenative morphological phenomena using such rules without the use of (phonologically) unmotivated diacritics.</p><p>As an example, we have shown how our system can handle German umlautung in a linguistically satisfactory manner. Translation of the umlaut is performed by a two-level rule which is filtered by a feature <i>umlautung. </i>The morphosyntactic interpretation of the umlaut is only performed at the level of the grammar rules.</p><p>The proposed method can be applied to other non-concatenative phenomena as well. The idea of filters seems also to be a promising solution for morphonological phenomena which are restricted to certain classes of morphs (or words).</p></section><references><p>Bear J. (1988a): A Morphological Recognizer with</p><p>Syntactic and Phonological Rules, COLING-86,</p><p>Bonn, BRD. Bear J. (1988a): Generation and Recognition of</p><p>Inflectional Morphology, in: H.Trost (ed.), 4.Österreichische Artificial Intelligencc-Tagung,</p><p>Springer, Berlin, 3-7. Carson J. (1988): Unification and transduction in</p><p>Computational Phonology, COLING-88, Budapest, 106-111.</p><doubt alpha="66.7" length="42" tooSmall="False" monospace="0.0">Gigerich H. (1987): Zur Schwa-Epenthese im</doubt><p>Standarddeutschen, Linguistische Berichte 112,449­469.</p><p>Görz G., Paulus D. (1988): A Finite State Approach to German Verb Morphology, COLÏNG-88, Budapest, 212-215.</p><doubt alpha="62.2" length="37" tooSmall="False" monospace="0.0">Karttunen L. (1983): KIMMO: A General</doubt><p>Morphological Processor, Texas Linguistic Forum 22, 167-186.</p><p>Koskenniemi K. (1983): Two-level Model for Morphological Analysis, IJCAI-83, Karlsruhe, BRD, 683-685.</p><p>Schiller A., Steffens P. (1990): A Two-Level Morphology for a German natural language understanding system, IBM Stuttgart, manuscript.</p><doubt alpha="59.5" length="42" tooSmall="False" monospace="0.0">Trost H., Dorffner G. (1987): A System for</doubt><p>Morphological Analysis and Synthesis of German Texts, in: D. Hainline (ed.), New Developments in Computer-Assisted Language Learning, Grooms Helm, London.</p><doubt alpha="0.0" length="1" tooSmall="False" monospace="0.0">6</doubt></references></body></article>