<?xml version="1.0"?><!DOCTYPE article SYSTEM "/project/take/software/searchbench_offline_processing/paperxml_generator/aclextractor/src/python/../resource/dtd/paperxml.dtd"><article><header><firstpageheader><page local="1" global="297"/><title>THE (C,A,),T FRAMEWORK IN EUROTRA: A THEORETICALLY COMMITTED NOTATION FOR MT</title><author surname="Arnold" givenname="D.J."><org  name="THE &lt;C" country="USA"/></author><author surname="Varile" givenname="G.B."><org  name="THE &lt;C" country="USA"/></author><author surname="Tombe" givenname="L. des"><org  name="THE &lt;C" country="USA"/></author><author surname="M.Rosner"><org  name="THE &lt;C" country="USA"/></author><author surname="S.Krauwer"><org  name="THE &lt;C" country="USA"/></author></firstpageheader><frontmatter><p>THE &lt;C.A&gt;,T <u>FRAMEWORK IN EUROTRA :  ft THEORETICALLY COMMITTED NOTATION FOR HT</u></p></frontmatter><abstract></abstract></header><body><section title=""><p>D.J. Arnold: University of Essex, Colchester, Essex, C04 3SQ, UK.</p><p>S.Krauwer: University of Utrecht, Trans 14, 3512 JK   Utrecht, NL.</p><p>M.Rosner: ISSCO, 54, Route des Acacias,  1227 Geneva, Switzerland.</p><p>L. des Tombe: University of Utrecht, Trans 14, 3512 JK   Utrecht, NL.</p><p>G.B. Varile: Commission of the European Communities, L-2920 Luxembourg.</p><doubt alpha="58.3" length="12" tooSmall="False" monospace="0.0">0_bsj_ra_tt-</doubt><p>This paper describes a model for MT, developed within the Eurotra Ml project, based on the idea of <u>composi-tional transl</u>a<u>tion</u>, by describing a basic, experimen­tal notation which embodies the idea. The introduc­tion provides background, section 1 introduces the basic ideas and the notation, and section 2 discusses some of the theoretical and practical implications of the model, including some concrete extensions, and some more speculative discussion.</p><p>0;__Introduction,_aims and background.</p><p>As Kay (19Ü5) has emphasised, machine translation today is always experimental in nature. We think a number of things follow from this, among them the need for clear and rather strong theoretical princip­les which can be treated as hypotheses for testing. The idea is, of course, that such testing is revea­ling irrespective of the confirmation or disconfirma­tion of the hypotheses. Furthermore, especially where projects of considerable size are concerned, clear and explicitely stated theoretical principles are necessary for the çommoiL under standi ng of the problem.</p><p>We assume that it is possible to distinguish a number of different levels of description for MT theories (programmes, systems, etc), where in general, the relation between levels is that lower levels are motivated, or evaluated with respect to higher le­vels. The aim of this stratification is to introduce a kind of modularity, so that it is possible to preserve stability while responding to changing per­ceptions of 'the MT problem'. We distinguish the fol­lowing levels of description:</p><p>NuB; A set of executable programs, and descriptions of a set of languages and the relations between them. Hul; A set of substantive theories of representation, and a set of languages in which linguistic descrip­tions are expressed.</p><p>Mu2: The basic theory of translation, general theory of (linguistic) representation Si computational appa­ratus.</p><p>M_u3_: The basic principles, aims, goals, characteris­tic assumptions,    the 'spirit' of the enterprise.</p><p>The purpose of the paper is to discuss the Mu2 level, concentrating on the basic theory or model of trans­lation. The approach will be to describe a family of abstract, special purpose 'MT machines' by describing the syntax and semantics of a very basic notation for MT. We will make some assumptions about representa­tion and user languages for concreteness, but they will be simplified, and unrealistic in the main. Next we examine the model of translation by making expli­cit    some of the theoretical commitments implicit in the notation, discussing some its attractions, and weaknesses, and sketching some possible remedies for the latter. Section 1 will present the notation, with some relevant background, section 2 will discuss the commitment the notation makes.</p><p>iiJl!Li._tÊ.i.â&gt;L_!!LDiâL^i_yia.naLatiSi'</p><p>The ideas described in this section were first pre­sented at the the Colgate Conference on Methodologi­cal Issues in MT (Arnold et al, Des Tombe et al, 1985). A key idea in what follows is an interpreta­tion of the idea that translation is a 'compositio­nal' process. To our knowledge, the first applica­tion of this idea in MT is in the work of Jan Lands­bergen in the Rosetta project  (cf Landsbergen 19B4).</p><p>The fundamental problem of MT is to find a notation, with an associated interpreter, for describing the translation relations between texts in different languages in a 'natural' way. Since it seems impos­sible to provide such a notation for relating texts to texts directly, the standard response is to decom­pose the problem 'horizontally' into a sequence of steps,  as in :</p><doubt alpha="45.7" length="35" tooSmall="False" monospace="0.0">(1)  TLg — RL i—RL2—RL3...  RLn—TLT</doubt><p>where TLg and TLt are source and target text lan­guages, and the RLj are representation languages (or <u>levels</u> Df representation, as they are often called) of some sort. (Notice that in what follows we will systematically use the term 'language' ambiguously for both natural languages and representation lan­guages). Given such a picture one naturally thinks of the languages that are input and output of trans­fer, but for the purpose of this discussion they could be any pair of representation languages at all. What is crucial to this discussion is the as­sumption that MT characteristically involves more than one representation language. This paint is worth stressing. The following discussion will be couched in terms of a representational theory (Mul) that involves several specific levels of representation. We believe that this is motivated, but the interest of the general model in no way depends on the ex­istence of these levels.</p><p>Given this, one is obviously lead to consider (a) the nature of the representation languages, and (b) the nature of the relations between the representation languages themselves, and between the representation languages and the text languages.</p><p>We will not discuss the nature of the representation languages here in any detail (see Arnold et al, January 19B5, for detailed discussion), but it seems important that they should be:</p><page local="2" global="298"/><p>Di f f er eut i <u>at.</u><u> i</u> nq i This is required if the system is to preserve whatever properties are preserved under 'currect translation'. If two (unambiyuous) texts are not translation equivalents i.e. if they differ with respect to these properties, then the representation languages must be rich enough to pro­vide different representations for them.</p><p>(ii) L&amp;iLQ-âÎ!_Lfi. (specific and independently defi­nable): By 'learnable' we mean simply that it must be possible for linguists (who must state the relations hetween a language and its neighbours, ultimately in the form of an executable description) to be able to UlLiiLLit_ajid. that relation: for any given text they should be able to determine the appropriate repre­sentation, and vice versa: the intuitive semantics of the representation language should be accessible to them. This normally means that the representational theory should be rather specific and constrained. It also means that an independent specification of the language should be available. To take the most pro­blematic case, consider the writer of the generation component in a multilingual HT system, where the only definition of the input representation language is that provided by the transfer components themselves. The task seems clearly impossible.</p><p>(iii) Si mply _ r el at able: This is the most straightfor­ward, and the most commonly appealed to requirement for the adequacy of representation languages. It must at least be easier to relate the representation languages than it i r» to relate the text languages. It is this requirement that usually rules out natural languages (e.g. Latin) as representation languages, and motivates representation languages which are highly structured.</p><p>We will not be concerned with particular representa­tion languages that have been proposed in Eurotra but, for the purposes of exposition, it is worth a brief description of the kind of representation lan­guages (levels) we will assume in what follows. We will assume there are three such languages (apart from levels such as actual, and normalised text):</p><p>(a) A surface constituent/morphological structure level of a rather standard kind, allowing more than one x-bar projection of the major lexical categories (see Arnold et al, January 1985). This level is usually called ECS (Eurotra Constituent Structure).</p><p>(b) A level which represents syntactic relations or dependency, and information about syntactic category. This level involves structures where each (non-co­ordinate) construction contains a primitive item which is the head, or governor ('gov') of the other elements of the construction (this amounts to a sti­pulation that there can be no more than one projec­tion of lexical categories: X' syntax, in effect). This level is usually called ERS (Eurotra Relational Structure).</p><p>(c) A level which represents semantic relations/de­pendency, and thus contains an indication of semantic relations (case or theta roles). Like the syntactic relational level, we will take this to be an X' level. It is intended to abstract away from surface syntactic phenomena that are not relevant to transla­tion, and to re-interpret some syntactic characteris­tics semantically (e.g. replacing tense marking by an indication of time reference). This level is normally called Interface Structure (IS), since it provides an interface between analysis and transfer, and transfer and generation components.</p><p>Turning now to the relations between these languages, as regards the text&lt;-/representation language rela­tion, there is very little to say in an MT context that does not arise elsewhere in computational lin­guistics, where parsing and generation are two of the major areas of research. It is the relation between the representation languages that concerns MT speci­fically. We think the following three conditions are i mportant :</p><p>(i) compositionality (ii) directness (primitiveness ,  'one-shut-ness') (iii) simplicity (statable in a simple way)</p><p>Idioms apart, the translation of complex expressions is normally based in some systematic way on the translations of their parts — normal translation is in some sense 'compositiona1'. Of course, it is only because of this that the translation relation is productive at all, and if one is looking for princip­les, it seems reasonable to require something like this of the relation between the representational languages also. The following gives a slightly more precise sense to this idea:</p><p>Translation is compositional when the translation of a complex expression is some (reasonably straightforward! function of the translation of the basic expressions it contains, plus the translation of their mode of combination.</p><p>Condition (ii) 'directness' or 'priaitiveness' is simply intended to exclude illicit representational levels between those officially sanctioned: what it says is that the RLj-RLj relation must be direct, or primitive, and cannot be mediated by other undeclared representational levels. Such a condition helps to maintain clarity (and 1earnabi1ity), and gives con­tent to the other conditions.</p><p>A more formal reconstruction of these ideas (along the lines suggested in Montague (1974)) might be as foilows.</p><p>We begin by defining compositional translation as a relation between 'grammars' (generative devices) specifying languages, rather than languages directly, thus making the RLj-RLj relationship parasitic on the relation between the corresponding 'grammars'. In­stead of (1), we are thinking of a picture like (2), where a 'vertical' dimension has been added</p><doubt alpha="19.2" length="26" tooSmall="False" monospace="0.0">&lt;2) Gi ___ G2..._...„__ Gn</doubt><doubt alpha="46.7" length="30" tooSmall="False" monospace="0.0">TLg --- RLi       RL2RLn---TLt</doubt><p>Compositional translation of two representation lan­guages RLj and RLj is then defined by a pair of relations T and T"* ('translators') between the Gj and 0j (generative devices, grammars) specifying (generating, enumerating) RLi and RLj.</p><p>We take a generative device S to be a pair, &lt;C,A&gt; where C is a finite set of constructors ('rules' defining the class of complex expressions), and A is a finite set of atoms (basic expressions).</p><page local="3" global="299"/><p>We say that translation from Lj to Lj is st_r_i_ç_tl_y_ composit<u>iona</u>l if there is mapping T from 6j=&lt;Cj &gt; to Gj =&lt;Cj,flj&gt; such that:</p><doubt alpha="65.4" length="26" tooSmall="False" monospace="0.0">(i) T maps Aj into Aj, and</doubt><doubt alpha="61.0" length="77" tooSmall="False" monospace="0.0">(ii) there is a mapping t from Cj into Cj such that if     uc&lt;c : ui,...,un&gt;,</doubt><doubt alpha="46.9" length="32" tooSmall="False" monospace="0.0">then ï(u)c&lt;tc ! T(ui),...,Ï(un)&gt;</doubt><p>As will become apparent, these definitions imply a very restricive theory of translation, one which is much too strong to be usable. However, before dis­cussing its inadeguacy, we will make the ideas invol­ved more concrete by describing a notatiun for Bs and Ts which is strongly committed to these ideas, and by discussing a very simple example of the use nf the notation. The description is rough and not precisely formalised, but should give an idea of the issues i nvolved.</p><p>Notation for constructors and atoms:</p><p>atom ::-  (name,  feature description) constructor::= (name,feature description)[argspec*!</p><p>argspec::= (name, feature description) (In fact, atoms are simply constructors with arity 0, but we will preserve the intuitive distinction here). The feature theory we assume here is extremely simple: a feature description is a set of attrihute-value pairs. The 'name' is just a distinguished fea­ture representing the intuitive linguistic basis Df the language being described (thus, it might be a syntactic category, a syntactic relation, or a seman­tic relation as appropriate). Notice that this name need not be unique. Each constructor lias in addition a unique abbreviatory constructor name which is used by the T-rules.</p><p>The language L generated by a G is a set of well-formed abject  (wfo's) such that:</p><p>Every atom is a wfo; and if c„ is a constructor of arity n,    and each of uj, un is a wfo, then cn i ui, ..., <i>an </i>is a wfo.</p><p>This very simple syntax for Bs will lead to over-generation ~- for example, it will allow up construc­tors with two arguments as a wfo, even if the first argument is a verb, and the second is a preposition. For this reason we supplement the purely syntactic description with a semantics based on applying con­structors to arguments. We will thus normally be concerned only with the subset wfns that are also const<u>ruct</u>s, in the following sense:</p><p>Every atom is a construct! a constructor applied to some arguments yields a construct providing the argu-m e n t s u,ni_f _y_ with the appropriate argspecs of the constructor.</p><p>Examples (fnr a constituent structure language): atoms: aBKampie = (example, {cat^n, num=5ing})</p><doubt alpha="58.6" length="58" tooSmall="False" monospace="0.0">ath iSi"       (this,  {cat = det, num-sing}) constructor:</doubt><doubt alpha="49.0" length="51" tooSmall="False" monospace="0.0">Gnp= (np,  fnum=X, per=3})  C&lt;_,  fcat-det, num=X&gt;)</doubt><doubt alpha="46.7" length="92" tooSmall="False" monospace="0.0">( _ ,   (cat = n ,  num-'X } )1construct      Cnp ! a+his ^example -(np,  £num=sing, per=3&gt;)</doubt><doubt alpha="58.1" length="62" tooSmall="False" monospace="0.0">[(this,  fcat-det, nun~sing}), (example,   {cat=n, num-sing})!</doubt><p>The syntax and semantics of Ts is roughly as follows. Syntactically, a T-rule is of the form: Ihs ==&gt; rhs, where lhs and rhs are atoms or constructors of source and target language Gs respectively. For example, the following might be T-rules relating a level based on syntactic relations with one based on semantic rela­tions for the atoms corresponding to the verbs <u>1 ike </u>anci IlLL- (They are both assumed to assign subject and object relations to their dependents syntactically. They are assumed to assign respectively, experience!-and patient, and agent and patient to their semantic dependents. Since the leading linguistic idea at both levels is relational, and the nature of the relation cannot be determined for constructs in iso­lation, the name feature of atoms and constructors is 'blank' at these levels).</p><doubt alpha="43.4" length="53" tooSmall="False" monospace="0.0">(3) (, fword-11ke, cat- v, f rame = sub j-ob j} ) =-&gt;</doubt><doubt alpha="57.1" length="42" tooSmall="False" monospace="0.0">(..,  {word=like, cat=v, f rame=exp-patl )</doubt><doubt alpha="50.0" length="46" tooSmall="False" monospace="0.0">(4) (  ,(word-hit, cat=v, frame=subj-obj}) ==&gt;</doubt><doubt alpha="61.0" length="41" tooSmall="False" monospace="0.0">(  ,  (word-hit, cat=v, frame-agent-pati)</doubt><p>The semantics of this is that all source language atoms which unify with the lhs are translated to all target language atoms which unify with the rhs.</p><p>The following might be <b>a </b>constructor to constructor T-rule for the same two levels :</p><p>15)       c5ubj-ubj cexp-pat meaniny that any source language construction built by applying c5Ubj — obj t° some arguments uj, ... u„ is translated by applying cEXp_j,at to the translations of ui.  •■■, "n</p><p>This syntax and semantics for T-rules implements the idea of strict compositionality defined above.</p><p>This model is elegant, but inadequate, given the way natural languages appear to be. What strict composi-tionality requires is at least a rather strong homo-morphi5m hetween the languages related by a T. It is easy    to find examples where this looks implausible.</p><p>For example, consider the common need to re-Drder members of a construction in translation; or the need to eliminate 'formal' items which are <b>a </b>part of constructions in one language (one level) but not in another- (perhaps re-expressing some information they carry as part of <b>a </b>feature), as in (6)j or the kind of simple structural change involved in going from <b>a </b>level which has both S and VP constructions to one which has verb, subject, and object as members of <b>a </b>single construction (7); or the need to re-analyse <b>an </b>item which is part of une construction in one lan­guage, as part of another construction in translation (B).</p><doubt alpha="59.5" length="42" tooSmall="False" monospace="0.0">(o)      Eg for jules to understand it1 "&gt;</doubt><p>'■Sl-finitel jules understand it <b>1</b> t rely-on Cnp sandy <i>11</i></p><doubt alpha="55.2" length="58" tooSmall="False" monospace="0.0">(7)    Eg    jules tVp hit sandy11 ==&gt;[ g hit jules sandy1</doubt><doubt alpha="43.8" length="32" tooSmall="False" monospace="0.0">(B)    [ rely [ppon sandy 11 -=&gt;</doubt><p>Uf course, one could easily vary assumptions about representations so that these examples disappear, but other examples conflicting with the new assumptions will    be just as easy to find.      Notice that though, for simplicity, we have chosen examples that are close to the surface of one language (English), there are many examples of this kind between languages:<page local="4" global="300"/></p><p>(9)  Cjules zwemt graag] "HjuleBj likeslei to swimll 'Lexical holes' such as English exhibits with respect to Dutch (English has no adverb 'likingly' to trans­late graag) , and idioms such as <u>give a hand</u> will normally give rise to the need for non-strictly com­positional translations, for obvious reasons. As a more general example, it is often the case that what is expressed lexically or syntactically in one lan­guage is expressed morphologically in another. Thus, modality is often expressed by inflection in Romance languages, and by combinations of separate lexical items in Germanic languages, and correspon­dences between compounds in Germanic languages, and syntactic constructions in Romance languages (as in (111) are very common. Treating this kind of thing will certainly lead to non-strictly compositional translations somewhere.</p><doubt alpha="60.3" length="68" tooSmall="False" monospace="0.0">jules swims likingly' (IB)  tgiveta hand to X33 ==&gt; [(helpen ( }),X3</doubt><doubt alpha="64.7" length="51" tooSmall="False" monospace="0.0">(11) Cnapple seller] ==&gt; [npvendeur Cpp de pommes33</doubt><p>The solution to this problem adopted by Landsbergen (1984) in Rosetta is to 'tune' the Bs to each other, thereby ensuring that they are homomorphic, and that something close to strict compositionality can be preserved. (In fact, Landsbergen requires the trans­lation relation to be symmetric, so the grammars turn out to be isomorphic). This preserves the elegance of the model, but at the expense of the elegance of the linguistic theories and descriptions (the Mul and MuB) , which become extremely complex, and potentially unusable. For example, it requires <u>give a hand to </u>and <u>helpen</u>. and <u>graag</u>. and <u>like to</u> to be treated alike. Providing a systematic and general character­isation of a theory of representation which allows this seems highly problematic. What one expects is that the representational theory will become unlearn-able in the sense described above. A second objec­tion to this approach is the obvious one that it eliminates the modularity that is potentially avail­able with this model (each G can be thought of as a module, e.g.). This reduces its attractiveness from a developmental point of view, particularly where multi-1ingual MT involving large numbers of lan­guages, or wide coverage (hence collaboration of large numbers of individuals) is envisaged.</p><p>For these reasons, we have preferred to explore an alternative approach, which involves allowing some <u>relaxations</u> of strict compositionality. The fol­lowing three relaxations have been proposed :</p><p>(i) To allow variables on either side of T rules :</p><doubt alpha="9.5" length="42" tooSmall="False" monospace="0.0">(12) e.g.      c27[1,2,33==&gt; c38 [ 2,  3 1</doubt><p>with the meaning: translate any expression formed by applying c27 to three arguments by an expression formed by applying c3B to the translation of the second and third arguments. This relaxation allows for re-orderings, deletions, and reduplications by T-rules, and seems an entirely natural extension of strict compositionality.</p><p>(ii) To allow functions made up of constructors, atoms,    and variables of the appropriate Gs on either side of T-rules, e.g.</p><doubt alpha="34.0" length="50" tooSmall="False" monospace="0.0">(13) e.g. csCI, cvp[2, 311 «&gt; cSUDj/0bj E2,  1, 33</doubt><p>Notice that since the output of such a translation rule is still an expression in the target language (i.e. an expression built by applying target G con­structors to target constructs), this relaxation still yields 'one shot' translation.</p><p>(iii) To allow the choice of the target constructor (function) to be dependent on properties of the argu­ments involved. For example, one does not want all Cv pp] constructions to be treated like <u>rely on</u> in (8), and the exceptional translation behaviour of idioms, and constructions involving lexical holes is clearly dependent on the presence of particular pro­perties within constructions (e.g. the presence of particular lexical items):</p><doubt alpha="20.3" length="59" tooSmall="False" monospace="0.0">(14) e.g.  c35 11, 2/fl = vl, 3] = = &gt; c4i U, 2, 3/f2 = v23</doubt><p>with the meaning: translate constructions formed by applying c35 to three arguments by constructions formed by applying c46 to their translations, provi­ding the second argument of c35 unifies with a fea­ture description where the attribute fl has the value vl, and the translation of the third argument unifies with a feature description where the attribute f2 has the value v2.</p><p>Though there is no provision for wild notational devices such as path variables, these relaxations greatly increase the power of the notation, to an extent which is problematic, given our methodology. We would naturally like to impose restrictions, so that we can preserve the idea that in compositional translation the translation of a whole is some 'rea­sonably straightforward' function of the translation □f its parts. One possibility is to impose special restrictions (or alternatively restrict some relaxa­tions) to certain translators (e.g. one would like the transfer translators to he as near as possible restricted to some kind of atom-atom translator). More generally, one might require that at most one side of a T-rule be a function (in the sense of relaxation (b)), or to require that context sensitive T-rules may only refer to attributes of particular arguments (e.g. attributes of the heads of construc­tions, perhaps, or to only allow them to test for the presence of particular lexical items among their arguments). There are interesting methodological and empirical problems involved in trying <i>to </i>find approp­riate restrictions, but we will not pursue them here, since (as will appear in the fallowing section), the notation is still restrictive enough for there to be a theoretical commitment which deserves discussion.</p><p>2. The theoretic<u>al co</u>mmi<u>tments pf</u> the model.</p><p>The attractiveness of our model as a framework for practical and theoretical MT derives from its modula­rity and its orderliness in the main. Practically, it ensures that translation proceeds via a series of representations which are described explicitly, and which therefore have to be capable of systematic description, and it ensures that the language gene­rated by applying a sequence of translators is always a subset of a language that has been explicitly described.      It   thus   comes as close as possible to excluding 'hybrid' representations, and ensuring that representations languages will be 'learnable'.<page local="5" global="301"/> More­over, the separation of Gs and Ts, and the use of a semantics based on unification provides a high degree of declarativeness, and the homogeneity and uniformi­ty of the model may be of practical benefit. The separation of Gs and Ts also provides a high degree of modularity, so, e.g. different Gs can in principle be developed in parallel, and the effects of modifi­cations may often be localised to one G and the adjacent Ts.    This is developmentally attractive.</p><p>Methodologically and theoretically the model is at­tractive in a number of ways. The complexity of T-rules provides a very simple and effective evaluation metric against which to judge competing proposals about representational levels (so it is relatively easy to find arguments why there should or should not be intermediate representations of a certain sort). And it provides a level of abstraction at which linguists and implementers can communicate easily. However, perhaps the most important advantage of the model is that it decomposes the 'problem of MT', and provides a framework for investigating some interes­ting and apparently manageable sub-problems. Some of these are discussed in a preliminary way here.</p><p>This notation, and hence the model that it instan­tiates, in effect provides a context free grammar notation augmented by a simple feature theory based on unification, and (via the T rules) the capacity for certain transformations. We have no demonstra­tion of the weak genenerative capacity of the nota­tion, but one suspects it is at least as powerful as the notations of LFG (Kaplan &amp; Bresnan 1982), or FUG (Kay 19B5). Taking full advantage of relaxations (a)-(c) below may well yield Turing machine capacity. While this makes it likely that the notation provides some treatment of all translational1 y relevant pheno­mena, it is still rather restricted as regards des­criptive or expressive capacity, and there is no guarantee that the treatment will be 'natural', ap­propriate,  or even practically usable.</p><p>One approach to the issue of usability is the provi­sion of user friendly abbreviations (e.g.), and it is fairly easy to imagine some conventions which would take this basic (Mu2) notation and make it more usable as a programming language for linguists (i.e. a Mul  'user language').</p><p>Some of the major modifications to the model which have been proposed include:</p><p>(a) The introduction of special versions of construc­tor application in place of unification, for example, in the treatment of co-ordinate constructions. The properties of co-ordinate constructions are partly determined by the fact that they inherit the common features of their elements, 50 the feature descrip­tion of the construction should be the genera<u>lisa</u>tion (roughly self consistent intersection of the feature descriptions) of the elements, rather than their unification.</p><p>(b) The introduction of Kleene star to avoid deeply recursive structures in the treatment of construc­tions which allow arbitrary numbers of arguments (e.g. most constructions can include an indefinitely large number of PP modifiers). Since the syntax of Gs   requires specific reference to the arity of constructors, the obvious way of dealing with this phe­nomena in the basic notation is to have recursive constructors (e.g. a constructor that combines an np and a pp to form an np), yielding structures such as:</p><doubt alpha="28.6" length="7" tooSmall="False" monospace="0.0">(15) np</doubt><doubt alpha="80.0" length="5" tooSmall="False" monospace="0.0">np pp</doubt><p>This treatment is not obviously incorrect, but it is not necessarily the most intuitively satisfactory treatment either, and it can have the undesriable effect of burying the lexical heads of constructions arbitrarily far down inside them.</p><p>(c) A closely related point is that the model des­cribed is committed to representation languages where members of constructions are strictly ordered (CC32: a,bl   is a different abject from Cc32! -- e.g.</p><p>the latter may fail to unify to give aconstruct, while the former succeeds). This may not always be very natural, especially where relational languages are concerned: since the elements of constructions are distinguished by their roles, they do not also need to be distinguished positional1 y.</p><p>A number of modifications along these lines are being discussed in the project. They are not unproblematic (or even obviously correct), for example, (b) and (c) above both suggest that constructors be treated as operations on sets, rather than lists of arguments. Apart from changing the formal nature of construc­tors, a problem will arise in going from unordered representation languages to ones which are ordered, motivating an extension to the T-rule notation. Ne­vertheless, they seem to within a reasonable distance of (and hence compatible with) the essentials of the basic model.</p><p>A consequence of the CFG basis of the model is that constructs are always hierarchical objects similar to tree structures (each application of a constructor yields a new level of structure, intuitively). The model is most naturally applied in the description of linguistic phenomena that can be thought of hierar­chically, and in translating between languages that capture such phenomena. Thus, it is naturally ap­plied in the description of phrase and relational structures (though cf above), and given the unifica­tion based semantics, in dealing with phenomena such as agreement between members of constructions.</p><p>Moreover, though the 'naturalness' of the treatment is perhaps more questionable, it provides interes­ting, and apparently workable accounts of a number of phenomena that are not obviously hierarchical. For example: it is reasonably easy to see how the relaxa­tions of strict compositionality allow a treatment of functional control and unbounded syntactic dependen­cies (Arnold et al, 1985 sketches a crude, but straightforward treatment exploiting the possibility Df having functions composed of target G constructors and variables in T-rules).</p><page local="6" global="302"/><p>One can also envisage a treatment of pronominal refe­rence (which is naturally thought of in terms of co-indexation across trees, rather than in hierarchical terms) within this model along the following lines. Suppose every construct is assigned a unique index, and every non-atomic construct is augmented by two lists (which we will refer to jointly as the a/a lists)s (i) an antecedent list, of the indices of the poten­tial antecedents the construct contains!</p><p>(ii) an anaphor list, of the indices <b>of </b>the potentially anaphoric items the construct contains.</p><p>We will say that <b>a </b>member of the antecedent list and <b>a </b>member of the anaphor list are 'compatible' provi­ding they do not differ with respect to the relevant inherent linguistic properties (such as number and gender). Every time <b>a </b>constructor is applied to some arguments, the a/a lists of the arguments <b>are </b>inspec­ted!</p><p>(i) if two arguments have compatible items on their lists, then an indication that these two items are bound to each other is added to the construct that results; (ii) the indices of the arguments, and members of the arguments' a/a lists are used to form the relevant a/a lists of the construct.</p><p>Apart from testing for inherent properties of antece­dents and anaphors, structural conditions can be imposed, e.g. the c-command condition can be imposed by allowing members of antecedent lists to be trans-fered 'upwards' to only one construct. This seems to provide a basic method for expressing all the antecedent-anaphor relations, in so far as they are grammatically determined, at least.</p><p>It is a natural consequence of the CFG basis of the notation that, under this treatment of antecedent-anaphor relations, objects do not themselves contain an indication of their antecedents (or anaphors). Instead, this information is present in the construct that contains them. For the same reasons, the nota­tion involves an interesting commitment in some kinds of context senstive translation.</p><p>Consider, for example, the translation of the verb know into French, which <b>(for </b>simplicity) we will take to be savoir if there is <b>a </b>sentential complement, and connaître if there is a nominal complement. i.e. in standard rewrite notation:</p><doubt alpha="55.3" length="38" tooSmall="False" monospace="0.0">(Hi)a.know -&gt; connaître     /__ np-obj</doubt><doubt alpha="55.6" length="27" tooSmall="False" monospace="0.0">b. know -&gt; savoir /__ s-obj</doubt><p>Our notation has no direct analogue to this sort of statement: the context, sensitivity has to be taken care of in the translation of the construction con­taining the verb. Suppose C32 is tne English con­structor which builds the IS representation of JjlLes. <u>knows S</u>, and suppose that Eng 1 ish-Frencli transfer contains the following T-rules:</p><doubt alpha="48.3" length="29" tooSmall="False" monospace="0.0">(17)    a. know ==&gt; connaitre</doubt><doubt alpha="64.7" length="17" tooSmall="False" monospace="0.0">b. know=•-&gt;savoir</doubt><doubt alpha="28.6" length="14" tooSmall="False" monospace="0.0">c. c32=-&gt; Cf36</doubt><doubt alpha="57.3" length="89" tooSmall="False" monospace="0.0">ThenC32:know, Jules, S     will produce both of: (IS)      (1)    Cf36!connaitre Jules S'</doubt><doubt alpha="64.0" length="25" tooSmall="False" monospace="0.0">(ii) Cf3&amp;!savoir Jules S'</doubt><p>and what one expects is that unification in (i) (or some later translation of (i)) will fail to produce a construct (e.g. because cf36 checks the Syntactic category of its third argument). This seems a very natural account for cases such as these, where the target S contains the information for making the right choice. But one cannot expect this always to be so (e.g. where the information required is only part of the source language), and in such cases context sensitive T-rules will be required (cf relaxation (c) above). It is clear that this notation is committed to such cases heing less common, at least.</p><p>As already noted, this model provides for a high degree of modularity in principle. In fact, the degree of modularity is rather extreme: not only are individual Gs modules, but individual constructors and atoms are modules also. Though the use of a feature theory allows some generalisations to be captured, the degree of modularity means that many generalisations that hold 'horizontally' (across lan­guages), and 'vertically' (within languages) are missed.</p><p>The most obvious case of horizontal generalisations are 'invariances' and default translations across languages. For example, one does not expect the value of the attribute which identifies individual lexical items to change normally during analysis and generation, and the simplest and most restrictive view of transfer would be that only this attribute changes. Similarly, one knows that syntactic sub­jects normally correspond to semantic agents, and vice versa.</p><p>Within individual languages some capacity to capture generalisations across constructors is a prerequisite of some of the modifications mentioned above (in particular, CF type treatments of unbounded dependen­cies depend on some such mechanism if massive redun­dancy is to be avoided). More generally, one would like to be able to state conventions (about e.g. the percolation of attributes from heads of constructions to constructions) once and for all, rather than ha­ving to state them separately in each constructor, and there are various defaults which could make con­struction of Gs easier (e.g. the default case is that verbs have regular morphology).</p><p>Here we will briefly describe a fairly simple exten­sion of the basic notation which is capable of dea­ling    with these phenomena:    injection__LUljL*Li which describe (relations between) classes of constructs by stating partial descriptions of (pairs of) con­structs.     The following are examples:</p><doubt alpha="55.3" length="38" tooSmall="False" monospace="0.0">(19) (_, fcat=v, morph-form=regular })</doubt><doubt alpha="22.4" length="49" tooSmall="False" monospace="0.0">(20) (_,  (cat=X,  ...})[  (gov, (cat=X,  ...)) ]</doubt><doubt alpha="32.5" length="40" tooSmall="False" monospace="0.0">(21) (~,  (cat = s, tense = vll)   U■==&gt;</doubt><doubt alpha="29.4" length="17" tooSmall="False" monospace="0.0">(_,  £time=v21)11</doubt><p>The idea being that such rules can be used to 'in­ject" generalisations into existing constructors, atoms, or T-rules. The normal problem with such 'meta' devices as these is controlling their interac­tion. A very simple way of avoiding this would be to adopt the following semantics: applying an injection rule I to a rule (atom, constructor, or T-rule) R succeeds    if    I    unifies with R,    in which    case the unification replaces N.<page local="7" global="303"/> Tins semantics means that injection rules cannot affect the cardinality tif the rule set, but it gruatly simplifies the form and content of the rules that must be written, and pro­vides u perspicuous way of stating certain générali­sai: i uns.</p><p>As stated, injection rule (19) is intended to unify with all atoms that have cat:-v, and to add in the informâtiun that they have regular morphology. No­tice that thin injection rule will fail to apply to any atom that already has a different specification for morph-fnan, so there is a straightforward way of treating exceptions such as irregular verb morpholu-gy. (20) is a very simple example of a percolation injection- When applied to the constructors of a relational level K, it will ensure that the category of the head (guv) of a construct is percolated to the construction. Again, exceptions can be stipulated in individual constructors. (21) is intended to state a correspondence between tense=vl, and time-:v2, and will inject this relationship into all T-rul.es that translate sentences.</p><p>It is appropriate to end by mentioning the most obvious open questions, since they suggest the direc­tion which future work should take.</p><p>(i) We have nut yet investigated the implications of the model for robustness, and while the model has been set up so that T-rules should he reversible to a large extertt.s we have insufficient practical ex­perience with it to know huw tar this potential can lie exploited.</p><p>(ii) Perhaps the most obvious theoretical commitment is that the notation is linguistic in nature, de­signed fur representing linguistic knowledge (it would nut be a very natural method fur representing more general 'real world' knowledge). We think this is appropriate in MT, which is fundamentally about relating lingui<u>stic</u> objects. However, as many exam-pies indicate, there is an important role for general knowledge in MT, and this must be accommodated some­where. No doubt various compromises are possible, ami there is certainly room in the model fur such quasi-1 inyuif.tic entities as semantic features, but taking the model seriously involves rejecting know­ledge representation languages as levels per so. The role of general knowledge representations cannot therefore be a step in translation, and can only be to provide a method of choosing between alternative representations at linguistic levels.</p><p>(iii) This leads directly tu another point; the framework provides a number nf ways of coping with null-determinism (filtering, by target Gs, context sensitive T-rules, e.g.), but there is no method far the explicit comparison of competing representations (e.g. as in 'preference semantics' Wilks (197B)) , and to provide such a method seems beyond the scope of the model we have described. Practically, it is not clear whether this is a problem or nut, however, an approach which is consistent with the general spirit of the model might he to define a number of 'choice' levels, at which choices between alternatives would be made (IS is the obvious candidate). We would require G and T rules to be set up so that all alter­native representations at these levels would lie transi ationally equivalent, so that chuice cuuld be arbitrary.</p><p>(iv) We have assumed an (&gt;xtreiiiely simple feature theory (e.g. tit; have not alluwed attributes to take features as values). It is clear that a more sophis­ticated theory is desirable, and some work has been done in this direction within the model. However, what is not clear is how the extra descriptive power of an extended feature theory affects the 'pragwa--tics' nf the model --- the way the model should be used for linguistic description.</p><p>Investigation of this model is still at an early stage, and much of the ahove is speculative or sche­matic. However, despite its preliminary status, we feel the approach described here is promising, and we hope we have said enuugh tu show why we feel it is worthy of attention.</p><p>More information on the Eurotra project can be found in e.g.  King &amp; Porschke (1982) and King et al (19ÜH).</p><p>BïLfe.CRUt.S'i.</p><p>D.J.Arnold, L.Jaspaert &amp; L.des Tombe "EL8-3t Eurotra Linguistic Specifications (January 1985)" Eurotra Report ETL-5, DGXIII, CEC Luxembourg.</p><p>1).J.Arnold, L.Jaspaert, R.Johnson, ö.Krauwer, H.Ron­ner, L, des Tombe, G.U. Varile, <i>h</i><i> </i>H. Warwick "A Hill View nf the &lt;C,A&gt;, T Framework in Eurotra" in Prrn:ee_-«•iüBSJaf ±h.eJÇpji féroce_9JLXtUtfiQdÜLI^ JOsîiiPdP 1 <i>ogi-</i>ca.l ._S.ssues___ in_H.achinei Tran^L^ion.._pj NaUuv-U _Lan_-auages, Colgate Univ., Hamilton, N.Y.  19Gb, pp. 1.-14.</p><p>R.Kaplan &amp; J. Dresnnn "Lexical Functional Grammari a Formal    System for  Grammatical  Représentation"  in The</p><p>ÔSBiâJL___Repr esentq j; ion ofGr amwat i_c at______Re 1 at i on s ,    <i>J.</i></p><p>Dresnan  (ed),  HIT Press, 19112.</p><p>H.Kay "Functional Unification Grammars a Formalism for HT" in Proceeding*. of_gjy.NB„Bi Stanford, Cali­fornia, 1984, pp. 75-78." "</p><p>M. King, Ii. L. J ohn son ï&lt; L.des Tymbe "Eurotra! a Multi­lingual system under Development" in Cqmjut.atinnaj LiJUUliSiii<b>&amp;i</b><b> </b>llsü-3,  198S, pp. lbS-169.</p><p>H.King ?! G.Perschke "Eurotra &amp; its Objectives" in 8u!JillMUa<b>.i</b><b> </b>I:1i 1982, pp.27-32.</p><p>J.Landsburgen "Isomorphic Grammarn and their Use in the Rusettn Translation System" to appear in Hai:hiji(!_ IrjyiäiMiPJiJ...^ H.King (ed), Edinburgh Univ. Press.</p><p>R. Hnntague Formal JMiiJLas.o.B hjLi Yale University Press, 1974.</p><p>L■ des Tombe, D.J.Arnold, L.Jaspaert, R,Johnson, S.Krauwer, M.Rosner, G.B. Varile, &amp; 8. Warwick "A Preliminary Linguistic Framework for Eurotra (June 1985) " in Proceedings of the Conferen<u>ce on</u> Theoroti-cal <b>8i</b><b> </b>Methodological Issues in Machine Transi at i un of</p><p>Natural_____Languages.    Colgate   Univ.,    Hamilton, N.Y.</p><p>1905,  pp. 2B3-208.</p><p>Y.Wilks "Making Preferences More Active" in AlJqur-nal_,  11 s 3,  197B, pp. 197223.</p></section></body></article>