README: GG (June 2006)

This is the first public release of GG, an HPSG for German developed
at the DFKI Language Technology Lab, Saarbrücken, Germany. 

Development was funded by the German Ministry for Education, Science,
Research and Technology ("Bundesministerium für Bildung und Forschung
(BMBF)"), as part of the projects DISCO (ITW 9002 0), Verbmobil (01
701 V0), Whiteboard (01 IW 002), Quetal (01 IW C02), and Collate (01
IN A01 B).

The grammar is released under the LGPLLR, a version of the LGPL adapted to
linguistic resources. See LICENSE for details. 

The grammar is intended to be used with systems that can run TDL-style
grammars, such as the LKB (Copestake 2001) or Pet (Callmeier 2000),
and can be used for parsing (LKB & Pet) and generation (LKB).

Documentation for the grammar can be found in the scientific
literature. Publications particularly relevant to this version of the
grammar are:

Müller & Kasper (2000)
Crysmann (2003)
Crysmann (2005)


The grammar uses revised German spelling conventions, at least as far as
the ss/ß rules are concerned. Support for the old spelling conventions
will be added in the future. The grammar and the lexicon are encoded
in UTF-8.

In the LKB, you can load the grammar via the script file lkb/script.
We encourage the use of the lexical database (lexdb) by Benjamin
Waldron, which is distributed as part of the LKB. See the DELPH-IN
Wiki for installation instructions. I recommend the following settings
for *lexdb-params*: ((:dbname "gg") (:mode "gg")), to disable sortal
restrictions.


The grammar files come with a Makefile to facilitate installation:

Run `make' to compile .grm files for the Pet runtime parser cheap(1).
Make sure the `flop(1)' preprocessor is in your PATH. 

german-unfill.grm 
	binary grammar file with unfilling (no packing) 
	suitable for HoG

german-pack.grm
	packing, no unfilling
	suitable for HoG, but interoperability is not that well tested
	symbolic link to german.grm, loads definition for packing restrictor in pet/german-pack.set

german.grm 
	no packing, no unfilling 
	slow, suitable for HoG



KNOWN ISSUES:

Owing to a bug in Allegro CLIM, direct input of 8bit characters from
the LKB GUI is not possible.  As a workaround, you can use either TeX
or HTML syntax, or invoke the parser from the Lisp prompt
'(do-parse-tty "Am Anfang war das Wort.")'. 

Alternatively, you can use the Trollet GUI by Pavel Mihalyov, which
fully supports UTF-8.

See TODO for grammar related known issues. 



Berthold Crysmann, Fefor, Norway, 17th June 2006


References:

@InCollection{mueller_s-kasper_w00,
  author =       {Stefan Müller and Walter Kasper},
  title =        {{HPSG} Analysis of {German} },
  booktitle =    {Verbmobil: Foundations of Speech-to-Speech Translation},
  pages =        {238--253},
  publisher =    {Springer},
  year =         2000,
  editor =       {Wolfgang Wahlster},
  address =      {Berlin}
}
@Article{crysmann_b04rlc,
  author =       {Berthold Crysmann},
  title =        {Relative Clause Extraposition in {German}: An Efficient and Portable Implementation},
  journal =      {Research on Language and Computation},
  volume = {3},
  number = {1},
  pages = {61--82},
  year =         {2005}
}

@InProceedings{crysmann_b03ranlp,
  author =       {Berthold Crysmann},
  title =        {On the Efficient Implementation of {German} Verb Placement in
{HPSG}},
  booktitle = {Proceedings of RANLP 2003},
  address = {Borovets, Bulgaria},
  pages = {112--116},
  year =         2003
}

@InProceedings{crysmann_b05hpsg,
  author =       {Berthold Crysmann},
  title =        {Syncretism in {German}: a Unified  Approach to
  Underspecification, Indeterminacy, and likeness of Case},
  booktitle = {Proceedings of the 12th Int'l Conference on Head-driven Phrase Structure Grammar (HPSG), Aug 22--24, Lisbon},
  year =         {to appear},
  address =      {Stanford},
  publisher = {CSLI publications}
}