DeepBankDE

This resource is composed of 15.000 sentences taken from the German TIGER treebank that have been parsed using the Cheetah grammar for German (Cramer 2011) using the PET parser. The corpus consist of files containing Trees and MRS.

It contains syntactic and semantic representations of sentences and phrases. STTS tags from the original TIGER corpus are preserved in the Derivation Trees.

The structure of the data are TAB-delimited fields: ID, Derivation Tree, Tree, MRS.

The resource can only be accessed if the requester has a license for the original TIGER treebank.

Useful references:

Brants, Sabine, Stefanie Dipper, Peter Eisenberg, Silvia Hansen, Esther König, Wolfgang Lezius, Christian Rohrer, George Smith, and Hans Uszkoreit. 2004. TIGER: Linguistic Interpretation of a German Corpus. Journal of Language and Computation, 2004 (2), 597-620.

Ann Copestake, Dan Flickinger, Carl Pollard and Ivan A. Sag. 2005. Minimal Recursion Semantics: An Introduction. Research on Language and Computation, Springer, 3, 281–332.

Bart Cramer. 2011. Improving the feasibility of precision-oriented HPSG parsing. PhD thesis, Universität des Saarlandes.

Bart Cramer and Yi Zhang. 2009. Constructon of a German HPSG grammar from a detailed treebank. In: Proceedings of the ACL 2009 Grammar Engineering across Frameworks workshop, pages 37-45, Singapore, Singapore.

Bart Cramer and Yi Zhang. 2010. Constraining robust constructions for broad-coverage parsing with precision grammars. In: Proceedings of COLING-2010.

You don’t have the permission to edit this resource.