Grammar Development for Czech Syntactic Parser with Corpus-based Techniques

Investor logo

Warning

This publication doesn't include Faculty of Arts. It includes Faculty of Informatics. Official publication website can be found on muni.cz.
Authors

KOVÁŘ Vojtěch KADLEC Vladimír HORÁK Aleš

Year of publication 2006
Type Article in Proceedings
Conference Proceedings of Corpus Linguistic 2006
MU Faculty or unit

Faculty of Informatics

Citation
Field Informatics
Keywords parsing grammar czech corpus
Description In the paper, we present the description of the Czech syntactic parser synt developed at FI MU NLP laboratory. The presented system is based on the meta-grammar formalism with a head-driven chart parser. The parsing technique provides fast analysis of the context free backbone with successive evaluation of the contextual constraints using so called ``forest of values.'' The meta-grammar formalism allows to capture complicated grammatic relations with a maintainable number of rules. Besides the description of the synt system, we display the process of the meta-grammar development. One of the first phases is formed by construction of corpus data for testing. In the paper, we demonstrate the exploitation of the corpus on testing a method for detection of the ``best analysis'' selection with the results of testing the synt analysis on Czech corpus.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.