Building Big Czech Corpus : Collecting and Converting Czech Corpora

Warning

This publication doesn't include Faculty of Arts. It includes Faculty of Informatics. Official publication website can be found on muni.cz.
Authors

HANČAR Pavel

Year of publication 2008
Type Article in Proceedings
Conference RASLAN 2008
MU Faculty or unit

Faculty of Informatics

Citation
Web https://nlp.fi.muni.cz/raslan/2008/papers/11.pdf
Field Linguistics
Keywords corpus; desamb; vertjoin;
Description This paper describes a creating of a big Czech corpus from many Czech corpora kept on the NLP Centre server. It describes new tools developed for this purpose, difficulties which may come up and a way how solve them.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.