Lhoucí nebo lžoucí ? Slovotvorné dublety procesuálních adjektiv na -oucí/-ící v českých korpusech
Title in English | Lhoucí or lžoucí ? Variants of Czech gerund (verbal adjectives on -oucí/-ící) |
---|---|
Authors | |
Year of publication | 2014 |
Type | Article in Proceedings |
Conference | Přednášky a besedy ze XLVII. běhu LŠSS |
MU Faculty or unit | |
Citation | |
Field | Linguistics |
Keywords | Corpus; corpus driven; corpus based; word formation; gerund |
Description | Our goal is to present a corpus driven study of Czech gerund (verbal adjectives on -oucí/-ící). The link between inflective and word formation variants will be demonstrated on the material from corpus SYN (2,6 milliard tokens of written Czech) and large web corpus czTenTen12 (5,2 milliard tokens of Czech text from internet – cleaned and deduplicated). On some practical examples we shall display the overgeneration of word formation rules, witch is a problem of both NLP and second language mastering. The adjectives on -oucí/-ící are regularly derived from verbs hence are not usually registered in Czech monolingual dictionaries. Consequently text corpora are significant source of knowledge of their formation and of their usage, which can be a thing of importance for non-native speakers. |