Stylometrická analýza církevněslovanských textů české provenience
Title in English | Stylometric Analysis of the Church Slavonic Texts of Czech Origin |
---|---|
Authors | |
Year of publication | 2023 |
Type | Article in Periodical |
Magazine / Source | SLAVIA-časopis pro slovanskou filologii |
MU Faculty or unit | |
Citation | |
Web | https://www.slu.cas.cz/cs/slavia |
Doi | http://dx.doi.org/10.58377/slav.2023.5.06 |
Keywords | cluster analysis; Czech Church Slavonic; lexical diversity; stylometric analysis; token length |
Description | The paper presents a pilot study of stylometric analysis of Czech Church Slavonic texts. The aim of the study is to measure similarities / differences among texts based on selected quantitative characteristics. Specifically, the average token length (ATL), moving average type-token ratio (MATTR), and text distances determined by normalized frequencies of the most frequent words (MFW) are applied. For the analysis, we used a corpus of twelve Church Slavonic literary writings attributed (with various probability) to Czech authors in the 10th and 11th centuries. In addition, two more textual sources were added (Codex Suprasliensis and the Life of St. Methodius) to compare the results and get a more complex view of relationships among texts. The results show the plausibility of the application of methods on this specific sample of texts. |