Development of the NVH Schema Format for Lexicographic Purposes
Autoři | |
---|---|
Rok publikování | 2023 |
Druh | Článek ve sborníku |
Konference | Recent Advances in Slavonic Natural Language Processing, RASLAN 2023 |
Fakulta / Pracoviště MU | |
Citace | |
www | Článek ve sborníku |
Klíčová slova | NVH, XML, Name-Value Hierarchy, Lexonomy, Sketch Engine |
Přiložené soubory | |
Popis | A unified e-dictionary entry format is one of the most important things to consider when building a new dictionary. In the Lexonomy tool, where the new NVH lightweight markup language is used to store dictionary data, an NVH schema is assigned to each dictionary, specifying the NVH structure belonging to each dictionary entry. Until now, the schemata used in Lexonomy were quite limited and focused only on the position of a node in the NVH hierarchy and on the arity of its occurrence. In the recent development, we identified a need for a more fine-grained restriction mechanism and, therefore, extended the NVH schema format so that it can also inspect the value of each node according to its type and confirm match according to a predefined regular expression. |