Klasifikace XML dokumentů

Warning

This publication doesn't include Faculty of Arts. It includes Faculty of Informatics. Official publication website can be found on muni.cz.
Title in English XML documents classification
Authors

PROCHÁZKA Martin BLAŤÁK Jan

Year of publication 2005
Type Article in Proceedings
Conference Znalosti 2005, sborník příspěvků
MU Faculty or unit

Faculty of Informatics

Citation
Field Informatics
Keywords XML; XML mining; schemaless XML classification
Description We present a new method for classifying XML documents that does not require any additional information like a XML scheme or DTD. A new method for transforming XML data into one table is introduced. We present results of experiments with data from Internet Movie Database (IMDb). We show that our method overcomes the previous work in terms of accuracy and F1 measure.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.