Word Sense Induction Using Word Sketches

This publication doesn't include Faculty of Arts. It includes Faculty of Informatics. Official publication website can be found on muni.cz.

Authors

HERMAN Ondřej KOVÁŘ Vojtěch JAKUBÍČEK Miloš RYCHLÝ Pavel

Type Article in Proceedings
Conference Proceedings of the 7th International Conference on Statistical Language and Speech Processing
MU Faculty or unit

Faculty of Informatics

Citation
Doi http://dx.doi.org/10.1007/978-3-030-31372-2_7
Keywords Word sense induction;Word sketch;Collocations;Word embeddings
Description We present three methods for word sense induction based on Word Sketches. The methods are being developed a part of an semiautomatic dictionary creation system, providing annotators with the summarized semantic behavior of a word. Two of the methods are based on the assumption of a word having a single sense per collocation. We cluster the Word Sketch based collocations by their co-occurrence behavior in the first method. The second method clusters the collocations using word embedding model. The last method is based on clustering of Word Sketch thesauri. We evaluate the methods and demonstrate their behavior on representative words.