Building the Ultimate Math Search Engine

Varování

Publikace nespadá pod Filozofickou fakultu, ale pod Fakultu informatiky. Oficiální stránka publikace je na webu muni.cz.
Autoři

LÍŠKA Martin SOJKA Petr

Rok publikování 2015
Druh Účelové publikace
Fakulta / Pracoviště MU

Fakulta informatiky

Citace
Popis Mathematics information retrieval (MIR) is a domain specific branch of Information Retrieval. MIR aims at searching information in documents with significant amount of mathematical content in the form of expressions and formulae. Based on the newly established international MIR evaluation forum and on the number of MIR related research groups around the world, it is definitely on the rise. In this work I have summarized and compared different approaches to math-aware search systems. More detailed description of Math Indexer and Searcher (MIaS) was provided as this is our system created at Faculty of Informatics, Masaryk University, primarily designed and developed by me. MIaS is currently reported as the best performing MIR system in terms of effectiveness. In this work I proposed several topics which are main research interests of my studies. The topics correlate with possible features that can improve the effectiveness of MIR systems. Namely, the proposed topics are math formula substree unification, integration of algebraic computational power into the indexing as well as searching phase, query expansion as a way of increasing recall, query variables, combination of more approaches within one system and a utilization of combination of text and math search. One topic that spans over all other topics is evaluation which is a necessity in a process of continuous improvement of effectiveness.
Související projekty:

Používáte starou verzi internetového prohlížeče. Doporučujeme aktualizovat Váš prohlížeč na nejnovější verzi.