WebBootCat: a Web Tool for Instant Corpora

Warning

This publication doesn't include Faculty of Arts. It includes Faculty of Informatics. Official publication website can be found on muni.cz.
Authors

BARONI Marco KILGARRIFF Adam POMIKÁLEK Jan RYCHLÝ Pavel

Year of publication 2006
Type Article in Proceedings
Conference Proceeding of the EuraLex Conference 2006
MU Faculty or unit

Faculty of Informatics

Citation
Field Informatics
Keywords text corpora; web application; automatic text corpus creation
Description We present a web service for quickly producing corpora for specialist areas, in any of a range of languages, from the web. The underlying BootCaT tools have already been extensively used: here, we present a version which is easy for non-technical people to use as all they need do is fill in a web form. The corpus, once produced, can be either downloaded or loaded into the Sketch Engine, a corpus query tool, for further exploration. Reference corpora are used to identify the key terms in the specialist domain. The service is freely available to all on a trial basis.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.