GlossExtractor: a web application to automatically create a domain glossary
Roberto Navigli
The 10th Congress of the Italian Association for Artificial Intelligence (AIIA 2007)
Roma, Italy, September 10-13, 2007
Abstract
We describe a web application, GlossExtractor, that receives in input the output of a terminology extraction web application, TermExtractor, or a user-provided terminology, and then searches on several repositories (on-line glossaries, web documents, user-specified web pages) sentences that are candidate definitions for each of the input terms. Candidate definitions are then filtered using statistical indicators and machine-learned regular patterns. Finally, the user can inspect the acquired definitions and perform an individual or group validation. The validated glossary is then downloaded in one of several formats.