TY - GEN
T1 - Building Ontologies from Collaborative Knowledge Bases to Search and Interpret Multilingual Corpora
AU - Genc, Yegin
AU - Lennon, Elizabeth A.
AU - Mason, Winter
AU - Nickerson, Jeffrey V.
N1 - Publisher Copyright:
© 2013 Proceedings of the Annual Meeting of the Association for Computational Linguistics. All rights reserved.
PY - 2013
Y1 - 2013
N2 - Tools and techniques that automate the interpretation of multilingual corpora are useful on many fronts; scholars, as an example, could use such tools to more readily pinpoint relevant articles from journals in a wide variety of languages. This work describes techniques to build and characterize ontologies using collaborative knowledge bases, e.g., Wikipedia. These ontologies can then be used to search and classify texts. Originally developed for monolingual corpora, we extend the approach to multilingual texts and test the methods with Mandarin scientific abstracts. The presented techniques provide a novel and efficient mechanism to obtain contextually rich ontologies and measure document relevancy within multilingual corpora.
AB - Tools and techniques that automate the interpretation of multilingual corpora are useful on many fronts; scholars, as an example, could use such tools to more readily pinpoint relevant articles from journals in a wide variety of languages. This work describes techniques to build and characterize ontologies using collaborative knowledge bases, e.g., Wikipedia. These ontologies can then be used to search and classify texts. Originally developed for monolingual corpora, we extend the approach to multilingual texts and test the methods with Mandarin scientific abstracts. The presented techniques provide a novel and efficient mechanism to obtain contextually rich ontologies and measure document relevancy within multilingual corpora.
UR - http://www.scopus.com/inward/record.url?scp=85121828255&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85121828255&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85121828255
T3 - Proceedings of the Annual Meeting of the Association for Computational Linguistics
SP - 87
EP - 94
BT - 6th Workshop on Building and Using Comparable Corpora, BUCC 2013 at the 51st Annual Meeting of the Association for Computational Linguistics, ACL 2013 - Proceedings
A2 - Sharoff, Serge
A2 - Zweigenbaum, Pierre
A2 - Rapp, Reinhard
A2 - Rapp, Reinhard
T2 - 6th Workshop on Building and Using Comparable Corpora, BUCC 2013 at the 51st Annual Meeting of the Association for Computational Linguistics, ACL 2013
Y2 - 8 August 2013
ER -