[euralex] UBY - A Large-Scale Unified Lexical-Semantic Resource (UBY 1.0) released

  • From: Christian Meyer <meyer@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx>
  • To: "euralex@xxxxxxxxxxxxx" <euralex@xxxxxxxxxxxxx>
  • Date: Fri, 30 Mar 2012 16:14:39 +0000

(apologies for X-posting)

We are pleased to announce the release of UBY 1.0 -
a large-scale lexical-semantic resource for natural language processing (NLP)
based on the ISO standard Lexical Markup Framework (LMF):

 http://www.ukp.tu-darmstadt.de/data/uby/

UBY combines a wide range of information from expert-constructed and 
collaboratively constructed resources for English and German.
Currently, UBY holds structurally and semantically interoperable versions of 
nine resources in two languages:

 * English WordNet, Wiktionary, Wikipedia, FrameNet and VerbNet,
 * German Wikipedia, Wiktionary and GermaNet, and multilingual OmegaWiki.

A subset of these resources is linked at the word sense level.
There are monolingual sense alignments between VerbNet–FrameNet and 
VerbNet–WordNet as well as between WordNet–Wikipedia and WordNet–Wiktionary.
In addition, UBY provides cross-lingual sense alignments between WordNet and 
German OmegaWiki, also including the inter-language links given in Wikipedia 
and OmegaWiki.

All resources in UBY are represented according to our LMF lexicon model, 
UBY-LMF.
UBY-LMF captures lexical information at a fine-grained level by employing a 
large number of Data Categories from ISOCat.


Highlights of UBY:

 * The union of a wide range of heterogeneous resources in a single, 
standardized resource.
 * The linking at the word sense level between a subset of the resources.

UBY is complemented by a Java API, the UBY-API, and conversion tools (e.g., for 
converting the resources to UBY-LMF).
The UBY API and conversion tools are available at Google Code:

  http://code.google.com/p/uby/

Highlights of the UBY-API:

 * Unified access to the various information types in the nine resources.
 * Easy cross-resource access to the various information types in the resources.

A tutorial showing the use of the UBY-API can be found at

 http://code.google.com/p/uby/wiki/ApiTutorial

A Web Interface for exploring and visualizing UBY is currently being developed 
and will soon be available
at http://www.ukp.tu-darmstadt.de/data/uby/.

This project was initiated under the auspices of Prof. Dr. Iryna Gurevych, 
Ubiquitous Knowledge Processing Lab (UKP), Technische Universität Darmstadt.
We are grateful for the generous financial support from the Volkswagen 
Foundation and the German Research Foundation.

Please direct any questions or suggestions to

  uby-users@xxxxxxxxxxxxxxxx<mailto:uby-users@xxxxxxxxxxxxxxxx>

--
Christian M. Meyer, M.Sc.
Doctoral Researcher
Ubiquitous Knowledge Processing (UKP Lab)
FB 20 Computer Science Department
Technische Universität Darmstadt
Hochschulstr. 10, D-64289 Darmstadt, Germany
Phone [+49] (0)6151 16-7477, fax -5455, room S2/02/B117
meyer@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx<mailto:meyer@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx>
www.ukp.tu-darmstadt.de<http://www.ukp.tu-darmstadt.de/>
Web Research at TU Darmstadt (WeRC) 
www.werc.tu-darmstadt.de<http://www.werc.tu-darmstadt.de/>

Other related posts:

  • » [euralex] UBY - A Large-Scale Unified Lexical-Semantic Resource (UBY 1.0) released - Christian Meyer