Deep Neural Networks for Disambiguation of Words and Named Entities

keywords: 
Deep learning, graph-based approaches, Named Entity Disambiguation, Word Sense Disambiguation.
Description: 
In natural-language understanding, addressing word and name ambiguity is a key issue nowadays. Some words and name mentions could refer to more than hundreds of concepts or entities. The ambiguity is resolved using the context words surrounding the target word or name mention. Deep learning based approaches take advantage of large annotated corpora like Wikipedia in order to build representations that encode the semantic meaning of the context. In the other hand, graph-based approaches exploit the relations among concepts and entities in the Knowledge Bases such as WordNet or BabelNet. Graph-based systems look for the coherence among concepts and entity candidates given a context.
The thesis proposed here will explore the optimal method for combining representation of context given by deep neural networks and graph-based approaches. Giving as a result a model that should disambiguate word and name mentions all together.
Objectives: 
The main objective of the thesis is to build a supervised model that correctly disambiguates both words and name mentions occurring in documents. Resulting system should face two main problems of natural-language understanding, Word Sense Disambiguation (WSD) and Named Entity Disambiguation (NED). In addition, the system should be able to disambiguate multilingual documents.
Task: 
Review the state of the art for Deep Neural Networks and Graph-based approaches.
Build a Deep Neural model able to represent context for WSD and NED tasks.
Explore graph based approaches to WSD and NED.
Combine both sources.
References: 
Dayu Yuan, Julian Richardson, Ryan Doherty, Colin Evans, Eric Altendorf.
Semi-supervised Word Sense Disambiguation with Neural Models.
https://arxiv.org/abs/1603.07012
Zhaochen Guo and Denilson Barbosa.
Robust Named Entity Disambiguation with Random Walks
http://www.semantic-web-journal.net/system/files/swj1511.pdf
Ikuya Yamada, Hiroyuki Shindo, Hideaki Takeda, Yoshiyasu Takefuji.
Joint Learning of the Embedding of Words and Entities for Named Entity Disambiguation.
https://arxiv.org/pdf/1601.01343.pdf
Alessandro Raganato, Claudio Delli Bovi and Roberto Navigli
Neural Sequence Learning Models for Word Sense Disambiguation
https://aclanthology.info/pdf/D/D17/D17-1121.pdf
Team: 
Eneko Agirre, Oier Lopez de Lacalle, Ander Barrena, Josu, Mikel, Gorka, Aitor Soroa
Profile: 
Computer scientist
File: 
contact: 
ander.barrena[abildua|at]ehu.eus
Date: 
2017