Academia.eduAcademia.edu
paper cover icon
Towards a Robust Understanding of Arabic Word Sense

Towards a Robust Understanding of Arabic Word Sense

2019
Sudhanshu Semwal
Abstract
Word Sense Disambiguation (WSD) is a task which aims to identify the meaning of a word given its context. This problem has been investigated and analyzed in depth in English. However, work in Arabic has been limited despite the fact that there are half a billion native Arabic speakers. In this work, we present multiple approaches for the problem of WSD in Arabic, inspired by recent developments and successes in learning word embeddings with approaches such as GloVe, and Word2vec. The primary shortcoming of word embeddings is the single vector representation of a word’s meaning, although many words are polysemous. Our first contribution in this work is to computationally obtain an embedding for each sense, using an Arabic WordNet (AWN) to overcome the problem of WSD. We also compute word semantic similarity giving thought to multiple Arabic stemming algorithms. We tackle WSD from a different point of view that solves WSD by restoring Arabic diacritics to reduce the complexity of word...

Sudhanshu Semwal hasn't uploaded this paper.

Create a free Academia account to let Sudhanshu know you want this paper to be uploaded.

Ask for this paper to be uploaded.