Academia.eduAcademia.edu

Aligning the IndoWordNet with the Princeton WordNet

Abstract

The IndoWordNet is an Indian language lexical resource. The project started with Hindi Word-Net, which was manually built from various resources with the preference for culture-specific synsets. Other WordNets in IndoWordNet were then translated from Hindi WordNet. The development approach used in IndoWordNet is very similar to that used in Princeton WordNet (PWN). PWN forms a semantic network where English synsets are nodes, and semantic relations are edges connecting them. Due to the popularity of PWN, IndoWordNet also connected Hindi and English languages through direct and hypernymy linkages between their synsets. These linkages generate three types of mappings between IndoWordNet and PWN. This paper proposes to align the IndoWordNet with PWN using a large scale lexical-semantic resource called Universal Knowledge Core (UKC), which forms a semantic network where nodes are languageindependent concepts. In the UKC semantic relations connect concepts and not synsets. The IndoWordNet is an Indian language lexical resource. The project started with Hindi Word- Net, which was manually built from various resources with the preference for culture-specific synsets. Other WordNets in IndoWordNet were then translated from Hindi WordNet. The development approach used in IndoWordNet is very similar to that used in Princeton WordNet (PWN). PWN forms a semantic network where English synsets are nodes, and semantic relations are edges connecting them. Due to the popularity of PWN, IndoWordNet also connected Hind and English languages through direct and hypernymy linkages between their synsets. These linkages generate three types of mappings between IndoWordNet and PWN. This paper proposes to align the IndoWordNet with PWN using a large scale lexical-semantic resource called Universal Knowledge Core (UKC), which forms a semantic network where nodes are language independent concepts. In the UKC semantic relations connect concepts and not synsets.