Academia.eduAcademia.edu

Statistical Physics for Natural Language Processing

Abstract

In this paper we study the Enertex model that has been applied to fundamental tasks in Natural Language Processing (NLP) including automatic document summarization and topic segmentation. The model is language independent. It is based on the intuitive concept of Textual Energy, inspired by Neural Networks and Statistical Physics of magnetic systems. It can be implemented using simple matrix operations and on the contrary of PageRank algorithms, it avoids any iterative process.