Academia.eduAcademia.edu

Text classification with the support of pruned dependency patterns

2010, Pattern Recognition Letters

Abstract
sparkles

AI

Text classification is enhanced using a modified bag-of-words approach that incorporates lexical dependency patterns and a pruning strategy. By adding grammatical relations between words as features and removing less informative ones, the proposed method significantly outperforms traditional text classification techniques on multiple datasets. Experimental results demonstrate the effectiveness of using both word pruning and dependency features, paving the way for more accurate document categorization.