Academia.eduAcademia.edu

International Journal of Web Information Systems

2008, Information Systems

Abstract
sparkles

AI

The paper discusses the increasing need for automatic text categorization methods in the context of large-scale digital document collections. It elaborates on the limitations of human categorization and highlights the evolution of text categorization methods, particularly the naïve Bayes approach and its extensions. The research presents various statistical techniques, including SVM and LR models, to evaluate performance across different domains, providing empirical results that demonstrate improvements in accuracy and reductions in false positive rates.