Text categorization is an important application of machine learning to the field of document information retrieval. Most machine learning methods treat text documents as a feature vectors.We report text categorization accuracy for different types of features and different types of feature weights. The comparison of these classifiers shows that stemmed or un-stemmed single words asfeatures give better classifier performance compared with other types of features, and LOG(tf)IDF weight as feature weight gives better classifier performance than other types of feature weights . Text categorization is a conventional classification problem applied to the textual domain. It solves the problem of assigning text content to predefined categories
đang được dịch, vui lòng đợi..
