Naoto Sato, Kanako Komiya, Koji Fujimoto, Y. Kotani
{"title":"Categorization of product pages depending on information on the Web","authors":"Naoto Sato, Kanako Komiya, Koji Fujimoto, Y. Kotani","doi":"10.1109/JCSSE.2011.5930153","DOIUrl":null,"url":null,"abstract":"In this paper, the authors categorize product pages on the Web depending on their information. We used naive Bayes and the complement naive Bayes classifier, and tried four kinds of features to categorize them: all the words of the titles of the product pages, the nouns extracted from the titles, all the words of the titles and the descriptions of the product pages, and the nouns extracted from them. The experiments show that the product pages can be classified most correctly depending on only the nouns of the titles of the product pages. Moreover the complement naive Bayes classifier outperformed the naive Bayes classifier.","PeriodicalId":287775,"journal":{"name":"2011 Eighth International Joint Conference on Computer Science and Software Engineering (JCSSE)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 Eighth International Joint Conference on Computer Science and Software Engineering (JCSSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/JCSSE.2011.5930153","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In this paper, the authors categorize product pages on the Web depending on their information. We used naive Bayes and the complement naive Bayes classifier, and tried four kinds of features to categorize them: all the words of the titles of the product pages, the nouns extracted from the titles, all the words of the titles and the descriptions of the product pages, and the nouns extracted from them. The experiments show that the product pages can be classified most correctly depending on only the nouns of the titles of the product pages. Moreover the complement naive Bayes classifier outperformed the naive Bayes classifier.