{"title":"A Clique Based Web Page Classification Corrective Approach","authors":"Belmouhcine Abdelbadie, Benkhalifa Mohammed","doi":"10.1109/WI-IAT.2014.135","DOIUrl":"https://doi.org/10.1109/WI-IAT.2014.135","url":null,"abstract":"Nowadays, the web is the most relevant data source. Its size does not stop growing day by day. Web page classification becomes crucial due to this overwhelming amount of data. Web pages contain many noisy contents that bias textual classifiers and lead them to lose focus on their main subject. Web pages are related to each other either implicitly by users' intuitive judgments or explicitly by hyperlinks. Thus, the use of those links in order to correct a class assigned by textual classifier to a web page can be beneficial. In this paper, we propose a post classification corrective approach called Clique Based Correction (CBC) that uses the query-log to build an implicit neighborhood, and collectively corrects classes assigned by a textual classifier to web pages of that neighborhood. This correction helps improve text classifier's results by correcting wrongly assigned categories. When two web pages are linked to each other, they may share the same topic, but when more web pages (three for example) are all related to each other, the probability that those web pages share the same subject becomes stronger. The proposed method operates in four steps. In the first step, it builds a graph called implicit graph, whose vertices are web pages and edges are implicit links. In the second step, it uses a text classifier to determine classes of all web pages represented by vertices in the implicit graph. In the third step, it extracts cliques of web pages from the implicit graph. In the fourth step, it assigns a class to every clique using a voting process. Each web page will be labeled using the class of its clique. This adjustment leads to improvements of results provided by the text classifier. We conduct our experiments using three classifiers: SVM (Support Vector Machine), NB (Naïve Bayes) and KNN (K Nearest Neighbors), on two subsets of ODP (Open Directory Project). Results show that: (1) when applied after SVM, NB or KNN, CBC helps bringing improvements on results. (2) The number of unrelated web pages must be low in order to have significant improvement.","PeriodicalId":120608,"journal":{"name":"Proceedings of the 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) - Volume 02","volume":"131 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121375225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"WI 2014 - II Publisher's Information","authors":"Evan Butterfield, Lynne Harris, P. Kellenberger","doi":"10.1109/wi-iat.2014.213","DOIUrl":"https://doi.org/10.1109/wi-iat.2014.213","url":null,"abstract":"IEEE Computer Society Conference Publishing Services (CPS) The IEEE Computer Society produces conference publications for more than 300 acclaimed international conferences each year in a variety of formats, including books, CD-ROMs, USB Drives, and on-line publications. For information about the IEEE Computer Society’s Conference Publishing Services (CPS), please e-mail: cps@computer.org or telephone +1-714-821-8380. Fax +1-714-761-1784. Additional information about Conference Publishing Services (CPS) can be accessed from our web site at: http://www.computer.org/cps","PeriodicalId":120608,"journal":{"name":"Proceedings of the 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) - Volume 02","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122653751","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
A. Abe, R. Amadini, Aijun An, Malik Anupama, A. Appice, N. Baptiste, J. Bazan, Khalil Ben Mohamed, S. Yahia, Petr Berka, Lilian Berton, Jerzy Błaszczyński, Desamparados Blazquez, Szymon Bobek, Niladri Chatterjee, Wanxiang Che, Ruoying Chen, Karthikeyani Chitrambalam, Sergio Davalos, Yihua Ding, Pan Du, Maciej Durzewski, G. Dziczkowski, Haruka Eigen, Johannes Fähndrich, M. Fragoulis, S. Giallorenzo, G. Guo, J. Guo, A. H. Mohamad, Mellah Hakima, Liang He, T. Ho, K. Jackowski, A. Jawdat, W. Jaworski, Yuxiang Jia, Shu-yi Jiang, Peng Jin, Shuyuan Jin, Xiaolong Jin, Manish Joshi, M. Junghans, A. Jurek, Andisheh Keikha, Yanyan Lan, Jun Lang, Mark Last, Raymond Y. K. Lau, A. Ławrynowicz, Ai-hua Li, Bin Li, Jianping Li, Xingsen Li, Zhixing Li, Huizhi Liang, Zheng Lin, Pengyuan Liu, Y. Liu, Erick López-Ornelas, Guowei Ma, C. Makris, Adolfo Martínez-Usó, Stuart Middleton, M. Moreno, G. Nie, M. Pavlidou, Filipa Peleja, Gianvito Pio, Laura Po, R. Porrini, P. Portier, Achim Rettinger, E. Ritacco, G. Semeraro, F. Serafi
{"title":"WI 2014 Non-Program Committee Reviewers - II","authors":"A. Abe, R. Amadini, Aijun An, Malik Anupama, A. Appice, N. Baptiste, J. Bazan, Khalil Ben Mohamed, S. Yahia, Petr Berka, Lilian Berton, Jerzy Błaszczyński, Desamparados Blazquez, Szymon Bobek, Niladri Chatterjee, Wanxiang Che, Ruoying Chen, Karthikeyani Chitrambalam, Sergio Davalos, Yihua Ding, Pan Du, Maciej Durzewski, G. Dziczkowski, Haruka Eigen, Johannes Fähndrich, M. Fragoulis, S. Giallorenzo, G. Guo, J. Guo, A. H. Mohamad, Mellah Hakima, Liang He, T. Ho, K. Jackowski, A. Jawdat, W. Jaworski, Yuxiang Jia, Shu-yi Jiang, Peng Jin, Shuyuan Jin, Xiaolong Jin, Manish Joshi, M. Junghans, A. Jurek, Andisheh Keikha, Yanyan Lan, Jun Lang, Mark Last, Raymond Y. K. Lau, A. Ławrynowicz, Ai-hua Li, Bin Li, Jianping Li, Xingsen Li, Zhixing Li, Huizhi Liang, Zheng Lin, Pengyuan Liu, Y. Liu, Erick López-Ornelas, Guowei Ma, C. Makris, Adolfo Martínez-Usó, Stuart Middleton, M. Moreno, G. Nie, M. Pavlidou, Filipa Peleja, Gianvito Pio, Laura Po, R. Porrini, P. Portier, Achim Rettinger, E. Ritacco, G. Semeraro, F. Serafi","doi":"10.1109/wi-iat.2014.222","DOIUrl":"https://doi.org/10.1109/wi-iat.2014.222","url":null,"abstract":"Akinori Abe Roberto Amadini Aijun An Malik Anupama Annalisa Appice Nadine Baptiste Jan Bazan Khalil Ben Mohamed Sadok Ben Yahia Petr Berka Lilian Berton Jerzy Błaszczyński Desamparados Blazquez Szymon Bobek Isaac-Bernardo CaceidoCastro Niladri Chatterjee Wanxiang Che Ruoying Chen Karthikeyani Chitrambalam Sergio Davalos Yihua Ding Pan Du Maciej Durzewski Grzegorz Dziczkowski Haruka Eigen Johannes Fähndrich Manos Fragoulis Saverio Giallorenzo Guibing Guo Jiafeng Guo Abdul Hadi Mohamad Mellah Hakima Liang He Tin Ho Konrad Jackowski Ahmad Jawdat Wojciech Jaworski Yuxiang Jia Shuqiang Jiang Peng Jin Shuyuan Jin Xiaolong Jin Manish Joshi Martin Junghans Anna Jurek Andisheh Keikha Yanyan Lan Jun Lang Mark Last Raymond Y. K. Lau Agnieszka Ławrynowicz Aihua Li Bin Li Jianping Li Xingsen Li Zhixing Li Huizhi Liang Zheng Lin Pengyuan Liu Ying Liu Erick Lopez-Ornelas Guowei Ma Christos Makris Adolfo Martínez-Usó Stuart Middleton Maria Moreno Guangli Nie Maria Pavlidou Filipa Peleja Gianvito Pio Laura Po Riccardo Porrini Pierre-Edouard Portier Achim Rettinger Ettore Ritacco Giovanni Semeraro Francesco Serafino Yanqiu Shao Hualei Shen Huawei Shen Andrey Sherbakov Xuebo Song Bouchra Soukarieh Qi Su Shanu Sushmita Wojciech Świeboda Frank Takes Xuri Tang Yingjie Tian Endang Tjhwa Giorgio Valentini Maurizio Vincini Peter Vojtas Bo Wang Gang Wang Meng Wang Xinglong Wang Zhefeng Wang Zhigang Wang Bunthit Watanapa Sanjaya Wijeyratne Fei Wu Yunqing Xia Jianpeng Xu Tianbing Xu Xueke Xu Yue Xu Norio Yoshimoto Hong Yu Jingsong Yu Xiaodan Yu Mohammed Zaki Lei Zhang Peng Zhang Wenjuan Zhang Xiaofeng Zhang Xiuzhen Zhang Yuhao Zhang Xingming Zhao Yong Zheng Xiaofei Zhou","PeriodicalId":120608,"journal":{"name":"Proceedings of the 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) - Volume 02","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123407761","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"WI 2014 - II Title Page iii","authors":"A. Skowron, Lipika Dey, Adam Krasuski, Yuefeng Li","doi":"10.1109/wi-iat.2014.203","DOIUrl":"https://doi.org/10.1109/wi-iat.2014.203","url":null,"abstract":"","PeriodicalId":120608,"journal":{"name":"Proceedings of the 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) - Volume 02","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114800348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"WI 2014 - II Copyright Page","authors":"M. Bartosik","doi":"10.1109/wi-iat.2014.207","DOIUrl":"https://doi.org/10.1109/wi-iat.2014.207","url":null,"abstract":"The papers in this book comprise the proceedings of the meeting mentioned on the cover and title page. They reflect the authors’ opinions and, in the interests of timely dissemination, are published as presented and without change. Their inclusion in this publication does not necessarily constitute endorsement by the editors, the IEEE Computer Society, or the Institute of Electrical and Electronics Engineers, Inc.","PeriodicalId":120608,"journal":{"name":"Proceedings of the 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) - Volume 02","volume":"78 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117240034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"WI 2014 - II Author Index","authors":"","doi":"10.1109/wi-iat.2014.211","DOIUrl":"https://doi.org/10.1109/wi-iat.2014.211","url":null,"abstract":"","PeriodicalId":120608,"journal":{"name":"Proceedings of the 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) - Volume 02","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134246008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"WI 2014 - II Title Page i","authors":"","doi":"10.1109/wi-iat.2014.201","DOIUrl":"https://doi.org/10.1109/wi-iat.2014.201","url":null,"abstract":"","PeriodicalId":120608,"journal":{"name":"Proceedings of the 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) - Volume 02","volume":"189 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126797314","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Chi-Chih Yao, Karl J. Friston, H. Skarżyńśki, S. Decker, Robert Kowalski, Sadaaki Miyamoto, Yi Pan, J. Sowa, Marcin S. Szczuka, A. Skowron, Ning Zhong
{"title":"WI 2014 Preface - II","authors":"Chi-Chih Yao, Karl J. Friston, H. Skarżyńśki, S. Decker, Robert Kowalski, Sadaaki Miyamoto, Yi Pan, J. Sowa, Marcin S. Szczuka, A. Skowron, Ning Zhong","doi":"10.1109/wi-iat.2014.224","DOIUrl":"https://doi.org/10.1109/wi-iat.2014.224","url":null,"abstract":"This volume contains the papers selected for presentation at the 2014 IEEE/WIC/ACM International Conference on Web Intelligence (WI'14), held as part of the 2014 Web Intelligence Congress (WIC'14) at the University of Warsaw, Warsaw, Poland, from 11 to 14 in August, 2014. The conference was sponsored and co-organized by the IEEE Computer Society, the Web Intelligence Consortium (WIC), Association for Computing Machinery (ACM), the University of Warsaw, Polish Mathematical Society and Warsaw University of Technology.","PeriodicalId":120608,"journal":{"name":"Proceedings of the 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) - Volume 02","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126158558","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Proceedings of the 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) - Volume 02","authors":"","doi":"10.5555/2682648","DOIUrl":"https://doi.org/10.5555/2682648","url":null,"abstract":"","PeriodicalId":120608,"journal":{"name":"Proceedings of the 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) - Volume 02","volume":"182 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132889733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}