{"title":"基于深度学习的网页功能分类","authors":"Caner Balim, Kemal Özkan","doi":"10.1109/SIU.2019.8806240","DOIUrl":null,"url":null,"abstract":"Automatic processing of websites is of great importance for applications such as search engine that extract information from web pages. Search engines use meta tag values when classifying pages of websites. Meta tag names can change for different languages. For example, for login page, entries such as login, login page or giris, giris sayfası may change from language to language. When the websites are examined, it can be seen that each of the pages created for the same purpose has similar designs. In this study, a deep learning based model was proposed for functional classification of web pages, regardless of language. Transfer learning was used to reduce the cost during the feature extraction process from recorded web page images. Finally, the results of two different experiments are presented for show the effectiveness of the proposed method in the classification of web pages according to their functions.","PeriodicalId":326275,"journal":{"name":"2019 27th Signal Processing and Communications Applications Conference (SIU)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Functional Classification of Web Pages with Deep Learning\",\"authors\":\"Caner Balim, Kemal Özkan\",\"doi\":\"10.1109/SIU.2019.8806240\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automatic processing of websites is of great importance for applications such as search engine that extract information from web pages. Search engines use meta tag values when classifying pages of websites. Meta tag names can change for different languages. For example, for login page, entries such as login, login page or giris, giris sayfası may change from language to language. When the websites are examined, it can be seen that each of the pages created for the same purpose has similar designs. In this study, a deep learning based model was proposed for functional classification of web pages, regardless of language. Transfer learning was used to reduce the cost during the feature extraction process from recorded web page images. Finally, the results of two different experiments are presented for show the effectiveness of the proposed method in the classification of web pages according to their functions.\",\"PeriodicalId\":326275,\"journal\":{\"name\":\"2019 27th Signal Processing and Communications Applications Conference (SIU)\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 27th Signal Processing and Communications Applications Conference (SIU)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SIU.2019.8806240\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 27th Signal Processing and Communications Applications Conference (SIU)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIU.2019.8806240","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Functional Classification of Web Pages with Deep Learning
Automatic processing of websites is of great importance for applications such as search engine that extract information from web pages. Search engines use meta tag values when classifying pages of websites. Meta tag names can change for different languages. For example, for login page, entries such as login, login page or giris, giris sayfası may change from language to language. When the websites are examined, it can be seen that each of the pages created for the same purpose has similar designs. In this study, a deep learning based model was proposed for functional classification of web pages, regardless of language. Transfer learning was used to reduce the cost during the feature extraction process from recorded web page images. Finally, the results of two different experiments are presented for show the effectiveness of the proposed method in the classification of web pages according to their functions.