{"title":"搜索引擎查询中的成人内容检测","authors":"Levent Soykan, Cihan Karsak, Ilknur Durgar El-Kahlout","doi":"10.1109/SIU55565.2022.9864759","DOIUrl":null,"url":null,"abstract":"It is important to detect adult content in search engine queries in order to filter adult sites depending on the safe internet choices of the user. In this study, we investigate adult content classification in search engine entries for Turkish. Firstly, we collected and labeled data, and then we carried out classification experiments with both machine learning and deep learning methods. As a result of the experiments, we observed that deep learning methods performs better than machine learning methods. We obtained the best accuracy scores with the transformer based Electra model with 0.94 F1 score.","PeriodicalId":115446,"journal":{"name":"2022 30th Signal Processing and Communications Applications Conference (SIU)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Adult Content Detection in Search Engine Queries\",\"authors\":\"Levent Soykan, Cihan Karsak, Ilknur Durgar El-Kahlout\",\"doi\":\"10.1109/SIU55565.2022.9864759\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"It is important to detect adult content in search engine queries in order to filter adult sites depending on the safe internet choices of the user. In this study, we investigate adult content classification in search engine entries for Turkish. Firstly, we collected and labeled data, and then we carried out classification experiments with both machine learning and deep learning methods. As a result of the experiments, we observed that deep learning methods performs better than machine learning methods. We obtained the best accuracy scores with the transformer based Electra model with 0.94 F1 score.\",\"PeriodicalId\":115446,\"journal\":{\"name\":\"2022 30th Signal Processing and Communications Applications Conference (SIU)\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 30th Signal Processing and Communications Applications Conference (SIU)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SIU55565.2022.9864759\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 30th Signal Processing and Communications Applications Conference (SIU)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIU55565.2022.9864759","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
It is important to detect adult content in search engine queries in order to filter adult sites depending on the safe internet choices of the user. In this study, we investigate adult content classification in search engine entries for Turkish. Firstly, we collected and labeled data, and then we carried out classification experiments with both machine learning and deep learning methods. As a result of the experiments, we observed that deep learning methods performs better than machine learning methods. We obtained the best accuracy scores with the transformer based Electra model with 0.94 F1 score.