{"title":"基于密度的垃圾邮件检测器的内存管理","authors":"Kenichi Yoshida, Fuminori Adachi, Takashi Washio, H. Motoda, Teruaki Homma, Akihiro Nakashima, Hiromitsu Fujikawa, Katsuyuki Yamazaki","doi":"10.1109/SAINT.2005.38","DOIUrl":null,"url":null,"abstract":"The volume of mass unsolicited electronic mail, often known as spam, has recently increased enormously and has become a serious threat to not only the Internet but also to society. A new spam detection method which uses document space density information has been proposed. Although the proposed method requires extensive e-mail traffic to acquire the necessary information, it can achieve perfect detection (i.e., both recall and precision is 100%) under practical conditions. This paper describes the memory management mechanism of this new spam detection method. Although the \"least recently used\" strategy is the standard memory management strategy, we show that 1) the use of the direct-mapped cache can be used as a substitute for the LRU cache, and 2) \"retaining multiply accessed entries\" strategy can further improve the memory management performance and improve the theoretical recall rate for spam detection.","PeriodicalId":169669,"journal":{"name":"The 2005 Symposium on Applications and the Internet","volume":"22 6S 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Memory management of density-based spam detector\",\"authors\":\"Kenichi Yoshida, Fuminori Adachi, Takashi Washio, H. Motoda, Teruaki Homma, Akihiro Nakashima, Hiromitsu Fujikawa, Katsuyuki Yamazaki\",\"doi\":\"10.1109/SAINT.2005.38\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The volume of mass unsolicited electronic mail, often known as spam, has recently increased enormously and has become a serious threat to not only the Internet but also to society. A new spam detection method which uses document space density information has been proposed. Although the proposed method requires extensive e-mail traffic to acquire the necessary information, it can achieve perfect detection (i.e., both recall and precision is 100%) under practical conditions. This paper describes the memory management mechanism of this new spam detection method. Although the \\\"least recently used\\\" strategy is the standard memory management strategy, we show that 1) the use of the direct-mapped cache can be used as a substitute for the LRU cache, and 2) \\\"retaining multiply accessed entries\\\" strategy can further improve the memory management performance and improve the theoretical recall rate for spam detection.\",\"PeriodicalId\":169669,\"journal\":{\"name\":\"The 2005 Symposium on Applications and the Internet\",\"volume\":\"22 6S 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2005-01-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The 2005 Symposium on Applications and the Internet\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SAINT.2005.38\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The 2005 Symposium on Applications and the Internet","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SAINT.2005.38","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The volume of mass unsolicited electronic mail, often known as spam, has recently increased enormously and has become a serious threat to not only the Internet but also to society. A new spam detection method which uses document space density information has been proposed. Although the proposed method requires extensive e-mail traffic to acquire the necessary information, it can achieve perfect detection (i.e., both recall and precision is 100%) under practical conditions. This paper describes the memory management mechanism of this new spam detection method. Although the "least recently used" strategy is the standard memory management strategy, we show that 1) the use of the direct-mapped cache can be used as a substitute for the LRU cache, and 2) "retaining multiply accessed entries" strategy can further improve the memory management performance and improve the theoretical recall rate for spam detection.