{"title":"利用TF*PDF算法从新闻档案中提取主题","authors":"Khoo Khyou Bun, M. Ishizuka","doi":"10.1109/WISE.2002.1181645","DOIUrl":null,"url":null,"abstract":"Since the Web became widespread, the amount of electronically available information online, especially news archives, has proliferated and threatens to become overwhelming. We propose an information system that will extract main topics in a news archive on a weekly basis. By obtaining a weekly report, a user can know what the main news events were in the past week.","PeriodicalId":392999,"journal":{"name":"Proceedings of the Third International Conference on Web Information Systems Engineering, 2002. WISE 2002.","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"131","resultStr":"{\"title\":\"Topic extraction from news archive using TF*PDF algorithm\",\"authors\":\"Khoo Khyou Bun, M. Ishizuka\",\"doi\":\"10.1109/WISE.2002.1181645\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Since the Web became widespread, the amount of electronically available information online, especially news archives, has proliferated and threatens to become overwhelming. We propose an information system that will extract main topics in a news archive on a weekly basis. By obtaining a weekly report, a user can know what the main news events were in the past week.\",\"PeriodicalId\":392999,\"journal\":{\"name\":\"Proceedings of the Third International Conference on Web Information Systems Engineering, 2002. WISE 2002.\",\"volume\":\"45 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2002-12-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"131\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the Third International Conference on Web Information Systems Engineering, 2002. WISE 2002.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WISE.2002.1181645\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Third International Conference on Web Information Systems Engineering, 2002. WISE 2002.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WISE.2002.1181645","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Topic extraction from news archive using TF*PDF algorithm
Since the Web became widespread, the amount of electronically available information online, especially news archives, has proliferated and threatens to become overwhelming. We propose an information system that will extract main topics in a news archive on a weekly basis. By obtaining a weekly report, a user can know what the main news events were in the past week.