自动收集、监控和挖掘日本博客

WWW Alt. '04 Pub Date : 2004-05-19 DOI:10.1145/1013367.1013455

T. Nanno, Toshiaki Fujiki, Yasuhiro Suzuki, M. Okumura

{"title":"自动收集、监控和挖掘日本博客","authors":"T. Nanno, Toshiaki Fujiki, Yasuhiro Suzuki, M. Okumura","doi":"10.1145/1013367.1013455","DOIUrl":null,"url":null,"abstract":"We present a system that tries to automatically collect and monitor Japanese blog collections that include not only ones made with blog softwares but also ones written as normal web pages. Our approach is based on extraction of date expressions and analysis of HTML documents. Our system also extracts and mines useful information from the collected blog pages.","PeriodicalId":409891,"journal":{"name":"WWW Alt. '04","volume":"71 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"78","resultStr":"{\"title\":\"Automatically collecting, monitoring, and mining japanese weblogs\",\"authors\":\"T. Nanno, Toshiaki Fujiki, Yasuhiro Suzuki, M. Okumura\",\"doi\":\"10.1145/1013367.1013455\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We present a system that tries to automatically collect and monitor Japanese blog collections that include not only ones made with blog softwares but also ones written as normal web pages. Our approach is based on extraction of date expressions and analysis of HTML documents. Our system also extracts and mines useful information from the collected blog pages.\",\"PeriodicalId\":409891,\"journal\":{\"name\":\"WWW Alt. '04\",\"volume\":\"71 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2004-05-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"78\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"WWW Alt. '04\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/1013367.1013455\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"WWW Alt. '04","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1013367.1013455","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 78

摘要

我们提出了一个系统，试图自动收集和监控日本的博客集合，其中不仅包括博客软件制作的，也包括作为正常网页编写的博客。我们的方法基于日期表达式的提取和HTML文档的分析。系统还对收集到的博客页面进行了有用信息的提取和挖掘。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Automatically collecting, monitoring, and mining japanese weblogs

We present a system that tries to automatically collect and monitor Japanese blog collections that include not only ones made with blog softwares but also ones written as normal web pages. Our approach is based on extraction of date expressions and analysis of HTML documents. Our system also extracts and mines useful information from the collected blog pages.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

WWW Alt. '04

自引率

0.00%

发文量