{"title":"挖掘博客圈,为移动地图服务生成当地美食热点","authors":"C. Shih, Ting-Chun Peng, W. Lai","doi":"10.1109/ICDIM.2009.5356785","DOIUrl":null,"url":null,"abstract":"On-the-go consumers require dynamic information, particularly \"word of mouth, \" to make better purchase decisions. A popular genre of mobile map services is travel/cuisine, which is a popular topic for bloggers as well. This study attempts to generate local cuisine hotspot maps through blog content mining. The main obstacle in doing this involves recognizing and extracting restaurants and essential restaurant information (i.e., restaurant dishes) in unstructured content. In contrast to traditional Named Entity Recognition (NER) targets, dish name is a promising target that received little attention in previous studies. This study develops methods for recognizing and extracting restaurant names and dish names from Chinese blog posts and achieves satisfactory performance. The extraction results are arranged into hotspots and presented in map views. The extracted information can be fed back to POI (Point of Interest) databases, and thus a brand-new POI database comprising information extracted from User Generated Content (UGC) can be realized.","PeriodicalId":300287,"journal":{"name":"2009 Fourth International Conference on Digital Information Management","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Mining the blogosphere to generate local cuisine hotspots for mobile map service\",\"authors\":\"C. Shih, Ting-Chun Peng, W. Lai\",\"doi\":\"10.1109/ICDIM.2009.5356785\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"On-the-go consumers require dynamic information, particularly \\\"word of mouth, \\\" to make better purchase decisions. A popular genre of mobile map services is travel/cuisine, which is a popular topic for bloggers as well. This study attempts to generate local cuisine hotspot maps through blog content mining. The main obstacle in doing this involves recognizing and extracting restaurants and essential restaurant information (i.e., restaurant dishes) in unstructured content. In contrast to traditional Named Entity Recognition (NER) targets, dish name is a promising target that received little attention in previous studies. This study develops methods for recognizing and extracting restaurant names and dish names from Chinese blog posts and achieves satisfactory performance. The extraction results are arranged into hotspots and presented in map views. The extracted information can be fed back to POI (Point of Interest) databases, and thus a brand-new POI database comprising information extracted from User Generated Content (UGC) can be realized.\",\"PeriodicalId\":300287,\"journal\":{\"name\":\"2009 Fourth International Conference on Digital Information Management\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-12-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 Fourth International Conference on Digital Information Management\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDIM.2009.5356785\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 Fourth International Conference on Digital Information Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDIM.2009.5356785","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
摘要
忙碌的消费者需要动态信息,尤其是“口口相传”,以做出更好的购买决定。一种流行的手机地图服务类型是旅游/美食,这也是博客的热门话题。本研究尝试通过博客内容挖掘生成地方美食热点地图。这样做的主要障碍涉及在非结构化内容中识别和提取餐馆和基本餐馆信息(即餐馆菜肴)。与传统的命名实体识别(NER)目标相比,菜肴名称是一个很有前途的目标,但在以往的研究中很少受到关注。本研究开发了从中文博客文章中识别和提取餐厅名称和菜名的方法,取得了令人满意的效果。提取结果按热点排列,在地图视图中呈现。提取的信息可以反馈到POI (Point of Interest)数据库,从而实现一个全新的POI数据库,其中包含从用户生成内容(UGC)中提取的信息。
Mining the blogosphere to generate local cuisine hotspots for mobile map service
On-the-go consumers require dynamic information, particularly "word of mouth, " to make better purchase decisions. A popular genre of mobile map services is travel/cuisine, which is a popular topic for bloggers as well. This study attempts to generate local cuisine hotspot maps through blog content mining. The main obstacle in doing this involves recognizing and extracting restaurants and essential restaurant information (i.e., restaurant dishes) in unstructured content. In contrast to traditional Named Entity Recognition (NER) targets, dish name is a promising target that received little attention in previous studies. This study develops methods for recognizing and extracting restaurant names and dish names from Chinese blog posts and achieves satisfactory performance. The extraction results are arranged into hotspots and presented in map views. The extracted information can be fed back to POI (Point of Interest) databases, and thus a brand-new POI database comprising information extracted from User Generated Content (UGC) can be realized.