SMUC '10最新文献

筛选
英文 中文
On the difficulty of clustering company tweets 关于公司推文聚类的难度
SMUC '10 Pub Date : 2010-10-30 DOI: 10.1145/1871985.1872001
Fernando Pérez-Téllez, David Pinto, J. Cardiff, Paolo Rosso
{"title":"On the difficulty of clustering company tweets","authors":"Fernando Pérez-Téllez, David Pinto, J. Cardiff, Paolo Rosso","doi":"10.1145/1871985.1872001","DOIUrl":"https://doi.org/10.1145/1871985.1872001","url":null,"abstract":"Twitter is a new successful technology of the Web 2.0 genre which is used by millions of people and companies to publish brief messages (\"tweets\") with the purpose of sharing experiences and/or opinions about a product or service. Due to the huge amount of information available in this type of technology, there is a clear need for new systems that can mine these messages in order to derive information about the collective thinking of twitterers (e.g. for opinion or sentiment analysis). Tweet analysis is a very important task because comments, opinions, suggestions, complaints can be used as marketing strategies or for determining information on a company's reputation. For this purpose, it is necessary to establish whether a tweet refers to a company or not, which is not a straightforward keyword search process as there may be multiple contexts in which a name can be used. The aim of this work is to present and compare a number of different approaches based on clustering that determine whether a given tweet refers to a particular company or not. For this purpose, we have used an enriching methodology in order to improve the representation of tweets and as a consequence the performance of the clustering company tweets task. The obtained results are promising and highlight the difficulty of this task.","PeriodicalId":244822,"journal":{"name":"SMUC '10","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134456631","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 39
A knowledge-rich approach to feature-based opinion extraction from product reviews 从产品评论中提取基于特征的意见的知识丰富的方法
SMUC '10 Pub Date : 2010-10-30 DOI: 10.1145/1871985.1871990
Fermín L. Cruz, Javier Ortega
{"title":"A knowledge-rich approach to feature-based opinion extraction from product reviews","authors":"Fermín L. Cruz, Javier Ortega","doi":"10.1145/1871985.1871990","DOIUrl":"https://doi.org/10.1145/1871985.1871990","url":null,"abstract":"Feature-based opinion extraction is a task related to information extraction, which consists of extracting structured opinions on features of some object from reviews or other subjective textual sources. Over the last years, this problem has been studied by some researchers, generally in an unsupervised, domain-independent manner. As opposed to that, in this work we propose a redefinition of the problem from a more practical point of view, and describe a domain-specific, resource-based opinion extraction system. We focus on the description and generation of those resources, and briefly report the extraction system architecture and a few initial experiments. The results suggest that domain-specific knowledge is a valuable resource in order to build precise opinion extraction systems.","PeriodicalId":244822,"journal":{"name":"SMUC '10","volume":"120 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116357277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 33
Exploiting tag and word correlations for improved webpage clustering 利用标签和词的相关性来改进网页聚类
SMUC '10 Pub Date : 2010-10-30 DOI: 10.1145/1871985.1871989
Anusua Trivedi, Piyush Rai, S. Duvall, Hal Daumé
{"title":"Exploiting tag and word correlations for improved webpage clustering","authors":"Anusua Trivedi, Piyush Rai, S. Duvall, Hal Daumé","doi":"10.1145/1871985.1871989","DOIUrl":"https://doi.org/10.1145/1871985.1871989","url":null,"abstract":"Automatic clustering of webpages helps a number of information retrieval tasks, such as improving user interfaces, collection clustering, introducing diversity in search results, etc. Typically, webpage clustering algorithms only use features extracted from the page-text. However, the advent of social-bookmarking websites, such as StumbleUpon and Delicious, has led to a huge amount of user-generated content such as the tag information that is associated with the webpages. In this paper, we present a subspace based feature extraction approach which leverages tag information to complement the page-contents of a webpage to extract highly discriminative features, with the goal of improved clustering performance. In our approach, we consider page-text and tags as two separate views of the data, and learn a shared subspace that maximizes the correlation between the two views. Any clustering algorithm can then be applied in this subspace. We compare our subspace based approach with a number of baselines that use tag information in various other ways, and show that the subspace based approach leads to improved performance on the webpage clustering task. Although our results here are on the webpage clustering task, the same approach can be used for webpage classification as well. In the end, we also suggest possible future work for leveraging tag information in webpage clustering, especially when tag information is present for not all, but only for a small number of webpages.","PeriodicalId":244822,"journal":{"name":"SMUC '10","volume":"84 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128005185","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Exploiting web reviews for generating customer service surveys 利用网络评论生成客户服务调查
SMUC '10 Pub Date : 2010-10-30 DOI: 10.1145/1871985.1871995
Suke Li, Zhong Chen
{"title":"Exploiting web reviews for generating customer service surveys","authors":"Suke Li, Zhong Chen","doi":"10.1145/1871985.1871995","DOIUrl":"https://doi.org/10.1145/1871985.1871995","url":null,"abstract":"Traditional customer satisfaction analysis relies on the work of designing, distributing, collecting and analyzing surveys. Surveys that are designed by humans may be subjective, and it is hard to know what service aspects are the most important for customers. To address this issue, this paper proposes a method of automatically generating service surveys through mining Web reviews. Candidate service aspects are extracted using simple extraction rules. Then we rank candidate service aspects in terms of their weights generated by combining co-occurrence method and linear regression method together. Experimental results demonstrate the effectiveness of the proposed method.","PeriodicalId":244822,"journal":{"name":"SMUC '10","volume":"18 9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125768295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Smuc 2010 industry panel Smuc 2010行业研讨会
SMUC '10 Pub Date : 2010-10-30 DOI: 10.1145/1871985.1872006
N. Wanas, M. Makrehchi, José Carlos Cortizo
{"title":"Smuc 2010 industry panel","authors":"N. Wanas, M. Makrehchi, José Carlos Cortizo","doi":"10.1145/1871985.1872006","DOIUrl":"https://doi.org/10.1145/1871985.1872006","url":null,"abstract":"Nayer Wanas is currently the CTO and founder of Nile innovations, a startup founded in 2010 focusing on analysis, processing and delivery of user generated content streams on different platforms. In 2007, Nayer joined the Cairo Microsoft Innovation Laboratory (CMIC) as a researcher conducting research in the area of mining social media and web 2.0 platforms. In 2005, he was involved in the design and founding of the Data Mining and Computer Modeling Centre of Excellence in Egypt, and held the position of program manager of the center till 2007. In 2004, he was a founder and associate of knowledge for development, a consultancy in utilizing ICT for socio-economic development. In 2003,","PeriodicalId":244822,"journal":{"name":"SMUC '10","volume":"118 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129995359","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Web-based statistical fact checking of textual documents 基于web的文本文档统计事实检查
SMUC '10 Pub Date : 2010-10-30 DOI: 10.1145/1871985.1872002
A. Magdy, N. Wanas
{"title":"Web-based statistical fact checking of textual documents","authors":"A. Magdy, N. Wanas","doi":"10.1145/1871985.1872002","DOIUrl":"https://doi.org/10.1145/1871985.1872002","url":null,"abstract":"User generated content has been growing tremendously in recent years. This content reflects the interests and the diversity of online users. In turn, the diversity among internet users is also reflected in the quality of the content being published online. This increases the need to develop means to gauge the support available for content posted online. In this work, we aim to make use of the web-content to calculate a statistical support score for textual documents. In the proposed algorithm, phrases representing key facts are extracted to construct basic elements of the document. Search is used thereon to validate the support available for these elements online, leading to assigning an overall score for each document. Experimental results have shown a difference between the score distribution of factual news data and false facts data. This indicates that the approach seems to be a promising seed for distinguishing different articles based on the content.","PeriodicalId":244822,"journal":{"name":"SMUC '10","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122004015","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 56
Keynote speaker 主讲人
SMUC '10 Pub Date : 2010-01-01 DOI: 10.1145/1871985.1871987
Bing Liu
{"title":"Keynote speaker","authors":"Bing Liu","doi":"10.1145/1871985.1871987","DOIUrl":"https://doi.org/10.1145/1871985.1871987","url":null,"abstract":"Following a brief introduction into 3D laser printing on the micrometer and nanometer scale based on two-photon absorption [1,2], I will emphasize recent progress of my group in this field. This includes replacing two-photon absorption by one-color two-step absorption, allowing to use compact and inexpensive continuous-wave lasers rather than femtosecond laser systems [3]. Using two-color two-step absorption combined with the idea of light-sheet laser printing [1,4], we have more recently achieved print rates approaching 10^7 voxels/s. Aligning the director of liquid-crystal elastomers during the 3D laser printing process yields 3D micro-architectures that can be actuated by light from an LED [5]. Finally, I discuss recent [6] progress in regard to laser printing of functional microelectronic devices such as diodes, memristors, and transistors.","PeriodicalId":244822,"journal":{"name":"SMUC '10","volume":"23 1","pages":"1-2"},"PeriodicalIF":0.0,"publicationDate":"2010-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123935581","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信