MovieMatcher: A Heuristic to Match Movies Based on Metadata from Different Data Sources

Fernando Brito, J. Moreira
{"title":"MovieMatcher: A Heuristic to Match Movies Based on Metadata from Different Data Sources","authors":"Fernando Brito, J. Moreira","doi":"10.1145/2664551.2664587","DOIUrl":null,"url":null,"abstract":"With the advent of Web 2.0 and the behavior change which it brought, there are millions of users worldwide contributing to different databases with various forms of data, such as movie ratings, for example. Moreover, the same real-world object (a song, a band or a movie) can be modeled using different ontologies or represented in different ways within the same ontology. Thus, the same film is often described by different attributes in different databases, making it difficult to perform an automatic mapping between those databases. We propose MovieMatcher, which is a heuristic that matches films across different databases using their metadata. After performing 2 experiments with the attempt to match 500 films to IMDb and Rotten Tomatoes databases, MovieMatcher had a success rate of 97.4% and 94.1%, in contrast to an alternative, simpler approach (title exact matching), which had a success rate of 80.8% and 81.9%, respectively.","PeriodicalId":114454,"journal":{"name":"Brazilian Symposium on Multimedia and the Web","volume":"73 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Brazilian Symposium on Multimedia and the Web","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2664551.2664587","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

With the advent of Web 2.0 and the behavior change which it brought, there are millions of users worldwide contributing to different databases with various forms of data, such as movie ratings, for example. Moreover, the same real-world object (a song, a band or a movie) can be modeled using different ontologies or represented in different ways within the same ontology. Thus, the same film is often described by different attributes in different databases, making it difficult to perform an automatic mapping between those databases. We propose MovieMatcher, which is a heuristic that matches films across different databases using their metadata. After performing 2 experiments with the attempt to match 500 films to IMDb and Rotten Tomatoes databases, MovieMatcher had a success rate of 97.4% and 94.1%, in contrast to an alternative, simpler approach (title exact matching), which had a success rate of 80.8% and 81.9%, respectively.
MovieMatcher:一种基于不同数据源元数据的电影匹配启发式算法
随着Web 2.0的出现及其带来的行为变化,全世界有数百万用户为不同的数据库提供各种形式的数据,例如电影评级。此外,可以使用不同的本体对相同的现实世界对象(歌曲、乐队或电影)进行建模,或者在同一本体中以不同的方式表示。因此,同一部电影通常在不同的数据库中用不同的属性来描述,这使得在这些数据库之间执行自动映射变得困难。我们提出MovieMatcher,这是一种启发式方法,可以使用不同数据库的元数据匹配电影。在进行了两次尝试将500部电影与IMDb和烂番茄数据库匹配的实验后,MovieMatcher的成功率分别为97.4%和94.1%,而另一种更简单的方法(标题精确匹配)的成功率分别为80.8%和81.9%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信