{"title":"基于网络爬行和无监督学习的印尼Youtube浏览网络模式分析(分析Pola Minat Tayangan Youtube DI Indonesia dengan网络爬行和监督学习)","authors":"Nfn Nofriani","doi":"10.33164/iptekkom.20.2.2018.93-106","DOIUrl":null,"url":null,"abstract":"YouTube is a popular video sharing website, specifically in Indonesia. Every day, in every country, the list of trending videos is updated on YouTube’s Trending page. The data of trending videos can be used for information exploration, such as analysis on the pattern of interests of YouTube browsing. This research aims to grab and analyse the metadata of trending videos to generate a classifier model and statistics of trending YouTube videos in Indonesia. The data is grabbed from YouTube’s Trending page using Scraper and Screaming Frog SEO Spider tools, every day for 10 consecutive days. The data is later classified into video categories. The approach used for this purpose is rule-based classification using J48 tree algorithm and TF-IDF filter. The result of this research shows that videos about people, blogs, sports, news, politics, comedy, entertainment and music are what interest the people in Indonesia the most.","PeriodicalId":368220,"journal":{"name":"JURNAL IPTEKKOM : Jurnal Ilmu Pengetahuan & Teknologi Informasi","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Analysis On Internet Pattern of Youtube Browsing in Indonesia Using Web Crawling and Unsupervised Learning (Analisis Pola Minat Tayangan Youtube DI Indonesia dengan Web Crawling dan Supervised Learning)\",\"authors\":\"Nfn Nofriani\",\"doi\":\"10.33164/iptekkom.20.2.2018.93-106\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"YouTube is a popular video sharing website, specifically in Indonesia. Every day, in every country, the list of trending videos is updated on YouTube’s Trending page. The data of trending videos can be used for information exploration, such as analysis on the pattern of interests of YouTube browsing. This research aims to grab and analyse the metadata of trending videos to generate a classifier model and statistics of trending YouTube videos in Indonesia. The data is grabbed from YouTube’s Trending page using Scraper and Screaming Frog SEO Spider tools, every day for 10 consecutive days. The data is later classified into video categories. The approach used for this purpose is rule-based classification using J48 tree algorithm and TF-IDF filter. The result of this research shows that videos about people, blogs, sports, news, politics, comedy, entertainment and music are what interest the people in Indonesia the most.\",\"PeriodicalId\":368220,\"journal\":{\"name\":\"JURNAL IPTEKKOM : Jurnal Ilmu Pengetahuan & Teknologi Informasi\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-12-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JURNAL IPTEKKOM : Jurnal Ilmu Pengetahuan & Teknologi Informasi\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.33164/iptekkom.20.2.2018.93-106\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JURNAL IPTEKKOM : Jurnal Ilmu Pengetahuan & Teknologi Informasi","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.33164/iptekkom.20.2.2018.93-106","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
YouTube是一个很受欢迎的视频分享网站,尤其是在印度尼西亚。每天,在每个国家,YouTube的趋势页面上都会更新热门视频列表。趋势视频的数据可以用于信息挖掘,例如分析YouTube浏览的兴趣模式。本研究旨在抓取和分析趋势视频的元数据,以生成印度尼西亚YouTube趋势视频的分类器模型和统计数据。这些数据是使用Scraper和scream Frog SEO Spider工具从YouTube的趋势页面抓取的,每天持续10天。这些数据随后被分类为视频类别。用于此目的的方法是使用J48树算法和TF-IDF过滤器的基于规则的分类。研究结果显示,印尼民众最感兴趣的是有关人物、部落格、体育、新闻、政治、喜剧、娱乐和音乐的影片。
Analysis On Internet Pattern of Youtube Browsing in Indonesia Using Web Crawling and Unsupervised Learning (Analisis Pola Minat Tayangan Youtube DI Indonesia dengan Web Crawling dan Supervised Learning)
YouTube is a popular video sharing website, specifically in Indonesia. Every day, in every country, the list of trending videos is updated on YouTube’s Trending page. The data of trending videos can be used for information exploration, such as analysis on the pattern of interests of YouTube browsing. This research aims to grab and analyse the metadata of trending videos to generate a classifier model and statistics of trending YouTube videos in Indonesia. The data is grabbed from YouTube’s Trending page using Scraper and Screaming Frog SEO Spider tools, every day for 10 consecutive days. The data is later classified into video categories. The approach used for this purpose is rule-based classification using J48 tree algorithm and TF-IDF filter. The result of this research shows that videos about people, blogs, sports, news, politics, comedy, entertainment and music are what interest the people in Indonesia the most.