Yantine Arsita Br. Panjaitan, I. Surjandari, Asma Rosyidah
{"title":"Text document clustering using self organizing map: Theses and dissertations of universitas Indonesia","authors":"Yantine Arsita Br. Panjaitan, I. Surjandari, Asma Rosyidah","doi":"10.1109/ICSITECH.2017.8257096","DOIUrl":null,"url":null,"abstract":"Accessibility is a critical aspect to be considered by college library in order to facilitate users in searching library collections. The Library of Universitas Indonesia, as one of Asia's largest library with more than 1,500,000 book collections, should also concern about accessibility to balance its numerous collections. UI-ana collections or works produced by and associated with Universitas Indonesia; in particular theses (undergraduate and graduate theses) and dissertations are one of the largest numbers of collections in Universitas Indonesia's Library. However, the current collection's management system was still based on the submission of the collection in Universitas Indonesia's Library. Since these collections are arranged with no exact criterion, it is harder for users to find theses and dissertations with the same topic. Therefore, management of these collections based on certain criterion is extremely needed to facilitate users in searching these collections. This research aims to determine the categories that can represent theses and dissertations through abstract text mining of each collection in 2005–2015 with a clustering algorithm, namely Self-organizing Map. This study found 139 categories which will be used to classify theses and dissertations of Universitas Indonesia.","PeriodicalId":165045,"journal":{"name":"2017 3rd International Conference on Science in Information Technology (ICSITech)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 3rd International Conference on Science in Information Technology (ICSITech)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSITECH.2017.8257096","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Accessibility is a critical aspect to be considered by college library in order to facilitate users in searching library collections. The Library of Universitas Indonesia, as one of Asia's largest library with more than 1,500,000 book collections, should also concern about accessibility to balance its numerous collections. UI-ana collections or works produced by and associated with Universitas Indonesia; in particular theses (undergraduate and graduate theses) and dissertations are one of the largest numbers of collections in Universitas Indonesia's Library. However, the current collection's management system was still based on the submission of the collection in Universitas Indonesia's Library. Since these collections are arranged with no exact criterion, it is harder for users to find theses and dissertations with the same topic. Therefore, management of these collections based on certain criterion is extremely needed to facilitate users in searching these collections. This research aims to determine the categories that can represent theses and dissertations through abstract text mining of each collection in 2005–2015 with a clustering algorithm, namely Self-organizing Map. This study found 139 categories which will be used to classify theses and dissertations of Universitas Indonesia.