{"title":"Optimal Selection of Training Courses for Unemployed People based on Stable Marriage Model","authors":"Jorge Martínez Gil, B. Freudenthaler","doi":"10.1145/3366030.3366063","DOIUrl":"https://doi.org/10.1145/3366030.3366063","url":null,"abstract":"The problem that we address here is given n job seekers and n job offers, where each job seeker has ranked all job offers in order of preference given by a suitability function, and vice versa; the goal is to compute the minimum set of skills to be offered to the job seekers, so that a) a global stable marriage between job seekers and potential employers can be reached, and b) the degree of satisfaction for that stable marriage might be maximum. To achieve this goal, we have designed an iterative algorithmic solution that can be solved in polynomial time. Additionally, we illustrate our solution with an use case based on a numerical example.","PeriodicalId":446280,"journal":{"name":"Proceedings of the 21st International Conference on Information Integration and Web-based Applications & Services","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128154334","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An Analysis of Influence of Emoticons on Affective Impressions Feeling from Tweets","authors":"Koji Nakahira, T. Kumamoto","doi":"10.1145/3366030.3366067","DOIUrl":"https://doi.org/10.1145/3366030.3366067","url":null,"abstract":"In this paper, we investigated how Twitter users perceived affective impressions from tweets (with and without emoticons), and formulated the influence of emoticons on affective impressions. Initially, we conducted questionnaires, and quantified the impressions associated with three types of text: tweets with emoticons, tweets without emoticons, and emoticons. Multiple regression analysis was then applied to the three types of impression data, and consequently, multiple regression equations representing the relationships among them were obtained, where impression data on the tweets with emoticons were used as the objective variable, and impression data on the tweets without emoticons and the emoticons were used as the explanatory variables. Finally, the accuracy of the equations was estimated for learned and unlearned data, and their effectiveness was shown. Note that our target impressions are limited to the following eight types: \"Offensive and/or Unpleasant,\" \"Negative,\" \"Good feeling,\" \"Happy and/or Pleasant,\" \"Positive,\" \"Warm feel,\" \"Gloomy,\" and \"Scary.\"","PeriodicalId":446280,"journal":{"name":"Proceedings of the 21st International Conference on Information Integration and Web-based Applications & Services","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134350534","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Query Relaxation using Spreading-Activation and SKOS-Ontologies","authors":"Alexander Stenzer","doi":"10.1145/3366030.3366097","DOIUrl":"https://doi.org/10.1145/3366030.3366097","url":null,"abstract":"Digital libraries and archives adopting the Linked Open Data (LOD) approach add descriptive metadata to the objects stored in their inventory in order to facilitate searching and sorting. In many cases the available metadata terms are organized as a controlled vocabulary. Technically, the controlled vocabularies are often provided in the form of SKOS ontologies thus allowing to apply semantic web technologies to establish inter-vocabulary relations and ask queries. However, how can a search over the contents of different digital libraries each of which relies on their own vocabulary be performed without sacrificing recall or having to align the underlying ontologies beforehand? In this paper we present an approach based on query relaxation to solving this problem. Considering the graph nature of ontologies for controlled vocabularies we propose to use a spreading activation algorithm to relax and subsequently transform SPARQL queries in a way that makes them suitable for other vocabularies.","PeriodicalId":446280,"journal":{"name":"Proceedings of the 21st International Conference on Information Integration and Web-based Applications & Services","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133059550","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Data Source Selection in Big Data Context","authors":"Hicham Moad Safhi, B. Frikh, B. Ouhbi","doi":"10.1145/3366030.3366121","DOIUrl":"https://doi.org/10.1145/3366030.3366121","url":null,"abstract":"Big Data presents promising technological and economical opportunities. In fact, it has become the raw material of production for many organizations. Data is available in large quantities, and it continues generating abundantly. However, not all the data will have valuable knowledge. Unreliable sources provide misleading and biased information, and even reliable sources could suffer from low data quality. In this paper, we propose a novel methodology for the selectability of data sources, by both considering the presence and the absence of users' preferences. The proposed model integrates multiple factors that affect the reliability of data sources, including their quality, gain, cost and coverage. Experimental results on real world data-sets, show its capability to find the subset of relevant and reliable sources with the lowest cost.","PeriodicalId":446280,"journal":{"name":"Proceedings of the 21st International Conference on Information Integration and Web-based Applications & Services","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126019961","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"BM25-AH: Enhanced BM25 Algorithm for Domain-Specific Search Engine","authors":"Kirk Kalian, Charles Remig, Youna Jung","doi":"10.1145/3366030.3366107","DOIUrl":"https://doi.org/10.1145/3366030.3366107","url":null,"abstract":"The Virginia Military Institute (VMI) uses Google search to provide webpage search service in the VMI website. As Google search is a general-purpose service, it does not consider VMI-specific information and in turn, often fails to retrieve relevant information. To address the limitation, in this paper, we propose the BM25-AH algorithm, an extension of the BM25 ranking algorithm that utilizes domain-specific knowledge by adding four new features to the existing BM25 algorithm. The implementation results show us the potential of the proposed algorithm and the new VMI search engine.","PeriodicalId":446280,"journal":{"name":"Proceedings of the 21st International Conference on Information Integration and Web-based Applications & Services","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123499092","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Big Data Management and Analytics in Intelligent Smart Environments: State-of-the-Art Analysis and Future Research Directions","authors":"A. Cuzzocrea","doi":"10.1145/3366030.3366044","DOIUrl":"https://doi.org/10.1145/3366030.3366044","url":null,"abstract":"This paper focuses on big data management and analytics in intelligent smart environments, with particular regards to intelligent transportation and logistics systems, and provides relevant research directions that may represent a milestone for future years.","PeriodicalId":446280,"journal":{"name":"Proceedings of the 21st International Conference on Information Integration and Web-based Applications & Services","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121310025","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Václav Jirkovský, O. Šebek, Petr Kadera, Pavel Burget, Sönke Knoch, Tilman Becker
{"title":"Facilitation of Domain-Specific Data Models Design using Semantic Web Technologies for Manufacturing","authors":"Václav Jirkovský, O. Šebek, Petr Kadera, Pavel Burget, Sönke Knoch, Tilman Becker","doi":"10.1145/3366030.3366111","DOIUrl":"https://doi.org/10.1145/3366030.3366111","url":null,"abstract":"Modern manufacturing faces a challenge of integrating data models from various sources/domains which may differ both semantically and technically when particular domain specific data models are designed by different users and stored in different formats. This paper introduces an approach for facilitating the design of domain-specific data models using semantic web technologies. In this approach, all the information required for managing the production (including a description of a product, processes involved in the production, and existing resources and their specifications) is captured in an ontology. The proposed Product, Process, and Resource (PPR) ontology defines fundamental conceptualization of the production that can be easily applied to the arbitrary domain. Application of the PPR ontology is demonstrated in the case of simple truck assembling by means of robots. Capturing the knowledge in the form of ontology provides the advantage of employing supporting tools such as reasoners for consistency checking or query languages for information extraction. The paper demonstrates the utilization of SQWRL for searching resources suitable to manipulate given truck parts on the basis of semantic matching between properties of particular elements.","PeriodicalId":446280,"journal":{"name":"Proceedings of the 21st International Conference on Information Integration and Web-based Applications & Services","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130986621","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Crawling Method with No Parameters for Geo-social Data based on Road Maps","authors":"Sou Ijima, Masaharu Hirota, Shohei Yokoyama","doi":"10.1145/3366030.3366094","DOIUrl":"https://doi.org/10.1145/3366030.3366094","url":null,"abstract":"Researchers must crawl geo-social data to analyze and visualize geo-social data. A conventional method to exhaustively crawl geosocial data is based on a grid. The crawler divides a specified area into a grid and uses the center coordinates of each cell to query databases using APIs. However, there is a difficult problem when using the grid-based method. It is that researchers cannot estimate the optimized grid size to exhaustively crawl geo-social data in advance because the optimized grid size depends on data density owing to geographical characteristics of an area. We focus on the fact that geo-social data are dense along roads. Thus, we propose a method based on road maps to exhaustively crawl geo-social data. We demonstrated that our method can crawl geo-social data by using almost the same number of queries compared to the crawler with an optimized grid size.","PeriodicalId":446280,"journal":{"name":"Proceedings of the 21st International Conference on Information Integration and Web-based Applications & Services","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114373786","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Content Based Fake News Detection Using N-Gram Models","authors":"Hnin Ei Wynne, Zar Zar Wint","doi":"10.1145/3366030.3366116","DOIUrl":"https://doi.org/10.1145/3366030.3366116","url":null,"abstract":"Fake news is very popular these days because of the increasing popularity of social media. Detecting fake news is considered as one of the most dangerous types of deception because it is created with dishonest intention to misdirect the public. Many researchers proposed fake news detection systems considering many approaches; content, social-context, and propagation. When the news is detected fake or real, there is a limitation in the accuracy and understandability of language. In this paper, we propose the fake news detection system that considers the content of the online news articles. We investigate two machine learning algorithms with the use of word n-grams and character n-grams analysis. Experiments yield better results using character n-grams with Term-Frequency-Inverted Document Frequency (TF-IDF) and Gradient Boosting Classifier achieves an accuracy of 96%.","PeriodicalId":446280,"journal":{"name":"Proceedings of the 21st International Conference on Information Integration and Web-based Applications & Services","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133311605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Takuma Hirotsu, Masaharu Hirota, Tetsu Araki, Masaki Endo, H. Ishikawa
{"title":"Tourism application with CNN-Based Classification specialized for cultural information","authors":"Takuma Hirotsu, Masaharu Hirota, Tetsu Araki, Masaki Endo, H. Ishikawa","doi":"10.1145/3366030.3366073","DOIUrl":"https://doi.org/10.1145/3366030.3366073","url":null,"abstract":"Over-tourism has become an important difficulty in Japan because the number of visiting international tourists has increased in recent years. This intensive tourism leads to sightseeing problems because opportunities to inform tourists about culture and rules in tourist areas are few. Some system is needed to convey correct cultural aspects of tourist areas. This paper proposes a system to present a user with useful information such as area- specific culture from photographs taken with a convolutional neural network (CNN). Tourists can gain information by associating the contents with the real world by browsing useful information while viewing photographs. After we constructed the prototype system to present 30 types of useful information in English, we evaluated our system quantitatively. We also administered a questionnaire survey for Japanese and foreign residents. The results demonstrate that our system is effective to facilitate foreign tourists' understanding Japanese culture and norms.","PeriodicalId":446280,"journal":{"name":"Proceedings of the 21st International Conference on Information Integration and Web-based Applications & Services","volume":"49 5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133713063","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}