Shaiful Bakhtiar bin Rodzman, Normaly Kamal Ismail, Nurazzah Abd Rahman, Syed Ahmad Aljunid, Hayati Abd Rahman, Z. M. Nor, Ku Muhammad Naim Ku Khalif, Ahmad Yunus Mohd Noor
{"title":"Experiment with Text Summarization as a Positive Hierarchical Fuzzy Logic Ranking Indicator for Domain Specific Retrieval of Malay Translated Hadith","authors":"Shaiful Bakhtiar bin Rodzman, Normaly Kamal Ismail, Nurazzah Abd Rahman, Syed Ahmad Aljunid, Hayati Abd Rahman, Z. M. Nor, Ku Muhammad Naim Ku Khalif, Ahmad Yunus Mohd Noor","doi":"10.1109/ISCAIE.2019.8743988","DOIUrl":null,"url":null,"abstract":"Ranking function acts as a predictive algorithm that is used to establish a simple ordering of documents according to its relevance and this process shows the effectiveness, quality and the accuracy for the variety type of Information Retrieval (IR) such as, Domain Specific Retrieval of Malay Translated Hadith. In this research, a Hierarchical Fuzzy Logic Controller of Mamdani-type Fuzzy Inference System has been built to define the ranking function based on the BM25 Model. The model examines four-inputs which are Ontology BM25 Score, Fabrication Rate of Hadith, Shia Rate of Hadith from the previous works of the researchers and the New additional Positive Rate of Hadith. It also examines four-output values of Final Ranking Score which consist of three triangular membership functions. The new Positive Rate of hadith is based on the score value of the automatic text summarization that was executed in pre-processing phase. The proposed system has outperformed the BM25 original score and the Vector Space Model (VM) on 5 topic of queries and 26 queries in the term of individual queries, while the BM25 original score and Vector Space Model only yielded better result in 3 and 0 queries respectively on the P@10, %no measures and MAP. P@10 represent the values of Precision at Rank 10 P@10), %no measures represent the percentage of queries with no relevant documents in the top ten retrieved and MAP represents Mean Average Precision of the queries. The results show the proposed system have capability to demote negative documents and move up the relevant documents in the ranking list with positive indicator and its capability to recall unseen document with the application of ontology in text retrieval. For the future works, the researcher would like to apply the usage of new ranking indicator such as reliability score from the expert and the lay users of the Domain Specific Retrieval of Malay Translated Hadith.","PeriodicalId":369098,"journal":{"name":"2019 IEEE 9th Symposium on Computer Applications & Industrial Electronics (ISCAIE)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 9th Symposium on Computer Applications & Industrial Electronics (ISCAIE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCAIE.2019.8743988","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Ranking function acts as a predictive algorithm that is used to establish a simple ordering of documents according to its relevance and this process shows the effectiveness, quality and the accuracy for the variety type of Information Retrieval (IR) such as, Domain Specific Retrieval of Malay Translated Hadith. In this research, a Hierarchical Fuzzy Logic Controller of Mamdani-type Fuzzy Inference System has been built to define the ranking function based on the BM25 Model. The model examines four-inputs which are Ontology BM25 Score, Fabrication Rate of Hadith, Shia Rate of Hadith from the previous works of the researchers and the New additional Positive Rate of Hadith. It also examines four-output values of Final Ranking Score which consist of three triangular membership functions. The new Positive Rate of hadith is based on the score value of the automatic text summarization that was executed in pre-processing phase. The proposed system has outperformed the BM25 original score and the Vector Space Model (VM) on 5 topic of queries and 26 queries in the term of individual queries, while the BM25 original score and Vector Space Model only yielded better result in 3 and 0 queries respectively on the P@10, %no measures and MAP. P@10 represent the values of Precision at Rank 10 P@10), %no measures represent the percentage of queries with no relevant documents in the top ten retrieved and MAP represents Mean Average Precision of the queries. The results show the proposed system have capability to demote negative documents and move up the relevant documents in the ranking list with positive indicator and its capability to recall unseen document with the application of ontology in text retrieval. For the future works, the researcher would like to apply the usage of new ranking indicator such as reliability score from the expert and the lay users of the Domain Specific Retrieval of Malay Translated Hadith.