{"title":"基于冲突评分的孟加拉语关系抽取远程监督减少错误标签","authors":"Tanzim Mahfuz, T. Suha, M. Anwar","doi":"10.1109/CSDE50874.2020.9411604","DOIUrl":null,"url":null,"abstract":"The research area of information extraction (IE) aims to extract structured information such as types of entities and relations between them, from unstructured textual data like newswires, blogs, governmental documents etc. Relation extraction (RE) deals with the automatic detection of relationships between concepts mentioned in free texts. Knowledge-based distant supervision (DS) uses structured data to heuristically label a training corpus. However, this heuristic can generate some noisy labeled data. In this paper, we propose a method using conflict score in DS to reduce the number of wrong labels for Bangla sentences.","PeriodicalId":445708,"journal":{"name":"2020 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Reducing Wrong Labels using Conflict Score in Distant Supervision for Relation Extraction in Bangla Language\",\"authors\":\"Tanzim Mahfuz, T. Suha, M. Anwar\",\"doi\":\"10.1109/CSDE50874.2020.9411604\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The research area of information extraction (IE) aims to extract structured information such as types of entities and relations between them, from unstructured textual data like newswires, blogs, governmental documents etc. Relation extraction (RE) deals with the automatic detection of relationships between concepts mentioned in free texts. Knowledge-based distant supervision (DS) uses structured data to heuristically label a training corpus. However, this heuristic can generate some noisy labeled data. In this paper, we propose a method using conflict score in DS to reduce the number of wrong labels for Bangla sentences.\",\"PeriodicalId\":445708,\"journal\":{\"name\":\"2020 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSDE50874.2020.9411604\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSDE50874.2020.9411604","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Reducing Wrong Labels using Conflict Score in Distant Supervision for Relation Extraction in Bangla Language
The research area of information extraction (IE) aims to extract structured information such as types of entities and relations between them, from unstructured textual data like newswires, blogs, governmental documents etc. Relation extraction (RE) deals with the automatic detection of relationships between concepts mentioned in free texts. Knowledge-based distant supervision (DS) uses structured data to heuristically label a training corpus. However, this heuristic can generate some noisy labeled data. In this paper, we propose a method using conflict score in DS to reduce the number of wrong labels for Bangla sentences.