N. T. Son, Nguyễn Thụy Phương Duyên, H. Quoc, Le-Minh Nguyen
{"title":"Recognizing logical parts in Vietnamese legal texts using Conditional Random Fields","authors":"N. T. Son, Nguyễn Thụy Phương Duyên, H. Quoc, Le-Minh Nguyen","doi":"10.1109/RIVF.2015.7049865","DOIUrl":null,"url":null,"abstract":"Analyzing the structure of legal sentences in legal document is an important phase to build a knowledge management system in Legal Engineering. This paper proposes a new approach to recognize logical parts in Vietnamese legal documents based on a statistic machine learning method - Conditional Random Fields. Beside linguistic features such as word features, part of speech features, we use semantic features of logical parts such as trigger features and ontology features to improve the result of the annotation system. Experiments were conducted in a Vietnamese Business Law data set and obtained 78.12% at precision and 68.72% at recall measure. Compare to state-of-the-art systems, it improves the result for recognizing some logical parts.","PeriodicalId":166971,"journal":{"name":"The 2015 IEEE RIVF International Conference on Computing & Communication Technologies - Research, Innovation, and Vision for Future (RIVF)","volume":"CE-31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The 2015 IEEE RIVF International Conference on Computing & Communication Technologies - Research, Innovation, and Vision for Future (RIVF)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RIVF.2015.7049865","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Analyzing the structure of legal sentences in legal document is an important phase to build a knowledge management system in Legal Engineering. This paper proposes a new approach to recognize logical parts in Vietnamese legal documents based on a statistic machine learning method - Conditional Random Fields. Beside linguistic features such as word features, part of speech features, we use semantic features of logical parts such as trigger features and ontology features to improve the result of the annotation system. Experiments were conducted in a Vietnamese Business Law data set and obtained 78.12% at precision and 68.72% at recall measure. Compare to state-of-the-art systems, it improves the result for recognizing some logical parts.