{"title":"基于语料库的海洋本体信息提取方法","authors":"Svetlana Strinyuk, Irina Scherbakova, V. Lanin","doi":"10.1109/AICT52784.2021.9620410","DOIUrl":null,"url":null,"abstract":"Extracting information from texts and representing it as a formal knowledge system has become important due to increasing volume of freely available texts, which makes difficult finding relevant information and separating meaningful data from insignificant quickly. These tasks are especially important for industries involving international collaboration such as shipping industry. Shipping industry being a lifeblood of the world economy provides 90% of goods delivery. The industry is regulated by International Maritime organization (IMO) Conventions which are due to regular revising and editing according to changing conditions of trade and multiple parties involved. This pilot research provides a corpus based approach to information extracting and building IMO Conventions Ontology. A Corpus of IMO Convention texts are processed with using semantic approach to extract definitions. Based on core shipping industry definitions extracted from IMO Conventions the Ontology will be build. Developed ontology can be used for intelligent processing of documents and teaching purposes.","PeriodicalId":150606,"journal":{"name":"2021 IEEE 15th International Conference on Application of Information and Communication Technologies (AICT)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Corpus Based Information Extraction Approach for Marine Ontology Development\",\"authors\":\"Svetlana Strinyuk, Irina Scherbakova, V. Lanin\",\"doi\":\"10.1109/AICT52784.2021.9620410\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Extracting information from texts and representing it as a formal knowledge system has become important due to increasing volume of freely available texts, which makes difficult finding relevant information and separating meaningful data from insignificant quickly. These tasks are especially important for industries involving international collaboration such as shipping industry. Shipping industry being a lifeblood of the world economy provides 90% of goods delivery. The industry is regulated by International Maritime organization (IMO) Conventions which are due to regular revising and editing according to changing conditions of trade and multiple parties involved. This pilot research provides a corpus based approach to information extracting and building IMO Conventions Ontology. A Corpus of IMO Convention texts are processed with using semantic approach to extract definitions. Based on core shipping industry definitions extracted from IMO Conventions the Ontology will be build. Developed ontology can be used for intelligent processing of documents and teaching purposes.\",\"PeriodicalId\":150606,\"journal\":{\"name\":\"2021 IEEE 15th International Conference on Application of Information and Communication Technologies (AICT)\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 15th International Conference on Application of Information and Communication Technologies (AICT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AICT52784.2021.9620410\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 15th International Conference on Application of Information and Communication Technologies (AICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AICT52784.2021.9620410","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Corpus Based Information Extraction Approach for Marine Ontology Development
Extracting information from texts and representing it as a formal knowledge system has become important due to increasing volume of freely available texts, which makes difficult finding relevant information and separating meaningful data from insignificant quickly. These tasks are especially important for industries involving international collaboration such as shipping industry. Shipping industry being a lifeblood of the world economy provides 90% of goods delivery. The industry is regulated by International Maritime organization (IMO) Conventions which are due to regular revising and editing according to changing conditions of trade and multiple parties involved. This pilot research provides a corpus based approach to information extracting and building IMO Conventions Ontology. A Corpus of IMO Convention texts are processed with using semantic approach to extract definitions. Based on core shipping industry definitions extracted from IMO Conventions the Ontology will be build. Developed ontology can be used for intelligent processing of documents and teaching purposes.