N. M. Hernandez, P. Lucañas, J. C. Graciosa, C. Mamador, L. Caezar, I. Panganiban, Cong Yu, K. Maver, M. Maver
{"title":"在混合云容器中使用智能机器学习算法的非结构化井数据自动信息检索平台","authors":"N. M. Hernandez, P. Lucañas, J. C. Graciosa, C. Mamador, L. Caezar, I. Panganiban, Cong Yu, K. Maver, M. Maver","doi":"10.3997/2214-4609.201803031","DOIUrl":null,"url":null,"abstract":"There is a large amount of historic and valuable well information available stored either on paper and more recently as digital documents and reports in the oil and gas industry especially by national data management systems and oil companies. These technical documents contain valuable information from disciplines like geoscience and engineering and are in general stored in a unstructured format. To extract and utilize all this well data, a machine learning-enabled platform, consisting of a carefully selected sequence of algorithms, has been developed as a hybrid cloud container that automatically reads and understands the technical documents with little human supervision. The user can upload raw data to the platform, which are stored on a private local server. The machine learning algorithms are activated and implement the necessary processing and workflows. Structured data is generated as output, which are pushed through to a search engine that is accessible to the user in the cloud. The aim of the platform is to ease the identification of important parts of the technical documents, automatically extract relevant information and visualize it for the user, so they can easily do further analysis, share it with colleagues or agnostically port it to other platforms as input.","PeriodicalId":231338,"journal":{"name":"First EAGE/PESGB Workshop Machine Learning","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"An Automated Information Retrieval Platform For Unstructured Well Data Utilizing Smart Machine Learning Algorithms Within A Hybrid Cloud Container\",\"authors\":\"N. M. Hernandez, P. Lucañas, J. C. Graciosa, C. Mamador, L. Caezar, I. Panganiban, Cong Yu, K. Maver, M. Maver\",\"doi\":\"10.3997/2214-4609.201803031\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"There is a large amount of historic and valuable well information available stored either on paper and more recently as digital documents and reports in the oil and gas industry especially by national data management systems and oil companies. These technical documents contain valuable information from disciplines like geoscience and engineering and are in general stored in a unstructured format. To extract and utilize all this well data, a machine learning-enabled platform, consisting of a carefully selected sequence of algorithms, has been developed as a hybrid cloud container that automatically reads and understands the technical documents with little human supervision. The user can upload raw data to the platform, which are stored on a private local server. The machine learning algorithms are activated and implement the necessary processing and workflows. Structured data is generated as output, which are pushed through to a search engine that is accessible to the user in the cloud. The aim of the platform is to ease the identification of important parts of the technical documents, automatically extract relevant information and visualize it for the user, so they can easily do further analysis, share it with colleagues or agnostically port it to other platforms as input.\",\"PeriodicalId\":231338,\"journal\":{\"name\":\"First EAGE/PESGB Workshop Machine Learning\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"First EAGE/PESGB Workshop Machine Learning\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3997/2214-4609.201803031\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"First EAGE/PESGB Workshop Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3997/2214-4609.201803031","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An Automated Information Retrieval Platform For Unstructured Well Data Utilizing Smart Machine Learning Algorithms Within A Hybrid Cloud Container
There is a large amount of historic and valuable well information available stored either on paper and more recently as digital documents and reports in the oil and gas industry especially by national data management systems and oil companies. These technical documents contain valuable information from disciplines like geoscience and engineering and are in general stored in a unstructured format. To extract and utilize all this well data, a machine learning-enabled platform, consisting of a carefully selected sequence of algorithms, has been developed as a hybrid cloud container that automatically reads and understands the technical documents with little human supervision. The user can upload raw data to the platform, which are stored on a private local server. The machine learning algorithms are activated and implement the necessary processing and workflows. Structured data is generated as output, which are pushed through to a search engine that is accessible to the user in the cloud. The aim of the platform is to ease the identification of important parts of the technical documents, automatically extract relevant information and visualize it for the user, so they can easily do further analysis, share it with colleagues or agnostically port it to other platforms as input.