Jose Carlos Morin Garcia, Juan Armando Barron Lugo, Jose Luis Gonzalez Compean, Ivan Lopez Arevalo, J. Carretero, Martha Cordero Oropeza
{"title":"由医疗保健数据科学服务的时空变量定义的数据和任务编排","authors":"Jose Carlos Morin Garcia, Juan Armando Barron Lugo, Jose Luis Gonzalez Compean, Ivan Lopez Arevalo, J. Carretero, Martha Cordero Oropeza","doi":"10.1145/3569192.3569208","DOIUrl":null,"url":null,"abstract":"Data science services have become a solution for healthcare organizations to take advantage of the large volumes of data (e.g., data lakes and data warehouses) produced during the interaction of healthcare staff with patients and government agencies. However, the data orchestration for these services is not trivial when dealing with multiple data sources where decision-making processes should combine them to create a single solid information piece (big picture) for making inferences or predictions. In this paper, we present a data and task orchestration method for supporting healthcare data science services. This method considers stages such as data fusion/integration for enabling the crossing of information, computing splits for producing, on-the-fly and on-demand, data subsets by using spatio-temporal variables, converting splited data into information, consolidation of information into segments to create a big picture of data and, in the last stage, makes available data segments for consumption on decision-making processes by using spatio-temporal queries. A case study based on the fusion of healthcare data sources about psychiatric, drug consumption, and macro-economics was conducted by using a prototype of the data orchestration proposed in this paper. The evaluation revealed the flexibility of this data orchestration approach to convert multiple data sources into useful information for healthcare decision-making processes.","PeriodicalId":249004,"journal":{"name":"Proceedings of the 9th International Conference on Bioinformatics Research and Applications","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Data and task orchestration defined by spatio-temporal variables for healthcare data science services\",\"authors\":\"Jose Carlos Morin Garcia, Juan Armando Barron Lugo, Jose Luis Gonzalez Compean, Ivan Lopez Arevalo, J. Carretero, Martha Cordero Oropeza\",\"doi\":\"10.1145/3569192.3569208\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Data science services have become a solution for healthcare organizations to take advantage of the large volumes of data (e.g., data lakes and data warehouses) produced during the interaction of healthcare staff with patients and government agencies. However, the data orchestration for these services is not trivial when dealing with multiple data sources where decision-making processes should combine them to create a single solid information piece (big picture) for making inferences or predictions. In this paper, we present a data and task orchestration method for supporting healthcare data science services. This method considers stages such as data fusion/integration for enabling the crossing of information, computing splits for producing, on-the-fly and on-demand, data subsets by using spatio-temporal variables, converting splited data into information, consolidation of information into segments to create a big picture of data and, in the last stage, makes available data segments for consumption on decision-making processes by using spatio-temporal queries. A case study based on the fusion of healthcare data sources about psychiatric, drug consumption, and macro-economics was conducted by using a prototype of the data orchestration proposed in this paper. The evaluation revealed the flexibility of this data orchestration approach to convert multiple data sources into useful information for healthcare decision-making processes.\",\"PeriodicalId\":249004,\"journal\":{\"name\":\"Proceedings of the 9th International Conference on Bioinformatics Research and Applications\",\"volume\":\"32 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 9th International Conference on Bioinformatics Research and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3569192.3569208\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 9th International Conference on Bioinformatics Research and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3569192.3569208","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Data and task orchestration defined by spatio-temporal variables for healthcare data science services
Data science services have become a solution for healthcare organizations to take advantage of the large volumes of data (e.g., data lakes and data warehouses) produced during the interaction of healthcare staff with patients and government agencies. However, the data orchestration for these services is not trivial when dealing with multiple data sources where decision-making processes should combine them to create a single solid information piece (big picture) for making inferences or predictions. In this paper, we present a data and task orchestration method for supporting healthcare data science services. This method considers stages such as data fusion/integration for enabling the crossing of information, computing splits for producing, on-the-fly and on-demand, data subsets by using spatio-temporal variables, converting splited data into information, consolidation of information into segments to create a big picture of data and, in the last stage, makes available data segments for consumption on decision-making processes by using spatio-temporal queries. A case study based on the fusion of healthcare data sources about psychiatric, drug consumption, and macro-economics was conducted by using a prototype of the data orchestration proposed in this paper. The evaluation revealed the flexibility of this data orchestration approach to convert multiple data sources into useful information for healthcare decision-making processes.