{"title":"数据科学在重建2013-2021年罗萨里奥岛(哥伦比亚-加勒比地区)气象变量时间序列中的应用","authors":"Camilo Contreras Vargas, Julián Quintero Ibáñez, Ángela Solanilla","doi":"10.26640/22159045.2022.604","DOIUrl":null,"url":null,"abstract":"This study reviews two time series of meteorological variables measured by an automatic station located in Islas del Rosario (Colombian Caribbean), belonging to the Network for Measurement of Oceanographic Parameters and Marine Meteorology (RedMpomm) of the General Maritime Directorate (Dimar). The time series correspond to data of air temperature and wind magnitude in the period 2013-2021, which present some missing values. The objective of the study was to develop a model that would automatically reconstruct missing values in the time series, using the advantages of data science to complete information with estimated values. The importance of obtaining reconstructed series lies in having more solid databases to be used in the research and academic work carried out by Dimar. The methodology developed consisted of the use of imputation of medians from existing data on dates and times associated with missing values, all this through the use of data lags and complementary information such as periodicity relationships on the data set. The results showed that it was possible to implement a reliable methodology capable of estimating the most appropriate value to complete the different time series, which constitutes a first approximation for reconstruction of meteorological data.","PeriodicalId":33310,"journal":{"name":"Boletin Cientifico CIOH","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Application of Data Science for the reconstruction of time series of meteorological variables in the Islas del Rosario (Colombian Caribbean), between the years 2013-2021\",\"authors\":\"Camilo Contreras Vargas, Julián Quintero Ibáñez, Ángela Solanilla\",\"doi\":\"10.26640/22159045.2022.604\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study reviews two time series of meteorological variables measured by an automatic station located in Islas del Rosario (Colombian Caribbean), belonging to the Network for Measurement of Oceanographic Parameters and Marine Meteorology (RedMpomm) of the General Maritime Directorate (Dimar). The time series correspond to data of air temperature and wind magnitude in the period 2013-2021, which present some missing values. The objective of the study was to develop a model that would automatically reconstruct missing values in the time series, using the advantages of data science to complete information with estimated values. The importance of obtaining reconstructed series lies in having more solid databases to be used in the research and academic work carried out by Dimar. The methodology developed consisted of the use of imputation of medians from existing data on dates and times associated with missing values, all this through the use of data lags and complementary information such as periodicity relationships on the data set. The results showed that it was possible to implement a reliable methodology capable of estimating the most appropriate value to complete the different time series, which constitutes a first approximation for reconstruction of meteorological data.\",\"PeriodicalId\":33310,\"journal\":{\"name\":\"Boletin Cientifico CIOH\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Boletin Cientifico CIOH\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.26640/22159045.2022.604\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Boletin Cientifico CIOH","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.26640/22159045.2022.604","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Application of Data Science for the reconstruction of time series of meteorological variables in the Islas del Rosario (Colombian Caribbean), between the years 2013-2021
This study reviews two time series of meteorological variables measured by an automatic station located in Islas del Rosario (Colombian Caribbean), belonging to the Network for Measurement of Oceanographic Parameters and Marine Meteorology (RedMpomm) of the General Maritime Directorate (Dimar). The time series correspond to data of air temperature and wind magnitude in the period 2013-2021, which present some missing values. The objective of the study was to develop a model that would automatically reconstruct missing values in the time series, using the advantages of data science to complete information with estimated values. The importance of obtaining reconstructed series lies in having more solid databases to be used in the research and academic work carried out by Dimar. The methodology developed consisted of the use of imputation of medians from existing data on dates and times associated with missing values, all this through the use of data lags and complementary information such as periodicity relationships on the data set. The results showed that it was possible to implement a reliable methodology capable of estimating the most appropriate value to complete the different time series, which constitutes a first approximation for reconstruction of meteorological data.