Sara Dakrory, Bahgat Abdelhamid Abdelatif, Mohammed Kayed, A. A. Ali
{"title":"Extracting Geographic Addresses from Social Media using Deep Recurrent Neural Networks","authors":"Sara Dakrory, Bahgat Abdelhamid Abdelatif, Mohammed Kayed, A. A. Ali","doi":"10.1109/JAC-ECC54461.2021.9691442","DOIUrl":null,"url":null,"abstract":"The importance of geographical, addresses in people's daily lives cannot be underestimated. People usually use the Internet to search for unfamiliar areas and then use map services to mark locations. Using social media to extract information, particularly geographical addresses, is rapidly increasing worldwide. Social media represents the right choice as a source in identifying the location that people need to find. In this paper, a deep neural network using a Bidirectional Long Short-Term Memory with CRF (BI-LSTM-CRF) model is applied for address extraction. In addition, a Bidirectional Encoder Representations from Transformers (BERT) model is implemented to extract the geographical addresses from Facebook posts. Further, we reveal how to use the BIEO tagging method to apply the sequence labeling technique to Arabic postal address extraction. An Arabic corpus from social media is annotated to evaluate our proposed model. The results show that Arabic postal addresses can be extracted through BI-LSTM-CRF and BERT models with a high F-measure.","PeriodicalId":354908,"journal":{"name":"2021 9th International Japan-Africa Conference on Electronics, Communications, and Computations (JAC-ECC)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 9th International Japan-Africa Conference on Electronics, Communications, and Computations (JAC-ECC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/JAC-ECC54461.2021.9691442","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The importance of geographical, addresses in people's daily lives cannot be underestimated. People usually use the Internet to search for unfamiliar areas and then use map services to mark locations. Using social media to extract information, particularly geographical addresses, is rapidly increasing worldwide. Social media represents the right choice as a source in identifying the location that people need to find. In this paper, a deep neural network using a Bidirectional Long Short-Term Memory with CRF (BI-LSTM-CRF) model is applied for address extraction. In addition, a Bidirectional Encoder Representations from Transformers (BERT) model is implemented to extract the geographical addresses from Facebook posts. Further, we reveal how to use the BIEO tagging method to apply the sequence labeling technique to Arabic postal address extraction. An Arabic corpus from social media is annotated to evaluate our proposed model. The results show that Arabic postal addresses can be extracted through BI-LSTM-CRF and BERT models with a high F-measure.