Johannes Niu, Mila Stillman, Philipp Seeberger, Anna Kruspe
{"title":"A dataset of Open Source Intelligence (OSINT) Tweets about the Russo-Ukrainian war","authors":"Johannes Niu, Mila Stillman, Philipp Seeberger, Anna Kruspe","doi":"arxiv-2409.01052","DOIUrl":null,"url":null,"abstract":"Open Source Intelligence (OSINT) refers to intelligence efforts based on\nfreely available data. It has become a frequent topic of conversation on social\nmedia, where private users or networks can share their findings. Such data is\nhighly valuable in conflicts, both for gaining a new understanding of the\nsituation as well as for tracking the spread of misinformation. In this paper,\nwe present a method for collecting such data as well as a novel OSINT dataset\nfor the Russo-Ukrainian war drawn from Twitter between January 2022 and July\n2023. It is based on an initial search of users posting OSINT and a subsequent\nsnowballing approach to detect more. The final dataset contains almost 2\nmillion Tweets posted by 1040 users. We also provide some first analyses and\nexperiments on the data, and make suggestions for its future usage.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Social and Information Networks","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.01052","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Open Source Intelligence (OSINT) refers to intelligence efforts based on
freely available data. It has become a frequent topic of conversation on social
media, where private users or networks can share their findings. Such data is
highly valuable in conflicts, both for gaining a new understanding of the
situation as well as for tracking the spread of misinformation. In this paper,
we present a method for collecting such data as well as a novel OSINT dataset
for the Russo-Ukrainian war drawn from Twitter between January 2022 and July
2023. It is based on an initial search of users posting OSINT and a subsequent
snowballing approach to detect more. The final dataset contains almost 2
million Tweets posted by 1040 users. We also provide some first analyses and
experiments on the data, and make suggestions for its future usage.