Annika Tillander , Susanna Lehtinen-Jacks , Nisha Singh , Oskar Halling Ullberg , Ulrika Florin , Katarina Bälter
{"title":"瑞典问卷职业开放式回答中办公室工作人员代理变量分配的数据","authors":"Annika Tillander , Susanna Lehtinen-Jacks , Nisha Singh , Oskar Halling Ullberg , Ulrika Florin , Katarina Bälter","doi":"10.1016/j.dib.2025.112105","DOIUrl":null,"url":null,"abstract":"<div><div>In numerous research disciplines, including epidemiology, it is common to compare different occupational categories, such as office workers and non-office workers. When only self-reported occupation titles are available, it is necessary to categorize individuals based on their self-reported titles. Thus, the possibility to identify office workers via self-reported occupation titles can enhance research on the health and well-being of office workers in large population-based epidemiological studies, even without specific questions about office work.</div><div>This paper introduces data and R code that can be used to assign a proxy variable for office worker based on responses to an open-ended question (OEQ) about occupation in Swedish questionnaires. The proxy variable is based on the Swedish Standard Classification of Occupations 2012 (SSYK 2012), which includes 8946 occupation titles. Using a translation key, the titles have been categorized into three groups: managers, white-collar workers, and blue-collar workers. White-collar workers (including managers) are considered office workers, while blue-collar workers are classified as non-office workers. The proxy variable has been refined using pilot data from the Swedish population-based epidemiological resource LifeGene.</div><div>The R code, together with the proxy variable, can be used in any dataset with a Swedish OEQ about occupation, facilitating the categorization of respondents as either white-collar or blue-collar workers and serving as a proxy variable for office worker. The R code can be used for OEQs regardless of language, provided there is a dataset with a standard classification of occupation in the desired language.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"63 ","pages":"Article 112105"},"PeriodicalIF":1.4000,"publicationDate":"2025-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Data for assigning a proxy variable for office worker in open-ended responses on occupation in Swedish questionnaires\",\"authors\":\"Annika Tillander , Susanna Lehtinen-Jacks , Nisha Singh , Oskar Halling Ullberg , Ulrika Florin , Katarina Bälter\",\"doi\":\"10.1016/j.dib.2025.112105\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In numerous research disciplines, including epidemiology, it is common to compare different occupational categories, such as office workers and non-office workers. When only self-reported occupation titles are available, it is necessary to categorize individuals based on their self-reported titles. Thus, the possibility to identify office workers via self-reported occupation titles can enhance research on the health and well-being of office workers in large population-based epidemiological studies, even without specific questions about office work.</div><div>This paper introduces data and R code that can be used to assign a proxy variable for office worker based on responses to an open-ended question (OEQ) about occupation in Swedish questionnaires. The proxy variable is based on the Swedish Standard Classification of Occupations 2012 (SSYK 2012), which includes 8946 occupation titles. Using a translation key, the titles have been categorized into three groups: managers, white-collar workers, and blue-collar workers. White-collar workers (including managers) are considered office workers, while blue-collar workers are classified as non-office workers. The proxy variable has been refined using pilot data from the Swedish population-based epidemiological resource LifeGene.</div><div>The R code, together with the proxy variable, can be used in any dataset with a Swedish OEQ about occupation, facilitating the categorization of respondents as either white-collar or blue-collar workers and serving as a proxy variable for office worker. The R code can be used for OEQs regardless of language, provided there is a dataset with a standard classification of occupation in the desired language.</div></div>\",\"PeriodicalId\":10973,\"journal\":{\"name\":\"Data in Brief\",\"volume\":\"63 \",\"pages\":\"Article 112105\"},\"PeriodicalIF\":1.4000,\"publicationDate\":\"2025-09-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Data in Brief\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2352340925008273\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data in Brief","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352340925008273","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
Data for assigning a proxy variable for office worker in open-ended responses on occupation in Swedish questionnaires
In numerous research disciplines, including epidemiology, it is common to compare different occupational categories, such as office workers and non-office workers. When only self-reported occupation titles are available, it is necessary to categorize individuals based on their self-reported titles. Thus, the possibility to identify office workers via self-reported occupation titles can enhance research on the health and well-being of office workers in large population-based epidemiological studies, even without specific questions about office work.
This paper introduces data and R code that can be used to assign a proxy variable for office worker based on responses to an open-ended question (OEQ) about occupation in Swedish questionnaires. The proxy variable is based on the Swedish Standard Classification of Occupations 2012 (SSYK 2012), which includes 8946 occupation titles. Using a translation key, the titles have been categorized into three groups: managers, white-collar workers, and blue-collar workers. White-collar workers (including managers) are considered office workers, while blue-collar workers are classified as non-office workers. The proxy variable has been refined using pilot data from the Swedish population-based epidemiological resource LifeGene.
The R code, together with the proxy variable, can be used in any dataset with a Swedish OEQ about occupation, facilitating the categorization of respondents as either white-collar or blue-collar workers and serving as a proxy variable for office worker. The R code can be used for OEQs regardless of language, provided there is a dataset with a standard classification of occupation in the desired language.
期刊介绍:
Data in Brief provides a way for researchers to easily share and reuse each other''s datasets by publishing data articles that: -Thoroughly describe your data, facilitating reproducibility. -Make your data, which is often buried in supplementary material, easier to find. -Increase traffic towards associated research articles and data, leading to more citations. -Open up doors for new collaborations. Because you never know what data will be useful to someone else, Data in Brief welcomes submissions that describe data from all research areas.