{"title":"The performance of large language models on quantitative and verbal ability tests: Initial evidence and implications for unproctored high-stakes testing","authors":"Louis Hickman, Patrick D. Dunlop, Jasper Leo Wolf","doi":"10.1111/ijsa.12479","DOIUrl":"10.1111/ijsa.12479","url":null,"abstract":"<p>Unproctored assessments are widely used in pre-employment assessment. However, widely accessible large language models (LLMs) pose challenges for unproctored personnel assessments, given that applicants may use them to artificially inflate their scores beyond their true abilities. This may be particularly concerning in cognitive ability tests, which are widely used and traditionally considered to be less fakeable by humans than personality tests. Thus, this study compares the performance of LLMs on two common types of cognitive tests: quantitative ability (number series completion) and verbal ability (use a passage of text to determine whether a statement is true). The tests investigated are used in real-world, high-stakes selection. We also examine the performance of the LLMs across different test formats (i.e., open-ended vs. multiple choice). Further, we contrast the performance of two LLMs (Generative Pretrained Transformers, GPT-3.5 and GPT-4) across multiple prompt approaches and “temperature” settings (i.e., a parameter that determines the amount of randomness in the model's output). We found that the LLMs performed well on the verbal ability test but extremely poorly on the quantitative ability test, even when accounting for the test format. GPT-4 outperformed GPT-3.5 across both types of tests. Notably, although prompt approaches and temperature settings did affect LLM test performance, those effects were mostly minor relative to differences across tests and language models. We provide recommendations for securing pre-employment testing against LLM influences. Additionally, we call for rigorous research investigating the prevalence of LLM usage in pre-employment testing as well as on how LLM usage affects selection test validity.</p>","PeriodicalId":51465,"journal":{"name":"International Journal of Selection and Assessment","volume":"32 4","pages":"499-511"},"PeriodicalIF":2.6,"publicationDate":"2024-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/ijsa.12479","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140964082","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Examining the efficacy of inoculation and value-affirmation interventions in improving precandidate reactions among prospective military recruits","authors":"Justin R. Feeney, Ben Sylvester, Steve Gooch","doi":"10.1111/ijsa.12475","DOIUrl":"10.1111/ijsa.12475","url":null,"abstract":"<p>This study engaged 4848 first-time, English-speaking prospective Canadian Armed Forces applicants to evaluate pre-application interventions' efficacy on the Practice Canadian Forces Aptitude Test (PCFAT). Using a five-level between-subjects design, participants were randomly assigned to one of the following intervention conditions: inoculation message, value-affirmation message, a combination of both, placebo writing intervention, or a no-intervention control group. The interventions were anchored in inoculation theory and value-affirmation theory and aimed to reduce math anxiety and close the gender gap in test performance. Contrary to expectations, the interventions did not significantly reduce math anxiety or improve problem-solving performance. Consistent with the literature, a negative relationship was found between levels of math anxiety and problem-solving scores, and men outscored women in problem-solving across all conditions. Despite these outcomes, the study lays a foundation for future research on enhancing pre-applicant experiences in an increasingly competitive labor market. Implications and future directions are discussed.</p>","PeriodicalId":51465,"journal":{"name":"International Journal of Selection and Assessment","volume":"32 4","pages":"491-498"},"PeriodicalIF":2.6,"publicationDate":"2024-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/ijsa.12475","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140837435","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sumayya Saleem, Linda White, Michal Perlman, Elizabeth Dhuey
{"title":"Promoting equity in hiring: An evaluation of the HireNext Job Posting Assessment","authors":"Sumayya Saleem, Linda White, Michal Perlman, Elizabeth Dhuey","doi":"10.1111/ijsa.12477","DOIUrl":"10.1111/ijsa.12477","url":null,"abstract":"<p>The language used in job postings can deter applicants and contribute to the employment gap, which refers to high rates of youth unemployment occurring simultaneously with high levels of job vacancies. We tested youth preferences for job postings modified using a free online tool that uses natural language processing to make them more appealing to young and diverse candidates. Using data from 1050 respondents aged 18–35 with education below a postsecondary degree, we found a consistent and statistically significant preference for modified postings, irrespective of the extent or types of changes made. More traditionally disadvantaged respondents (i.e., with lower education, lower incomes, disabilities, women, and unemployed youth) displayed a stronger preference for modified postings. These findings suggest that this tool can help employers recruit disadvantaged youth and bridge the employment gap.</p>","PeriodicalId":51465,"journal":{"name":"International Journal of Selection and Assessment","volume":"32 4","pages":"479-490"},"PeriodicalIF":2.6,"publicationDate":"2024-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/ijsa.12477","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140837267","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Investigating impression management use in asynchronous video interviews across 10 countries","authors":"René Arseneault, Nicolas Roulin","doi":"10.1111/ijsa.12476","DOIUrl":"https://doi.org/10.1111/ijsa.12476","url":null,"abstract":"<p>This cross-cultural study investigates how interviewees from 10 culturally-distinct countries differ in their use of impression management (IM) tactics in asynchronous video interviews (AVIs), and the relationship(s) between those tactics and interview performance. A total of 582 participants from ten countries (India, Canada, South Africa, Poland, Spain, Iran, Germany, Chile, Philippines, China) completed an 8-question AVI for a mock position as a manager in a bank. We drew upon GLOBE's cultural framework to predict and explain observed differences in self-reported IM use and performance. We used multi-level modeling to test our hypotheses. Interviewees from our ten countries differed slightly in their IM use for various tactics, but IM use was seldom related to GLOBE cultural dimensions. Partially consistent with previous in-person interview research, honest IM tactics (e.g., self promotion) were positively, but deceptive tactics (e.g., extensive image creation) negatively, associated with interview performance. This research is the first to investigate cross-cultural IM differences in AVIs, thus addressing a critical gap in the selection literature at a time when many organizations conduct interviews virtually to save costs, streamline the hiring process, or simply conduct most of their activities remotely.</p>","PeriodicalId":51465,"journal":{"name":"International Journal of Selection and Assessment","volume":"32 3","pages":"461-477"},"PeriodicalIF":2.6,"publicationDate":"2024-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/ijsa.12476","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141967618","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Justin R. Feeney, Kabir N. Daljeet, Richard D. Goffin, Travis J. Schneider
{"title":"Rating accuracy, leniency, and rater perceptions when using the RPM and BARS","authors":"Justin R. Feeney, Kabir N. Daljeet, Richard D. Goffin, Travis J. Schneider","doi":"10.1111/ijsa.12474","DOIUrl":"10.1111/ijsa.12474","url":null,"abstract":"<p>Researchers have argued that social-comparative rating formats hold important psychometric advantages over traditional absolute ratings. We asked 152 participants to observe and assess the videotaped performance of individuals completing a task using a social-comparative (Relative Percentile Method; RPM) and absolute rating (Behaviorally Anchored Rating Scale; BARS) formats. After collecting expert ratings on the same set of videos, we calculated accuracy indices and leniency. Additionally, we collected rater perceptions of accuracy and fairness for both formats. Our study revealed that the BARS was perceived as more accurate and fairer than the RPM. However, the RPM was found to be better at combating rater leniency. We discuss the implications of these findings.</p>","PeriodicalId":51465,"journal":{"name":"International Journal of Selection and Assessment","volume":"32 3","pages":"451-460"},"PeriodicalIF":2.6,"publicationDate":"2024-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/ijsa.12474","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140677838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"How do avatar characteristics affect applicants' interactional justice perceptions in artificial intelligence-based job interviews?","authors":"Qingfei Min, Haoye Sun, Xiaodi Wang, Crystal Zhang","doi":"10.1111/ijsa.12472","DOIUrl":"10.1111/ijsa.12472","url":null,"abstract":"<p>Artificial intelligence (AI)-based job interviews are increasingly adopted in organizations' recruitment activities. Despite their standardization and flexibility, concerns about fairness for applicants remain a critical challenge. Taking a perspective on interface design, this research examines the role of avatar characteristics in shaping perceptions of interactional justice in AI-based job interviews. Through a scenario-based study involving 465 participants, the impact of avatar characteristics—specifically, appearance, linguistic style, and feedback informativeness—on applicants' perceptions of interpersonal justice and informational justice was investigated. The findings indicate that avatars characterized by a warm and cheerful appearance, coupled with an affective expression style and informative feedback, significantly enhance perceptions of interpersonal justice and informational justice. These insights offer valuable practical guidance for avatar design in AI-based job interview systems.</p>","PeriodicalId":51465,"journal":{"name":"International Journal of Selection and Assessment","volume":"32 3","pages":"442-450"},"PeriodicalIF":2.6,"publicationDate":"2024-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140602592","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Meagan E. Brock Baskin, James M. Vardaman, Matthew R. Leon
{"title":"Like clockwork: A within-person analysis of time theft in the workplace","authors":"Meagan E. Brock Baskin, James M. Vardaman, Matthew R. Leon","doi":"10.1111/ijsa.12471","DOIUrl":"10.1111/ijsa.12471","url":null,"abstract":"<p>Employee time theft is a silent and costly behavior that is difficult to manage in organizations. Time theft is less understood than other forms of deviance as it is engaged in more frequently and considered to be less harmful by those who engage in it. This study tests a within-person model of time theft in which negative emotion mediates the positive relationship between job stressors (task ambiguity, task conflict) and time theft. Results show support for the mediating role of emotions in the positive stressor−time theft relationship at the within-person level. Results also show that high levels of between-person emotional stability weakened the impact of negative emotions on time theft. These results indicate that job related stressors create an environment in which time theft may become a frequent behavior. In discussing these results, we provide recommendations for employers to combat this phenomenon.</p>","PeriodicalId":51465,"journal":{"name":"International Journal of Selection and Assessment","volume":"32 3","pages":"429-441"},"PeriodicalIF":2.6,"publicationDate":"2024-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140576623","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Nelli Niemitz, Lucas Rietheimer, Cornelius König, Markus Langer
{"title":"The (null) effects of video questions on applicant reactions in asynchronous video interviews: Evidence from an actual hiring context","authors":"Nelli Niemitz, Lucas Rietheimer, Cornelius König, Markus Langer","doi":"10.1111/ijsa.12470","DOIUrl":"10.1111/ijsa.12470","url":null,"abstract":"<p>Asynchronous video interviews (AVIs) are growing in popularity, but tend to suffer from negative applicant reactions, possibly due to lower social presence compared to other interview formats. Research has suggested that specific design features may influence applicant reactions by increasing perceived social presence. In this study, we manipulated the question format (video vs. text) during an actual hiring process (<i>N</i> = 76), testing whether video questions influence social presence, applicant reactions, impression management, and interview performance. There was no evidence that video (vs. text) questions affected any of these variables. We discuss how specific AVI design choices may have affected our results and suggest that future research could investigate the additive and interactive effects of different AVI design features.</p>","PeriodicalId":51465,"journal":{"name":"International Journal of Selection and Assessment","volume":"32 3","pages":"421-428"},"PeriodicalIF":2.6,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/ijsa.12470","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140576512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Timothy G. Wingate, Sabah Rasheed, Stephen D. Risavy, Chet Robie
{"title":"How does bias enter the employment interview? Identifying the riskiest applicant characteristics, interviewer characteristics, and sources of potentially biasing information","authors":"Timothy G. Wingate, Sabah Rasheed, Stephen D. Risavy, Chet Robie","doi":"10.1111/ijsa.12467","DOIUrl":"10.1111/ijsa.12467","url":null,"abstract":"<p>The current study examines the riskiest forms and sources of potential bias in the employment interview. A mixed methods survey focused on interviewers' attention to various potentially biasing applicant characteristics, how interviewers learn about such characteristics, the traits of interviewers who are sensitive to such characteristics, and how knowledge of such characteristics affects interviewers' opinions of applicants. Participants were 680 professional interviewers from the U.S., U.K., and Canada. High risks of bias were associated with six applicant characteristics (as targets of bias), three interviewer characteristics (as predisposing toward bias), and three sources of information in the interview process. Interviewers commonly rationalized their judgments in job-relevant terms. These results have implications for understanding and limiting the risk of common selection biases.</p>","PeriodicalId":51465,"journal":{"name":"International Journal of Selection and Assessment","volume":"32 3","pages":"399-420"},"PeriodicalIF":2.6,"publicationDate":"2024-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/ijsa.12467","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140373415","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Playful personnel selection: The use of traditional versus game-related personnel selection methods and their perception from the recruiters’ and applicants’ perspectives","authors":"Marie L. Ohlms, Klaus G. Melchers, Uwe P. Kanning","doi":"10.1111/ijsa.12466","DOIUrl":"10.1111/ijsa.12466","url":null,"abstract":"<p>Game-related assessments (GRAs) are increasingly gaining interest in practice and research. However, little is known about how recruiters view these methods. This study examined recruiters’ (<i>N</i> = 158) reasons for using GRAs, the influence of their characteristics on their use of these methods, and recruiters’ perceptions of GRAs compared to those of potential applicants (<i>N</i> = 144). Results indicated that perceived ease of use, perceived usefulness, innovativeness in IT, and video game pursuit partly explained recruiters’ intentions to use GRAs, which then predicted actual usage. The expected positive impact on the organizational image and applicant reactions were among the most important reasons for using GRAs. Recruiters and potential applicants perceived GRAs as more enjoyable and modern than traditional assessment methods.</p>","PeriodicalId":51465,"journal":{"name":"International Journal of Selection and Assessment","volume":"32 3","pages":"381-398"},"PeriodicalIF":2.6,"publicationDate":"2024-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/ijsa.12466","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140375924","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}