{"title":"ESCOX: A tool for skill and occupation extraction using LLMs from unstructured text","authors":"Dimitrios Christos Kavargyris , Konstantinos Georgiou , Eleanna Papaioannou , Konstantinos Petrakis , Nikolaos Mittas , Lefteris Angelis","doi":"10.1016/j.simpa.2025.100772","DOIUrl":null,"url":null,"abstract":"<div><div>ESCOX, also known as ESCOSkillExtractor, is an open-source, non-proprietary tool for identifying and classifying skills, skillsets, and occupations from job postings and general text. It utilizes the European Skills, Competences, Qualifications and Occupations (ESCO) taxonomy to structure extraction, addressing the need for taxonomy-aligned skill identification in unstructured labor market data. Developed within the SKILLAB EU Horizon project, ESCOX combines LLMs and text embeddings to map content to standardized categories. It offers a user-friendly graphical interface for researchers, educators, and HR professionals, supporting skills gap analysis, training, recruitment, and policy planning, and contributing to the development of a skills-based economy.</div></div>","PeriodicalId":29771,"journal":{"name":"Software Impacts","volume":"25 ","pages":"Article 100772"},"PeriodicalIF":1.3000,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Software Impacts","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2665963825000326","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
ESCOX, also known as ESCOSkillExtractor, is an open-source, non-proprietary tool for identifying and classifying skills, skillsets, and occupations from job postings and general text. It utilizes the European Skills, Competences, Qualifications and Occupations (ESCO) taxonomy to structure extraction, addressing the need for taxonomy-aligned skill identification in unstructured labor market data. Developed within the SKILLAB EU Horizon project, ESCOX combines LLMs and text embeddings to map content to standardized categories. It offers a user-friendly graphical interface for researchers, educators, and HR professionals, supporting skills gap analysis, training, recruitment, and policy planning, and contributing to the development of a skills-based economy.
ESCOX,也被称为ESCOSkillExtractor,是一个开源、非专有的工具,用于从招聘启事和一般文本中识别和分类技能、技能集和职业。它利用欧洲技能、能力、资格和职业(ESCO)分类法进行结构提取,解决了在非结构化劳动力市场数据中对分类一致的技能识别的需求。ESCOX是在SKILLAB EU Horizon项目中开发的,它结合了法学硕士和文本嵌入,将内容映射到标准化类别。它为研究人员、教育工作者和人力资源专业人员提供了一个用户友好的图形界面,支持技能差距分析、培训、招聘和政策规划,并为技能经济的发展做出贡献。