{"title":"植食性节肢动物学名自动标注及大语言模型词源趋势分析。","authors":"Kota Nojiri, Keito Inoshita, Haruto Sugeno","doi":"10.2108/zs250025","DOIUrl":null,"url":null,"abstract":"<p><p>Scientific names, especially epithets (specific names in the zoological nomenclature), are derived from various factors, not only species characteristics but also cultural backgrounds, such as the names of people. They reflect how species were perceived at the time. However, several ethical issues have been raised, such as naming species after criminals and gender imbalance in eponyms (epithets named after people). Previous research has been conducted through thorough literature reviews with random sampling, which requires significant time and effort. In this study, the accuracy of the automated labeling using a large language model (LLM) was assessed, and the temporal etymological trends of 2705 species of phytophagous arthropods were investigated. LLM-based classification achieved <i>F1</i> scores above 75% and accuracy above 90% in <i>Morphology</i>, <i>Host</i>, <i>Geography</i>, and <i>People</i>. However, <i>Ecology & Behavior</i> and <i>Other</i> exhibited accuracy issues. Analyses using the generalized additive model (GAM) revealed shifting naming trends, with a decrease in <i>Morphology</i> and an increase in <i>Geography</i> and <i>People</i>, consistent with previous research on spiders. This study demonstrates the effectiveness of LLM-based classification for epithets and provides a new perspective on the social and scientific debates surrounding scientific names based on etymological trends.</p>","PeriodicalId":24040,"journal":{"name":"Zoological Science","volume":"42 5","pages":"492-497"},"PeriodicalIF":1.0000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Automated Labeling of Scientific Names and Etymological Trend Analysis in Phytophagous Arthropods Using Large Language Model.\",\"authors\":\"Kota Nojiri, Keito Inoshita, Haruto Sugeno\",\"doi\":\"10.2108/zs250025\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Scientific names, especially epithets (specific names in the zoological nomenclature), are derived from various factors, not only species characteristics but also cultural backgrounds, such as the names of people. They reflect how species were perceived at the time. However, several ethical issues have been raised, such as naming species after criminals and gender imbalance in eponyms (epithets named after people). Previous research has been conducted through thorough literature reviews with random sampling, which requires significant time and effort. In this study, the accuracy of the automated labeling using a large language model (LLM) was assessed, and the temporal etymological trends of 2705 species of phytophagous arthropods were investigated. LLM-based classification achieved <i>F1</i> scores above 75% and accuracy above 90% in <i>Morphology</i>, <i>Host</i>, <i>Geography</i>, and <i>People</i>. However, <i>Ecology & Behavior</i> and <i>Other</i> exhibited accuracy issues. Analyses using the generalized additive model (GAM) revealed shifting naming trends, with a decrease in <i>Morphology</i> and an increase in <i>Geography</i> and <i>People</i>, consistent with previous research on spiders. This study demonstrates the effectiveness of LLM-based classification for epithets and provides a new perspective on the social and scientific debates surrounding scientific names based on etymological trends.</p>\",\"PeriodicalId\":24040,\"journal\":{\"name\":\"Zoological Science\",\"volume\":\"42 5\",\"pages\":\"492-497\"},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2025-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Zoological Science\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.2108/zs250025\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ZOOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Zoological Science","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.2108/zs250025","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ZOOLOGY","Score":null,"Total":0}
Automated Labeling of Scientific Names and Etymological Trend Analysis in Phytophagous Arthropods Using Large Language Model.
Scientific names, especially epithets (specific names in the zoological nomenclature), are derived from various factors, not only species characteristics but also cultural backgrounds, such as the names of people. They reflect how species were perceived at the time. However, several ethical issues have been raised, such as naming species after criminals and gender imbalance in eponyms (epithets named after people). Previous research has been conducted through thorough literature reviews with random sampling, which requires significant time and effort. In this study, the accuracy of the automated labeling using a large language model (LLM) was assessed, and the temporal etymological trends of 2705 species of phytophagous arthropods were investigated. LLM-based classification achieved F1 scores above 75% and accuracy above 90% in Morphology, Host, Geography, and People. However, Ecology & Behavior and Other exhibited accuracy issues. Analyses using the generalized additive model (GAM) revealed shifting naming trends, with a decrease in Morphology and an increase in Geography and People, consistent with previous research on spiders. This study demonstrates the effectiveness of LLM-based classification for epithets and provides a new perspective on the social and scientific debates surrounding scientific names based on etymological trends.
期刊介绍:
Zoological Science is published by the Zoological Society of Japan and devoted to publication of original articles, reviews and editorials that cover the broad field of zoology. The journal was founded in 1984 as a result of the consolidation of Zoological Magazine (1888–1983) and Annotationes Zoologicae Japonenses (1897–1983), the former official journals of the Zoological Society of Japan. Each annual volume consists of six regular issues, one every two months.