{"title":"大型语言模型的伦理评价及其优化","authors":"Yujing Lyu, Yanyong Du","doi":"10.1007/s43681-024-00654-9","DOIUrl":null,"url":null,"abstract":"<div><p>The utilization of large language models (LLMs)has experienced tremendous growth in the past few years, bringing numerous benefits and conveniences. Yet, this expansion has also underscored ethical concerns, including issues such as hallucinations, toxic content, biased data and other unintended consequences. While the governance of these risks has garnered attention, a comprehensive and rigorous analysis of ethical evaluation connected to LLMs remains lacking. Against the background, this paper conducts an analysis of 105 assessment tools developed by governmental agencies, academic institutions, research groups, and technology corporations. The findings reveal a convergence emerging of these assessment principles, primarily focusing on data ethic, bias, discrimination and fairness, safety, robustness, human preferences alignment, particular ethical scenarios, responsibility, transparency and interpretability, and public participation. The study also presents the limitations of current ethical assessments paired with a critical analysis. This involves considering the collaboration between various institutions while taking into account the general public, the necessity of incorporating multidimensional real-world ethical contexts and related datasets, and the importance of integrating worldwide AI ethics guidelines with the ethical evaluation of LLMs. Such optimization can be incorporated into future evaluation efforts, aligning the technical advancements of LLMs with ethical considerations.</p></div>","PeriodicalId":72137,"journal":{"name":"AI and ethics","volume":"5 5","pages":"4579 - 4592"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The ethical evaluation of large language models and its optimization\",\"authors\":\"Yujing Lyu, Yanyong Du\",\"doi\":\"10.1007/s43681-024-00654-9\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The utilization of large language models (LLMs)has experienced tremendous growth in the past few years, bringing numerous benefits and conveniences. Yet, this expansion has also underscored ethical concerns, including issues such as hallucinations, toxic content, biased data and other unintended consequences. While the governance of these risks has garnered attention, a comprehensive and rigorous analysis of ethical evaluation connected to LLMs remains lacking. Against the background, this paper conducts an analysis of 105 assessment tools developed by governmental agencies, academic institutions, research groups, and technology corporations. The findings reveal a convergence emerging of these assessment principles, primarily focusing on data ethic, bias, discrimination and fairness, safety, robustness, human preferences alignment, particular ethical scenarios, responsibility, transparency and interpretability, and public participation. The study also presents the limitations of current ethical assessments paired with a critical analysis. This involves considering the collaboration between various institutions while taking into account the general public, the necessity of incorporating multidimensional real-world ethical contexts and related datasets, and the importance of integrating worldwide AI ethics guidelines with the ethical evaluation of LLMs. Such optimization can be incorporated into future evaluation efforts, aligning the technical advancements of LLMs with ethical considerations.</p></div>\",\"PeriodicalId\":72137,\"journal\":{\"name\":\"AI and ethics\",\"volume\":\"5 5\",\"pages\":\"4579 - 4592\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-01-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"AI and ethics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s43681-024-00654-9\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"AI and ethics","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1007/s43681-024-00654-9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The ethical evaluation of large language models and its optimization
The utilization of large language models (LLMs)has experienced tremendous growth in the past few years, bringing numerous benefits and conveniences. Yet, this expansion has also underscored ethical concerns, including issues such as hallucinations, toxic content, biased data and other unintended consequences. While the governance of these risks has garnered attention, a comprehensive and rigorous analysis of ethical evaluation connected to LLMs remains lacking. Against the background, this paper conducts an analysis of 105 assessment tools developed by governmental agencies, academic institutions, research groups, and technology corporations. The findings reveal a convergence emerging of these assessment principles, primarily focusing on data ethic, bias, discrimination and fairness, safety, robustness, human preferences alignment, particular ethical scenarios, responsibility, transparency and interpretability, and public participation. The study also presents the limitations of current ethical assessments paired with a critical analysis. This involves considering the collaboration between various institutions while taking into account the general public, the necessity of incorporating multidimensional real-world ethical contexts and related datasets, and the importance of integrating worldwide AI ethics guidelines with the ethical evaluation of LLMs. Such optimization can be incorporated into future evaluation efforts, aligning the technical advancements of LLMs with ethical considerations.