利用机器学习技术探索汉字雕塑的文化属性

Journal of Autonomous Intelligence Pub Date : 2024-02-05 DOI:10.32629/jai.v7i4.1471

Zhen Luo

{"title":"利用机器学习技术探索汉字雕塑的文化属性","authors":"Zhen Luo","doi":"10.32629/jai.v7i4.1471","DOIUrl":null,"url":null,"abstract":"The article employs machine learning, specifically the CLIP (Contrastive Language-Image Pretraining) model, to analyze Chinese character sculptures’ cultural attributes. It overcomes challenges in multi-dimensional data processing and high digitization costs. The process involves normalizing sculpture images, using FastText for vector representations of Chinese characters, and mapping text to the same embedding space as images for word embedding. The CLIP model, through unsupervised training, minimizes the negative logarithmic likelihood loss between image and text embeddings to establish cultural attribute representations. Key findings include the CLIP model’s improved performance over the M3 model, with a 5.4% higher average AUC. The model demonstrates high efficiency and accuracy, evident in its low RMSE (0.034) and MAE (0.025) and fast analysis time of 182 ms. This approach effectively and accurately analyzes the cultural attributes of Chinese character sculptures, addressing existing research gaps.","PeriodicalId":307060,"journal":{"name":"Journal of Autonomous Intelligence","volume":"77 3","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Exploration of the cultural attributes of Chinese character sculpture using machine learning technology\",\"authors\":\"Zhen Luo\",\"doi\":\"10.32629/jai.v7i4.1471\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The article employs machine learning, specifically the CLIP (Contrastive Language-Image Pretraining) model, to analyze Chinese character sculptures’ cultural attributes. It overcomes challenges in multi-dimensional data processing and high digitization costs. The process involves normalizing sculpture images, using FastText for vector representations of Chinese characters, and mapping text to the same embedding space as images for word embedding. The CLIP model, through unsupervised training, minimizes the negative logarithmic likelihood loss between image and text embeddings to establish cultural attribute representations. Key findings include the CLIP model’s improved performance over the M3 model, with a 5.4% higher average AUC. The model demonstrates high efficiency and accuracy, evident in its low RMSE (0.034) and MAE (0.025) and fast analysis time of 182 ms. This approach effectively and accurately analyzes the cultural attributes of Chinese character sculptures, addressing existing research gaps.\",\"PeriodicalId\":307060,\"journal\":{\"name\":\"Journal of Autonomous Intelligence\",\"volume\":\"77 3\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-02-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Autonomous Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.32629/jai.v7i4.1471\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Autonomous Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.32629/jai.v7i4.1471","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

文章采用机器学习，特别是 CLIP（对比语言-图像预训练）模型来分析汉字雕塑的文化属性。它克服了多维数据处理和数字化成本高的难题。这一过程包括对雕塑图像进行归一化处理，使用 FastText 对汉字进行矢量表示，并将文本映射到与图像相同的嵌入空间进行文字嵌入。CLIP 模型通过无监督训练，最小化图像和文本嵌入之间的负对数似然损失，从而建立文化属性表征。主要发现包括 CLIP 模型的性能比 M3 模型有所提高，平均 AUC 高出 5.4%。该模型的 RMSE（0.034）和 MAE（0.025）均较低，分析时间仅为 182 毫秒，可见其高效性和准确性。该方法有效、准确地分析了汉字雕塑的文化属性，填补了现有研究的空白。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Exploration of the cultural attributes of Chinese character sculpture using machine learning technology

The article employs machine learning, specifically the CLIP (Contrastive Language-Image Pretraining) model, to analyze Chinese character sculptures’ cultural attributes. It overcomes challenges in multi-dimensional data processing and high digitization costs. The process involves normalizing sculpture images, using FastText for vector representations of Chinese characters, and mapping text to the same embedding space as images for word embedding. The CLIP model, through unsupervised training, minimizes the negative logarithmic likelihood loss between image and text embeddings to establish cultural attribute representations. Key findings include the CLIP model’s improved performance over the M3 model, with a 5.4% higher average AUC. The model demonstrates high efficiency and accuracy, evident in its low RMSE (0.034) and MAE (0.025) and fast analysis time of 182 ms. This approach effectively and accurately analyzes the cultural attributes of Chinese character sculptures, addressing existing research gaps.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Autonomous Intelligence

自引率

0.00%

发文量