基于Instagram标题的MBTI个性类型预测,使用Word2Vec和Long-Short Term内存(LSTM)

Alfian Hakim, Satrio Hadi Wijoyo, Nanang Yudi Setiawan
{"title":"基于Instagram标题的MBTI个性类型预测,使用Word2Vec和Long-Short Term内存(LSTM)","authors":"Alfian Hakim, Satrio Hadi Wijoyo, Nanang Yudi Setiawan","doi":"10.25126/jtiik.20231057064","DOIUrl":null,"url":null,"abstract":"Myers-Briggs Type Indicator (MBTI) adalah metode pengujian psikologi yang membedakan kepribadian seseorang. MBTI termasuk pembagian tipe kepribadian yang paling populer di dunia, termasuk di Korea Selatan. Tren MBTI di Korea Selatan juga dimanfaatkan oleh para artis K-Pop untuk berbagi tipe MBTI sehingga bisa mendekatkan hubungan antara penggemar dan idolanya. Salah satu media sosial yang umum digunakan oleh artis K-Pop adalah Instagram. Penelitian ini mencoba membuat model klasikasi tipe kepribadian berdasarkan caption Instagram artis K-Pop menggunakan Word2Vec dan Long-Short Term Memory (LSTM). Terdapat 118.401 data caption yang sudah dibersihkan melalui serangkaian langkah pre-processing dari 458 artis. Distribusi tipe kepribadian menunjukkan bahwa target tidak seimbang sehingga perlu dilakukan penanganan yaitu penggeseran threshold yang dilakukan pasca pemodelan. Evaluasi kombinasi model menghasilkan nilai macro f1 0,65 pada data artis, dengan rincian model Extroversion-Introversion, Sensing-Intuition, Thinking-Feeling memiliki nilai macro f1 yang sama yaitu 0,88, sedangkan model Judging-Perceiving memiliki nilai macro f1 yang sedikit lebih baik yaitu 0,90. Model diimplementasikan dalam aplikasi web Streamlit agar penggemar K-Pop dapat menggunakannya untuk memprediksi tipe MBTI dengan masukan caption Instagram. Aplikasi web dievaluasi menggunakan kuesioner System Usability Scale (SUS) dan mendapatkan skor 84,55 sehingga sudah termasuk kategori acceptable. Abstract Myers-Briggs Type Indicator (MBTI) is a psychological test that distinguishes a person's personality. The MBTI is one of the most popular personality types in the world, including South Korea. The MBTI trend in South Korea is also used by K-Pop artists to share their MBTI types so they could be closer to their fans. One of the social media commonly used by K-Pop artists is Instagram. This study tries to develop a personality type classification model based on Instagram captions of K-Pop artists using Word2Vec and Long-Short Term Memory (LSTM). There are 118,401 caption data that have been cleaned through a series of pre-processing steps from 458 artists. The distribution of personality types shows that the target is not balanced, so it is necessary to handle imbalanced data, namely shifting the threshold after modeling. Evaluation of the combination model yields a macro f1 value of 0.65 in the artist data, with the details of the Extroversion-Introversion, Sensing-Intuition, Thinking-Feeling models having the same macro f1 value of 0.88, while the Judging-Perceiving model has a slightly better macro f1 value of 0.90. The model is deployed in the Streamlit web application so that K-Pop fans can use it to predict the MBTI type by inputting Instagram captions. The web application is evaluated using the System Usability Scale (SUS) questionnaire and gets a score of 84.55 so it is considered acceptable.","PeriodicalId":32501,"journal":{"name":"Jurnal Teknologi Informasi dan Ilmu Komputer","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prediksi Tipe Kepribadian MBTI Artis K-Pop Berdasarkan Caption Instagram Menggunakan Word2Vec dan Long-Short Term Memory (LSTM)\",\"authors\":\"Alfian Hakim, Satrio Hadi Wijoyo, Nanang Yudi Setiawan\",\"doi\":\"10.25126/jtiik.20231057064\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Myers-Briggs Type Indicator (MBTI) adalah metode pengujian psikologi yang membedakan kepribadian seseorang. MBTI termasuk pembagian tipe kepribadian yang paling populer di dunia, termasuk di Korea Selatan. Tren MBTI di Korea Selatan juga dimanfaatkan oleh para artis K-Pop untuk berbagi tipe MBTI sehingga bisa mendekatkan hubungan antara penggemar dan idolanya. Salah satu media sosial yang umum digunakan oleh artis K-Pop adalah Instagram. Penelitian ini mencoba membuat model klasikasi tipe kepribadian berdasarkan caption Instagram artis K-Pop menggunakan Word2Vec dan Long-Short Term Memory (LSTM). Terdapat 118.401 data caption yang sudah dibersihkan melalui serangkaian langkah pre-processing dari 458 artis. Distribusi tipe kepribadian menunjukkan bahwa target tidak seimbang sehingga perlu dilakukan penanganan yaitu penggeseran threshold yang dilakukan pasca pemodelan. Evaluasi kombinasi model menghasilkan nilai macro f1 0,65 pada data artis, dengan rincian model Extroversion-Introversion, Sensing-Intuition, Thinking-Feeling memiliki nilai macro f1 yang sama yaitu 0,88, sedangkan model Judging-Perceiving memiliki nilai macro f1 yang sedikit lebih baik yaitu 0,90. Model diimplementasikan dalam aplikasi web Streamlit agar penggemar K-Pop dapat menggunakannya untuk memprediksi tipe MBTI dengan masukan caption Instagram. Aplikasi web dievaluasi menggunakan kuesioner System Usability Scale (SUS) dan mendapatkan skor 84,55 sehingga sudah termasuk kategori acceptable. Abstract Myers-Briggs Type Indicator (MBTI) is a psychological test that distinguishes a person's personality. The MBTI is one of the most popular personality types in the world, including South Korea. The MBTI trend in South Korea is also used by K-Pop artists to share their MBTI types so they could be closer to their fans. One of the social media commonly used by K-Pop artists is Instagram. This study tries to develop a personality type classification model based on Instagram captions of K-Pop artists using Word2Vec and Long-Short Term Memory (LSTM). There are 118,401 caption data that have been cleaned through a series of pre-processing steps from 458 artists. The distribution of personality types shows that the target is not balanced, so it is necessary to handle imbalanced data, namely shifting the threshold after modeling. Evaluation of the combination model yields a macro f1 value of 0.65 in the artist data, with the details of the Extroversion-Introversion, Sensing-Intuition, Thinking-Feeling models having the same macro f1 value of 0.88, while the Judging-Perceiving model has a slightly better macro f1 value of 0.90. The model is deployed in the Streamlit web application so that K-Pop fans can use it to predict the MBTI type by inputting Instagram captions. The web application is evaluated using the System Usability Scale (SUS) questionnaire and gets a score of 84.55 so it is considered acceptable.\",\"PeriodicalId\":32501,\"journal\":{\"name\":\"Jurnal Teknologi Informasi dan Ilmu Komputer\",\"volume\":\"32 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-10-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Jurnal Teknologi Informasi dan Ilmu Komputer\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.25126/jtiik.20231057064\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jurnal Teknologi Informasi dan Ilmu Komputer","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.25126/jtiik.20231057064","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

Myers-Briggs型(MBTI)是一种区分人性格的心理测试方法。MBTI是世界上最受欢迎的人格类型之一,包括韩国。韩国的MBTI趋势也被K-Pop艺术家用来分享MBTI类型,从而促进粉丝和偶像之间的关系。K-Pop艺术家最常使用的社交媒体之一是Instagram。本研究试图在Instagram字幕中使用Word2Vec和Long-Short Term内存(LSTM)创建性格类型分类模型。有118,401个数据标题通过一系列预先处理458位艺术家的步骤被清除。人格类型的分布表明,目标是不平衡的,因此需要处理后建型后的threshold位移。对模型的组合评估会在艺术家的数据中产生f1 - 65的宏值,而解析插管、思想感受与f1的宏值为0.88相同,而判断-感知的值则比f1的宏值高一点。模型是在web Streamlit应用程序中实现的,这样K-Pop粉丝就可以用它来预测MBTI类型的Instagram标题。web应用程序使用Usability Scale (SUS)的问卷进行评估,得分为84.55,因此属于接受类别。抽象的Myers-Briggs类型(MBTI)是对一个人人格的心理测试。MBTI是世界上最受欢迎的人物之一,包括韩国人。韩国的MBTI趋势还被K-Pop艺术家用来分享他们的MBTI类型,这样他们就可以更接近他们的粉丝。韩国流行艺术家在Instagram上使用的社交媒体之一。这项研究是基于使用Word2Vec和Long-Short Memory (LSTM)在Instagram上发布K-Pop艺术家标题的典型分类模型。有些118.401标题的数据已被cleaned无论是a系列pre-processing台阶从458的艺术家。人格的分布表明目标不是平衡的,所以有必要调整数据附加,namely在调制后更换threshold。调查员》,《f1 yields a宏模型数据价值0。65《艺术家的细节》里,用Extroversion-Introversion Sensing-Intuition, Thinking-Feeling models同一个宏f1价值》玩得0。88,而《Judging-Perceiving模特有f1有点更好的宏观价值90的0。模特是在Streamlit web应用程序中部署的,所以K-Pop粉丝可以用它来预测m - ti类型通过Instagram标签。网络应用程序采用了可疑的系统计算结果,得到了84.55的分数,所以这被认为是可以接受的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Prediksi Tipe Kepribadian MBTI Artis K-Pop Berdasarkan Caption Instagram Menggunakan Word2Vec dan Long-Short Term Memory (LSTM)
Myers-Briggs Type Indicator (MBTI) adalah metode pengujian psikologi yang membedakan kepribadian seseorang. MBTI termasuk pembagian tipe kepribadian yang paling populer di dunia, termasuk di Korea Selatan. Tren MBTI di Korea Selatan juga dimanfaatkan oleh para artis K-Pop untuk berbagi tipe MBTI sehingga bisa mendekatkan hubungan antara penggemar dan idolanya. Salah satu media sosial yang umum digunakan oleh artis K-Pop adalah Instagram. Penelitian ini mencoba membuat model klasikasi tipe kepribadian berdasarkan caption Instagram artis K-Pop menggunakan Word2Vec dan Long-Short Term Memory (LSTM). Terdapat 118.401 data caption yang sudah dibersihkan melalui serangkaian langkah pre-processing dari 458 artis. Distribusi tipe kepribadian menunjukkan bahwa target tidak seimbang sehingga perlu dilakukan penanganan yaitu penggeseran threshold yang dilakukan pasca pemodelan. Evaluasi kombinasi model menghasilkan nilai macro f1 0,65 pada data artis, dengan rincian model Extroversion-Introversion, Sensing-Intuition, Thinking-Feeling memiliki nilai macro f1 yang sama yaitu 0,88, sedangkan model Judging-Perceiving memiliki nilai macro f1 yang sedikit lebih baik yaitu 0,90. Model diimplementasikan dalam aplikasi web Streamlit agar penggemar K-Pop dapat menggunakannya untuk memprediksi tipe MBTI dengan masukan caption Instagram. Aplikasi web dievaluasi menggunakan kuesioner System Usability Scale (SUS) dan mendapatkan skor 84,55 sehingga sudah termasuk kategori acceptable. Abstract Myers-Briggs Type Indicator (MBTI) is a psychological test that distinguishes a person's personality. The MBTI is one of the most popular personality types in the world, including South Korea. The MBTI trend in South Korea is also used by K-Pop artists to share their MBTI types so they could be closer to their fans. One of the social media commonly used by K-Pop artists is Instagram. This study tries to develop a personality type classification model based on Instagram captions of K-Pop artists using Word2Vec and Long-Short Term Memory (LSTM). There are 118,401 caption data that have been cleaned through a series of pre-processing steps from 458 artists. The distribution of personality types shows that the target is not balanced, so it is necessary to handle imbalanced data, namely shifting the threshold after modeling. Evaluation of the combination model yields a macro f1 value of 0.65 in the artist data, with the details of the Extroversion-Introversion, Sensing-Intuition, Thinking-Feeling models having the same macro f1 value of 0.88, while the Judging-Perceiving model has a slightly better macro f1 value of 0.90. The model is deployed in the Streamlit web application so that K-Pop fans can use it to predict the MBTI type by inputting Instagram captions. The web application is evaluated using the System Usability Scale (SUS) questionnaire and gets a score of 84.55 so it is considered acceptable.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
审稿时长
16 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信