Alfian Hakim, Satrio Hadi Wijoyo, Nanang Yudi Setiawan
{"title":"基于Instagram标题的MBTI个性类型预测,使用Word2Vec和Long-Short Term内存(LSTM)","authors":"Alfian Hakim, Satrio Hadi Wijoyo, Nanang Yudi Setiawan","doi":"10.25126/jtiik.20231057064","DOIUrl":null,"url":null,"abstract":"Myers-Briggs Type Indicator (MBTI) adalah metode pengujian psikologi yang membedakan kepribadian seseorang. MBTI termasuk pembagian tipe kepribadian yang paling populer di dunia, termasuk di Korea Selatan. Tren MBTI di Korea Selatan juga dimanfaatkan oleh para artis K-Pop untuk berbagi tipe MBTI sehingga bisa mendekatkan hubungan antara penggemar dan idolanya. Salah satu media sosial yang umum digunakan oleh artis K-Pop adalah Instagram. Penelitian ini mencoba membuat model klasikasi tipe kepribadian berdasarkan caption Instagram artis K-Pop menggunakan Word2Vec dan Long-Short Term Memory (LSTM). Terdapat 118.401 data caption yang sudah dibersihkan melalui serangkaian langkah pre-processing dari 458 artis. Distribusi tipe kepribadian menunjukkan bahwa target tidak seimbang sehingga perlu dilakukan penanganan yaitu penggeseran threshold yang dilakukan pasca pemodelan. Evaluasi kombinasi model menghasilkan nilai macro f1 0,65 pada data artis, dengan rincian model Extroversion-Introversion, Sensing-Intuition, Thinking-Feeling memiliki nilai macro f1 yang sama yaitu 0,88, sedangkan model Judging-Perceiving memiliki nilai macro f1 yang sedikit lebih baik yaitu 0,90. Model diimplementasikan dalam aplikasi web Streamlit agar penggemar K-Pop dapat menggunakannya untuk memprediksi tipe MBTI dengan masukan caption Instagram. Aplikasi web dievaluasi menggunakan kuesioner System Usability Scale (SUS) dan mendapatkan skor 84,55 sehingga sudah termasuk kategori acceptable. Abstract Myers-Briggs Type Indicator (MBTI) is a psychological test that distinguishes a person's personality. The MBTI is one of the most popular personality types in the world, including South Korea. The MBTI trend in South Korea is also used by K-Pop artists to share their MBTI types so they could be closer to their fans. One of the social media commonly used by K-Pop artists is Instagram. This study tries to develop a personality type classification model based on Instagram captions of K-Pop artists using Word2Vec and Long-Short Term Memory (LSTM). There are 118,401 caption data that have been cleaned through a series of pre-processing steps from 458 artists. The distribution of personality types shows that the target is not balanced, so it is necessary to handle imbalanced data, namely shifting the threshold after modeling. Evaluation of the combination model yields a macro f1 value of 0.65 in the artist data, with the details of the Extroversion-Introversion, Sensing-Intuition, Thinking-Feeling models having the same macro f1 value of 0.88, while the Judging-Perceiving model has a slightly better macro f1 value of 0.90. The model is deployed in the Streamlit web application so that K-Pop fans can use it to predict the MBTI type by inputting Instagram captions. The web application is evaluated using the System Usability Scale (SUS) questionnaire and gets a score of 84.55 so it is considered acceptable.","PeriodicalId":32501,"journal":{"name":"Jurnal Teknologi Informasi dan Ilmu Komputer","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prediksi Tipe Kepribadian MBTI Artis K-Pop Berdasarkan Caption Instagram Menggunakan Word2Vec dan Long-Short Term Memory (LSTM)\",\"authors\":\"Alfian Hakim, Satrio Hadi Wijoyo, Nanang Yudi Setiawan\",\"doi\":\"10.25126/jtiik.20231057064\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Myers-Briggs Type Indicator (MBTI) adalah metode pengujian psikologi yang membedakan kepribadian seseorang. MBTI termasuk pembagian tipe kepribadian yang paling populer di dunia, termasuk di Korea Selatan. Tren MBTI di Korea Selatan juga dimanfaatkan oleh para artis K-Pop untuk berbagi tipe MBTI sehingga bisa mendekatkan hubungan antara penggemar dan idolanya. Salah satu media sosial yang umum digunakan oleh artis K-Pop adalah Instagram. Penelitian ini mencoba membuat model klasikasi tipe kepribadian berdasarkan caption Instagram artis K-Pop menggunakan Word2Vec dan Long-Short Term Memory (LSTM). Terdapat 118.401 data caption yang sudah dibersihkan melalui serangkaian langkah pre-processing dari 458 artis. Distribusi tipe kepribadian menunjukkan bahwa target tidak seimbang sehingga perlu dilakukan penanganan yaitu penggeseran threshold yang dilakukan pasca pemodelan. Evaluasi kombinasi model menghasilkan nilai macro f1 0,65 pada data artis, dengan rincian model Extroversion-Introversion, Sensing-Intuition, Thinking-Feeling memiliki nilai macro f1 yang sama yaitu 0,88, sedangkan model Judging-Perceiving memiliki nilai macro f1 yang sedikit lebih baik yaitu 0,90. Model diimplementasikan dalam aplikasi web Streamlit agar penggemar K-Pop dapat menggunakannya untuk memprediksi tipe MBTI dengan masukan caption Instagram. Aplikasi web dievaluasi menggunakan kuesioner System Usability Scale (SUS) dan mendapatkan skor 84,55 sehingga sudah termasuk kategori acceptable. Abstract Myers-Briggs Type Indicator (MBTI) is a psychological test that distinguishes a person's personality. The MBTI is one of the most popular personality types in the world, including South Korea. The MBTI trend in South Korea is also used by K-Pop artists to share their MBTI types so they could be closer to their fans. One of the social media commonly used by K-Pop artists is Instagram. This study tries to develop a personality type classification model based on Instagram captions of K-Pop artists using Word2Vec and Long-Short Term Memory (LSTM). There are 118,401 caption data that have been cleaned through a series of pre-processing steps from 458 artists. The distribution of personality types shows that the target is not balanced, so it is necessary to handle imbalanced data, namely shifting the threshold after modeling. Evaluation of the combination model yields a macro f1 value of 0.65 in the artist data, with the details of the Extroversion-Introversion, Sensing-Intuition, Thinking-Feeling models having the same macro f1 value of 0.88, while the Judging-Perceiving model has a slightly better macro f1 value of 0.90. The model is deployed in the Streamlit web application so that K-Pop fans can use it to predict the MBTI type by inputting Instagram captions. The web application is evaluated using the System Usability Scale (SUS) questionnaire and gets a score of 84.55 so it is considered acceptable.\",\"PeriodicalId\":32501,\"journal\":{\"name\":\"Jurnal Teknologi Informasi dan Ilmu Komputer\",\"volume\":\"32 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-10-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Jurnal Teknologi Informasi dan Ilmu Komputer\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.25126/jtiik.20231057064\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jurnal Teknologi Informasi dan Ilmu Komputer","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.25126/jtiik.20231057064","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Prediksi Tipe Kepribadian MBTI Artis K-Pop Berdasarkan Caption Instagram Menggunakan Word2Vec dan Long-Short Term Memory (LSTM)
Myers-Briggs Type Indicator (MBTI) adalah metode pengujian psikologi yang membedakan kepribadian seseorang. MBTI termasuk pembagian tipe kepribadian yang paling populer di dunia, termasuk di Korea Selatan. Tren MBTI di Korea Selatan juga dimanfaatkan oleh para artis K-Pop untuk berbagi tipe MBTI sehingga bisa mendekatkan hubungan antara penggemar dan idolanya. Salah satu media sosial yang umum digunakan oleh artis K-Pop adalah Instagram. Penelitian ini mencoba membuat model klasikasi tipe kepribadian berdasarkan caption Instagram artis K-Pop menggunakan Word2Vec dan Long-Short Term Memory (LSTM). Terdapat 118.401 data caption yang sudah dibersihkan melalui serangkaian langkah pre-processing dari 458 artis. Distribusi tipe kepribadian menunjukkan bahwa target tidak seimbang sehingga perlu dilakukan penanganan yaitu penggeseran threshold yang dilakukan pasca pemodelan. Evaluasi kombinasi model menghasilkan nilai macro f1 0,65 pada data artis, dengan rincian model Extroversion-Introversion, Sensing-Intuition, Thinking-Feeling memiliki nilai macro f1 yang sama yaitu 0,88, sedangkan model Judging-Perceiving memiliki nilai macro f1 yang sedikit lebih baik yaitu 0,90. Model diimplementasikan dalam aplikasi web Streamlit agar penggemar K-Pop dapat menggunakannya untuk memprediksi tipe MBTI dengan masukan caption Instagram. Aplikasi web dievaluasi menggunakan kuesioner System Usability Scale (SUS) dan mendapatkan skor 84,55 sehingga sudah termasuk kategori acceptable. Abstract Myers-Briggs Type Indicator (MBTI) is a psychological test that distinguishes a person's personality. The MBTI is one of the most popular personality types in the world, including South Korea. The MBTI trend in South Korea is also used by K-Pop artists to share their MBTI types so they could be closer to their fans. One of the social media commonly used by K-Pop artists is Instagram. This study tries to develop a personality type classification model based on Instagram captions of K-Pop artists using Word2Vec and Long-Short Term Memory (LSTM). There are 118,401 caption data that have been cleaned through a series of pre-processing steps from 458 artists. The distribution of personality types shows that the target is not balanced, so it is necessary to handle imbalanced data, namely shifting the threshold after modeling. Evaluation of the combination model yields a macro f1 value of 0.65 in the artist data, with the details of the Extroversion-Introversion, Sensing-Intuition, Thinking-Feeling models having the same macro f1 value of 0.88, while the Judging-Perceiving model has a slightly better macro f1 value of 0.90. The model is deployed in the Streamlit web application so that K-Pop fans can use it to predict the MBTI type by inputting Instagram captions. The web application is evaluated using the System Usability Scale (SUS) questionnaire and gets a score of 84.55 so it is considered acceptable.