Kiss up, Kick down: Exploring Behavioral Changes in Multi-modal Large Language Models with Assigned Visual Personas
- Authors
- Sun, Seungjong; Lee, Eungu; Baek, Seo Yeon; Hwang, Seunghyun; Lee, Wonbyung; Nan, Dongyan; Jansen, Bernard J.; Kim, Jang Hyun
- Issue Date
- 2024
- Publisher
- Association for Computational Linguistics (ACL)
- Citation
- EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference, pp 10888 - 10901
- Pages
- 14
- Indexed
- SCOPUS
- Journal Title
- EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference
- Start Page
- 10888
- End Page
- 10901
- URI
- https://scholarx.skku.edu/handle/2021.sw.skku/120726
- Abstract
- This study is the first to explore whether multi-modal large language models (LLMs) can align their behaviors with visual personas, addressing a significant gap in the literature that predominantly focuses on text-based personas. We developed a novel dataset of 5K fictional avatar images for assignment as visual personas to LLMs, and analyzed their negotiation behaviors based on the visual traits depicted in these images, with a particular focus on aggressiveness. The results indicate that LLMs assess the aggressiveness of images in a manner similar to humans and output more aggressive negotiation behaviors when prompted with an aggressive visual persona. Interestingly, the LLM exhibited more aggressive negotiation behaviors when the opponent's image appeared less aggressive than their own, and less aggressive behaviors when the opponent's image appeared more aggressive. © 2024 Association for Computational Linguistics.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - Graduate School > Department of Applied Data Science > 1. Journal Articles
- Computing and Informatics > Convergence > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.