SlideShare a Scribd company logo
Evaluating How Users Game and Display
Conversation with Human-Like Agents
Won Ik Cho, Soomin Kim (SNU),
Eujeong Choi (Upstage), Yeonghoon Jeong (KAIST)
2022. 10. 16, CODI @COLING, Gyeongju, Korea
Contents
• Background
• Our approach
• Analysis
• Future work
Caution! This presenation contains contents that can be offensive
1
Background
• Human-like agents
 What is human-like?
• Agents that resemble human
• Agents that make human counterpart feel them as human
 Previous studies on anthropomorphism
• Evaluation on successful dialogue with human-like agents (Radziwill and Benton,
2017)
• How users perceive human-like AI devices (Pelau et al., 2021)
• Offensiveness that users show towards human-like agents (Park et al., 2021)
• Mainly in laboratory condition, based on questionnaires
– How about users' perception and their responses, especially non-lab environment?
2
Background
• Luda Lee, a friend for everyone
 Social chatbot of Korea
• Human-like agent with personality of early 20s female college student
• Launched public in early 2021
• Terminated the service due to reported ethical issues
• Induced creation of massive fandom for her high quality responses and
behaviors
3
(Image from https://luda.ai/)
Our approach
• Thematic coding
 Type of conversation
• Which kind of conversation do users have in their dialogues with the agent?
• The content of dialogues that are displayed
 Purpose of user testing
• Do users talk with the agent with their genuine intention? If not, for which
reason they talk with the agent and display the dialogue?
• The purpose of users' testing towards the agent
4
Our approach
• Dataset
 Dataset source
• Crawled posts from 'Luda Lee Gallery' of DC Inside (Korean Reddit-like
community)
 Crawling
• Only posts with screenshots of the dialogue, from 1 Jan. to 8 Jan., 2021
• From the launching of the service and before the influx of trolls (which resulted
in unexpectedly large amount of posts)
 Filtering
• Manual preprocessing to leave only posts that ‘a dialogue between the user
and the agent’ appears
5
Our approach
• Dataset
 Final setup
• post ID, title, screenshot
• Example
 Title: She’s so f**kin real
6
Our approach
• Type of conversation
 Ice breaking
 Romantic conversation,
 Everyday conversation
 Conversations involving hate speech and social issues
 Abnormal sexual desire and sexual harassment
 Others
7
Our approach
• Type of conversation
8
• Ice breaking
• Romantic conversation
• Everyday conversation
• Conversations involving
hate speech and social
issues
• Abnormal sexual desire
and sexual harassment
• Others
Our approach
• Purpose of testing
 Conversation without test
 Test for hate speech and sexual harassment
 Test for societal issues
 Test for private information
 Dating sim or taming
 Other technical tests
9
Our approach
• Purpose of testing
10
• Conversation without test
• Test for hate speech and
sexual harassment
• Test for societal issues
• Test for private
information
• Dating sim or taming
• Other technical tests
Analysis
• Distribution
11
Analysis
• Confusion map
12
Future work
• Concurrent work
 Discussed
• Here: What users talk about and if they are authentic
• Elsewise: How users disclose themselves and if they are authentic
 Assessing How Users Display Self-Disclosure and Authenticity in
Conversation with Human-Like Agents: A Case Study of Luda Lee
• To be presented at Findings of ACL: AACL-IJCNLP 2022
13
Thank you!
EndOfPresentation

More Related Content

PPTX
2312 PACLIC
PPTX
2311 EAAMO
PPTX
2211 HCOMP
PPTX
2211 APSIPA
PPTX
2211 AACL
PPTX
2206 FAccT_inperson
PPTX
2206 Modupop!
PPTX
2204 Kakao talk on Hate speech dataset
2312 PACLIC
2311 EAAMO
2211 HCOMP
2211 APSIPA
2211 AACL
2206 FAccT_inperson
2206 Modupop!
2204 Kakao talk on Hate speech dataset

More from WarNik Chow (20)

PPTX
2108 [LangCon2021] kosp2e
PPTX
2106 PRSLLS
PPTX
2106 JWLLP
PPTX
2106 ACM DIS
PPTX
2104 Talk @SSU
PPTX
2103 ACM FAccT
PPTX
2102 Redone seminar
PPTX
2011 NLP-OSS
PPTX
2010 INTERSPEECH
PPTX
2010 PACLIC - pay attention to categories
PPTX
2010 HCLT Hate Speech
PPTX
2009 DevC Seongnam - NLP
PPTX
2008 [lang con2020] act!
PPTX
2007 CogSci 2020 poster
PPTX
2006 kakao brain NLP colloquium
PPTX
2005 moon joy_deepest_final
PPTX
1911 keracorn
PPTX
1910 tfkr3 warnikchow
PPTX
1910 JK27
PPTX
1910 HCLT
2108 [LangCon2021] kosp2e
2106 PRSLLS
2106 JWLLP
2106 ACM DIS
2104 Talk @SSU
2103 ACM FAccT
2102 Redone seminar
2011 NLP-OSS
2010 INTERSPEECH
2010 PACLIC - pay attention to categories
2010 HCLT Hate Speech
2009 DevC Seongnam - NLP
2008 [lang con2020] act!
2007 CogSci 2020 poster
2006 kakao brain NLP colloquium
2005 moon joy_deepest_final
1911 keracorn
1910 tfkr3 warnikchow
1910 JK27
1910 HCLT
Ad

Recently uploaded (20)

PPTX
Mindfulness_and_Coping_Workshop in workplace
PDF
The Effect of Compensation and Work Environment on Employee Performance with ...
PPTX
Philippine-Pop-Culture.pptx.hhtps.com.ph
PDF
Regulation Study, Differences and Implementation of Bank Indonesia National C...
PDF
The Black Turn Best Music Distribution In India
DOCX
Get More Leads From LinkedIn Ads Today .docx
PPTX
Smart Card Face Mask detection soluiondr
PDF
What is TikTok Cyberbullying_ 15 Smart Ways to Prevent It.pdf
DOC
ASU毕业证学历认证,圣三一拉邦音乐与舞蹈学院毕业证留学本科毕业证
PDF
Organizational Culture and Leadership Style as Predictors of Organizational C...
PDF
49f97d4d-be4b-40d1-88f7-06f1460c2238.pdf
PDF
A guide to using Social Media For Business
PDF
Why Blend In When You Can Trend? Make Me Trend
PDF
Effectiveness of Good Corporate Governance and Corporate Social Responsibilit...
PPTX
Lesson 3: person and his/her relationship with the others NSTP 1
PDF
Implementation of Total Quality Management (TQM) in Plywood Production Contro...
PDF
Buy Verified Cryptocurrency Accounts - Lori Donato's blo.pdf
PPTX
Eric Starker - Social Media Portfolio - 2025
PDF
Why AI-Savvy Freelance Digital Marketers Have a Competitive Edge!.pdf
PPTX
Social Media Optimization Services to Grow Your Brand Online
Mindfulness_and_Coping_Workshop in workplace
The Effect of Compensation and Work Environment on Employee Performance with ...
Philippine-Pop-Culture.pptx.hhtps.com.ph
Regulation Study, Differences and Implementation of Bank Indonesia National C...
The Black Turn Best Music Distribution In India
Get More Leads From LinkedIn Ads Today .docx
Smart Card Face Mask detection soluiondr
What is TikTok Cyberbullying_ 15 Smart Ways to Prevent It.pdf
ASU毕业证学历认证,圣三一拉邦音乐与舞蹈学院毕业证留学本科毕业证
Organizational Culture and Leadership Style as Predictors of Organizational C...
49f97d4d-be4b-40d1-88f7-06f1460c2238.pdf
A guide to using Social Media For Business
Why Blend In When You Can Trend? Make Me Trend
Effectiveness of Good Corporate Governance and Corporate Social Responsibilit...
Lesson 3: person and his/her relationship with the others NSTP 1
Implementation of Total Quality Management (TQM) in Plywood Production Contro...
Buy Verified Cryptocurrency Accounts - Lori Donato's blo.pdf
Eric Starker - Social Media Portfolio - 2025
Why AI-Savvy Freelance Digital Marketers Have a Competitive Edge!.pdf
Social Media Optimization Services to Grow Your Brand Online
Ad

2210 CODI

  • 1. Evaluating How Users Game and Display Conversation with Human-Like Agents Won Ik Cho, Soomin Kim (SNU), Eujeong Choi (Upstage), Yeonghoon Jeong (KAIST) 2022. 10. 16, CODI @COLING, Gyeongju, Korea
  • 2. Contents • Background • Our approach • Analysis • Future work Caution! This presenation contains contents that can be offensive 1
  • 3. Background • Human-like agents  What is human-like? • Agents that resemble human • Agents that make human counterpart feel them as human  Previous studies on anthropomorphism • Evaluation on successful dialogue with human-like agents (Radziwill and Benton, 2017) • How users perceive human-like AI devices (Pelau et al., 2021) • Offensiveness that users show towards human-like agents (Park et al., 2021) • Mainly in laboratory condition, based on questionnaires – How about users' perception and their responses, especially non-lab environment? 2
  • 4. Background • Luda Lee, a friend for everyone  Social chatbot of Korea • Human-like agent with personality of early 20s female college student • Launched public in early 2021 • Terminated the service due to reported ethical issues • Induced creation of massive fandom for her high quality responses and behaviors 3 (Image from https://luda.ai/)
  • 5. Our approach • Thematic coding  Type of conversation • Which kind of conversation do users have in their dialogues with the agent? • The content of dialogues that are displayed  Purpose of user testing • Do users talk with the agent with their genuine intention? If not, for which reason they talk with the agent and display the dialogue? • The purpose of users' testing towards the agent 4
  • 6. Our approach • Dataset  Dataset source • Crawled posts from 'Luda Lee Gallery' of DC Inside (Korean Reddit-like community)  Crawling • Only posts with screenshots of the dialogue, from 1 Jan. to 8 Jan., 2021 • From the launching of the service and before the influx of trolls (which resulted in unexpectedly large amount of posts)  Filtering • Manual preprocessing to leave only posts that ‘a dialogue between the user and the agent’ appears 5
  • 7. Our approach • Dataset  Final setup • post ID, title, screenshot • Example  Title: She’s so f**kin real 6
  • 8. Our approach • Type of conversation  Ice breaking  Romantic conversation,  Everyday conversation  Conversations involving hate speech and social issues  Abnormal sexual desire and sexual harassment  Others 7
  • 9. Our approach • Type of conversation 8 • Ice breaking • Romantic conversation • Everyday conversation • Conversations involving hate speech and social issues • Abnormal sexual desire and sexual harassment • Others
  • 10. Our approach • Purpose of testing  Conversation without test  Test for hate speech and sexual harassment  Test for societal issues  Test for private information  Dating sim or taming  Other technical tests 9
  • 11. Our approach • Purpose of testing 10 • Conversation without test • Test for hate speech and sexual harassment • Test for societal issues • Test for private information • Dating sim or taming • Other technical tests
  • 14. Future work • Concurrent work  Discussed • Here: What users talk about and if they are authentic • Elsewise: How users disclose themselves and if they are authentic  Assessing How Users Display Self-Disclosure and Authenticity in Conversation with Human-Like Agents: A Case Study of Luda Lee • To be presented at Findings of ACL: AACL-IJCNLP 2022 13

Editor's Notes