KorraAI - a probabilistic virtual agent framework

KorraAI Framework
A framework for building probabilistic virtual agents
ANTON ANDREEV, CNRS, GIPSA-LAB, GRENOBLE
JULY 2022

About me
• 7 years in private sector
• 2 years NLP + Semantic Annotation
• 10 years CNRS - Ingénieur d'études
• Brain Computer Interface (BCI), Brain Invaders, P300 speller for
patients, Artificial Nose, Virtual Reality
• C/C++, C#, Matlab, Unity, Python, Julia
• Data scientist
• 2.5 years development of KorraAI framework

Humans
• Search for repeating patterns
• to predict future events
• repeating events can be uninteresting (e.g. computer games)
• Have periodic needs
• hunger, thirst, communication, sleep, rest …
• Do perform certain activities periodically
• watch movies, play computer games, do sport …
• Can work with uncertain information
• difficult to model with current hardware computer architectures

ECA Challenges
ECA (Embodied Conversational Agent) = Virtual Agent = Bot
• Users have too high expectations
• Natural language is not a precise language (Lojban is)
• Reasoning and how to take decisions
• Humans are faster learners (just from a few examples)
• Synchronization with facial expressions and gestures
• Real time processing
*ECA (Embodied Conversational Agent)

General ECA vs Specialized ECA
Specialized ECA
• Works over specific domain
• Limits the ambiguity in user’s responses
• Allows for pre-defined suggested answers and questions
• Requires less NLP efforts
• Is more practically oriented
KANOPEE is an specialized ECA into the domains of
sleep disorders and smoking addiction

KorraAI original design goals
• Web-site: https://github.com/toncho11/KorraAI
• Platform for experimentation and research
• Example: use of Probabilistic Programming (PP) in ECAs
• NLP + Semantic Annotation + Ontologies + RDF database
• Favors the use of probabilistic models – distributions and Bayesian networks
• Provide natural experience
• No fixed events
• For both specialized and general ECA
• Focuses on the ECA to user communication
• Easy to install and be used as a commercial product

KorraAI main features (1)
• At every start it generates a new
communication pipeline called Interaction
Queue
• Pro-active
•Can start communications by itself
•Does not block
• Manages user's responses with level of
confidence
•Used directly in Bayesian networks
•Used in probabilistic sampling

KorraAI main features (2)
• Handles changes in behavior over time
• Built-in support for Probabilistic
Programming and Bayesian networks
• Extensible – each ECA is a plugin
• Can be used on mobile phone and VR
• Real-time, support facial expressions

Application targets
• Everyday companion
• Sales agent
• Teacher
• Customer service
• Coach

KorraAI Technical Details
• Unity component Morph3D / MCS (free)
• Uses Windows voice synthesis or Amazon Polly (paid)
• Unity component for the lip-sync (paid)
• Unity component for music playback (paid)

KorraAI’s Plugin Architecture
• Separation between ECA
model and execution engine
• ECA models are hosted in
the KorraAI.dll
• Same 3D model, total
different behavior

ECAs in KorraAI
In reality building an ECA should start from an existing model
• Ana
• simple – used to show how to encode your first ECA
• Joi
• everyday companion
• April – commercial
•every day companion and sales agent

How to encode a personality?
The distribution over the activities and needs of a person (or an ECA): sleep, hunger, play video
games, shopping, etc. is called the "Main Distribution"
In real world In KorraAI
The target domain Categories and Interactions
How often do we do certain things? Distributions (e.x. “Main Distribution”)
Behavior can change over time or
interaction with others
Model Update Trigger class
Reasoning Probabilistic reasoning
Evaluate state and react Model Evaluate Trigger class with
probabilistic model

Tutorial on new ECA model in KorraAI
• Interactions Categories (required)
• Interactions (required)
• Main Distribution (required)
• Timing distributions
• Distributions within each category
• Triggers
• Probabilistic variables and Bayesian networks

Distributions
KorraAI is a non-deterministic program. It has probabilistic sampler.
• "Main Distribution"
• generates the "Interactions Queue" (at stat-up or on Trigger request)
• using probabilistic variables for each category
• Internal category distribution
• Time distributions (no fixed time intervals)
• Speech pauses – Normal (m=3.7s,v=0.25)
• When to smile – Normal (m=12s,v=3)
Distributions are
pushed everywhere
it can

Speech generation (1)
An example of how several distributions are used to generate speech

Speech generation (2)
Example movie suggestion text generation using a distribution
string text = KorraModelHelper.GetChance(new string[] {
"How about this " + type + " " + movieName + "? I recommend it.", //1
"I have a new " + type + " suggestion for you. It is called : " + movieName + ". You
should try it.", //2
(KorraModelHelper.GetChance(3) ? "Do you feel bored? " : "") + "I recommend you to
watch the " + type + " " + movieName + ".", //3
"Time for something interesting to watch. Try this one: " + movieName + ".", //4
"You are going to like this " + type + ": " + movieName + ". I highly recommend it." //5
});

Pre-generated Interaction Queue
Advantages:
• Easier to verify the resulting "Main Distribution“
• Easier to inspect
Example:
|1. Hi|2. What is your name?|3. My name is Joi.|4. Listen to this song: Robin
Schulz - Sugar. Please note that pressing Escape at any time will stop playback.|5. Time
for another outfit.|6. How about this TV series Shameless? I recommend it.|7. Time for
some sport. You should go to the gym. Sport is good for both physical and metal
health.|8. <prosody pitch="+0%">I was thinking. God must love stupid people. <break
time="600ms"/>He created SO many of them!</prosody>|9. ###place holder for
InAGoodMood|10. How old are you?|

Two models: ECA and User
We keep two models that are interconnected. Humans also build a
model for their interlocutors.
Example:
• We model the user's likeliness to watch a movie
• And use this model in the ECA's movie suggestions model

Naturalization
We need to follow some human norms during conversation
Speech
• Some interactions are manually added – initial greeting
• Certain interactions are coupled together
• Addressing by name: "Peter, …" follows a distribution
Non-verbal:
• Reaction time pause is smaller than starting new interaction
• Time ECA’s eyes stay focused on user is limited
• Voice annotation with SSML
• Facial expressions available: smile, surprise, blink with one eye, …

Handling of Empty Categories
Problem: when a category is depleted, a normalization will be applied and this
will increase the probabilities of the other categories
• Actually good – we compensate with what we have
• Bad because now we can not maintain the desired “Main Distribution”
Strategies:
• Set explicitly how a distribution changes
• If category A is depleted – use this new distribution
• If category A and B are depleted – use this new distribution
•Normalization only over certain categories, the others stay fixed (better)

Probabilistic Programming (1)
Problem with Probabilistic models:
• Tools for creating them – complete mess (Math, English, Diagrams,
Pictures) or some bizarre programming language e.g. Bugs
Advantages of PP:
• inference and modelling are completely separate – we can define models
without caring how we are going to do inference
• probabilistic inference is built-in in the PPL
• models can be specified in a declarative manner
• models compose freely, allowing us to construct complex models from
simpler ones
• popular host language with extensions KorraAI uses PP

Probabilistic Programming (2)
Probabilistic Programming Languages (PPL):
• WebPPL - http://webppl.org
• Figaro – https://github.com/p2t2/figaro
• Microsoft Infer.NET https://github.com/dotnet/infer
In KorraAI:
• Inference is exact
• Probabilistic C# library using LINQ extensions
https://github.com/joashc/csharp-probability-monad
• Allows for encoding of Bayesian networks
• Best alternative is Infer.NET

Example Probabilistic C# (1)
Model the “JokeTellingRate” used by the ECA
The initial model of the user:
• InAGoodMood = BernoulliF(Prob(0.6));
• LikesJoke = BernoulliF(Prob(0.7));
Where LikesJoke can be adjusted by a natural question "Do you like
jokes?“. Using predefined answers and probabilities:
• Absolutely– 0.9
• Mostly – 0.6
• Rather not –0.3
We are building a model of the user that
is used in the ECA’s “Main Distribution”
and thus affecting the ECA behavior

Example Probabilistic C# (2)
private static Func<bool, bool, Prob>
TellJokeProb = (likesJoke, inGoodMood) =>
{
if (likesJoke && inGoodMood) return Prob(0.4)
if (likesJoke && !inGoodMood) return Prob(0.9);
return Prob(0.2);
};
public static FiniteDist<bool> TellJokeRate =
from like in LikesJoke
from mood in InAGoodMood
from joke in BernoulliF(TellJokeProb(like, mood))
select joke;
CPD (Conditional probability
distribution)
Implementation of
the JokeTellingRate
InAGoodMood = BernoulliF(Prob(0.6));
LikesJoke = BernoulliF(Prob(0.7));

Model Update Trigger (MUT)
• Can track elapsed time (since ECA start) or 24h time
• Can tracks the user's responses
• Decides whether to re-sample the “Main Distribution”
Example MUTs:
• A MUT can track the user's response to "Did you watch a movie
yesterday?“, update the probabilistic variable responsible for the
Movie Suggestions category and force a probabilistic re-sampling and
thus modify the ECA’s behavior by suggesting more (less) movies
• After 15 minutes we can increase the number of jokes and again
request a re-sampling

Model Evaluate Trigger (MET)
• Again can track the user's responses
• Usually stores a probabilistic model such as BN
• Decides whether to inject a new interaction
Example MET
• Encode a surprise – a Bayesian model evaluates the plausibility
(or POE) of the user's response and if a contradiction is detected ,
it supplies a reaction – an interaction that goes first in the
Interaction Queue

Execution statistics
Problem: construct the Main Distribution for N interactions works, but it
won’t work for a fixed interval of time (1 hour for example). Interactions
take time. We can try to evaluate how much real time the Interaction
Queue will take to execute (FIT) and adjust the Main Distribution
FIT=∑((Ai+Pi) * Ci) for all categories i= 1..N
• Ai - average time per category (question + user's response)
• Pi – average time between interactions
• Ci – interactions per category
An ECA must run for some time in order to collect these real time
statistics

Sales agent ECA
• Set Categories: “Getting to know the user”, “Small talk”, “Introducing the
domain of the product”
• Set initial “Main Distribution” where “Getting to know the user”, “Small
talk” are with higher probabilities than “Introducing the domain of the
product”
• Set Triggers
• MUT – modify the “Main Distribution” to include more of the category “Introducing the
domain of the product” after 10-15 minutes
• MET – evaluate the user’s confidence in the ECA using a probabilistic model to decide
when to ask the user to buy a product (insert buy interaction in the Interaction Queue)
• Infer hidden states
• Instead of asking directly “Do you earn well?” we can infer the user’s purchasing power
based on other questions using a probabilistic model based on published statistics

Resources
Links on Probabilistic Programming:
◦ https://www.youtube.com/watch?v=fclvsoaUI-U
◦ https://www.cs.cornell.edu/courses/cs4110/2016fa/lectures/lecture33.html
◦ https://www.youtube.com/watch?v=9SEIYh5BCjc&ab_channel=MITCBMM

KorraAI - a probabilistic virtual agent framework

More Related Content

Similar to KorraAI - a probabilistic virtual agent framework (20)

Recently uploaded (20)

KorraAI - a probabilistic virtual agent framework

Editor's Notes