Context-dependent Incremental Intention Recognition ...

7 downloads 336 Views 989KB Size Report
Jul 14, 2011 - logo. Introduction. Models & Algorithms. Results & Conclusions ... Bayesian Network for Intention Recognition (IRBN). Causes/Reasons. C-2.
Introduction Models & Algorithms Results & Conclusions

Context-dependent Incremental Intention Recognition through Bayesian Network Model Construction Han The Anh ([email protected]) Luís Moniz Pereira ([email protected]) /UNL

UAI "Bayesian Modelling Applications Workshop" Barcelona, Spain, 14 July, 2011 logo

T.A.Han, L.M.Pereira

Context-dependent Incremental Intention Recognition

Introduction Models & Algorithms Results & Conclusions

Intention Recognition

Outline 1

Introduction Intention Recognition

2

Models & Algorithms Bayesian Network Construction Operators for Handling Bayesian Networks for Intention Recognition (IRBNs) Relations Amongst Intentions

3

Results & Conclusions Evaluation Metrics Linux Plan Corpus Prisoner’s Dilemma Corpus logo

T.A.Han, L.M.Pereira

Context-dependent Incremental Intention Recognition

Introduction Models & Algorithms Results & Conclusions

Intention Recognition

Why Intention Recognition?

Intention recognition (IR): inferring intentions of a single agent (individual) or a group of agents (collective) based on their observed actions. Why IR is necessary? Acting on environment, agent may have to deal with other agents Ease interactions, improve cooperation and coordination, especially when communication is limited. Defend from potential hostile behaviors, plan in advance to take advantage. logo

T.A.Han, L.M.Pereira

Context-dependent Incremental Intention Recognition

Introduction Models & Algorithms Results & Conclusions

Intention Recognition

Intention recognition systems: Typical Components Sequence of Observed Actions

A1 A2 A3 ....... Observed Agent

Model

-Bayesian KB -Markov KB ....

Set of Intentions Plan Library Plan Corpus Action Theory

Recognized Intentions

With Rankings (Probabilistic) Without Rankings (Non-probabilistic) logo

T.A.Han, L.M.Pereira

Context-dependent Incremental Intention Recognition

Introduction Models & Algorithms Results & Conclusions

Intention Recognition

System Description Preprocessing Bayesian Network Fragments

Plan Corpus Experts' Knowledge

I

A

Unit Fragments Operators

A1 A2 A3 ....... Bayesian Network for Intention Recognition

Incremental Prediction

-Probability Inference -Simplification

-Selection -Combination -Remove ....

IRBNs logo

T.A.Han, L.M.Pereira

Context-dependent Incremental Intention Recognition

Introduction Models & Algorithms Results & Conclusions

Bayesian Network Construction Operators for Handling IRBNs Relations Amongst Intentions

Bayesian Network for Intention Recognition (IRBN) Causes/Reasons P(C1)

C-1

Subject to Changes

Actions Intentions

I-1

C-2

. . . .

. . .

A-1

P(A1|I1,IM) . . . .

I-M

A-P C-N

CPD table for each node X P(X|parents(X)) T.A.Han, L.M.Pereira

IR: Compute P(I-i|obs) i = 1,...,M

logo

Context-dependent Incremental Intention Recognition

Introduction Models & Algorithms Results & Conclusions

Bayesian Network Construction Operators for Handling IRBNs Relations Amongst Intentions

Unit Fragments → Unit IRBN

Figure: Combine context-dependently selected unit fragments of a new action A with NOISY-OR. I1

. . . .

IN

A

A

I1

. . . .

A

IN

logo

T.A.Han, L.M.Pereira

Context-dependent Incremental Intention Recognition

Introduction Models & Algorithms Results & Conclusions

Bayesian Network Construction Operators for Handling IRBNs Relations Amongst Intentions

Integrate into the current model ... When a new action A3 arises, integrate it into the current model

Current IRBN

Unit IRBN for New Action

C-1

C-1 I-1

I-1 A-1 C-2

C-2

. . . .

A-3 I-2

I-2 A-2

. . . .

I-4

I-3 C-M

C-N

logo

T.A.Han, L.M.Pereira

Context-dependent Incremental Intention Recognition

Introduction Models & Algorithms Results & Conclusions

Bayesian Network Construction Operators for Handling IRBNs Relations Amongst Intentions

Remove Irrelevant Intentions If some intentions are found out to be irrelevant, e.g. when their probability is very small, they are removed from the model.

Current IRBN

After Removing I-3

C-1 C-1

I-1

A-1

A-1 I-1 C-2

Remove I-3

I-2

P(A|I1,I2,I3=F)

A-2

I-2

C-3

A-2 I-3

C-2

C-4

Very Unlikely

T.A.Han, L.M.Pereira

logo

Context-dependent Incremental Intention Recognition

Introduction Models & Algorithms Results & Conclusions

Bayesian Network Construction Operators for Handling IRBNs Relations Amongst Intentions

Incremental Intention Recognition Algorithm

Repeat until one intention remains or time limit is reached. 1

If new actions are observed: combine unit IRBNs for them with the current IRBN.

2

Compute conditional probability of each intention given current observed actions. Remove irrelevant intentions.

logo

T.A.Han, L.M.Pereira

Context-dependent Incremental Intention Recognition

Introduction Models & Algorithms Results & Conclusions

Bayesian Network Construction Operators for Handling IRBNs Relations Amongst Intentions

Relations Amongst Intentions: multiple intentions

In case agent pursues multiple intentions simultaneously, intentions may support or exclude each other. Two mutually exclusive intentions cannot be parents of an action: P(I1 = T , I2 = T ) = 0 → P(A|I1 , I2 , ...) is undefined. Need to combine mutually exclusive intentions in a single node.

logo

T.A.Han, L.M.Pereira

Context-dependent Incremental Intention Recognition

Introduction Models & Algorithms Results & Conclusions

Bayesian Network Construction Operators for Handling IRBNs Relations Amongst Intentions

Single Intention Case

In case agent pursues a single intention (Linux Plan corpus), all intentions are mutually exclusive. They are combined in a single node. A1 , ..., Am : current observed actions. Then, Q P(Ij ) m P(A |I ) i=1 P Qm i j P(I = Ij |A1 , ..., Am ) = n P(I ) j j=1 i=1 P(Ai |Ij ) The recognizer has linear complexity on the number of intentions being modeled. logo

T.A.Han, L.M.Pereira

Context-dependent Incremental Intention Recognition

Introduction Models & Algorithms Results & Conclusions

Evaluation Metrics Linux Plan Corpus Prisoner’s Dilemma Corpus

Evaluation Metrics Seq = a1 , ..., an : sequence of actions achieving intention I . N-best prediction: correct(A) = 1 if I is one of N most likely intentions, and 0 otherwise. n X precision(Seq) = ( correct(ai ))/z i=1 n X recall (Seq) = ( correct(ai ))/Z i=1

convergence(Seq) = (z − t + 1)/z z: #predictions; Z : #prediction opportunities; t: smallest number from that on correctly predicts. T.A.Han, L.M.Pereira

logo

Context-dependent Incremental Intention Recognition

Introduction Models & Algorithms Results & Conclusions

Evaluation Metrics Linux Plan Corpus Prisoner’s Dilemma Corpus

Linux Plan Corpus

One of rare, often used, plan corpora available for evaluating intention/plan recognizers. goals = tasks in Linux (e.g., find a file, copy some files); actions = Linux commands (e.g., find, cp, cd, ls). 19 goals; 43 actions (commands); 547 sessions. Collected from Linux users.

logo

T.A.Han, L.M.Pereira

Context-dependent Incremental Intention Recognition

Introduction Models & Algorithms Results & Conclusions

Evaluation Metrics Linux Plan Corpus Prisoner’s Dilemma Corpus

Intention Recognition Results on Linux Plan Corpus τ – confidence level: make prediction only when probability ≥ τ . 0.8 0.8

Precision

Critical Values decreasing

0.6

1�best

0.5

Convergence

0.7 0.7

0.6

Critical Values decreasing

1�best

0.5

2�best 3�best

0.4

2�best 0.4

3�best

4�best

4�best 0.3

0.3 0.0

0.2

0.4

0.6

0.8

Τ

0.0

0.2

0.4

0.6

0.8

Τ

Figure: Precision and Convergence for τ ∈ [0, 1]; N = 1, 2, 3, 4. logo

T.A.Han, L.M.Pereira

Context-dependent Incremental Intention Recognition

Introduction Models & Algorithms Results & Conclusions

Evaluation Metrics Linux Plan Corpus Prisoner’s Dilemma Corpus

We obtained better results than existent corpus-based intention recognizers. N-best τ Precision Recall Converg.

1-best 0.95 0.786 0.308 0.722

2-best 0.5 0.847 0.469 0.799

3-best 0.45 0.870 0.518 0.822

4-best 0.42 0.883 0.612 0.824

Table: Intention Recognition Results on the Linux Plan Corpus

logo

T.A.Han, L.M.Pereira

Context-dependent Incremental Intention Recognition

Introduction Models & Algorithms Results & Conclusions

Evaluation Metrics Linux Plan Corpus Prisoner’s Dilemma Corpus

Iterated Prisoner’s Dilemma Corpus (IPD)

Prisoner’s Dilemma is a symmetric two-player non-zero game defined by the payoff matrix

C D



C R, R T,S

D  S, T P, P

Intentions to be recognized are strategies in IPD: TFT, WSLS, GTFT, GRIM, FBF, etc. logo

T.A.Han, L.M.Pereira

Context-dependent Incremental Intention Recognition

Introduction Models & Algorithms Results & Conclusions

Evaluation Metrics Linux Plan Corpus Prisoner’s Dilemma Corpus

1.0

1.0

0.9

0.9

0.8

0.8

0.7 0.6

1�best

0.5

2�best

0.4

3�best

0.3

Convergence

Precision

Intention Recognition Results on IPD Plan Corpus

0.7 0.6

1�best

0.5

2�best

0.4

3�best

0.3 0.0

0.2

0.4

0.6

0.8

Τ

0.0

0.2

0.4

0.6

0.8

Τ

Figure: Precision and Convergence for τ ∈ [0, 1]; N = 1, 2, 3 logo

T.A.Han, L.M.Pereira

Context-dependent Incremental Intention Recognition

Introduction Models & Algorithms Results & Conclusions

Evaluation Metrics Linux Plan Corpus Prisoner’s Dilemma Corpus

Social Learning Corpus (IRCHANGE)

Agents may change initial intention, by learning from more successful agents Imitation Event (IE)

A-1

Intention (I) Success Difference (SD)

. . . . A-m

logo

T.A.Han, L.M.Pereira

Context-dependent Incremental Intention Recognition

Introduction Models & Algorithms Results & Conclusions

Evaluation Metrics Linux Plan Corpus Prisoner’s Dilemma Corpus

Results on IRCHANGE corpus Significant improvement of recognition performance when the more contextual information is observable. 0.8

� � � �

� � � � � � � � � � � � � � � �

precision Precision

N=2

� � � � � � � � � � � � � � � � 0.7 � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � 0.6 � � � � � � � � � � � � � � � � 0.5 � � � � � � � � � � � � � � � � � � � � � � � � � � � �

N=1

0.4

Strategy

Successes

No Info

0.3 0.0

0.2

0.4

0.6

0.8

Τ

τ

logo

T.A.Han, L.M.Pereira

Context-dependent Incremental Intention Recognition

Introduction Models & Algorithms Results & Conclusions

Evaluation Metrics Linux Plan Corpus Prisoner’s Dilemma Corpus

Summary

Incremental IR model via incrementally constructing a Bayesian network model. Evaluated on Linux Plan corpus, obtaining better results than existent systems. Present a new benchmark for intention/plan recognition based on social dilemmas (Prisoner’s Dilemma, Stag hunt,...). Show significant improvement by including more contextual information. logo

T.A.Han, L.M.Pereira

Context-dependent Incremental Intention Recognition

Introduction Models & Algorithms Results & Conclusions

Evaluation Metrics Linux Plan Corpus Prisoner’s Dilemma Corpus

Future Work

Create plan corpora to evaluate multiple-intention recognition. Apply to explain the role of intention recognition for the evolution of cooperation (see our papers in refs).

logo

T.A.Han, L.M.Pereira

Context-dependent Incremental Intention Recognition

Introduction Models & Algorithms Results & Conclusions

Evaluation Metrics Linux Plan Corpus Prisoner’s Dilemma Corpus

Thank you!

QUESTIONS?

logo

T.A.Han, L.M.Pereira

Context-dependent Incremental Intention Recognition

Introduction Models & Algorithms Results & Conclusions

Evaluation Metrics Linux Plan Corpus Prisoner’s Dilemma Corpus

IPD Plan Corpus Intentions are famous strategies in IPD: TFT, WSLS, GTFT, GRIM, FBF. There are infinite number of strategies. Corpus actions: s1 ...sM ξ, where si ∈ {E , R, T , S, P} – states of the M last interactions; ξ ∈ {C , D} – current move. Σ1 = {EC , RC , TC , SC , PC , ED, RD, TD, SD, PD} A plan session of TFT: [EC , RC , SD, PD, TD] 0

1

TFT : −

C

C

D

D

D

X: −

C

D

D

C

D

TFT-states : E

R

S

P

T

P

round :

2

3

4

5

logo

T.A.Han, L.M.Pereira

Context-dependent Incremental Intention Recognition

Introduction Models & Algorithms Results & Conclusions

Evaluation Metrics Linux Plan Corpus Prisoner’s Dilemma Corpus

Generating Training and Testing Datasets

7 strategies: AllC, AllD, TFT, GTFT, WSLS, GRIM and FBF. Training corpus is generated by playing with each strategy all the possible combinations 10 times (r = 5, . . . , 10). The testing dataset is generated by playing a random choice with each strategy in each round (r = 5, . . . , 10).

logo

T.A.Han, L.M.Pereira

Context-dependent Incremental Intention Recognition

Suggest Documents