Bayes linear kinematics in the design of experiments: the user ...

Introduction BLKs in the user experiment Conclusions and further work

Bayes linear kinematics in the design of experiments: the user experiment Kevin Wilson Newcastle University Supervised by Dr Malcolm Farrow and Dr Tom Nye

July 5, 2011

Kevin Wilson

BLK DoE


1

Introduction Design of Experiments The user experiment

2

BLKs in the user experiment Analysis via BLKs Utility functions Optimal solution

3

Conclusions and further work Conclusions Further work

Kevin Wilson

BLK DoE


Design of Experiments The user experiment

The choice of the design of an experiment can be viewed as a decision problem. There are trade offs between the costs of performing the experiment and the benefits derived from it. The benefits can be thought of in terms of gains in knowledge; either increase in information or some terminal payoff. The optimal design of an experiment is the best possible choice of design, found in a decision theoretic way.

Kevin Wilson

BLK DoE



- P 1 6 θ

d1

? - X

- d2

Figure: The process of Bayesian experimental design Kevin Wilson

BLK DoE



The solution to the experimental design problem is the design which maximises the expected utility. In order to do this we specify a utility function. Utility function A utility function U is a means of expressing a decision maker’s preferences over the decision space.

Kevin Wilson

BLK DoE



If we have initial decision d1 , terminal decision d2 , unknown parameters θ, observed data x, and utility function U(d1 , d2 , θ, x) then for a design d1 the expected utility of the best decision is given by Z Z U(d1 ) = max U(d1 , d2 , θ, x)f (θ|x, d1 )f (x|d1 )dθdx, X d2 ∈D2

Θ

Optimal design The optimal design d1∗ maximises this: Z Z U(d1∗ ) = max max U(d1 , d2 , θ, x)f (θ|x, d1 )f (x|d1 )dθdx. d1 ∈D1

X d2 ∈D2

Θ

Kevin Wilson

BLK DoE



Thus the theory of Bayesian experimental design is ‘fairly’ straightforward. However, a full Bayesian analysis tends to be computationally difficult as only the simplest models allow integrals such as these to be solved analytically. Except in special cases neither the maximisation nor the the integration can be solved analytically and approximation and/or simulation based methods are needed. This means that solution of these problems is typically very computationally intensive and can quickly become infeasible.

Kevin Wilson

BLK DoE



Before software goes on sale or a website goes live testing is undertaken. One important aspect of this is to see whether the product is user-friendly. This is generally done by taking a sample of users and asking each to perform a number of tasks. From the results of these tasks usability problems with the software are identified and the software can either be launched as it is or rewritten. Kevin Wilson

BLK DoE



Sample size n

outcome

X=x Terminal decision

launch

rewrite Kevin Wilson

BLK DoE


Analysis via BLKs Utility functions Optimal solution

If n users are asked to perform a task then the number who successfully complete that task, X , follows a binomial distribution X | θl ∼ bin(n, θl ), We shall make an exchangeability assumption; that users in the experiment and users who buy the product in the future are exchangeable. We are also interested in a second probability, θr . The two probabilities can be given marginal beta distributions θl ∼ beta(al , bl ), θr ∼ beta(ar , br ).

Kevin Wilson

BLK DoE



θl and θr will not in general be independent in our prior beliefs. First we transform; θl θr ηl = log , ηr = log , 1 − θl 1 − θr and then link ηl and ηr in a Bayes linear structure. Prior beliefs can be converted via E0 (θk ) =

ak ⇒ E0 (ηk ) = ψ(ak ) − ψ(bk ) ak + bk

and Var0 (θk ) =

ak b k ⇒ Var0 (ηk ) = ψ1 (ak ) + ψ1 (bk ) (ak + bk )2 (ak + bk + 1)

for k = l, r . We also need Cov0 (ηl , ηr ). Kevin Wilson

BLK DoE



Now, when X = x successes are observed θl is updated to θl | X = x ∼ beta(al∗ , bl∗ ), where al∗ = al + x and bl∗ = bl + n − x. This gives E(ηl | X = x) and Var(ηl | X = x). These updates can be propagated through to ηr via; E(ηr | X = x)

=

Var(ηr | X = x)

=

Cov0 (ηr , ηl ) [E(ηl | X = x) − E0 (ηl )] , Var0 (ηl ) Cov20 (ηr , ηl ) Var(ηl | X = x) Var0 (ηr ) − 1− . Var0 (ηl ) Var0 (ηl )

E0 (ηr ) +

The parameters of the posterior marginal beta distribution for θr can then be found by solving for ar∗ and br∗ ; E(ηr | X = x) = ψ(ar∗ ) − ψ(br∗ ),

Kevin Wilson

Var(ηr | X = x) = ψ1 (ar∗ ) + ψ1 (br∗ ).

BLK DoE



We shall also define Y , the number of customers who complete the task after the software has been released. Y ∼ bin(N, θk | X = x). The optimal terminal decision is E[U(topt | X = x)] = max{E[U(tl | X = x)], E[U(tr | X = x)]},

E[U(tk | X = x)] =

N Z 1Z 1 X y =0

0

0

U(sn , tk , Y , Ck )f (Y = y | X = x, θk )f (θl , θr | X = x)dθl dθr ,

for k = l, r . Then the optimal sample size is E[U(sopt )] = max{E[U(sn )]}, n ∈ N, where E[U(sn )] =

n X

f (X )E[U(topt | X = x)].

x=0 Kevin Wilson

BLK DoE

(1)



The probability that X = x is given by Z 1 f (X ) = 0

f (X | θl )f0 (θl )dθl =

n x

Γ(al + bl ) Γ(al + bl + n)

Γ(al + x) Γ(bl + n − x) Γ(al )

Γ(bl )

Having made the update X = x the overall utility function U(sn , tk , Y , Ck ) depends only upon the chosen θ. Therefore one of the marginal adjusted beta distributions will always be sufficient to find the expected utilities. Thus E[U(tk | X = x)] =

N Z 1 X y =0

0

U(sn , tk , Y , Ck )f (Y = y | X = x, θk )f (θk | X = x)dθk ,

for k = l, r . Then

E[U(tk | X = x)]

=

N Z 1 X Γ(ak∗ + bk∗ ) ak∗ −1 N b ∗ −1 y N−y θk (1 − θk ) × θ (1 − θk ) k dθk U() ∗ )Γ(b ∗ ) k y Γ(a 0 y =0 k k

=

N X Γ(ak∗ + bk∗ ) Γ(ak∗ + y ) Γ(bk∗ + N − y ) N U(sn , tk , Y , Ck ). ∗ ∗ y Γ(ak + bk + N) Γ(ak∗ ) Γ(bk∗ ) y =0

Kevin Wilson

BLK DoE



Design

Benefits

Costs

Experiment

No. successes

Per subject

Fixed Kevin Wilson

Rewrite

BLK DoE


0.0

0.2

0.4

Utility

0.6

0.8

1.0


0

20

40

60

80

100

Number of successes

The utility for the number of successes in N = 100 future customers in the task is given by y 1 − exp(− ) 10 . Us (Y ) = y 1 + 100 exp(− ) 10 Kevin Wilson

BLK DoE


0.0

0.2

0.4

Utility

0.6

0.8

1.0


0

50000

100000

150000

200000

Cost (£)

The utility for cost shall be 2Ck Uf (C ) = 1 − k log 1 + . Cmax Kevin Wilson

BLK DoE



If we assume mutual utility independence we can use a binary node U(sn , tk , Y , Ck ) = p1 Uf (C ) + p2 Us (Y ) + p3 Uf (C )Us (Y ). Suppose costs are Cmax = 200, 000, Cw = 50, 000, Co = 5, 000 and Cu = 500. Thus Cl = Co + Cu n = 5000 + 500n, and if the product is rewritten Cr = Co + Cu n + Cw = 55000 + 500n. We take p1 = 16 , p2 =

4 6

and p3 =

Kevin Wilson

1 6

BLK DoE

and, initially, N = 100.


●

0.815

● ●

●

●

●

●●


●● ●●

●

●

●

●

●

● ● ● ● ●

●

●

● ●

0.805

● ● ● ● ● ●

●

●

0.800

Expected Utility

0.810

●

●

● ● ●

●

● ●

0.795

● ● ● ● ● ●

0.790

● ● ●

●

0

10

20

30

40

50

Sample Size

Figure: A plot of E [U(sn )], the expected utility for each sample size

Kevin Wilson

BLK DoE


0.2


● ● ● ●

0.0

●

●

−0.2

Difference

●

●

−0.4

●

●

● ●

0

2

4

6

8

10

Number of Successes

Figure: The difference between the expected utilities of launch and rewrite with 11 users in the experiment Kevin Wilson

BLK DoE


Conclusions Further work

Bayes linear kinematics reduces the computational burdens when solving problems in Bayesian experimental design. Issues of commutativity that are associated with Bayes linear kinematics were not a problem as a single update was made. Breaking down utilities into mutually independent utility hierarchies can aid in the specification of an overall utility function.

Kevin Wilson

BLK DoE



Extension of this approach to design of experiments to multiple updates. Discounting of future years in benefit utility. Defining a Bayes linear kinematic benefit utility in terms of information gain. Valencia poster!

Kevin Wilson

BLK DoE


Kevin Wilson


BLK DoE

Bayes linear kinematics in the design of experiments: the user ...

Bayes linear kinematics in the design of experiments: the user ...

Suggest Documents

Bayes prediction in the linear model with spherically - ucsb pstat

Empirical Bayes Estimation in Multiple Linear

Proceedings of the Conference on the Design of Experiments in ...

Empirical Bayes Moderation of Asymptotically Linear Parameters

Empirical Bayes Moderation of Asymptotically Linear Parameters

Empirical Bayes Moderation of Asymptotically Linear Parameters

A Second Report on the User Experiments in the ...

Empirical Bayes Moderation of Asymptotically Linear Parameters

The design of laboratory experiments in the 1980's

REACTIVE TRANSPORT EXPERIMENTS OF LINEAR ...

Design of Experiments: Optimizing the Polycarboxylation ... - MDPI

Exploring the Use of Design of Experiments in Industrial Processes ...

Design of Experiments - Springer

Design of Experiments

Design of Experiments

Design of Experiments - minitab.co.kr

Design of Experiments

Design of Experiments Guide.

Design of Experiments - JMP.com

Design of Experiments

APOGEE Kinematics I: Overview of the Kinematics of the Galactic ...

The Arctic Journey: Design Experiments in the North - Springer Link

bayes linear networks and nonlinearities - CiteSeerX

Using experiments design to model linear firing ...