Exact Lifted Inference with Distinct Soft Evidence ... - Semantic Scholar

1 downloads 215 Views 373KB Size Report
Exact Lifted Inference with. Distinct Soft Evidence on Every. Object. Hung Hai Bui, Tuyen N. Huynh, Rodrigo de Salvo Bra
AAAI 2012

Exact Lifted Inference with Distinct Soft Evidence on Every Object Hung Hai Bui, Tuyen N. Huynh, Rodrigo de Salvo Braz Artificial Intelligence Center SRI International Menlo Park, CA, USA

July 26, 2012

1/18

Outline

1 Outline

2 Distinct Soft Evidence is Problematic

3 LIDE (Lifted Inference with Distinct Evidence)

4 Experiments

AAAI 2012

2/18

AAAI 2012

Lifted Inference and the Problematic Soft Evidence • The main idea of lifted inference is to exploit symmetry of the

probabilistic models. This leads to algorithms that can be very efficient on high-tree width, but symmetric models • Soft evidence at the level of every object destroys the model’s

symmetry • Everyone has different weight, cholesterol level, etc

Symmetric   Symmetry  destroyed  

• Aim: lifted inference with distinct soft evidence on every object 3/18

AAAI 2012

Distinct Soft Evidence on a Unary Predicate • The simplest form of distinct soft evidence: on every

grounding of a single unary predicate • Consider an MLN M consists of • An MLN M0 with a unary predicate q. • A set of soft evidence of the form wi : q(i) for every object i.

Evidence

M0 1.4

:

¬Smokes(x)

w1

:

Cancer (P1 )

2.3

:

¬Cancer (x)

w2

:

Cancer (P2 )

4.6

:

¬Friends(x, y )

1.5

:

Smokes(x) ⇒ Cancer (x)

w1000

:

Cancer (P1000 )

1.1

:

Smokes(x) ∧ Friends(x, y )

...

⇒ Smokes(y )

(tree-width = 1000) 4/18

AAAI 2012

LIDE (Lifted Inference with Distinct Evidence)

• Most lifted inference methods applied to M would completely

shatter the model, thus reverting to ground inference. • LIDE’s approach 1 2

Perform lifted inference on M0 only Use special operations to absorb the soft evidences • Instead of exploiting symmetry of the model, we exploit symmetry of the partition function

5/18

Symmetric Function

Definition A n-variable function F (t1 , . . . , tn ) is symmetric if for all permutation π, permuting the variables of F by π does not change the output value, that is, F (t1 , . . . , tn ) = F (tπ(1) . . . , tπ(n) ).

AAAI 2012

• F depends only on the histogram of its arguments. • If ti ∈ {0, 1}, the set {ck }, k = 0, . . . , n, where ck = F (t) for

any t such that ktk1 = k is termed the counting representation of the symmetric function F . • An exchangable distribution is a symmetric function, so it

admits a counting representation.

6/18

Exchangeability of Groundings of a Unary Predicate Theorem Let D∗ = {d1 , . . . , dn } be the set of individuals that do not appear as constants in the MLN M0 and q be a unary predicate in M0 . Let P0 (.) = Pr(q(d1 ) . . . q(dn ) | M0 ). Then, the random vector (q(d1 ) . . . q(dn )) is exchangeable under P0 .

AAAI 2012

• Proof is in the paper. • This seems trivial: d1 , . . . dn do not appear in M0 so they are

“indistinguishable”. But beware, “indistinguishable” does not necessarily imply exchangeable: groundings of an n-ary predicate in general are NOT exchangeable when n > 1.

7/18

AAAI 2012

LIDE as a Wrapper

1

Step 1: apply any applicable lifted inference technique on M0 to compute the counting representation {ck } of P0 (). • One natural method is counting elimination.

2

Step 2: Absorb the soft evidence • Equivalent to compute the posterior of a set of exchangable

binary random variables n

P(q1 , . . . , qn ) =

Y 1 P0 (q1 . . . qn ) φi (qi ) Z i=1

where qi = q(di )

8/18

Posterior of Exchangeable Binary RVs

n

Pr (q1 , . . . , qn ) =

Y 1 P0 (q1 . . . qn ) φi (qi ) Z i=1

We discuss three related problems, to compute • The MAP configuration q under the marginal Pr(q) (a.k.a the

marginal-map problem) • The partition function Z • The marginal Pr(qi ) for each individual di

AAAI 2012

9/18

MAP Inference Let αi =

AAAI 2012

φi (1) φi (0) ,

Φ=

Q

φi (0). Then n

P(q) =

Y q Φ P0 (q1 . . . qn ) αi i Z i=1

max P(q) = q

n Y Φ max ck max αiqi Z k q:kqk1 =k i=1

• Observation: the 2nd maximization simply picks k largest

elements of α. • By sorting the vector α, the MAP problem can be solved in

O(n log(n)) given {ck } as input.

10/18

AAAI 2012

Partition Function Z

Z (α1 , . . . , αn ) = Φ

X

P0 (q1 , . . . , qn )

q1 ...qn

n Y

αiqi

i=1

• Observation: Z is a polynomial in α. More importantly Z is a

symmetric polynomial. • According to the fundamental theorem of symmetric

polynomials, it can be expressed in terms of a small number of building units called elementary symmetric polynomials.

Z (α) = Φ

n X

ck ek (α)

k=0

11/18

Elementary Symmetric Polynomials • ek (α) is the k-th order elementary symmetric polynomial in α,

the sum of all products of distinct k elements of α X ek (α) = αi1 . . . αik 1≤i1

Suggest Documents