A user browsing model to predict search engine click ...

0 downloads 0 Views 2MB Size Report
Simple Click Models where E and R are independent. More precisely, we can represent the click-through rate as: P(Ci = 1) = P(Ei = 1). ︸ ︷︷ ︸ position bias.
Click Models & Diversified Search Aleksandr Chuklin Pavel Serdyukov Maarten de Rijke

Click Models Many tasks that leverage web search users’ implicit feedback rely on a proper and unbiased interpretation of user clicks. “Efficient Multiple-Click Models in Web Search”, Guo et al.

Why Predicting Clicks?

•  Simulate user behavior when real clicks are not available (e.g. Hofmann et al. WSDM’13) •  Estimate relevance of the documents (e.g. Chapelle et al. WWW’09) •  Understanding user behavior and click biases (e.g. Dupret et al. SIGIR’08)

Clicks and Relevance

Can we say that the click probability is determined by the relevance of the document?

P (Ri = 1) = rd . ( algorithm for model inference. Cascade Hypothesis assumes that examinati the us Next, we introduce two[7] important hypothesis: Simple Click hypothesis andbottom cascade hypothesis, which areresult the foundatio linearly Models from top to of the search pag of most existing click models. documentExamination is examined only if all the Bing SERP pageashowHypothesis ([20,previous 7]) assumes docum that a do ts at different slots. examined ument is clicked and only if it is is examined and relevan and the firstifdocument always examine i

which can be formulated as:

udy the user behavior in federP (E = 1 | E = 0) = 0 i+1 i C = 1 ⇐⇒ E = 1, R = 1 ( i i i d an observation that user bes highly different from that in where Ei and Ri are independent. we c P (E1 = 1)More = precisely, 1 better characterize user behavrepresent the click-through rate as: propose a novel Bayesian model Based the cascade P (Ci = 1) = P (E P (Ci model = 1|Ei = [7] 1) co( i = 1) el (FCM), which introduces two on the hypothesis, ! "# $ ! "# $ a user will continue the examination the fir ure the distinctive userthat behavposition bias documentuntil relavance first illustrates that users tend and then she abandons the whole search session: results and the visual attention where P (Ri = 1) = P (Ci = 1 | Ei = 1) indicates t xamination probability of other probability of click after examination. [20] assumes that t P (C =on rposition i = 1 | E i = 1) di otivates us to reconsider the exexamination probability depends solely i, a h document in federated search thus P (Ci = 1) = λi rdi , in which λi models the positi P (E = 1 | E = 1, C ) = 1 − C i+1 i i i del beyond the cascade hypothbias.

•  Examination model:

•  Cascade model:

The dependent click model (DCM) [14] generalizes cade model to allow multiple clicks: P (Ei+1 = 1 | Ei = 1, Ci = 1) = λi .

Probabilistic Graphical Models

“Efficient Multiple-Click Models in Web Search”, Guo et al. WSDM’09

Diversified Result Page

Diversified Result Page (Fresh Block)

Challenges for Click Prediction

•  Some result items may be more attractive than others •  Attractive results might appear in different positions (not necessarily top/bottom) •  Different users may have different intents

Solution: Explicitly Add Intent into Model

Fig. 2: The graphical model for UBM-IA. Gray squares correspond to observed va ables, blue circles to hidden variables. Arrows show dependency links.

The aim our work is that not towe studyhave how toafind intents corresponding to the que 1. ofAssume prior intent nstead, given that we know the query intent spectrum, we aim to investigate the eff distribution P(I) of this distribution on the users’ click-through behavior. So we assume that for ea 4 session2.  we Recalculate have a prior distribution of the intents P (I = i). Importantly, unlike click probability using Bayes rule: et al. [17] we do not assume that our intent distribution is fixed for the session. Wh predicting the next click, we modify the intent distribution P (I) using Bayes’ rule: X P (Ck |C1 , . . . , Ck 1 ) = P (Ck |C1 , . . . , Ck 1 , I) · P (I|C1 , . . . , Ck 1 ) | {z } | {z } I

probability from single intent model

posterior intent distribution

Dupret and Piwowarski [13] find that the single browsing model outperforms a mixt of browsing models when inferring intent distribution from clicks. We show that ng layout information and prior knowledge of intent distribution, we can significan outperform the single browsing model.

UBM click model •  C – click •  E – examination

Ek γ kd

Ek indicates whether the user looked at the document at rank k (hidden variables C C C Ck indicates whether the user clicked on the k-th document (observed variables). ... 1

k −1

k

a uI

rder to define a click model we need to denote dependencies between these var s. For example, for the UBM model we define k

P (Ek = 1 | C1 , . . . , Ck

1)

Ek = 0 ) C k = 0

=

P (Ck = 1 | Ek = 1) = auk ,

kd

(1

(2

(3

re kd is a function of two integer parameters: the current position k and the distanc UBM model by Dupret et al. SIGIR’08 e rank of previous click d = k P revClick = k max{j | 0  j < k & Cj = 1

Intent-Aware Version: UBM-IA Gk Ek

C1

...

Ck

C k −1 γ kd (G k , I )

a uI I

k

•  C – click •  G – document type (“layout”): fresh or web •  E – examination •  I – user intent: fresh or web

Based on UBM model by Dupret et al. SIGIR’08

Experiments

Experimental Setup

•  Result pages with fresh results sampled from Yandex query log in July 2012 •  30 days, ~15*106 sessions, ~3*106 queries •  Train click model on a day j, test on a day j + 1 •  Report average perplexity of predicting clicks

Table 1: Average perplexity gain for the combined UBM-IA model. Model Average Perplexity Gain Results

Confidence Interval (Bootstrap)

Table Table 1: 1: Average Average perplexity perplexity gain gain for for the the combined combined UBM-IA UBM-IA model. model. M-IA vs. UBM % gain for the combined [1.25%, 1.43%] Table 1: Average1.34 perplexity UBM-IA model. Model Average Model Average Perplexity Perplexity Gain Gain Confidence Confidence Interval Interval (Bootstrap) (Bootstrap) Model

UBM-IA vs. UBM UBM-IA vs. UBM UBM-IA vs. UBM

Average Perplexity Gain

1.34 % 1.34 % 1.34 %

Confidence Interval (Bootstrap)

[1.25%, 1.43%] [1.25%, 1.43%] [1.25%, 1.43%]

1.5 %

1.5 % 1.51.5 %%

1.1 1.5 % 1.5 % 1.5 %

%

1.1 % 1.11.1 %%

1.1 % % 1.1 % 1.1 0.7

%

0.7 % 0.70.7 %%

0.7 % 0.7 0.7 % %

0.3 % 0.30.3 %%

0.3 % 0.3 0.3 % %

-0.1 -0.1 % combined -0.1 %%

0.3 %

-0.1 %

layout

combined combined combined

intents only

layout layout layout

intents intentsonly only

intents only

(a) Fresh results withresults special (a) Fresh withsnippets special snippets

(a) (a) Fresh Fresh results results with with special special snippets snippets

combined

combined combined

combined

layout

%-0.1 intents-0.1 only -0.1 % %

layout only only layout intents intents

layout

intents only

(b)(b)Fresh results with ordinary Fresh results with ordinary snippetssnippets (b) (b) Fresh Fresh results results with with ordinary ordinary snippets snippets

Fig. 3: Perplexity gains for layout and intent models models compared to UBM. ig. 3: Perplexity gains for layout and intent compared UBM. Fig. UBM. Fig. 3: 3: Perplexity Perplexity gains gains for for layout layout and and intent intent models models compared compared to toto UBM.

then test these ideas separately and see what their contribution is. We call the resulting then test these ideas and see their contribution is. We the resulting then test these ideas separately separately andwhat see what what their contribution is. (5), We call thethe resulting click models UBM-layout andsee UBM-intents; they are defined using (9), (10) andresu these ideas separately and their contribution is. Wecall call

Results: Perplexity for Different Positions

5%

combined layout only intents only

4% 3% 2% 1% 0% -1 % 1

2

3

4

5

6

7

8

9

10

Comparison to Other Models 1.5 % 1.2 % 0.9 % 0.6 % 0.3 % 0% UBM-IA vs. UBM

EB_UBM vs. UBM

EB_UBM-IA vs. UBM

EB_UBM-IA vs. EB_UBM

•  UBM – model that does not consider vertical results •  UBM-IA – current work •  EB_UBM – exploration bias model by Chen et al. (WSDM’12)

Conclusion

Why Predicting Clicks (reminder)

•  Simulate user behavior when real clicks are not available •  Estimate relevance of the documents •  Understanding user behavior and click biases

Intent-Aware Click Models

•  Any existing click model can be modified to use intent and layout information for better click prediction •  The model allows to infer per-topic relevance from the click logs •  Per-topic analysis of user behavior and click biases (e.g. user patience)

Code available at

github.com/varepsilon/clickmodels

References 1.  Hofmann, K. et al. Reusing historical interaction data for faster online learning to rank for IR. WSDM (2013). 2.  Chapelle, O. and Zhang, Y. A dynamic bayesian network click model for web search ranking. WWW (2009). 3.  Dupret, G. and Piwowarski, B. A user browsing model to predict search engine click data from past observations. SIGIR (2008). 4.  Guo, F. et al. Efficient multiple-click models in web search. WSDM (2009). 5.  Chen, D. et al. Beyond ten blue links: enabling user click modeling in federated web search. WSDM (2012).

Suggest Documents