Feb 17, 2009 - Web-based Supplementary Materials. The Predictive Receiver Operating. Characteristic Curve for the Joint. Assessment of the Positive and ...
Web-based Supplementary Materials The Predictive Receiver Operating Characteristic Curve for the Joint Assessment of the Positive and Negative Predictive Value SHANG-YING SHIU Institute of Statistical Science, Academia Sinica, Taipei 115, Taiwan CONSTANTINE GATSONIS Center for Statistical Sciences, Brown University, Providence, RI 02912, U.S.A.
Appendix A An interpretation of the hazard rate order In this appendix we give an interpretation of the hazard rate order and provide an example of when the condition fails. Similar argument can apply to the reversed hazard rate order. Definition 1. (The Hazard Rate Order) Let X and Y be two random variables with absolutely continuous distribution functions F and G, respectively, such that g(t) f(t) ≤ , 1 − F (t) 1 − G(t)
∀t > −∞.
Then X is said to be smaller than Y in the hazard rate order. 1
0.0 -0.2 log(1-CDF)
-0.4 -0.6 -5
-4
-3
-2
-1
0
t
Figure 1. A violation of the hazard rate order: the binormal assumption with a = 1 and b = 2. The hazard rate order is defined on the ratio of the PDF and the survival function. This ratio f(t)/(1 − F (t)) is the first derivative of log(1 − F (t)). Therefore, the hazard rate order implies that the slope of the log of one survival function is uniformly smaller than that of the other. If one function does not have a uniformly smaller slope than the other, and if the two functions have the same starting points (which are zero here), then for the function initially having a lower speed therefore falling behind at the beginning may pass over the other function because of a higher speed later. In other words, the two functions are likely to cross at some point. Figure 1 demonstrates the failure of the hazard rate order. The solid curve represents the log of the survival function from N(0,1), and the dashed curve
2
is from N(1,2). Note that the two curves cross at some point around minus one. To summarize, one interpretation of the hazard rate order is that the log of the survival function of one random variable declines uniformly faster than that of the other. A common example that the hazard rate order fails, is that the log of the survival functions of two random variables cross. However, the non-crossing of the log of the two survival functions does not imply that the hazard rate order is satisfied.
Appendix B Proofs of Proposition 3.1 and Proposition 3.2 In this appendix, we will prove Proposition 3.1 and Proposition 3.2. For convenience of notation, let t−a t−a ) ) Φ( − b b and h . (t) = (t) = h+ a,b a,b t−a t−a )/b )/b φ( φ( b b 1 − Φ(
Then we have d P P V (t) ≥ 0 dt d NP V (t) ≥ 0 dt
⇐⇒
+ h+ a,b (t) ≥ h0,1 (t),
⇐⇒
− h− a,b (t) ≥ h0,1 (t).
− We begin with studying the geometric properties of h+ a,b and ha,b . The fol-
lowing lemma describes the properties, and will be used to prove Proposition 3.1 and Proposition 3.2. Lemma 1.
∀a, b > 0, 3
(i) h+ a,b is strictly decreasing and convex on t ∈ (−∞, ∞). (ii) h− a,b is strictly increasing and convex on t ∈ (−∞, ∞). Proof. In the following, we only prove statement (i). Statement (ii) can be obtained by similar arguments. It is equivalent to prove that h+ 0,1 is strictly decreasing and convex. By taking the first derivative of h+ 0,1 , we have g(t) d + h0,1(t) = , dt φ(t) where g(t) = t[1 − Φ(t)] − φ(t). Since d g(t) = 1 − Φ(t) > 0, dt and since lim g(t) = lim −φ(t) +
t→∞
t→∞
t = 0, 1 1 − Φ(t)
we obtain that g(t) < 0 ∀t ∈ (−∞, ∞). Therefore h+ 0,1 is strictly decreasing. By taking the second derivative of h+ 0,1 , we have f(t) d2 + h (t) = , 0,1 dt2 φ(t) where f(t) = 1 − Φ(t) + g(t)t.
Since d f(t) = 2g(t) < 0, dt and since lim f(t) = lim 1 − Φ(t) +
t→∞
t→∞
4
t = 0, 1 g(t)
we obtain that f(t) > 0 ∀t ∈ (−∞, ∞). Therefore h+ 0,1 is convex.
− Proof of Proposition 3.1. Since h+ a,b is strictly decreasing, and since ha,b
is strictly increasing, we have + + h+ a,1 (t) = h0,1 (t − a) > h0,1 (t) ∀t, − − h− a,1 (t) = h0,1 (t − a) < h0,1 (t) ∀t.
These two inequalities complete the proof.
Proof of Proposition 3.2. In the following, we will only prove Proposition 3.2 for b > 1. The statements for b < 1 can be obtained using similar arguments. (a) Since b > 1, we have 1 − Φ(
t−a ) < 1 − Φ(t) ∀t < c∗1 , b
where c∗1 = a/(1 − b). Moreover, since φ(
t−a )/b 1 a 2 1 1 1 2 a b = e− 2 ( b2 −1)t + b t− 2 ( b ) → ∞ as t → −∞, φ(t) b
that is, φ(t) < 1 ∀t < c∗2 . t−a )/b φ( b + ∗ ∗ Therefore, we have h+ a,b (t) < h0,1 (t) ∀t < min(c1 , c2 ). ∃c∗2 < 0 such that
(b) Since h+ a,b (t) is decreasing and b > 1, we have + h+ a,b (t) = bh0,1 (
t−a a + ) ≥ bh+ . 0,1 (t) > h0,1 (t) ∀t ≥ b 1−b 5
(c) Since h+ a,b is increasing, and since
+ h+ a,b (t) = h0,1 (
t−a ), b
we have
+ h+ a,b (t) > h0,1 (t), + h+ a,b (t) = h0,1 (t), + h+ a,b (t) < h0,1 (t),
a , 1−b a t= , 1−b a ∀t > . 1−b ∀t
h0,1 (t) ∀t > cP P V .
Therefore, the positive predictive value is strictly decreasing on t ∈ (−∞, c∗P P V ) and is strictly increasing on t ∈ (c∗P P V , ∞). Similarly, we can prove that the negative predictive value is strictly increasing on t ∈ (−∞, c∗N P V ) and is strictly decreasing on t ∈ (c∗N P V , ∞), where c∗N P V > a/(1 − b). Therefore, we also conclude that c∗P P V ≤ c∗N P V .
Appendix C Existence of pseudo-likelihood estimates and the variance-covariance matrix (i) The existence of pseudo-likelihood estimates depends on the identifiability of the model. For the predictive model presented in Section 4, the identifiability is guaranteed whenever the number of ordinal categories K ≥ 3. 6
(ii) Examine if the matrix J=−
K
δk EΨ
k=1
∂ 2lnfk (D|Ik ; Ψ) ∂ψu ∂ψl
is positive definite, where Ψ = {θ1, ..., θk−1, a, b, r}, lnfk (D|Ik ; Ψ) = −Dln(1 + hk ) + (1 − D)ln(1 −
1 ), 1 + hk
and hk = r(Ik − Φ(θk ))/(Ik − Φ((θk − a)/b). Since ∂lnfk (D|Ik ; Ψ) ∂hk 1 1 ∂hk ∂lnfk (D|Ik ; Ψ) = = − + (1 − D) , ∂ψu ∂hk ∂ψu 1 + hk hk ∂ψu the expected second derivative is
∂lnfk (D|Ik ; Ψ) ∂lnfk (D|Ik ; Ψ) = EΨ −EΨ ∂ψu ∂ψl 2 1 ∂hk ∂hk 1 = EΨ − + (1 − D) , ∂ψu ∂ψl 1 + hk hk which shows that the matrix J is of the form Vk VkT ck , where Vk is the ∂ 2lnfk (D|Ik ; Ψ) ∂ψu ∂ψl
vector of ∂hk /∂Ψ and ck is a constant larger than zero. Hence, it is proved that J is positive definite.
7