Knowledge-Based Formulation of Dynamic Decision Models

0 downloads 0 Views 99KB Size Report
model construction (KBMC) systems have emerged in recent years. ... base. DynaMoL model. DYNAMO solution dynamic decision problem description ... concept definition has at least an attribute indicating the specialization relation, AKO.
Knowledge-Based Formulation of Dynamic Decision Models Chenggang Wang, Tze-Yun Leong Medical Computing Laboratory, School of Computing, National University of Singapore, Lower Kent Ridge Road, Singapore 119260 {wangcg, leongty}@comp.nus.edu.sg

Abstract. We present a new methodology to automate decision making over time and uncertainty. We adopt a knowledge-based model construction approach to support automated and interactive formulation of dynamic decision models, i.e., models that explicitly consider the effects of time. Our work integrates and extends different features of the existing frameworks. We incorporate a hybrid knowledge representation scheme that integrates categorical knowledge, probabilistic knowledge, and deterministic knowledge. We provide a set of knowledge-based modification operations for automatic and interactive generation, abstraction, and refinement of the model components. We have built a knowledge base in a real-world domain and shown that it can support automated construction of a reasonable dynamic decision model. The results indicate the practical promise of the proposed design.

1 Introduction Dynamic decision modeling is a very challenging task. The multitude of problems, the domain-specificity, the uncertainty, and the temporal nature of the underlying phenomena all contribute to the intricacy of the dynamic decision modeling process [5]. To automate the model construction process, efforts in developing knowledge-based model construction (KBMC) systems have emerged in recent years. The relevant works include ALTERID [1], Frail3 [4], SUDO-PLANNER [14], and DYNASTY [9]. These systems advocate that the decision models for different problems should be constructed on demand from a knowledge base [13]. This approach facilitates scalability and reusability of the knowledge base. Moreover, the resulting decision models are parsimonious and most relevant to the problems at hand. An important issue in KBMC research is the choice of the target model. All of the existing KBMC systems synthesize only one type of decision models; influence diagram or its variants are the choice in all the systems mentioned. Since each type of decision models can convey only certain information explicitly, some important characteristics of the decision situations are lost when the models are completed. Moreover, most of the existing frameworks do not support knowledge-based model modification. The model construction process terminates when an initial model is completed. If the user is not satisfied with the resulting model, he has to manually modify it separately. We propose a KBMC framework that adopts a new dynamic decision modeling

language, DynaMoL [5], to model target decisions, and provides a set of knowledgebased modification operations for interactive generation, abstraction, and refinement of the model components. The framework incorporates a hybrid knowledge base representation scheme that integrates categorical knowledge, probabilistic knowledge, and deterministic knowledge. We have tested the design by building a knowledge base in a real-world domain and demonstrating how it can support automated formulation of a reasonable dynamic decision model. We illustrate the proposed design and address some future research issues in this paper.

2 System Architecture The proposed system architecture is depicted in Figure 1. Given as input a dynamic decision problem description, the model constructor will access the knowledge base via the KB-manager to formulate a model. Further modification operations can be applied interactively by the user. The final model is then evaluated or solved, with respect to some pre-specified optimality and evaluation criteria, e.g., life expectancy, monetary cost, and/or desirability or utility, by DYNAMO, a prototype implementation of DynaMoL, to determine an optimal course of action.

dynamic decision problem description

user

interface

model constructor model modification

solution

KB-manager

DynaMoL model

DYNAMO

knowledge base

Figure 1. The system architecture

3 An Example Problem For illustration, we consider a simplified dynamic decision problem in colorectal cancer management [2]. In managing the follow up of colorectal cancer patients, a series of diagnostic tests are performed to detect possible recurrence, metastasis, or both recurrence and metastasis of the cancer; treatment is prescribed if cancer is detected. The diagnostic tests available are different in terms of monetary cost, sensitivity, and specificity. The decision is to determine the optimal course of diagnostic tests for

effective disease detection so that timely treatment can be given.

4 The Target Language: DynaMoL DynaMoL has four major components. First, the decision grammar supports problem formulation with multiple interfaces. Second, the presentation convention, in the tradition of graphical decision models, governs parameter visualization in multiple perspectives; two graphical representations are currently included: transition view which corresponds to the Markov state transition diagram, as shown in Figure 2, and influence view which corresponds to the dynamic influence diagram [12] without decision nodes or value nodes, as shown in Figure 3. Third, the mathematical representation, in terms of a Semi-Markov Decision Process (SMDP), provides a concise formulation of the decision problem; it also admits various solution methods. Fourth, the translation convention establishes automatic transformations among the different graphical representations, and from these representations into an SMDP [5].

recurrent rec-met

well metastatic

Figure 2. Transition view for an action. The nodes denote the states; the links denote the possible transitions from one state to another

Figure 3. Influence view for an action. The nodes depict the possible event or chance variables, each with a possible set of outcomes or values, that affect the state transitions, and the links the probabilistic dependences. The states are captured as outcomes of the state variables

DynaMoL makes a good target decision modeling language because the high-level

modeling constructs and the multiple graphical perspectives allow simple and explicit specification of the decision factors and constraints, thus facilitating automated formulation. DynaMoL also supports incremental language extension, which allows the scope of the dynamic decision problems addressed to be gradually expanded.

5 Knowledge Base Representation Our knowledge base representation design is based on the first-order logic-like representations [1, 4, 7, 9] and the fluid multilevel representation [14] in existing KBMC systems. We adopt a hybrid framework that integrates categorical knowledge, probabilistic knowledge and deterministic knowledge. The categorical knowledge captures the definitional and structural relations of the domain concepts. This type of knowledge provides the power of abstraction and inheritance; it supports modeling at multiple levels of details. The probabilistic knowledge captures the interactions (probabilistic dependencies) among the concepts. These relations are needed to support the derivation of missing information in model construction. The deterministic knowledge expresses deterministic rules and declaratory constraints about the domain. 5.1 Categorical Knowledge We use knowledge frames to describe domain concepts, e.g., diseases, tests, treatments, and other entities related to the decision problems. All the concepts may be instantiated as actions and event variables in the probabilistic knowledge base. Every concept definition has at least an attribute indicating the specialization relation, AKO. For all concepts a and b and all individuals α, AKO = {(a, b) | a⊆ b, i.e., ∀α, α∈ a ⇒ α∈b}. The AKO relation imposes a hierarchy or partial ordering of the domain concepts. Figure 4 shows part of such a hierarchy for the example problem. The AKO links will always point upward. A parent of a concept indicates a concept immediately above the given concept in the hierarchy. An ancestor of a concept is situated somewhere above the given concept. A child and a descendent of a concept are analogously defined, but downward in the hierarchy. The lowest common ancestor of a set of concepts {C1, C2,… Cn} is a concept C which is an ancestor of these concepts, and for all children S1, S2,…Sq of C, there exists a Ck, 1 ≤k ≤ n, where Sj, 1≤ j≤ q, is not a parent of Ck. The lowest common ancestors of a set of concepts may not be unique.

event chance event findings

action

Physiological test state

test historical sign symptom result disease fact

complication

Figure 4. Domain knowledge base hierarchy

treatment

5.2 Probabilistic Knowledge To represent the probabilistic knowledge, we must be able to model the action effects, the relations among event variables, and the temporal nature of the decision environment. We use timed concept instances to represent actions and event variables. For example, test-A(t) represents the action to perform test-A at time t. Similarly, test-Aresult(t), which can take either the positive or negative value, represents the event variable test-A-result at time t. It follows that test-A-result(t, positive) is an event. Probabilistic dependency statements, similar to those in [1] except that we separate the logic program clauses as the preconditions in the probabilistic statements for representational clarity and inferential efficiency, represent the action effects and probabilistic relationships among the event variables. They have the form E0|p E1,…, En = Pr(ωE0|ωE1, ωE2… ωEn) ← C1, C2, … C m where n>=0, m>=0. Ei, i = 1,…, n, is an event variable and Cj, j = 1,…, m, is a conditioning literal, which could be an event or an action taken. ω refers to one of the alternative outcomes of the event variables. The left hand side of the equality indicates the possible probabilistic dependence between E1, E2,…, En and E0. The right hand side indicates the conditional probability distribution over the alternative outcomes of the event variable E0 given the outcomes for the event variables E1,…, En. The last part is a logical expression which must be true for this probabilistic relation to be applicable. If P is the above probabilistic sentence, we define precondition(P) to be the conjunction C1, C2, … C m, (probabilistic) antecedent(P) to be the conjunction E1,…,En, and (probabilistic) consequent(P) to be E0. As an example, consider the following dependency, which expresses the probability distribution of metastasis at time t+1 given the outcome of loss-of-appetite at time t when action “observation” is taken and the Dukes’ stage of the patient is C. metastatic(t+1)|ploss-of-appetite(t)=Pr(ωmetastatic(t+1)|ωloss-of-appetite(t)) metastasis (t+1, present) metastasis(t+1, absent) loss-of-appetite(t, yes) 0.6 0.4 loss-of-appetite(t, no) 0.3 0.7 ← observation(t), duke-stage(t,C)

A probabilistic knowledge base is a finite set of probabilistic dependency statements. Event variable e1 directly influences e2, denoted e 1 → e 2 , if there exists a probabilistic rule P such that e 1 ∈ antecedent (P) and e2 = consequent (P). There is a directed path from e1 to e2, if there exists a sequence e 1 → I 1 , I 1 → I 2 ,…, I n → e 2 , where I1, I2,…In are the intermediate nodes on the path between e1 and e2. Since an influence view is a directed acyclic graph, we must ensure that the knowledge base contains no cycles [6]. Since it is not easy to enforce this “acyclicity” requirement for the knowledge base, the KB-manager will include a function which, after new probabilistic rules are added to the knowledge base, will check if they will lead to any cycles in the originally acyclic knowledge base. 5.3 Deterministic Knowledge A deterministic knowledge base is a set of logical statements, called deterministic

dependency statements, of the form L0 ← L1 and L2 … and L n, or L0 ← L1 or L2 … or Ln n>=0, where ← stands for implication, Li, i = 1, …, n, is a condition literal, which can be an event or an action. A deterministic dependency statement expresses the logical relationship or constraint among the events and/or actions; it allows deduction of the implicit values for the preconditions. For example, if a patient is assumed to be dead after having two successive metastasis, the following rule can be included: status(t,dead) ← metastasis(t, yes) and metastasis(t-1, yes). 5.4 Inferences Supported by the KB-Manager With reference to Figure 1, the model constructor can access the knowledge base via some general queries to the KB-manager. This implies that the knowledge base representation is modular with respect to the inferences supported. Therefore, when new constructs are added to the knowledge base representation, or when the structure of the knowledge base is changed, we need not re-implement the model constructor. An example of the queries supported is “what are the event variables that directly influence given at