Tetsutaro Shibahara, John K. Tsotsos, John Mylopoulos, and H. Dominic Covvey ... system [Tsotsos 1981], such as the change/focus attention mechanism with ...
CAA: A KNOWLEDGE BASED SYSTEM USING CAUSAL KNOWLEDGE TO DIAGNOSE CARDIAC RHYTHM DISORDERS
T e t s u t a r o Shibahara, John K. T s o t s o s , John M y l o p o u l o s , and H. Dominic C o v v e y L a b o r a t o r y f o r Computational Medicine Dept. of Computer S c i e n c e , Univ. of T o r o n t o T o r o n t o , Ontario, CANADA M5S 1A4 ABSTRACT open A n e x p e r t s y s t e m , C a u s a l A r r h y t h m i a A n a l y z e r (CAA), is
being
developed
to
establish
a
framework
for
the
r e c o g n i t i o n of time v a r y i n g signals of a c o m p l e x r e p e t i t i v e
question
in t h e ECG r e s e a r c h domain d e s p i t e t h e
e f f o r t s and t h e s u c c e s s in c o m p u t e r i z e d ECG i n t e r p r e t a t i o n ( e a r l y s u c c e s s : e.g.
[Bonner 1 9 7 2 ] , r e c e n t Al a p p r o a c h :
[Birman 1 9 8 2 ] ) .
n a t u r e , s u c h as e l e c t r o c a r d i o g r a m s (ECGs). Using a s t r a t i f i e d k n o w l e d g e b a s e discerns
several
perspectives
underlying
entities,
knowledge
of
the
such
about
as
cardiac
the
phenomena o f
physiological
conduction
To
t h e CAA s y s t e m
the
system
event
and
the
diagnose
rhythm
disorders
of
the
heart,
the
e v e n t s in t h e u n d o r l y i n g c a r d i a c c o n d u c t i o n s y s t e m must be e x a c t l y d e t e r m i n e d from o n e o r more s t r e a m s o f o b s e r v e d b o d y s u r f a c e ECG signals.
Unlike o t h e r e x i s t i n g or p r o p o s e d
morphological w a v e f o r m k n o w l e d g e of ECG t r a c i n g s , w h e r e
ECG s y s t e m s , our s y s t e m utilizes k n o w l e d g e of t h e c a u s a l
conduction
s t r u c t u r e of physiological e v e n t s in the conduction s y s t e m ,
events
w a v e f o r m domain.
are
projected
into
the
observable
P r o j e c t i o n links have been defined to
r e p r e s e n t p r o j e c t i o n in CAA's f r a m e - b a s e d formalism and
and t r i e s
t o d e t e r m i n e t h e most likely s e t (or s e t s ) o f
u n d e r l y i n g e v e n t s t h a t explain t h e input w a v e signals.
a r e u s e d t o r a i s e h y p o t h e s e s a c r o s s d i f f e r e n t KBs. The c a u s a l l i n k s , t h e r e f o r e , w e r e i n t r o d u c e d
The CAA s y s t e m also i n t r o d u c e s and u s e s c a u s a l l i n k s extensively relations domain.
to
represent
between
various
concepts
in
causal
the
and
temporal
physiological
event
Its c o n t r o l s t r u c t u r e u s e s c a u s a l links t o p r e d i c t
in the
CAA s y s t e m and t h e y play an e s s e n t i a l role in c h a r a c t e r i z i n g a c o m p l e x e v e n t c o n c e p t a g g r e g a t e d from o t h e r more basic (component)
concepts
specifying the
causal
component
and
e v e n t h y p o t h e s e s a g a i n s t input d a t a , and t o c a l c u l a t e t h e
H e n c e , their r e p r e s e n t a t i o n a l role is analogous to t h a t of
The
among
the
temporal
d e g r e e o f i n t e g r i t y among c a u s a l l y r e l a t e d e v e n t s .
relationships
by
u n s e e n e v e n t s from r e c o g n i z e d e v e n t s , t o c o n f i r m t h e s e
events.
s t r u c t u r a l d e s c r i p t i o n s in a c o m p o s i t e s t r u c t u r e c o n c e p t in
m e t a - k n o w l e d g e representation of statistical information
the
a b o u t e v e n t s f a c i l i t a t e s a d e f a u l t r e a s o n i n g mechanism and
l a n g u a g e s s u c h as KLONE [ B r a c h m a n , R.J. 1 9 7 9 ] .
supports
e v e n t r e c o g n i t i o n , c a u s a l links a r e most e f f e c t i v e l y u s e d t o
this
expectation
process
providing
context
The CAA s y s t e m i n h e r i t s its basic c o n t r o l mechanisms from t h e ALVEN (A L e f t VENtricular Wall Motion A n a l y s i s ) [Tsotsos
attention
1981],
mechanism
such
with
domain,
as
the
similarity
links
and
other
representational In CAA's
d i f f i c u l t or impossible by noise or n a t u r e . A f r a m e - t y p e r e p r e s e n t a t i o n of s e m a n t i c n e t w o r k s is
the
u s e d t o maintain a s t r a t i f i e d k n o w l e d g e b a s e t h a t c o n t a i n s
A p r o t o t y p e CAA s y s t e m w i t h a limited number of
k n o w l e d g e a b o u t t h e w a v e f o r m s of ECG s i g n a l s , k n o w l e d g e about
representation
interrelationships.
PSN
(Procedural
Semantic
the
physiology
N e t w o r k s ) [ L e v e s q u e & Mylopoulos 1 9 7 9 ] .
The p r o t o t y p e
describe
has
results
d i f f e r e n t KBs.
far
in
e v e n t s w h e n t h e d i r e c t o b s e r v a t i o n s o f t h o s e e v e n t s are
abnormalities has b e e n implemented using t h e k n o w l e d g e
so
seen
change/focus
h y p o t h e s i s r a t i n g mechanism.
language
as
make e x p e c t a t i o n s o f e v e n t s linked t o a l r e a d y r e c o g n i z e d
sensitive statistical information.
system
spatial
demonstrated
satisfactory
using
the
of
rhythm
Projection
links
concept-to-concept
disorders, were
and
their
introduced
relationships
to
across
i n d e p e n d e n t l y sampled ECG d a t a . As for t h e c o n t r o l s t r u c t u r e , ALVEN's basic c o n t r o l 7
INTRODUCTION
s t r u c t u r e [ T s o t s o s 1 9 8 1 ] i s u s e d and e x t e n d e d t o f a c i l i t a t e t h e a b o v e e x p e c t a t i o n and p r o j e c t i o n m e c h a n i s m s , a n d t o
The main o b j e c t i v e of this s t u d y is to e s t a b l i s h a f r a m e w o r k for t h e r e c o g n i t i o n of time v a r y i n g signals of a
handle r e c u r s i v e h y p o t h e s i s g e n e r a t i o n f o r r e p e t i t i v e e v e n t recognition.
c o m p l e x r e p e t i t i v e n a t u r e , s u c h as e l e c t r o c a r d i o g r a m s . To t h i s e n d , an e x p e r t s y s t e m called CAA (Causal Arrhythmia A n a l y z e r ) is being d e v e l o p e d to d i a g n o s e r h y t h m d i s o r d e r s (usually monitoring.
called We
arrhythmias) have
chosen
in
electrocardiographic
the
arrhythmia
analysis
problem b e c a u s e it is a domain rich in t e m p o r a l and c a u s a l i n t e r r e l a t i o n s h i p s , and b e c a u s e it is c o n s i d e r e d a remaining
From t h e Al v i e w p o i n t , t h e r e f o r e , t h e CAA s y s t e m is c o n s i d e r e d a s a n empirical s e m a n t i c n e t w o r k s y s t e m f o r e v e n t s e q u e n c e r e c o g n i t i o n , w h i c h Includes t h e e x p l i c i t d e s c r i p t i o n and u s e o f c a u s a t i v e t e m p o r a l k n o w l e d g e a n d t h e use of a s t r a t i f i e d k n o w l e d g e b a s e s t r u c t u r e w i t h i n t e r - r e l a t e d d i s t i n c t KBs.
T. Shibahara et al. 243
2 REPRESENTATION OF DOMAIN KNOWLEDGE At present, the CAA stratified knowledge base Includes two distinct domain KBs: Morphological KB and Physiological KB. The Morphological KB describes the observable ECG waveform knowledge such as the criteria for amplitudes, durations, and shape features of ECG waves. The Physiological KB maintains the knowledge which explains abnormalities in the cardiac conduction system and interprets a rhythm disorder as a causal phenomenon in this sub-domain. The CAA knowledge base also includes the control knowledge for hypothesis activation and maintenance, and it may include strategic or therapeutic knowledge such as additional measurements and drug administration. CAA uses a frame-type representation of semantic networks, which evolved from ALVEN's formalism [Tsotsos 1980]. Thus the knowledge units are called classes and are used to abstract various concepts at each knowledge domain, e.g., class CELL-CYCLE of each portion of the heart muscle and class BEAT-PATTERN of consecutive heart beats. Each instantiation of the above class concepts generates a class-token or simply token. The above instance-of relation is extended to a relation between a class and a meta-class. For example, any statistical data about class BEAT itself cannot be the attributes of any specific beat instance. This is a rather important distinction that most medical expert system do not make: statistical Information, so commonly used in medical diagnosis systems, should be represented as attributes of the class rather than of instances of the class. Furthermore, this separation can be viewed as a default mechanism. In disease classes for which insufficient information is known to diagnose them categorically [Szolovitz & Pauker 1978] statistical information is usually contained either in the definition of the class, or with respect to a particular patient case. In such cases, however, metaclasses may be defined and used as the default reasoning mechanism. A class definition consists of component-class slots, organizational descriptors such as Is-a and Instance-of phrases, and link descriptors: causal links, projection links and similarity links. Similarity links are used as an aid in activating alternative or parallel hypotheses when exceptions occur in the recognition of the current class [Tsotsos 1981]. 3 REPRESENTATION OF CAUSAL UNKS Rleger's CSA system [Rieger and Grinsberg 1977] and Patll's ABEL system [Patil 1981] are examples of current causal representations. Rieger and Grinsberg Introduced several types of causal links and distinguished two types of causal flows: continuous causality and oneshot causality, with possible gating conditions. Our CAA system adopted the oneshot causal flows among events. Patll Introduced a multHevel causal network to explain the
aggregating process from basic physiological causal links to more global causal links among disorder events within a disease such as diarrhea. However, it does not seem to provide the causal links classified by the types of influence and temporality, which are essential to describe time varying phenomena as seen In the electrophyslology of the cardiac conduction system. We characterize a causal link by two features: (1)
the existential dependency of an effect event on its cause event(s), I.e., the feature that no effect events can exist or happen without cause events, and
(2)
the temporal constraints between the cause events and the effect events, with a possible delay time interval.
Thus, causal links of the CAA system are classified according to (1) types of Influence, and (2) types of temporal constraints. The type of influence of a causal link must be defined by the type of dependency and the roles of participating events such as cause, effect, and condition. The type of temporal constraints in a causal link is usually understood implicitly from the meaning of the link, which implies the temporal relationships between the participating events In the link. These constraints may be typed by Allen's "Interval Relations" classification [Allen 1981]. Some useful causal links In CAA are the following: (1)
TRANSFER, TRANSITION — causal links which describe state (or phase) change from the proceeding event to the following event in time involving a single subject; TRANSFER Indicates the subject normally completes the proceeding state (event) and changes into the following state; and TRANSITION means the subject is forced to terminate the current state and transition into a new state.
(2)
INITIATION, INTERRUPT, CAUSAL-BLOCK - causal links in which a causative event of one subject initiates or interrupts an effect event of another subject; In INITIATION, a causative starting (or ending) event of one subject triggers a new event of another subject, and in INTERRUPT, a causative event of one subject interrupts (and forces to terminate) an event of another subject and make it transition to a new state. CAUSAL-BLOCK describes no causal flow but still insists a specific (negative) condition among events.
The following class definition exemplifies the use of causal links In a normal conduction sequence In the heart. The dot "." notation Is used to specify the component of the referred slot.
244 T. Shibahara et al.
variables (such as event durations) with appropriately estimated values, namely modified statistical values defined through the corresponding metaclasses. Thus, the system refers to this kind of (default or statistical) metaclass knowledge when it faces lack of Information in the recognition process. The temporal locations and the waveforms above expected events must be confirmed if these have observable counterparts in the shape domain. the confirmation process from the event domain waveform (signal) domain. 4.2
of the events This is to the
Recognition Steps and Projection-links
Waveform recognition starts with picking up some prominent waveforms as the starting set of signal tokens. Gradually posing constraints to the most generic waveform class, the system tries to identify the token set to be a specialized waveform class. To find the best matching hypotheses to the original waveforms, a focus and change of attention mechanism is used [Tsotsos 1981]. This process goes over every set of signal tokens available.
The above causal links give the causative sequence of impulse propagation, where the depolarization phase of one part of the heart triggers the depolarization phase of the next part. 4 CONTROL STRATEGY FOR RECOGNITION
Based on the principles embodied in the ALVEN control structure, the control structure of the CAA system has been extended and developed for three purposes: (1) to exploit causal knowledge about events, (2) to provide a means of hypothesis projection across distinct KBs, and (3) to recognize repetitive event sequences and to detect beat to beat relationships. 4.1
Expectation and Confirmation through Causal Links
The task we are considering Is the recognition of complex time-varying events. The role of causal relationships in such complex events is to provide local context for their components or constituents, I.e., it is to produce expectations of the properties of this aggregated event. The system, therefore, must look ahead or look back for these causally linked component events, starting with one or more already-identified component events, to locate the temporal probable positions of "to-be-observed" events. The above linked component events, however, may not be observable as waveforms in the signal. In such a case, the system will supply these non-observable
Once the waveform hypotheses are established, the system must look for the source events in the physiology KB. This event seeking process associates each waveform with several underlying events through explicitly defined projection links in the projecting classes. Each projection link contains (1) a list of projected class names, and (2) a list of binding conditions to activate projected classes. Since the overall arrhythmia recognition process starts with one of the most global event hypotheses (usually beat patterns), the above projected classes may be instantiated only when the current global hypothesis requires them as its components and when they are contained in projection links at a proper temporal context. 4.3 Repetitive Beat Recognition The repetitive behavior of an arrhythmia is recursively defined in its class frame. Using the eligible hypotheses established at the initial stage, the system recursively generates and recognizes successive beat patterns. In this recognition, the similarity links between repetitive beat patterns enable the system to activate parallel or alternative hypotheses when exceptions or special conditions occurred as the recognition proceeds. Also, the causal links between consecutive BEAT classes verify the causal relationships among corresponding components on a beat-to-beat basis and estimate the periodicity of a series of beats as the whole. Examples to explain the above recognition stages are found in [Shibahara et al 1082].
T. Shibahara et al. 245
5 CONCLUSIONS A prototype CAA system has been completed with a knowledge base of a limited number of waveform abnormalities and rhythm disorders, and has so far yielded satisfactory results using independently sampled ECG data. The prototype includes all the basic control mechanisms such as causal links and projection links handlers, and has been implemented using a version of PSN (Procedural Semantic Network), previously developed at the Univ. of Toronto [Levesque & Mylopoulos 1979], [Patel-Schneider et al. 1982]. We may conclude that the inclusion of causal links as the amalgamation of causal relations and temporal constraints in a frame-type representation, along with the organizational primitives IS-A and PART-OF, has allowed us to tackle the problem of reconstructing complex electrophysiological event sequences from gross signal characteristics. This is accomplished by defining the semantics of causality and noting that it is these semantics that can be used for the generation of expected signal characteristics and other associated events. More generally, the inclusion of causal knowledge provides a context for the recognition and reconstruction of complex event sequences.
[Brachman, R.J. 1979] "On the Epistemological Status of Semantic Network", Associative Networks, Findler, N.V. (ed.), Associated Press, New York 1979 [Levesque, H. and Mylopoulos, J. 1979] "A Procedural Semantics for Semantic Associative Networks, ibid.
Networks",
[Patel-Schneider, P.F. et al 1982] "The Procedural Semantic Network: An Extensible Knowledge Representation Scheme", Proc. of 4th CSCSI/SCEIO Conference, May 1982 [Patil, R.S. 1981] "Causal Representation of Patient Illness for Electrolyte and Acid-Base Diagnosis". TR-267, Laboratory for Computer Science, MIT, Oct. 1981 [Rleger, C. and Grinsberg, M. 1977] The Declarative Representation and Procedural Simulation of Causality in Physical Mechanics", IJCAI 1977 [Shibahara, T. et al. 1982] "CAA: A Knowledge Based System with Causal Knowledge to Diagnose Rhythm Disorders in the Heart", Proc. of 4th CSCSI/SCEIO Conference, May 1982
ACKNOWLEDGEMENTS The authors would like to thank all the colleagues in the PSN group and the Laboratory for Computational Medicine (LCM) at the department, for providing the fertile environment for this research, and Dr. Menashe B. Waxman of Toronto General Hospital for his helpful discussions on ECG interpretation. The financial assistance of the Ontario Heart Foundation and Defence Research Establishment Atlantic is gratefully acknowledged.
REFERENCES
[Allen, J.F. 1981] "An Interval-Based Representation of Temporal Knowledge", Proc. of IJCAI-81, pp.221-228, 1981. [Birman, K.P. 1982] "Rule-Based Learning for More Accurate ECG Analysis", IEEE trans, of P.A.M.I., PAMI-4, No.4, July 1982 [Bonner, R.E. 1972] "A New Computer Program for Analysis of Scalar Electrocardiograms", Computers and Blomed. Res. 6, 1972
[Szolovitz, P., & Pauker, S.G. 1978] "Categorical and Probabilistic Reasoning in Medical Diagnosis", Artificiai Intelligence, 1978, 11, p.116-144 [Tsotsos, J.K. et al. 1980] "A Framework for Visual Motion understanding", PAMI-2, No.6, Nov. 1980 [Tsotsos, J.K. 1981] "Temporal Event Recognition: An Application to Left Ventricular Performance", IJCAI 1981