A Cognitive Architecture Theory of Comprehension and ... - CiteSeerX

8 downloads 0 Views 68KB Size Report
amounts of “cognitive activity” is not new [Lazarus. 1982]. The differences in processing requirements may in turn lead to timing effects. Other theories propose.
A Cognitive Architecture Theory of Comprehension and Appraisal Robert P. Marinier III, John E. Laird* University of Michigan 2260 Hayward Ann Arbor, Michigan 48109 USA {rmarinie,laird}@umich.edu

The authors acknowledge the funding support of the DARPA "Biologically Inspired Cognitive Architecture" program under the "Extending the Soar Cognitive Architecture" project award number FA8650-05-C-7253.

Em oti o

n

g

Responses to Emotion

Physiological Changes Thought-action urges Cognitive Changes Subjective feelings

Co m pr eh en sio n

Agent-Environment Transaction

Co pin *

Appraisal

ula tio n

The last 20 years has seen an explosion in emotion research. The advent of cognitive theories of emotion, especially appraisal theory, has made it possible to develop computational models of emotion systems. However, to date, most of these models have been developed in isolation from existing research on computational models of cognitive architectures. For example, the EMA system [Gratch and Marsella, 2004] implements a version of appraisal theory in the Soar cognitive architecture [Newell, 1990], but the appraisals are computed only because they are needed by the emotion theory as input. And instead of using psychologically plausible methods for generating the appraisals, EMA uses explicit representations and reasoning over probabilities and utilities intermixed with its task reasoning. Thus, emotion is not inherent to the architecture such that it can and must be used on every task. Instead, emotion is just like any other knowledge so when EMA responds to a situation, it “decides” to be emotional as opposed to just “being” emotional. We intend to pursue a research program that investigates building cognitive models where emotion is

Before presenting the core ideas behind how we model appraisal and responses to emotion, we define the framework in which we are conducting research. The emotion process is a cycle as shown in Figure 1. We begin at the agent-environment transaction. The agent-environment transaction is the relationship between an agent’s goals and its current situation [Lazarus, 1991]. Appraisal theory states that it is the evaluation of this relationship that leads to emotion. Note that there are many different appraisal theories (see Roseman and Smith [2001] for a review), and these theories are not all compatible. However, the distinctions among them are not generally important to our basic theory. Next, emotion, which we roughly define

Re g

1 Introduction

2 An Emotion Process Framework

eh en sio n

Current computational appraisal models do not include comprehensive theories for the origins of appraisals – why and how are they generated? We argue that an agent is constantly engaged in comprehension as it strives to make sense of its place in the world. Our hypothesis is that many appraisals are inherent to the comprehension process. We present a model for the comprehension process and describe how appraisals integrate with that process. We conclude with several predictions about the nature of comprehension and appraisal that our theory suggests.

inherent to the operation of all agents – it will be embedded in the underlying cognitive architecture, which in our case is also Soar. As such, ours is a theory of how emotion and cognition are integrated together, with emphasis on the generation of appraisals and the responding to emotions. It is not a neurological or physiological theory, but a theory at the level of cognitive architecture and as such, should extend to similar systems such as ACT-R [Anderson and Lebiere, 1998].

Co m pr

Abstract

Figure 1: The emotion process

as the results of appraisal, causes various automatic changes to take place. For example, there may be physiological changes, thought-action urges, cognitive changes, and the agent may experience feelings, which we roughly define as an agent’s perception of its emotional state. The agent can detect some of these changes and choose to respond in a variety of ways. One possibility is response-focused emotion regulation. This is when an agent tries to actively change its automatic responses; for example, an agent that tries to suppress its anger. Another possibility is coping. Coping is commonly divided into two categories – problem-focused and emotion-focused. Problem-focused is when an agent takes actions to change the environment part of the agent-environment transaction. For example, an agent may try to physically remove an obstacle to its goal. Emotion-focused coping is when an agent takes actions (usually internally) to change the agent part of the agent-environment transaction. For example, an agent may abandon an unreachable goal. Both of these result in changes in appraisal, which in turn leads to changes in emotion. To be clear, Figure 1 represents the flow of information and not discrete states. Parallelism allows many or all of these processes to be happening simultaneously. The key issue we will address here is, what underlying process supports appraisal? And then similarly, what underlying process supports coping? Our claim is that both of these are supported by comprehension.

3 Comprehension The interaction of an agent with an environment is not a simple monolithic event where the agent is presented with data and then immediately reacts. Instead, an agent’s processing can be decomposed into multiple steps. Some of these arise because of the inherent structure of the agent (such as separate perceptual and motor systems) and because of data dependencies between the separate phases. Newell [1990] proposes that the agent’s processing can be decomposed into eight stages, calling the overall process PEACTIDM (based on the first letter of each phase). 1) Perceive: Raw perceptual inputs. 2) Encode: Interpreting the raw inputs in a domainindependent way. 3) Attend: Attending to an input. 4) Comprehend: Understanding the input in the context of what the agent is doing. 5) Tasking: Performing maintenance on one’s goals. 6) Intend: Choosing a course of action. 7) Decode: Translating the action choice into motor commands. 8) Motor: Executing motor commands. Perception and motor are considered to be outside central cognition, and encode and decode are on the boundary. This sequence is mostly fixed because of the constraints between the steps (e.g. one can’t typically comprehend an input without first attending to it [Newell, 1990], etc.) There are some possible exceptions; for example, an agent may be able to Intend di-

rectly from some raw Perception. Tasking may also move around; for example, it may not be necessary to update one’s goals after every step, or it may make sense to perform goal maintenance before comprehension has taken place.

3.2

Prop ertie s of Co mprehension

Comprehension is at the center of the PEACTIDM process as it is the one place where knowledge of the environment (PEA) combines with knowledge of the agent’s goals (T) and desires to create a representation that can be the basis for action (IDM). Our hypothesis is that many appraisals are inherent to comprehension. The idea that emotion depends upon understanding what is happening goes back at least to Dewey [1895]. It may seem that we are merely relabeling “appraisal” and calling it comprehension, but our point is that independent of any theory of emotion, a cognitively capable agent must comprehend. Thus, comprehension is a nexus for the integration of cognition and emotion. The comprehension process is severely constrained by the fact that it is embedded in a knowledge-rich agent with an ongoing existence that must interact with a continually changing environment. We also consider additional constraints that arise from what is generally known about human psychology. Many of these restrictions are borrowed from NL-Soar [Lewis, 1993], a psychologically plausible natural language processing system based on a comprehension model of language. Below is a list of these constraints: •





• •

Domain independent: The agent should not need separate comprehension systems for each domain (although there may be domain-specific information which aids the process). Limited working memory: The agent should not require an arbitrary amount of working memory. NL-Soar’s approach (and ours as well) is to only represent one interpretation at a time. Incremental: The agent should not have to wait until all inputs necessary for a complete understanding are available. It should be able to build up its comprehension from smaller pieces. Happens over time: Situations unfold over time, so the agent should be able to construct and refine its comprehension over time. Supports immediate comprehension: The agent should be able to create some understanding of the situation from the very first moments of the situation. If the agent waits, it is wasting time that it could use to do some processing, no matter how speculative. Furthermore, the agent may want to respond to the situation immediately. Finally, unlike language, which contains markers that indicate logical stopping points (e.g. periods, commas, pauses in speech, etc.) for comprehending words and phrases, situations do not. The world unfolds as an endless stream of events. We adopt Talmy’s [1975] definition of an event as a period of time in which the primary figure, motion, path and ground do not change.





Supports hierarchical comprehension: Comprehending a series of events requires not only understanding the individual events and the connections between individual events, but also how they fit together at more abstract levels. Thus, it is not just that an agent is getting into the car, starting the engine, driving down the road, etc. Rather, it is that this agent is driving to work, which can provide context for future comprehension and prediction. Supports prediction: A true understanding of the situation includes a prediction of what is going to happen next. Prediction aids future comprehension for familiar situations because it allows the agent to confirm if an event is consistent with the current prediction instead of attempting to process the event in isolation. Prediction also allows the agent to prepare for future events.

The property of “immediate comprehension” leads the agent to require at least two supporting mechanisms: •



3.3

Immediate ambiguity resolution: Many events will be ambiguous when encountered in isolation. One approach is to defer commitment until a unique interpretation is clear; however, this can result in a combinatorial blowup in processing and memory usage as multiple ambiguous events are encountered. An alternative is for the agent to commit immediately to a “best” interpretation which avoids the combinatorial blowup and supports immediate comprehension. Error recovery: Since the best interpretation is ambiguous, the agent may choose a poor one. Thus, the agent must have the capability to recover when it discovers it has made a bad assumption. As in NL-Soar, we expect that recovery will usually involve a local repair to its interpretation, but can also require a complete reinterpretation in “garden path” situations.

Building Blo cks of Co mprehension

Newell [1990] described comprehension as producing a data structure that can be interpreted by other parts of the cognitive architecture. The data structure we have chosen is a schema [Rumelhart, 1980]. A schema is actually more than just a data structure – it is a concept that consists of a variablized data structure and Perceive

Encode

Attend

knowledge about how that data structure should be instantiated. For example, a schema for the concept “buy” might look like this: Schema: Buy Seller: Buyer: Price:

The items in brackets are variables with ranges. Once some of the variables have been set, the agent can infer the values of some of the others (or at least possible values). For example, if

is known to be greater than $1 million, then the agent might infer that the list of buyers could include people like Bill Gates, but not Bob Marinier. Not all elements in a schema are necessarily variablized; some may be concrete values that are inherent to the concept. Our comprehension system is built on event schemas [Zacks and Tversky, 2001]. The reason for focusing on events is that situations seem to be composed of series of events [Zacks and Tversky, 2001], and it is the understanding of these events that let an agent choose actions to initiate its own events. Others have also focused on events as the primary stimulus (e.g. Ortony et al. [1988]). Let’s consider an example. Suppose an agent has a schema for the “crossing the street” event: Schema: Crossing the street Event1: Step down from curb Event2: Walk across street Event3: Step up onto curb Thus, the abstract event “crossing the street” is defined in terms of more concrete events. These may also be broken down further. At some point, however, an abstract event will be composed of ground events which are the most basic units. These ground events are the events the agent recognizes when it encodes raw inputs. They also correspond to actions that the agent can directly execute in the world. For simplicity, we have not shown the explicit figures, motions, etc., which would typically be variablized, in this example. Now suppose the agent observes someone stepping down from a curb. It can now recognize that this is the first step of the “crossing the street” schema. If it commits to this interpretation, it can make predictions about what will happen next (e.g. that the person will walk across the street and step up onto the other curb). Comprehend Event

Determine Discrepancy from Expectation

Determine Goal/Need Conduciveness

Update Interpretation

Reinterpret events

Figure 2: The event comprehension process

It can also infer something about the other agent’s goals (e.g. that it wants to cross the street). Perhaps most importantly, the agent can infer what the meaning of this event is with respect to its own goals – is it ok for me if this person crosses the street? When the next event occurs, this may confirm the predictions or the agent may have to reinterpret using some other event schema. The initial choice of event schema may be impacted by several factors; for example, if the agent wants the person to cross the street, then it may be biased to interpret the initial event that way.

3.4

Th e Co mpre hension Pro cess

The comprehension process creates and maintains event schemas which represent the agent’s understanding of what is currently going on. Suppose that an agent already has a schema it has committed to based on prior processing. It perceives and encodes an event and then attends to it. Since it is an event, the agent executes a “comprehend event” operator. This operator breaks down into two steps: determine the discrepancy from expectation, and update the current interpretation (i.e. the event schema). Determining the discrepancy from expectation is necessary in order to decide how the interpretation should be updated. For example, if the event exactly matches the agent’s prediction, updating the interpretation may just be a matter of denoting that event has occurred. On the other hand, if the event is slightly different (i.e. of the same type but with different variable bindings), then the event may need to be swapped in for the prediction, and the future predictions updated to reflect the latest information. Finally, if the event is completely different than what was expected, the current interpretation may need to be entirely replaced with a reinterpretation that is consistent with its observations. This is the first example of how some appraisals may “fall out” of the agent’s processing. The “determine discrepancy from expectation” operator is an appraisal from Scherer’s [2001] theory, but similar ones exist in other appraisal theories. Figure 2 shows the comprehension aspects of the PEACTIDM process. As depicted, once the agent has comprehended the event, it can generate another appraisal, “goal/need conduciveness.” This appraisal is different in that it does not “fall out” of the comprehension process, but rather follows it. Since it isn’t required to comprehend, the agent may only generate it if it has enough resources to do so (e.g. time). In general, we can divide appraisals into at least 3 groups: 1) automatic appraisals (e.g. novelty may be computed automatically by the high-level vision or long-term memory systems), 2) deliberate but required appraisals (i.e. those appraisals which “fall out” of the comprehension process), and 3) deliberate but optional appraisals (i.e. those which the agent may do if it has time). This is not to imply that the agent can choose not to do optional appraisals. Rather, continued comprehension may take precedence over these appraisals, so they may be skipped if events are occurring rapidly. Table 1 shows some possible appraisals and what types they might have.

Table 1: Possible types for common appraisals. Appraisal Type Appraisal Automatic Novelty Intrinsic pleasantness Causality Deliberate-required Goal/Need relevance Discrepancy from expectation Outcome probability Deliberate-optional Goal/need conduciveness Coping potential These particular appraisals are merely exemplars. Rather than take a firm stance on the exact set of appraisals at this point in our research, we view the comprehension process as providing constraints on what the set of appraisals should include. That is, some appraisals may fit more naturally than others into the comprehension process. Of course, the constraint goes both ways in that the comprehension process must allow for many common appraisals. Novelty and intrinsic pleasantness are likely properties that arise during the encoding process, which is mostly automatic. The agent may get causality automatically when it chooses a schema. That is, the schema itself may include the causality information in its structure. That is not to say that determining causality in general is not a complex process that is not automatic; presumably the agent must sometimes go through that process when it is generating a new schema to understand a novel set of events. The distinction between required and optional deliberate appraisals may lie primarily in what is required to understand what is happening in order to act immediately vs. what is required to understand the relationship of what is happening to the agent’s goals. This distinction is slightly muddled because the agent’s interpretation is colored by its goals. Thus, we consider goal/need relevance to be required; it may occur as part of the attend process (i.e. the agent may ignore events irrelevant to its goals). Goal/need conduciveness and coping potential, on the other hand, do not seem to be critical to the immediate understanding of what is happening, but clearly it is in the agent’s interest to do these if it can.

3.5

Prop ertie s revi sit ed

Does the comprehension process fulfill each of the requirements described earlier? •

• •

Domain independent: The system works on events, which are a domain-independent representation of what is happening. There can be domain-dependent knowledge used in comprehension, but the basic process can be used in any situation. Limited working memory: The system only represents one interpretation at a time. Incremental: The system can attend to and process one event at a time,



• • •





Happens over time: Since events occur over time, the agent processes them over time. The delay between events is actually necessary in order for the agent to complete its processing without becoming overwhelmed. The agent may be able to utilize any extra time it has to generate additional appraisals. Immediate comprehension: The agent commits to an interpretation starting with the first event. Supports hierarchical comprehension: The event schemas describe events at multiple levels, directly supporting hierarchical comprehension. Supports prediction: The hierarchical nature of comprehension and event schemas directly support prediction. Once a high-level schema has been recognized, the subsequent events that it contains are predictions. Immediate ambiguity resolution: If multiple interpretations are available, the agent will pick one; possible methods include the one that best fits its goal, its prediction, or its current emotional state. Error recovery: There are at least two levels of error recovery: a simple level, in which the schema structures are correct but the variable bindings need to be updated, and a complex level in which the structure itself is incorrect and the agent needs to choose a new schema. The simple level may be implemented via a truth maintenance system that automatically updates the variable bindings used in the next predicted event. The complex level may utilize arbitrary processing. For example, the agent can recall the most recent event sequence from episodic memory and try to find a schema that matches that set.

The properties that we have described thus far are properties of the comprehension process itself, regardless of its realization. However, since the comprehension process is embedded in a cognitive architecture, we get the additional property that the process is impacted by emotion (and other processes) via the impact of those processes on the cognitive architecture. For example, research indicates that one’s emotional state can impact memory retrievals [Forgas, 1999]. Thus, those aspects of the comprehension process that are impacted by memory (i.e. nearly every step) are also impacted by emotional state.

3.6

Fe eling , co mpreh ensi on, and co ping

Although we traditionally think of the external world as being the total environment in which the agent must behave, for the cognitive system, the environment also includes internal perceptions such as feelings [Frijda, 2005]. Feelings serve as feedback about the meanings derived via the comprehension process. Just like external events, these “internal events” are processed through PEACTIDM – that is, they must be perceived, attended to, and comprehended. It is by understanding its feelings that the agent can put its action urges in context and actually Intend them, as opposed to simply

reacting [Frijda, 2005]. This also allows the agent to generate novel responses to its feelings, such as when it tries to cope. Finally, it allows for the possibility of metaemotions, or emotions about one’s feelings (e.g. shame at feeling fear) [Lambie and Marcel, 2002].

4

Predictions

This theory entails a number of predictions. First, because the comprehension process results in immediate comprehension, if the agent’s perception of a stream of events is interrupted, the agent should have an understanding of the situation up to that point (which will often include predictions as to what might happen next and what other agents are trying to do). Thus, an interrupted agent will behave based on the information received so far, but that behavior may be flawed in predictable ways due to mistakes in the agent’s interpretation (similar to garden path phenomena in language processing). Furthermore, the comprehension process imposes a partial ordering constraint on appraisal generation. For example, as shown in Figure 2, the “Discrepancy from Expectation” appraisal occurs before the “Goal/Need Conduciveness” appraisal. This is consistent with Scherer’s [2001] theory which also hypothesizes a sequential ordering. However, the reasons for the hypotheses differ. Scherer’s reasoning is that it would be a waste of processing resources to do some appraisals when the results of others show that they are irrelevant. For example, “Discrepancy from Expectation” comes after “Goal Relevance,” because if an event is not relevant, then the agent should ignore it. By contrast, our theory imposes an ordering because of the functional requirements of the comprehension process. A corollary of this is that our model allows for the possibility that different emotions will inherently require different amounts of processing. If some emotions only require appraisals that occur earlier in the comprehension process, then those emotions will take less processing than those that require more appraisals. The idea that different emotions may require different amounts of “cognitive activity” is not new [Lazarus 1982]. The differences in processing requirements may in turn lead to timing effects. Other theories propose that some basic emotions are refined into more complex emotions [Ortony et al, 1988; Weiner, 1986], which is compatible with our theory. Given that comprehension takes time, if time constraints are tight, some appraisals may not get generated, meaning that under certain time pressures, some emotions may not occur, or may be based on appraisals that were generated in earlier situations. Under extreme time constraints, comprehension itself may not be possible, leading to purely reactive behavior. In between, it may be possible that an agent “misses” some of the events, leading to flawed interpretations. Our model also supports appraisals that can happen at different time scales. Some appraisals may be based on comprehension in novel circumstances that require multiple retrievals from long-term memory, or even significant internal problem solving to understand the

situation, while others could be based on comprehension in well practiced situations where essentially reactive comprehension is possible. These differences in comprehension processing can lead to very different time scales for generating appraisals. Combining this with the previous two points, we predict that the complete appraisal (and thus the emotional reaction) can change over time as the comprehension of the situation evolves over time, which is also consistent with other theories (e.g. Ortony et al. [1988] and Weiner [1986]). Finally, we can also make a prediction regarding the coping process. Since Intending only occurs as part of problem-focused coping, and since Intending is the only route to deliberate action, one expected outcome is that if someone engages in emotion-focused coping first (e.g. to form a new goal), then there should be an extra delay before the agent’s next action. This is hardly surprising since the agent essentially needs to do a task switch internally before it can proceed. However, our model shows explicitly why and how the delay is incurred, which may lead to empirical predictions regarding the timing of this in humans.

5

Conclusion

In conclusion, a comprehensive model of emotion based on appraisal theory needs a detailed theory about how appraisals are generated. Our hypothesis is that comprehension plays a central role in generating many appraisals. Furthermore, by theorizing about appraisals in the context of comprehension and vice versa, we impose important constraints on our theories which help guide us to plausible solutions. Finally, comprehension may provide the framework in which the agent does all of its processing, including postappraisal functions like coping. The next steps for our own research are to create and evaluate a complete computational model of the above theory of comprehension, as well as the other components of our theory of emotion as shown in Figure 1. By doing this, we will gain a greater understanding of how comprehension (and thus appraisal generation and coping) integrates with and guides the agent’s general processing and also develop new criteria for determining which appraisals are appropriate for emotion theory (i.e. which ones fit into the comprehension process). Finally, we will produce specific testable predictions that can be used to help guide future research in emotion theory.

References [Anderson and Lebiere, 2004] John Anderson and Christian Lebiere. The Atomic Components of Thought. Erlbaum, Mahwah, New Jersey, 1998. [Dewey, 1895] John Dewey. The Theory of Emotion. (2) The Significance of Emotions. Psychological Review, 2:13-32, 1895. [Forgas, 1999] Joseph Forgas. Network Theories and Beyond. In Tim Dalgleish and Mick Power, editors, Handbook of Cognition and Emotion, pages 591611, 1999. Wiley & Sons, Chichester, England.

[Frijda, 2005] Nico Frijda. Emotion Experience. Cognition and Emotion, 19(4):473-497, 2005. [Gratch and Marsella, 2004] Jonathan Gratch and Stacy Marsella. A Domain-independent Framework for Modeling Emotion. Cognitive Systems Research, 5:269-306, 2004. [Lambie and Marcel, 2002] John Lambie and Anthony Marcel. Consciousness and the Varieties of Emotion Experience: A Theoretical Framework. Psychological Review, 109(2):219-259, 2002. [Lazarus, 1982] Richard Lazarus. Thoughts on the Relations Between Emotions and Cognition. American Psychologist, 37:1019-1024, 1982. [Lazarus, 1991] Richard Lazarus. Progress on a Cognitive-Motivational-Relational Theory of Emotion. American Psychologist, 46(8):819-834, 1991. [Lewis, 1993] Richard Lewis. An Architecturallybased theory of Human Sentence Comprehension. PhD thesis. Carnegie Mellon University, 1993. [Newell, 1990] Allen Newell. Unified Theories of Cognition. Harvard University Press, Cambridge, Massachusetts, 1990. [Ortony et al, 1988] Andrew Ortony, Gerald Clore, and Allan Collins. The Cognitive Structure of Emotions. Cambridge University Press, Massachusetts, 1988. [Rumelhart, 1980] David Rumelhart. Schemata: The building blocks of cognition. In Rand Spiro, Bertram Bruce, & William Brewer, editors, Theoretical issues in reading comprehension: Perspectives from cognitive psychology, linguistics, artificial intelligence, and education, pages 33-58. Erlbaum, New Jersey, 1980. [Scherer, 2001] Klaus Scherer. Appraisal considered as a process of multi-level sequential checking. In Klaus Scherer, Angela Schorr, & Tom Johnstone (editors), Appraisal processes in emotion: Theory, Methods, Research, pages 92-120. Oxford University Press, New York, 2001. [Roseman and Smith, 2001] Ira Roseman and Craig Smith. Appraisal Theory: Overview, Assumptions, Varieties, Controversies. In Klaus Scherer, Angela Schorr, & Tom Johnstone (editors), Appraisal processes in emotion: Theory, Methods, Research, pages 3-19. Oxford University Press, New York, 2001. [Talmy, 1975] Leonard Talmy. Semantics and syntax of motion. In John Kimball, editor, Syntax and semantics Vol. 4, pages 181-238. Academic Press, New York, 1975. [Weiner, 1986] Bernard Weiner. An Attributional Theory of Motivation and Emotion. Springer-Verlag, New York, 1986. [Zacks and Tversky, 2001] Jeffrey Zacks and Barbara Tversky. Event Structure in Perception and Conception. Psychological Bulletin, 127(1):3-21, 2001.

Suggest Documents