connotation as a form of inference - Association for Computational ...

3 downloads 0 Views 392KB Size Report
moon is made of. As Barthes then remarks, con- notations may be based on a set of .... verb like 'fiction' or 'joke'. A difficulty (which we provisionnally avoided).
CONNOTATION JB BERTHELIN,

AS

A

University of Essex

FORM

OF

I ~qFERENC

E

(language and l i n g u i s t i c s ) , Wivenhoe Pk, COLCHESTER C043Sq, GREAT BRITAIN,

and INSTITUT DE PROGRAMMATION, Universit~ Paris 6, 75230 PARIS

CEDEX05,

FRANCE.

Abstract Story-processing systems have to deal, or avoid dealing, with INFERENCE CONTROL' ' 6 ' I i , 13.

posed to ("fromage" RC("cheese")) RC("french"),

when designing one such system 2, we were great-

RC("cheese as geological

ly helped by the "(ERC)RC" expression for con-

in the processing of a robot who asked what the

notation I. Our system is specialised in multi -

moon is made of. As Barthes then remarks,

con-

faceted descriptions of characters

notations may be based on a set of i n i t i a l

ex-

ever, in the most d i f f i c u l t

but also s i t u a t i o n - s p e c i f i c

(not, how-

problems of beliefs

ones, as

stuff")

("cheese"

RC("bizarre"),

pressions rather than on a single one, which is

about beliefs 4 and 8): here we present another aspect of the story character processing, name-

expressed by (EiRCl, E2RC2 . . . . . EnRCn) R C This seems to be an essential feature of conno-

ly the recursive EXPLANATION of inconsistencies

t a t i o n s , since i t allows emphasis on 'connoted'

appearing in the description of a character.

contents by means of an accumulation of

signi-

f i e r s . Another feature, elegantly i l l u s t r a t e d We give a very schematic system overview,

then

in 'l'envers des signes ' i 0

by

the

two steps

some details about the CONNOTATIONrules and an

( ( " v o i l e " RC("navire"))RC("po~sie"))RC("rh~to-

example of t h e i r application to a story.

r i q u e " ) , is t h e i r r e c u r s i v i t y . As a beginning we made an attempt to

express

this in AI terms by writing a program, BAQUIL, which finds the connotations with structure(E I

Introduction

RCi, E2RC2)RC in a recursive way. The i n t e r e s t of such connotations can be shown by15:

The allusiveness of human languages, in addition to being quite convenient in social l i f e , fies the use of variable amounts of 1 i g e n c e

.a doctor asks:"how is he feeling ?"

justi-

.the nurse answers:"he is groaning."

i n t e 1 -

where the nurse means, by connotation,that the

in processing a sentencer accor-

ding to the number of reasoning steps

patient is suffering:

leading

("groaning" RC("groaning")

to a position where a s a t i s f a c t o r y reaction be,14 comes possible, l i k e "to redefine 'substance

RC("suffering)),

when reading Spinoza". Let us represent a f i r s t

definitive

step by

be ( " f e e l i n g " R C ( " f e e l i n g " ) ,

"ERC" i . e .

but our understanding is direc-

ted by the previous interrogation,

"an expression is related

so that

the

result we expect from our system w i l l "groaning"RC("groa-

to a content ''1, which Barthes calls the denota-

ning") )RC("suffering").

tion: the second reasoning step w i l l

reached by consulting a semantic network and ob-

be repre-

This result shall

be

sented in the connotation formula: " (ERC) RC".

serving that 'groaning' is not exactly a case of

I t does not determine exactly where the reaso-

'feeling',

ning step leads to: i f we write E's content as

nurse means he is suffering"

C(E), we may have quite general

tation and an inference, and seemingly a

connotations

l i k e ("cheese"RC("cheese"))RC("english")

but an expression of i t . is

Thus both

" the

a connouseful

one: compare with Charniak's 6 "demon-demon

as op-

228

in-

teraction,

'want to buy candies' and 'shake

can run without an elaborate model 5 of persona-

piggy-bank' together t r i g g e r 'need money'; but

lities,

BAQUIL has no such extended world-knowledge.

about foxes and sparrows, which would perhaps

is specialised in DISCOURSE ATTITUDES l i k e rors',

'lies',

'jokes' etc. This also means

It 'er-

fail

it

and so deal with fables and folktales

to have goals 'common to most people', al-

though they are meant as human in a way.

System overview

Before explaining how i t works, we give an idea of BAQUIL's construction ( i . e .

i t s hierarchy).

HAlt

/

14, ~ , .

j,~

,,

,.

,,

".,

/ INPUT

o u'rP~JT

/

¢~/Al~Acr~l~,V'-"

\ ~

~-scmrr'lo~ \

--229--

ou~vr

Schema.tic explanation o f how the system runs

consistency. The metarules are represented in BAQUIL by i n s t r u c t i o n s : Mi in the NOTATION proce-

i) the dictionary input procedure builds, from a

dure, M2 and M3 in the CONNOTATIONprocedure.

f i l e whose structure is shown in the ' sample session ' , a classical semantic network 7 ' 9 " ' '

The l a t t e r also contain the instruction corresponding to the inference rules, while the

com-

parison rules form the comparison procedure.

2) the story s p e c i a l i s t submits sentences to a parser 12 and the resulting case structures

Discussion of the metarules

to the story character s p e c i a l i s t , whose actions include the c o m p a r i s o n s i n f e r e n c e s

and M1expresses the hypothesis that many interes-

detailed here.

ting antinomies can be detected during the 3) output procedures express the inferences English and, on request, detail the

pairwise matching of predicates concerning

in

one story character."

repre-

sentation of each character in the story. M2 aims to expres a more or less syntactic finding about the description of a character (co-presence of two antinomic ' n o t a t i o n s ' )

Metarules of BAQUIL

in terms of the 'notations' ( t h i s page and the following three w i l l what rules are applied by the story specialist:

themselves.

develop M3 means "use the lexical taxonomy when trying

character

to recognize a s i t u a t i o n " ;

this includes METARULES, RULES OF

as a result i t in-

troduce a d i s t i n c t i o n between natural languages,

COMPARISON and INFERENCE RULES.)

for the subcategories of ( f o r instance) "inconMI BAQUIL starts a connotation or inference only when a comparison rule has been applied

are not the same in d i f f e r e n t dic-

tionariesl

Consider French and English, "error"

and "mistake" vs. "erreur" and "m@rise", or

to a pair of predicates which are related to the set of descriptors of one character: the predicates are versions of the semantic case structure in terms of the current

sistency"

worse: the two cases of "to t e l l

a l i e " in Rus-

sian, i . e .

Still

" v r a t ' " vs. " l ' g a t ' " .

perhaps acceptable to

character

i t is

allow for important

pragmatic divergences between languages (and,

description, and are called 'NOTATIONS'

indeed, dialects or sociolects.) Moreover, we did not represent the vocabulary of other lan-

M2 Except when specified otherwise (inference

guages than French and English in our system,

rules R6, R7), the inference is expressed

so we lack precisions about how "whorfian" i t

by a 'notation' whose verb is a subcatego-

would turn out.

ry of either CHANGE or INCONSISTENCY. M3 Those subcategories are examined in the dictionary order and the f i r s t

Although a similar discussion of the comparison

one which

and inference rules would be necessary, we w i l l

permits the application of an inference

simply present them here along with some exam-

rule is selected.

ples. The examples are taken from a set of 20 stories (4 to 200 sentences)which were dealt

To sum up: (M1) comparison and then inference

with by the system at Essex in 1979-1980.

about (M3) subcat(gories of (M2) change or in-

--230--

Compari son rules

Exagpl e s.

Their object is to t e l l whether an inference must be started, or not. C1 the predicates d i f f e r only by a negation in

C1 Confucius is handsome, Confucius is not hand-

one of them ( i . e . same environment, verb etc.

some (in which case 'handsome' need not be

and none was i n f e r r e d . )

a p r i o r i present in the l e x i c o n . ) .

C2 s i m i l a r to C1 but there is a hyponymy be-

C2 Confucius is h o r r i b l e , Confucius is not ugly.

tween the verbs.

C3 lexical exclusion between the verbs of two

C3 Confucius is r i c h , Confucius is broke.

affirmative predicates. C4 transgression of l e x i c a l i n t e r d i c t i o n , or l e x i c a l necessity ignored.

C4 Confucius is human and f l i e s away. (interdiction) Confucius f l i e s away, he does not e x i s t . (necessity)

C5 C1, C2 or C3 applies and the f i r s t

predicate

expresses an inference.

C6 a predicate confirms an inference.

C5 the systeminferred Confucius is l y i n g , the story reveals he is joking ( l i k e C3).

C6 in the s i t u a t i o n above, Lao-tsu says that Confucius is lying BEFORE the story disconfirms i t .

C7 a predicate confirm a discarded inference.

C7 Lao-tsu's remark comes AFTER the revision

( t h i s occurs after a C5 s i t u a t i o n led to

of the inference.

the application of the relevant inference rule. )

The lexicon element has following slots in

The predicate or ' n o t a t i o n ' structure, which

its

permits the comparison, is

structure:

( l i s t of subcategories, are-they-mutually-ex-

( a f f i r m a t . - o r - n e g . , case frame, char. descr.) and the connotation has the same structure aug-

c l u s i v e - o r - n o t , supercategory, property l i s t )

mented by reference to premises ( 2, 3 or a l i s t

and the property l i s t

contain references to

other lexical elements.

of pairs i f an inference has been confirmed.)

--231-

Connotation rules: exam~es

tween levels of discourse independently of which character the inference is about, R3 and

As the rules by themselves do not suggest the

R5 make use of some knowledge associated e i t h e r ,

s i t u a t i o n s which make them useful, l e t ' s

as in the examples, with the name of the charac-

have

t e r , or with previous sentences about i t ,

some examples f i r s t .

e.g.

'Krylov is an author' could be part of the beginning of the story. RI . . . t h e servant said: " t h a t cow is not going to eat you." ( . . . )

The next morning,

she

sees one of them is missing and says: "Oh my God ! the grey cow ate one of these men".

R3 and R5, which connect the inference with a

.INFERENCE ABOUT THE SERVANT'S VIEW OF THE

previously observed d e t a i l , are " c a u s a l i t y "

COW.

rules:

in addition to references to the two

premises whose comparison started i t ,

the in-

ference has one reference to the predicate R2 (same example as

RI)

that j u s t i f i e d verb l i k e

.ACCORDING TO THE SERVANT, THE COW UNDERWENT

the choice of a more precise

'fiction'

or ' j o k e ' .

A CHANGE. A difficulty

(which we p r o v i s i o n n a l l y avoided)

is that some l e x i c a l

R3 A teacher quoting Krylov said that God sent

connections could repre-

sent 'necessary t r u t h s '

a cheese to a raven; a c h i l d objects that

that are j u s t sometimes

there is no God.

true, as "teachers are sometimes i g o r a n t " ; what

.INFERENCE ABOUT AN INCONSISTENCY; USING THE

R5 should do with them is not clear. However, i f

LEXICAL CONNEXION BETWEEN THE TOPIC 'God' AND

one considers f o l k t a l e s and fables, the d i c t i o n 2

'religious

nary connection one uses are almost always

discourse', BAQUIL SELECTS THAT

KIND OF INCONSISTENCY. "foxes are s l y , f u l l R4 the peasant believes that the tree w i l l h i t by other rabbits.. But i t

stop"

be so the question does not arise.

is not.

.INFERENCE: ERROROF THE PEASANT.

A difficulty

R5 (continuation of example f o r R3)

f o r R3 is that a ' t o p i c '

could be

the teacher answers that there is no cheese

connected both to ' r e l i g i o u s discourse' and

either.

' j o k e ' , so in the present state of the system

.INFERENCE ABOUT AN INCONSISTENCY; USING THE

one of the connections would be systematically

LEXICAL CONNECTION BETWEEN 'Krylov' AND 'au-

ignored: but f o r the time being, there are very

t h o r ' , AND ONE FROM 'author' TO ' f i c t i o n ' ,

few l e x i c a l

BAQUIL SELECTS THE LATTER SUBCATEGORYOF

we cannot decide which necessary truths r e a l l y

connections, the reason being that

belong in a lexicon.

INCONSISTENCY.

While the other rules e x p l o i t comparisons be232

-

BAQUIL's connotation rules

'references' are the two i n i t i a l inference;

Rules R6 and R7 are i l l u s t r a t e d by the story of the vegetarian wolf: given a d i c t i o n a r y

'hypotheses come from the l i s t

subcategories of the i n i t i a l

which connects ' w o l f '

premises of the

verb, i . e .

to ' l i e ' ,

of

hence

i n f e r s from the be-

ginning of the story, where the wolf is con-

the

t r a d i c t e d by someone, that i t

d i f f e r e n t cases of 'change' or 'inconsistency' as represented

to 'bad a c t i o n ' ,

the system f i r s t

the s t o r y - t e l l i n g

in the lexicon.

telling

explicitly

lies;

later,

warns the reader about i t , that i t does not. The in-

ference is NOT erased, but earmarked as f a l l a cious; so R7 applies when a character says:

R1 i f both references are about the same character descriptor,

this wolf is l y i n g ; but, i f the story had

the inference is also about

that descriptor.

not denied the p o s s i b i l i t y

( t h i s allows f o r s t o r i e s

of a l i e ,

R6 would

have applied. One can see that R6, R7 and R8

inside the story).

are not quite s a t i s f y i n g .

I suspect them of

being in need of some refinement. The set of situations

R2 i f the i n f e r e n c e ' s verb is not yet specified

or e t c . '

and the inference is about the d e s c r i p t o r of

' c l e v e r character or lucky guess

is not c l e a r l y defined, and I do not

know what to do with the L i a r ' s Paradox.

the second reference, BAQUIL t r i e s the cases of 'change'. R3 i f the verb of a hypothesis is connected to

On the contrary, R3 and R5 give less trouble.

the name of the character in a reference by

This is shown by the 'sample session' next

a 'be about' l i n k ,

page, representing approximately 1.8 second

the hypothesis is selec-

ted.

CPU (using about 25K core) on a PDP-iO.

R4 i f one reference comes from a ' b e l i e f ' the other from the ' s t o r y - t e l l i n g ' ,

and

the verb

of the inference should be ' e r r o r ' .

R5 i f

the verb of a hypothesis is connected to

i t s character's name by a 'tendency' l i n k ,

RULES APPLIED: C4 " l e x i c a l

necessity"

the hypothesis is selected. R3 " t o p i c " R5 "tendency"

R6 i f a character's discourse matches an i n f e rence, BAQUIL checks whether the character is clever or has made a lucky guess etc.

R71if the s t o r y - t e l l i n g

C3 " l e x i c a l exclusion"

contradicts an i n f e r e n -

and~ce, the inference is ' d i s c a r d e d ' ; i f the R8Ydiscarded inference is matched by a discourse, one looks f o r an ' e r r o r '

or ' t r i c k '

etc. -

233--

SamPle session

(STORY INPUT)

(DICTIONARY INPUT)

Food was short. A teacher quoted Krylov.

WORDS

He said that Krylov wrote:

FOOD ISA SUBSTANCE

'God sent a cheese to the raven.'

SHORT ISA STATE

A child said:

TEACHER ISA MAN

'There is no God ! '

QUOTE ISA DISCOURSE

but the teacher replied

AUTHOR ISA MAN

'there is no cheese,

KRYLOV ISA AUTHOR

either . . . '

WRITE ISA DISCOURSE

THE END

SAY ISA DISCOURSE GOD ISA PUTATIVE BEING CHEESE ISA FOOD

(INFERENCES)

RAVEN ISA BIRD CHILD ISA MAN PRODUCE INCONSISTENCY ISA DISCOURSE

(God sent a cheese 1(there is no God

(comment: abbreviated PRODINC)

C4 in description of God and R3/God:

LIE ISA PRODINC

"the child means Krylov has a r e l i g i o u s

MISTAKE ISA PRODINC

discourse".

JOKE ISA PRODINC ILLUSION ISA PRODINC

(Krylov has a r e l i g i o u s discourse

RELIGIOUS DISCOURSE ISA PRODINC

2(the teacher quotes Krylov

FICTION ISA PRODINC

C4 in description of Krylov and default:

BE ABOUT ISA DISCOURSE

"the child means the teacher has an inconsis-

MEAN ISA DISCOURSE

tent discourse".

IGNORANT ISA STATE END WORDS (God sent a cheese 3(there is no cheese C4 in description of cheese and R5/Krylov: "the teacher means Krylov's discourse is f i c LINKS

tion".

USUAL(AUTHOR, FICTION) USUAL(CHILD, IGNORANT) USUAL(IGNORANT, MISTAKE) BE ABOUT(RELIGIOUS DISCOURSE, GOD)

(Krylov has a r e l i g i o u s discourse 4 (Krylov's discourse is f i c t i o n

FORBIDE(RELIGIOUS DISCOURSE, QUOTE)

C3 in description of Krylov and R5/child:

comment: that is to account for the location

"the teacher means the child is mistaken"

being Soviet Russia. NECESSITY(ACTION, EXIST)

END OF INFERENCES

NECESSITY(ACTION, AVAILABLE)

END OF SESSION

END LINKS -234-

Conclusion

of a nationality adjective suggesting a character t r a i t (Cretan for l i a r , e t c . ) , which is

While not achieving much by i t s e l f , Baquil is an important component of the larger system 2

helpful in many cases: when the nationality

currently b u i l t at Paris 6, and could probably

guage (Chines texts about the ancient kingdoms), but also when there are several possible

also be integrated in a large AI MT system as an expert of indirect descriptions of characters, for instance i t could recognize the use

References (1) BARTHES: Elements of semiology, Hill and

adjective is not familiar in the target lan-

interpretations for the nationality adjective in terms of personality ( 'qua' ambiguity).

(i0~ GENETTE: Figures I , Paris, 1966. (11) HAYES: The logic of frames, in METZING, edo

Wang, New York. (2)

BERTHELIN and SABAH: Le traitement des personnages, congr~s AFCET-IRIA, Toulouse

Frames, conceptions and text understanding, De Gruyter, 1980. (12) SABAH: A contribution to story understan-

1979. (3)

BIEN: Toward a multiple environments model

ding, thesis at Universit~ Paris 6, 1978. (13) SCHANKand ABELSON: Scripts, plans, goals

of NL, IJCAI-4. (4) BIEN and WILKS: Speech acts and multiple

and understanding, LEA, 1977. (14) WILKS: Grammar, meaning and the computer

environments, IJCAI-6. (5) CARBONNELL: Computer model of human per-

analysis of language, RKP, 1972 (15) WITTGENSTEIN: Philosophical Investigations,

sonality t r a i t s , IJCAI-6. (6) CHARNIAK: Toward a model of children story understanding, thesis at MIT, 1972.

Blackwell, 1953. IJCAI-4 and IJCAI-6 refer to the Proceedings of the international Joint Conferences on A r t i f i cial Intelligence.

(7) CHOURAQUI: Sur un r~seau s~mantique a c t i f , congr~s AFCET-IRIA, Toulouse 1979. (8) CLARK and MARSHALL: Mutual knowledge and definite reference, mimeo, Stanford Univ.

The story in the sample session is adapted from "The Big Red Joke Book" by Benton and Loomes, Pluto Press, London, 1979.

(9) FAHLMAN: Representing everyday knowledge, MIT, 1977. --235--

Suggest Documents