A Neuro-Computational Model of Economic ... - Semantic Scholar

0 downloads 0 Views 4MB Size Report
Jun 10, 2015 - KongFatt Wong-Lin for helpful discussions, and Alberto Bernacchia, John Murray and Xiao-. 20. Jing Wang for comments on the manuscript.
Articles in PresS. J Neurophysiol (June 10, 2015). doi:10.1152/jn.00184.2015

1 2 3 4

A Neuro-Computational Model of Economic Decisions

5

Aldo Rustichini1 and Camillo Padoa-Schioppa2

6

1

Department of Economics, University of Minnesota, 1925 4th Street South 4-101, Minneapolis,

7

MN 55455. 2 Departments of Anatomy and Neurobiology, Economics, and Biomedical

8

Engineering, Washington University in St Louis, 660 South Euclid Avenue, St Louis, MO 63110

9 10 11 12 13

Running title: Wang's model for economic decisions

14

Manuscript information: 250 words in Abstract; 750 words in Introduction; 10 Figures; 1

15

Table

16

Correspondence to: A.R. ([email protected]) or C.P.-S. ([email protected])

17

Acknowledgments: This research was supported by the National Institute on Drug Abuse (grant

18

R01-DA032758 to C.P.-S.) and the National Science Foundation (grant SES-1357877 to A.R.).

19

We thank Nicolas Brunel, Xinying Cai, David Levine, John Murray, Xiao-Jing Wang and

20

KongFatt Wong-Lin for helpful discussions, and Alberto Bernacchia, John Murray and Xiao-

21

Jing Wang for comments on the manuscript. We also thank Xiao-Jing Wang and KongFatt

22

Wong-Lin for sharing their Matlab code. 1 Copyright © 2015 by the American Physiological Society.

23

Abstract

24

Neuronal recordings and lesion studies indicate that key aspects of economic decisions take

25

place in the orbitofrontal cortex (OFC). Previous work identified in this area three groups of

26

neurons encoding the offer value, the chosen value and the identity of the chosen good. An

27

important and open question is whether and how decisions could emerge from a neural circuit

28

formed by these three populations. Here we adapted a biophysically realistic neural network

29

previously proposed for perceptual decisions (Wang 2002; Wong and Wang 2006). The domain

30

of economic decisions is significantly broader than that for which the model was originally

31

designed: yet the model performed remarkably well. The input and output nodes of the network

32

were naturally mapped onto two groups of cells in OFC. Surprisingly, the activity of

33

interneurons in the network closely resembled that of the third group of cells, namely chosen

34

value cells. The model reproduced several phenomena related to the neuronal origins of choice

35

variability. It also generated testable predictions on the excitatory/inhibitory nature of different

36

neuronal populations and on their connectivity. Some aspects of the empirical data were not

37

reproduced, but simple extensions of the model could overcome these limitations. These results

38

render a biologically credible model for the neuronal mechanisms of economic decisions. They

39

demonstrate that choices could emerge from the activity of cells in the OFC, suggesting that

40

chosen value cells directly participate in the decision process. Importantly, Wang's model

41

provides a platform to investigate the implications of neuroscience results for economic theory.

42 43

Introduction

44

Economic choices are thought to entail two mental stages: subjective values are first assigned to

45

the available options, and decisions are made by comparing values. Evidence from lesion studies

46

(Fellows 2011; Rudebeck and Murray 2014), functional imaging (Bartra et al. 2013; Clithero and

47

Rangel 2013) and neurophysiology (Mainen and Kepecs 2009; Padoa-Schioppa 2011; Wallis

48

2011) indicates that choices, in particular choices between goods, involve the OFC. Neuronal

49

recordings in primates choosing between different juices identified three groups of neurons in

50

this area: offer value cells encoding the value of individual juices, chosen juice cells encoding the

2

51

binary decision outcome, and chosen value cells encoding the value of the chosen juice (Padoa-

52

Schioppa 2013; Padoa-Schioppa and Assad 2006). Prima facie, these groups of neurons appear

53

sufficient to characterize ‒ and possibly generate ‒ a decision. Indeed, offer value cells capture

54

the input to the decision process, while chosen juice and chosen value cells capture the identity

55

and value of the chosen good, and thus the decision outcome. A primary goal for decision

56

neuroscience is to formalize this intuition by building a biologically realistic model in which the

57

groups of cells found in OFC form a circuit that generates decisions. Ideally, such a model would

58

encompass all that is known about these neurons and, concurrently, make new and testable

59

predictions.

60

Our thinking about the mechanisms of economic decisions is influenced by work on motion

61

perception (perceptual decisions). In a simplified scheme, two brain regions play a primary role:

62

the middle temporal (MT) area, where neurons encode the momentary evidence, and the lateral

63

intraparietal (LIP) area, where cells represent the decision outcome in the form of a planned

64

saccade (Gold and Shadlen 2007; Parker and Newsome 1998). Notably, there is a natural

65

analogy between neurons in MT and offer value cells, and between neurons in LIP and chosen

66

juice cells. In contrast, chosen value cells do not have a known correspondent in perceptual

67

decisions.

68

Several models have been proposed to describe the neuronal mechanisms of perceptual decisions

69

(Bogacz et al. 2006; Drugowitsch and Pouget 2012; Gold and Shadlen 2007). At the biophysical

70

level, a leading proposal is Wang's model, in which decisions emerge from a balance of recurrent

71

excitation and pooled inhibition. Different variants of the model account for perceptual decisions

72

(Wang 2002; Wong and Wang 2006), similarity judgments (Engel and Wang 2011), probabilistic

73

inference (Soltani and Wang 2010), behavior in a competitive game (Soltani et al. 2006) and

74

flexible sensori-motor mapping (Fusi et al. 2007). More recently, the model has been adapted to

75

describe the activity in LIP during foraging tasks (Soltani and Wang 2006) and to fit aggregate

76

neural activity during value-based decisions (Hunt et al. 2012; Jocham et al. 2012). However, a

77

precise mapping between Wang's model and the activity of neurons in OFC (or any brain area)

78

during economic decisions has not yet been attempted. In this study, we examined the extent to

79

which Wang's model can reproduce neuronal activity in the OFC. From a modeling perspective,

80

this is a challenging test because – as we argue in detail below – the domain of economic 3

81

decisions is significantly broader than that for which the model was originally designed, and

82

because neuronal activity in the OFC during economic decisions only partly resembles that in

83

area LIP during perceptual decisions. Our deliberate goal was to test the model without changing

84

its structure. Importantly, all the parameters in the model represent biophysical quantities such as

85

synaptic efficacies and time constants and are derived, at least approximately, from empirical

86

measures (Wang 2002). In our investigation, we used the same parameters previously used to

87

model perceptual decisions (Wong and Wang 2006).

88

In most respects, Wang's model reproduced the activity of different groups of cells in OFC

89

remarkably well. The two layers corresponding to the input and to the categorical output of the

90

network were identified with offer value cells and chosen juice cells, respectively. Most

91

surprisingly, the activity of inhibitory interneurons in the network closely resembled that of the

92

third group of neurons found in OFC, namely chosen value cells. The model also reproduced

93

several phenomena related to the neuronal origins of choice variability, namely choice hysteresis,

94

the "predictive activity" of chosen juice cells and the "activity overshooting" of chosen value

95

cells (Padoa-Schioppa 2013). Two aspects of the empirical data were not reproduced. First, the

96

model did not include neurons with negative encoding. Second, a significant baseline in the

97

activity of offer value cells introduced distortions in the behavior of the network. However,

98

simple extensions of the model could overcome these limitations. The results of this study render

99

a biologically credible model for the neuronal mechanisms of economic decisions.

100 101

Materials and Methods

102

Structure of the model

103

In its extended form (Brunel and Wang 2001; Wang 2002), the model is a recurrent network of

104

2,000 spiking neurons, of which 80% (NE) are excitatory pyramidal cells and 20% (NI) are

105

inhibitory interneurons. All neurons in the network are leaky integrate-and-fire cells endowed

106

with biophysically realistic parameters. Two external stimuli provide the primary input, and each

107

stimulus activates a fraction f = 0.15 of pyramidal cells. The remaining (1-2f) NE pyramidal cells

108

are non-responsive (untuned). The synaptic input to each neuron is both excitatory and inhibitory. 4

109

Excitatory inputs are through AMPA- and NMDA-mediated synapses, while inhibitory inputs

110

are through GABAA-mediated synapses. For each neuron, the input has an external component

111

and a recurrent component. For the two groups of selective pyramidal cells, the external

112

component includes the external stimulus. In addition, each neuron in the network receives an

113

external background noise distributed as a Poisson process. Stimulus current and external

114

background are through AMPA-mediated synapses. The recurrent component is provided by

115

other neurons in the network. In analogy to the Hebbian rule, synapses between neurons in a

116

given group (which fire together) are potentiated by a factor w+>1, whereas synapses between

117

neurons in different groups are depressed by a factor w–1) discussed above. In essence, we propose that the network responds to

21

571

the changes in value ranges by altering the synaptic efficacies between OV cells and CJ cells

572

(Eq.19). As previously shown, this synaptic plasticity can be achieved with a mechanism of

573

Hebbian learning (Padoa-Schioppa and Rustichini 2014). Framing effects may be reproduced if

574

this Hebbian learning lags range adaptation of OV cells.

575

Stable points of the dynamical system

576

One important question concerns the number steady states of the model. Wong and Wang (2006)

577

provided a bifurcation diagram, in which the number of attractors and the working-memory

578

regime were examined as a function of the input firing rate (μ0) and the parameter w+. Their

579

analysis was done for the reduced version of the model (W2) and for coh = 0. Generating a

580

bifurcation diagram for W11 is more difficult because the system is time dependent and high

581

dimensional. As a first step, we conducted a series of simulations to examine the end state of the

582

network for different sets of offer values (Fig.9). For these simulations we set ΔA = ΔB = [0 15]

583

and δJstim = (1, 1), such that ρ = 1. For comparison, although w+ defined in W11 is not identical

584

to w+ defined in W2, the value of w+ used here is comparable to values examined by Wong and

585

Wang (2006). In contrast, stimulus currents are substantially higher in our case. Specifically, in

586

our simulations we normally used ΔJ = JAMPA,input / JAMPA,ext,pyr = 30 (see Table 1). In such

587

conditions and for the data point at the center of Fig.3 (right panel), the product JAMPA,input rOV is

588

roughly equal to 100 JAMPA,ext,pyr (see Eq.19). In contrast, in the original W11, the equivalent

589

product was set to 40 JAMPA,ext,pyr when coh = 0.

590

Consider Fig.9, panel a2. In this scatter plot, x- and y-axes represent, respectively, the "final"

591

activity of CJA and CJB cells as measured in the time window 400-600 ms after stimulus on.

592

Each symbol + represents one trial and symbols are color coded according to the offer type. The

593

color legend is indicated in the leftmost panel in the same row. Rows 2 and 3 of the figure

594

illustrate the same data focusing on a subset of trials, with the color legend indicated in the

595

leftmost column. Several points can be noted. First, the network clearly separated between the

596

two end points when the value difference was sufficiently high (purple, blue and green symbols

597

in panels b1 and b2). It did not clearly separate between the end points when offer values were

598

close (panel b3). In separate simulations, we observed that extending the stimulus in time for 1 s

599

or increasing the levels of noise in the system did not affect these results (not shown). The 22

600

simulation in panels b1-b3 were based on our normal parameters w+ = 1.75 and ΔJ = 30. We then

601

repeated the simulations using parameters w+ = 1.85 and ΔJ = 30 (panels c1-c3), and w+ = 1.75

602

and ΔJ = 15 (panels d1-d3). The results obtained were qualitatively similar, although the

603

separation for difficult decisions tended to increase when w+ was raised (panel c2) and to

604

decrease when ΔJ was reduced (panel d2).

605

The formulation of W2 for economic decisions and a more formal analysis of the steady states

606

will be presented elsewhere.

607

The baseline activity of offer value cells

608

As illustrated in Fig.1a, offer value cells presented a baseline activity of ~6 sp/s, which is smaller

609

but comparable to the value-related dynamic range (~10 sp/s). Experimental work showed that

610

this baseline activity does not depend on the value range, on the juice preference or, for given

611

juice pair, on the relative value of the juices (Padoa-Schioppa 2009). As we examine the effects

612

of introducing a baseline in the activity of OV cells, two premises are in order. First, the original

613

W11 set the baseline equal to zero – a reasonable approximation since the baseline activity of

614

MT cells is indeed low. Second, the presence of a substantial baseline changes dramatically the

615

input to the network.

616

In a series of simulations, we tested W11 introducing a realistic baseline. If we consider the

617

situation in which all synaptic efficacies are balanced, the network is robust to the introduction of

618

a baseline. In this condition, even using the same parameters as in the initial simulations, we

619

obtained for the indifference function a straight line through the origin (not shown). Small

620

adjustments to the parameters provided realistic profiles for CJ cells and for CV cells (Fig.10a-f).

621

However, problems arise when one considers the fact that the relative value ρ between two goods

622

should be free to assume any value (see above section Imposing non-trivial relative values)

623

and/or the fact that (only) the value-dependent component of offer value cells activity adapts to

624

the range of offer values (see above section Range adaptation, context-dependent preferences

625

and Hebbian learning). The approach used in the baseline-subtracted case, namely to introduce

626

an imbalance in the synaptic efficacies no longer works. In other words, the indifference function

627

becomes non-homogenous even if the synaptic imbalance is limited to the input (Fig.10g-l). This

23

628

is because the synaptic efficacies δJstim multiply the entire activity of OV cells, including the

629

baseline. As a consequence, when δJstim, 1>δJstim, 2, the network chooses zero quantities of juice A

630

over small quantities of juice B. In conclusion, the baseline activity of offer value cells poses a

631

challenge for W11. Possible ways to address this issue are discussed below.

632 633

Discussion

634

We examined a biophysically realistic model previously proposed to describe the activity of area

635

LIP during perceptual decisions, namely W11, and we tested the extent to which it could

636

reproduce the activity recorded in the OFC during economic decisions. Our analysis represented

637

a challenging test for the model for at least four reasons. (1) Stimuli (i.e., offers) in economic

638

decisions are intrinsically two-dimensional, whereas stimuli in perceptual decisions are one-

639

dimensional. (2) The relative value between two given goods is arbitrary. Thus to accommodate

640

subjective preferences, the network must be flexible. (3) Offer value cells undergo range

641

adaptation but decisions should, in first approximation, not depend on the range of offer values.

642

In other words, the network must be adaptable. (4) The activity of neurons in the OFC only

643

partially resembles that of neurons in area LIP, for which the model was designed. Specifically,

644

in addition to offer value cells (analogous to MT) and chosen juice cells (analogous to LIP), there

645

are chosen value cells, which have no known correspondent in perceptual decisions. Moreover,

646

even the resemblance between chosen juice cells and neurons in LIP is weak, because the

647

representation of chosen juice cells is categorical while that of LIP is continuous, and because

648

chosen juice cells do not present the race-to-threshold and the working memory activity the

649

model was originally designed to reproduce. In spite of all these challenges, the spirit of this

650

study was not to design a new model. Rather, we tested Wang's model as much as possible off-

651

the-shelf, without modifying its structure or even adjusting its parameters. We did, however,

652

adapt the network in two important ways. First, we strengthened the connections between OV

653

cells and CJ cells (reflecting the fact that offer value cells and chosen juice cells are found in the

654

same area). Second, we introduced mechanisms of synaptic plasticity to account for the required

655

flexibility and adaptability of the model.

24

656

Several traits make W11 a particularly attractive model. First, the model is biophysically realistic.

657

Elements in the extended, spiking network are neurons endowed with realistic synaptic

658

connections and time constants, and the mean-field approach preserves the realism of the model.

659

Second, the model is nearly without free parameters. In other words, the model is very

660

constrained, especially when tested outside its original domain. Third, the model makes testable

661

predictions on the excitatory/inhibitory nature of different neuronal populations and on their

662

connectivity. These traits set W11 apart from other computational models of economic decisions

663

including the drift-diffusion models, mutual inhibition models and the leaky competing

664

accumulator model (Bogacz et al. 2006; Hare et al. 2011; Krajbich et al. 2010; Usher and

665

McClelland 2001). Furthermore, the explicit inclusion of inhibitory interneurons in W11

666

provides an account for chosen value cells, which are not explained in more schematic models

667

including the drift-diffusion model and the leaky competing accumulator model. Other studies

668

examined Wang's model in the context of value-based decisions. In particular, Behrens and

669

colleagues used the simplified W2 version to generate aggregate regressors for the analysis of

670

MEG data (Hunt et al. 2012; Jocham et al. 2012). In this respect, we note that the tests ran here

671

were significantly more stringent because we matched each node in the network with a specific

672

group of neurons, because the original analysis of neuronal data (Padoa-Schioppa and Assad

673

2006) tested a large number of variables and not only variables generated from the model, and

674

because we reproduced a variety of empirical findings.

675

We found that W11 provides a remarkably accurate account for the activity of neurons in the

676

OFC. The input node of the model (OV cells) can be identified with offer value cells. The model

677

naturally generates binary decisions, and the activity profile of the output node (CJ cells) is fairly

678

similar to that of chosen juice cells. Perhaps most surprisingly, the activity of inhibitory

679

interneurons (CV cells) is very similar to that of chosen value cells (more on this below). In

680

addition, W11 reproduces several phenomena related to the neuronal origins of choice variability,

681

namely choice hysteresis, the predictive activity of chosen juice cells and the activity

682

overshooting of chosen value cells. We examined what changes in the network are necessary to

683

ensure that relative values are arbitrary and to accommodate the fact that offer value cells

684

undergo range adaptation. We found that these two requirements are addressed only when

685

synaptic plasticity is introduced in the connections between OV cells and CJ cells. We also found

25

686

that W11 falls short of the experimental data in at least two ways. First, the model includes only

687

neurons with positive encoding (i.e., higher firing rate for higher values). In contrast, for each of

688

the three variables, a substantial fraction of the neurons in OFC presents negative encoding (i.e.,

689

higher firing rate for lower values) (Padoa-Schioppa 2013; 2009). Second, the model can provide

690

the flexibility necessary to accommodate any relative value, or it can handle a significant

691

baseline activity in the input node (OV cells), but it cannot do both of these things at the same

692

time. As discussed below, these limitations appear conceptually surmountable, although more

693

theoretical work is necessary in this respect.

694

Chosen value signals and interneurons in the OFC

695

One of the main results of this study is that interneurons of W11 were found to encode the

696

chosen value. This observation was unforeseen and somewhat extraordinary. Interneurons were

697

included in the progenitor models of W11 for biological realism and network stability (Amit and

698

Brunel 1997) – not to reproduce empirical observations analogous to chosen value cells.

699

Furthermore, chosen value cells have no known correspondent in perceptual decision tasks. In

700

contrast, chosen value signals have been observed in numerous studies of value-based decisions

701

and in multiple brain regions (Amemori and Graybiel 2012; Cai et al. 2011; Cai and Padoa-

702

Schioppa 2014; 2012; Grabenhorst et al. 2012; Lau and Glimcher 2008; Lee et al. 2012; Padoa-

703

Schioppa and Assad 2006; Roesch et al. 2009; Strait et al. 2014; Sul et al. 2010; Wunderlich et al.

704

2010). While it is clear that chosen value signals could inform a variety of mental functions

705

including associative learning, visual attention, emotion and others, the possible contributions of

706

chosen value signals to economic decisions have remained mysterious. The present results

707

suggest that chosen value cells in the OFC may be directly involved in the decision. This

708

suggestion is bolstered by the fact that CV cells in the model reproduce the activity overshooting

709

seen in chosen value cells. Importantly, the hypothesis that chosen value cells in the OFC

710

participate in the decision does not imply that all chosen value signals in the brain are the

711

signature of a decision process. Indeed, once computed and explicitly represented by a neuronal

712

population, the variable chosen value could be transmitted broadly to other brain regions and

713

thus facilitate various mental functions.

26

714

Considering the similarity between CV cells and chosen value cells from a different angle, W11

715

makes the strong prediction that all chosen value cells in the decision circuit are interneurons and

716

that all chosen juice cells are pyramidal cells. Future experiments will test these predictions.

717

Importantly, the present model provides an account for the decision circuit. However, OFC likely

718

comprises additional neuronal populations that do not directly intervene in the decision but

719

receive input from the decision circuit. Thus testing the predictions of the model will require

720

distinguishing between neurons that contribute to the decisions and other neurons within OFC.

721

Notably, the hypothesis that the variable chosen value is first computed in OFC and then

722

transmitted to other brain regions implies that this variable is passed from the decision circuit to

723

pyramidal cells in layers 5 and 6 of OFC, where cortico-cortical projections originate. In

724

principle, NS pyramidal cells in W11 (whose activity resembles that of chosen value cells with

725

low dynamic range) could serve that purpose, although alternate schemes are also possible.

726

In its current form, W11 presents two clear limitations – the fact that the model only includes

727

neurons with positive encoding and the challenge posed by the baseline activity of offer value

728

cells. Negative encoding is an intriguing phenomenon, partly because it is not observed in

729

sensory systems (to our knowledge). More detailed neurophysiology work is necessary to

730

establish the excitatory/inhibitory nature of neurons with negative encoding, whether these

731

neurons are preferentially located in specific cortical layers, and whether they present specific

732

patterns of connectivity. From a modeling perspective, the presence of cells with negative

733

encoding can be viewed as a degree of freedom present in the empirical data but not used by the

734

model. In principle, this degree of freedom could help resolve the challenge posed by the

735

baseline activity in offer value cells, because two inputs encoding the same variable with

736

opposite sign (excitatory and inhibitory) could be combined in a way that reduces or even

737

eliminates any baseline. In the light of these considerations, we view W11 as a benchmark and a

738

starting point for biophysically realistic models of economic decisions. The next steps are to

739

examine empirically the excitatory/inhibitory nature of different groups of cells in OFC, to assess

740

possible differences between cortical layers, and to establish the actual connectivity between the

741

different groups of neurons (a set of challenging tasks!). The results of these enquiries should

742

then inform new and more accurate neuro-computational models.

27

743

To conclude, Wang's model provides a biologically credible account for the neuronal

744

mechanisms of economic decisions. In the model, decisions are generated by three groups of

745

cells whose activity closely resembles the activity of the three groups of neurons previously

746

found in the OFC. This close resemblance does not demonstrate that economic decisions take

747

place in the OFC, or that the decision mechanisms are based on recurrent excitation and pooled

748

inhibition. However, the model does provide an important proof of concepts that economic

749

decisions could emerge from the activity of neurons identified in this area. In this framework,

750

W11 suggests that chosen value cells are a key component of the decision process. The model

751

makes several non-trivial predictions that require further empirical testing. Last but not least,

752

Wang's model represents an important bridge between neuroscience and economics, providing a

753

platform to investigate the implications of neuronal data for economic theory. We will examine

754

some of these implications in future work.

755 756 757

28

758

Table 1. Parameters used in simulations.

Network parameters (same as in W11) NE NI Cext F rext

1,600 400 800 0.15 3.0 sp/s

Total number of pyramidal cells Total number of interneurons Number of external connections per cell Fraction of pyramidal cells associated to each stimulus/good Mean firing rate of external cells

Time constants, synaptic efficacies and noise (same as in W11) τAMPA τNMDA τGABA JAMPA, ext, pyr JAMPA, rec, pyr JNMDA, rec, pyr JGABA, rec, pyr JAMPA, ext, in JAMPA, rec, in JNMDA, rec, in JGABA, rec, in γ ση

2 ms 100 ms 5 ms -0.1123 -0.0027 -0.00091979 0.0215 -0.0842 -0.0022 -0.00083446 0.0180 0.641 0.020 A

Time constant for AMPA-mediated synapses Time constant for NMDA-mediated synapses Time constant for GABA-mediated synapses Efficacy of external synapses, pyramidal cells Efficacy of recurrent AMPA synapses, pyramidal cells Efficacy of recurrent NMDA synapses, pyramidal cells Efficacy of recurrent GABA synapses, pyramidal cells Efficacy of external synapses, interneurons Efficacy of recurrent AMPA synapses, interneurons Efficacy of recurrent NMDA synapses, interneurons Efficacy of recurrent GABA synapses, interneurons Proportionality between input firing rate and gating variable Noise parameter of the Ornstein-Uhlenbeck process

Parameters of input-output function for integrate-and-fire neurons (same as in W11) IE gE cE II gI cI

125 0.16 310 177 0.087 615

Threshold current, pyramidal cells Gain factor, pyramidal cells Noise factor, pyramidal cells Threshold current, interneurons Gain factor, interneurons Noise factor, interneurons

Parameters used to model OV cells r0 Δr a b c d JAMPA, input

0 or 6 sp/s 8 sp/s toffer + 175 ms 30 ms toffer + 400 ms 100 ms 30*JAMPA, ext, pyr

Baseline activity Dynamic range See Eq.23 See Eq.23 See Eq.23 See Eq.23 Efficacy of AMPA-mediated synapses from OV to CJ cells

759 760

29

761

Figure legends

762

Figure 1. Summary of experimental results. a. Neuronal response encoding the offer value B. In

763

the left panel, the x-axis represents different offer types. Black dots represent the percent of trials

764

in which the animal chose juice B and red symbols indicate the neuronal firing rate. Each symbol

765

represents one trial type. Diamonds and circle indicate trials in which the animal chose juice A

766

and juice B, respectively. In the right panel, the same neuronal response (y-axis) is plotted

767

against the encoded variable (x-axis). It can be noted that the encoding is close to linear. b.

768

Activity profile for offer value cells. The plot illustrates the population activity. For each neuron,

769

trials were divided in three groups (tertiles) based on the value of the encoded juice (low,

770

medium, high). A trace was computed for each group of by averaging the activity across trials,

771

and the resulting traces were averaged across the population. Only cells with positive encoding

772

are shown here. c. Neuronal response encoding the chosen juice. Same conventions as in panel a.

773

In this case, the activity is roughly binary depending on the juice chosen by the animal. d.

774

Activity profile for chosen juice cells. In this case, trials were divided depending on whether the

775

animal chose the juice encoded by the cell (E chosen) or the other juice (O chosen). Here the

776

"encoded" juice was defined as that which elicited higher activity. Traces are population

777

averages. e. Neuronal response encoding the chosen value. Same conventions as in panel a. f.

778

Activity profile for chosen value cells. In this case, trials were divided in three groups (tertiles)

779

depending on the chosen value (low, medium, high). Traces are population averages. The figure

780

is adapted from (Padoa-Schioppa 2013).

781

Figure 2. Schematics of W11. Excitatory and inhibitory connections are indicated in blue and

782

red, respectively. Different arrow heads indicate synaptic efficacies between different groups of

783

cells (see figure legend and corresponding numerical values in Table 1).

784

Figure 3. Illustration of inputs for random dot task (left) and economic choice task (right).

785

Figure 4. Initial test of Wang's model. a. OVB cells, activity profile. Trials were divided in three

786

groups (tertiles) depending on the offer value B (same convention as in Fig.1b). b. OVB cells,

787

tuning curve. The firing rate (z-axis) is plotted as a function of the two offers (x- and y-axes).

788

Each data point represents one trial type. Red diamonds and blue circles represent trial types in

30

789

which the network chose juice A and juice B, respectively. c. OVB cells, reduced tuning curve.

790

Down-sampled version of the data shown in b, displayed in the format of Fig.1a. d. OVB cells,

791

activity versus encoded variable. Same firing rates shown in panel b plotted as a function of the

792

encoded variable (offer value B). efgh. Activity of CJB cells. In panel e, trials were divided in

793

two groups depending on the chosen juice (same convention as in Fig.1d). ijkl. activity of CV

794

cells. In panel i, trials were divided in three groups (tertiles) depending on the chosen value

795

(same conventions as in Fig.1f). In all panels, red diamonds (blue circles) represent choices of

796

juice A (juice B).

797

Figure 5. Choice patterns obtained for different balances of synaptic efficacies. a. Choice pattern

798

for the simulation illustrated in Fig.4. Each data point (gray dots) represents one offer type. The

799

percent of B choices (z-axis) is plotted as a function of the two offers (x- and y-axes). The

800

surface illustrates the result of a logistic fit including all second order terms (Eq.1). b. Same

801

surface as in panel a viewed from the z-axis. The indifference function, roughly corresponding to

802

the green pixels (see color legend) is a straight line through the origin. In this simulation, δJstim =

803

(2, 1) and all other synaptic weights were balanced. c. Choice pattern obtained with an

804

imbalanced NMDA-mediated reverberation. d. Choice pattern obtained with an imbalanced

805

inhibition. In both panels c and d, parameters were chosen such that the model was roughly

806

indifferent between 10A and 20B. In both cases, the indifference function did not cross the origin.

807

Figure 6. Robustness with respect to the strength of recurring synapses. abcd. Summary of

808

results obtained with w+ = 1.55. Panels a and b summarize the results obtained for CJB cells

809

(same format as in Fig.4ef). Panels c and d summarize the results obtained for CV cells (same

810

format as in Fig.4il). efgh. Summary of results obtained with w+ = 1.70. ijkl. Summary of results

811

obtained with w+ = 1.85.

812

Figure 7. Predictive activity and choice hysteresis. a. Choice hysteresis (experimental data). In

813

this representative session, trials were divided in two groups depending on whether the outcome

814

of the previous trial was juice A (A• trials) or juice B (B• trials). It can be observed that the

815

relative value of juice A was higher in A• trials (dark gray) compared to B• trials (light gray). In

816

other words, all other things equal, the animal had a tendency to choose on any given trial the

817

same juice as in the previous trial. b. Predictive activity of chosen juice cells (experimental data). 31

818

See main text. c. Predictive activity of CJ cells (model). Same simulation as in Fig.4. The

819

activity immediately before and after the offer is enlarged in the insert. It can be noted that the

820

predictive activity is significant but relatively modest compared to that measured experimentally

821

(see main text). d. Lack of choice hysteresis in initial simulations. Same simulation as in panel c

822

and Fig.4. In this simulation, the initial conditions were reset at the beginning of each trial. In the

823

analysis, trials were divided in A• trials and B• trials depending on the previous outcome, and the

824

two data sets were separately fitted with a logistic function. The two fits are completely

825

overlapping, indicating the lack of choice hysteresis. In other words, the predictive activity in

826

panel c corresponds to the "residual" predictive activity measured in OFC (see main text). ef.

827

Predictive activity in the presence of choice hysteresis. In this simulation, we set the initial

828

conditions in each trial equal to the final conditions in the previous trial (tail activity) and w+ =

829

1.82. To ensure that each juice was chosen in ~50% of trials, we set in this simulation ΔA = [0,

830

10] and ΔB = [0, 20]. Under these conditions, the predictive activity is much larger (compare

831

panels e and c) and there is a robust choice hysteresis (panel f). In panel f, the two logistic fit

832

refer to A• trials and B• trials, respectively. Panels a and b are reproduced from (Padoa-Schioppa

833

2013).

834

Figure 8. Activity overshooting. a. Overshooting of chosen value cells (experimental data). Only

835

trials in which the animal chose 1A are included in the figure. See main text for details. b.

836

Overshooting of CV cells (model). Same simulation as in Fig.4. For each quantity of A that

837

induced some split decisions (i.e., offer value A = 1...10), trials in which the network chose juice

838

A were divided in easy and split (see main text). The activity overshooting was observed for

839

each value of A. The main plot here illustrates the effect averaged across values of A. The insert

840

illustrates the effect for offer value A = 8. Panel a is reproduced from (Padoa-Schioppa 2013).

841

Figure 9. Stable points of W11. The leftmost column (panels a1-a3) represents the space of

842

possible offers (as in Fig.3, right panel). The other three columns represent the results of three

843

simulations, with w+ = 1.75 and ΔJ = 30 (normal parameters, panels b1-b3), w+ = 1.85 and ΔJ =

844

30 (panels c1-c3) and w+ = 1.75 and ΔJ = 15 (panels d1-d3). In each panel, the two axes

845

represent the "final" activity of CJA cells (y-axis) and CJB cells (x-axis) (time window 400-600

846

ms after the offer). Each data point represents one trial and data points are color-coded according

847

to the legend in the corresponding leftmost column (see main text). 32

848

Figure 10. Effects of non-zero baseline in OV cells. a-f. Symmetric network. Panel a shows the

849

simulated activity of OVB cells, panels bc show the activity of CJB cells, panels de show the

850

activity of CV cells and panel f shows the choice pattern. The presence of the baseline activity in

851

OV cells changes dramatically the nature of the input. However, with small adjustments, the

852

symmetric network is robust to this change. In this simulation, we set w+ = 1.65, JAMPA, rec, in = 0.9

853

JAMPA, rec, in, JNMDA, rec, in = 0.9 JNMDA, rec, in, JGABA, rec, in = 0.8 JGABA, rec, in. The activity profiles of CJ

854

and CV cells remain comparable to the empirical observations, and CV cells encode the chosen

855

value. g-l. Asymmetric network. In this simulation, we set δJstim = (1.2, 1) all other parameters as

856

in panels a-f. Although the activity profiles of CJ cells (panels hi) and CV cells (panels j) are

857

realistic, the indifference function is non-homogenous. Tests in which we tried to recover this

858

departure from homogeneity introducing imbalances at other stages of the network did not

859

provide successful results.

860 861

33

862

References

863

Abbott LF, and Chance FS. Drivers and modulators from push-pull and balanced synaptic

864

input. Prog Brain Res 149: 147-155, 2005.

865

Amemori K, and Graybiel AM. Localized microstimulation of primate pregenual cingulate

866

cortex induces negative decision-making. Nat Neurosci 15: 776-785, 2012.

867

Amit DJ, and Brunel N. Model of global spontaneous activity and local structured activity

868

during delay periods in the cerebral cortex. Cereb Cortex 7: 237-252, 1997.

869

Ariely D, Loewenstein G, and Prelec D. 'Coherent arbitrariness': stable demand curves without

870

stable preferences. Q J Econ 118: 73-105, 2003.

871

Bartra O, McGuire JT, and Kable JW. The valuation system: a coordinate-based meta-

872

analysis of BOLD fMRI experiments examining neural correlates of subjective value.

873

Neuroimage 76: 412-427, 2013.

874

Bogacz R, Brown E, Moehlis J, Holmes P, and Cohen JD. The physics of optimal decision

875

making: A formal analysis of models of performance in two-alternative forced-choice tasks.

876

Psychological Review 113: 700-765, 2006.

877

Brunel N, and Wang XJ. Effects of neuromodulation in a cortical network model of object

878

working memory dominated by recurrent inhibition. J Comput Neurosci 11: 63-85, 2001.

879

Cai X, Kim S, and Lee D. Heterogeneous coding of temporally discounted values in the dorsal

880

and ventral striatum during intertemporal choice. Neuron 69: 170-182, 2011.

881

Cai X, and Padoa-Schioppa C. Contributions of orbitofrontal and lateral prefrontal cortices to

882

economic choice and the good-to-action transformation. Neuron 81: 1140-1151, 2014.

883

Cai X, and Padoa-Schioppa C. Neuronal encoding of subjective value in dorsal and ventral

884

anterior cingulate cortex. J Neurosci 32: 3791-3808, 2012.

885

Clithero JA, and Rangel A. Informatic parcellation of the network involved in the computation

886

of subjective value. Soc Cogn Affect Neurosci 2013.

887

Compte A, Brunel N, Goldman-Rakic PS, and Wang XJ. Synaptic mechanisms and network

888

dynamics underlying spatial working memory in a cortical network model. Cereb Cortex 10:

889

910-923, 2000.

890

Conen K, Cai X, and Padoa-Schioppa C. unpublished observations.

34

891

Drugowitsch J, and Pouget A. Probabilistic vs. non-probabilistic approaches to the

892

neurobiology of perceptual decision-making. Curr Opin Neurobiol 22: 963-969, 2012.

893

Engel TA, and Wang XJ. Same or different? A neural circuit mechanism of similarity-based

894

pattern match decision making. J Neurosci 31: 6982-6996, 2011.

895

Fellows LK. Orbitofrontal contributions to value-based decision making: evidence from humans

896

with frontal lobe damage. Ann N Y Acad Sci 1239: 51-58, 2011.

897

Fusi S, Asaad WF, Miller EK, and Wang XJ. A neural circuit model of flexible sensorimotor

898

mapping: learning and forgetting on multiple timescales. Neuron 54: 319-333, 2007.

899

Gold JI, and Shadlen MN. The neural basis of decision making. Annu Rev Neurosci 30: 535-

900

574, 2007.

901

Grabenhorst F, Hernadi I, and Schultz W. Prediction of economic choice by primate

902

amygdala neurons. Proc Natl Acad Sci U S A 109: 18950-18955, 2012.

903

Hare TA, Schultz W, Camerer CF, O'Doherty JP, and Rangel A. Transformation of stimulus

904

value signals into motor commands during simple choice. P Natl Acad Sci USA 108: 18120-

905

18125, 2011.

906

Hunt LT, Kolling N, Soltani A, Woolrich MW, Rushworth MFS, and Behrens TEJ.

907

Mechanisms underlying cortical activity during value-guided choice. Nat Neurosci 15: 470-U169,

908

2012.

909

Jocham G, Hunt LT, Near J, and Behrens TE. A mechanism for value-guided choice based

910

on the excitation-inhibition balance in prefrontal cortex. Nat Neurosci 15: 960-961, 2012.

911

Kobayashi S, Pinto de Carvalho O, and Schultz W. Adaptation of reward sensitivity in

912

orbitofrontal neurons. J Neurosci 30: 534-544, 2010.

913

Krajbich I, Armel C, and Rangel A. Visual fixations and the computation and comparison of

914

value in simple choice. Nat Neurosci 13: 1292-1298, 2010.

915

Lau B, and Glimcher PW. Value representations in the primate striatum during matching

916

behavior. Neuron 58: 451-463, 2008.

917

Lee H, Ghim JW, Kim H, Lee D, and Jung M. Hippocampal neural correlates for values of

918

experienced events. J Neurosci 32: 15053-15065, 2012.

919

Mainen ZF, and Kepecs A. Neural representation of behavioral outcomes in the orbitofrontal

920

cortex. Curr Opin Neurobiol 2009.

35

921

Murray JD, Bernacchia A, Freedman DJ, Romo R, Wallis JD, Cai X, Padoa-Schioppa C,

922

Pasternak T, Seo H, Lee D, and Wang XJ. A hierarchy of intrinsic timescales across primate

923

cortex. Nat Neurosci 17: 1661-1663, 2014.

924

Newsome WT. The King Solomon lectures in neuroethology. Deciding about motion: linking

925

perception to action. J Comp Physiol [A] 181: 5-12, 1997.

926

Padoa-Schioppa C. Neurobiology of economic choice: a good-based model. Annu Rev Neurosci

927

34: 333-359, 2011.

928

Padoa-Schioppa C. Neuronal origins of choice variability in economic decisions. Neuron 80:

929

1322-1336, 2013.

930

Padoa-Schioppa C. Range-adapting representation of economic value in the orbitofrontal cortex.

931

J Neurosci 29: 14004-14014, 2009.

932

Padoa-Schioppa C, and Assad JA. Neurons in orbitofrontal cortex encode economic value.

933

Nature 441: 223-226, 2006.

934

Padoa-Schioppa C, and Assad JA. The representation of economic value in the orbitofrontal

935

cortex is invariant for changes of menu. Nat Neurosci 11: 95-102, 2008.

936

Padoa-Schioppa C, and Rustichini A. Rational attention and adaptive coding: a puzzle and a

937

solution. American Economic Review: Papers and Proceedings 104: 507-513, 2014.

938

Parker AJ, and Newsome WT. Sense and the single neuron: probing the physiology of

939

perception. Annu Rev Neurosci 21: 227-277, 1998.

940

Raghuraman AP, and Padoa-Schioppa C. Integration of multiple determinants in the neuronal

941

computation of economic values. J Neurosci 34: 11583-11603, 2014.

942

Renart A, Brunel N, and Wang X. Mean-field theory of recurrent cortical networks: from

943

irregularly spiking neurons to working memory. In: Computational Neuroscience: A

944

Comprehensive Approach, edited by Feng J. Boca Raton: CRC Press, 2003.

945

Roesch MR, Singh T, Brown PL, Mullins SE, and Schoenbaum G. Ventral striatal neurons

946

encode the value of the chosen action in rats deciding between differently delayed or sized

947

rewards. J Neurosci 29: 13365-13376, 2009.

948

Rudebeck PH, and Murray EA. The orbitofrontal oracle: cortical mechanisms for the

949

prediction and evaluation of specific behavioral outcomes. Neuron 84: 1143-1156, 2014.

950

Savage LJ. The foundations of statistics. New York,: Dover Publications, 1972, p. xv, 310 p.

36

951

Soltani A, Lee D, and Wang XJ. Neural mechanism for stochastic behaviour during a

952

competitive game. Neural Netw 19: 1075-1090, 2006.

953

Soltani A, and Wang XJ. A biophysically based neural model of matching law behavior:

954

melioration by stochastic synapses. J Neurosci 26: 3731-3744, 2006.

955

Soltani A, and Wang XJ. Synaptic computation underlying probabilistic inference. Nat

956

Neurosci 13: 112-119, 2010.

957

Strait CE, Blanchard TC, and Hayden BY. Reward value comparison via mutual inhibition in

958

ventromedial prefrontal cortex. Neuron 82: 1357-1366, 2014.

959

Sul JH, Kim H, Huh N, Lee D, and Jung MW. Distinct roles of rodent orbitofrontal and

960

medial prefrontal cortex in decision making. Neuron 66: 449-460, 2010.

961

Tversky A, and Kahneman D. The framing of decisions and the psychology of choice. Science

962

211: 453-458, 1981.

963

Usher M, and McClelland JL. The time course of perceptual choice: the leaky, competing

964

accumulator model. Psychol Rev 108: 550-592, 2001.

965

Wallis JD. Cross-species studies of orbitofrontal cortex and value-based decision-making. Nat

966

Neurosci 2011.

967

Wang XJ. Probabilistic decision making by slow reverberation in cortical circuits. Neuron 36:

968

955-968, 2002.

969

Wong KF, and Wang XJ. A recurrent network mechanism of time integration in perceptual

970

decisions. J Neurosci 26: 1314-1328, 2006.

971

Wunderlich K, Rangel A, and O'Doherty JP. Economic choices can be made using only

972

stimulus values. Proc Natl Acad Sci U S A 107: 15005-15010, 2010.

973

37

Figure 1

1A = 3.1B

2

R = 0.92 30

10

10

0

0

0B:1A 1B:3A 1B:2A 1B:1A 2B:1A 3B:1A 4B:1A 6B:1A 10B:1A 2B:0A

20

0%

20

5

2B:0A

6B:1A

4B:1A

3B:1A

2B:1A

0

1B:1A

0 1B:2A

10 0%

10

A

0%

0B:1A 1B:3A 1B:2A 1B:1A 2B:1A 3B:1A 4B:1A 6B:1A 10B:1A 3B:0A

Average firing rate (sp/s)

Firing rate (sp/s)

100%

20

10 0

12

10

20

f

2

20

14

offer

R = 0.90 30

16

8

40

30

E chosen O chosen

B

chosen juice

e Chosen value cells

10

8

18

Average firing rate (sp/s)

40

20

1B:3A

d 2

20

1A = 3.2B

10

offer

R = 0.95 30

40

12

10

50

30

0B:1A

Firing rate (sp/s)

40

14

offer value B

100%

1A = 1.9B

high medium low

6 0

c Chosen juice cells 50

16

b Average firing rate (sp/s)

30

100%

Firing rate (sp/s)

a Offer value cells

5

chosen value

10

high medium low

18 16 14 12 10 8

offer

500 ms

Figure 2

CV chosen value

NS non selective

CJA

CJB

chosen juice A

chosen juice B

OVA

OVB

offer value A

offer value B

Synaptic efficacies JGABA, rec, int

JAMPA, rec, pyr JAMPA, ext, pyr

JNMDA, rec, int

JAMPA, input

JAMPA, rec, int

Time constants

JAMPA, ext, int JGABA, rec, pyr JNMDA, rec, pyr

WGABA WNMDA WAMPA

Figure 3

economic choice task

random dot task 'A

offer A

motion strenght right



0

í í

0 motion strenght left



0

'B

0 offer B

Figure 4

OVB cells high medium low

6

b

c

A chosen B chosen

5

5

4

4

3

3

3

2

2

2

1

1

1

0

0

4 4

d

5

2

0 20

0 -500

0

500

0 10

1000

0

offer A

offer

20

10

0

offer B

0B:1A 1B:20A 1B:16A 1B:12A 1B:8A 1B:4A 4B:1A 8B:1A 12B:1A 16B:1A 20B:1A 1B:0A

Firing rate (sp/s)

a

0

5

10

15

20

offer value B

CJB cells A chosen B chosen

25

f

g

A chosen B chosen

h

15

15

10

10

10

5

5

5

0

0

15 20 15 10 5 0 20

0 -500

0

500

10

1000

0

offer A

offer

20

10

0

0B:1A 1B:20A 1B:16A 1B:12A 1B:8A 1B:4A 4B:1A 8B:1A 12B:1A 16B:1A 20B:1A 1B:0A

Firing rate (sp/s)

e

offer B

A chosen

B chosen

CV cells high medium low

18

j

k

A chosen B chosen

l

16

16

16

14

14

14

12

12

10

10

16

14 12 12

10 -500

10 20 0

offer

500

1000

10

offer A

0

0

5

10

offer B

15

20

0B:1A 1B:20A 1B:16A 1B:12A 1B:8A 1B:4A 4B:1A 8B:1A 12B:1A 16B:1A 20B:1A 1B:0A

Firing rate (sp/s)

i

0

10

20

30

chosen value (uB)

40

Figure 5

a

b

20

100 15

offer A

% B choice

75

50

10

25 5 0 20 20

15 15

10 5

d

0

5

0

0

10

15

20

15

20

offer B

5

offer A

100

0

10

offer B

c

20

20

90 80

15

15

50 40

offer A

60

offer A

70

10

10

30 20

5

5

10 0 0

5

10

offer B

15

20

0

0

5

10

offer B

Figure 6

CJB cells

a

A chosen B chosen

4

b

CV cells

c

A chosen B chosen

Firing rate (sp/s)

w+ = 1.55

3

í

0

14 14

9

4

6

12

12 2 20

3 -500

0

500

1000

10

offer A

0

0

10

20

10 -500

offer B

0

500

10 0

1000

offer

g

f 4

Firing rate (sp/s)

í

20

30

40

h

18

12

16

20 3

10

chosen value (uB)

e w+ = 1.70

A chosen B chosen

16

16

6 12

d

high medium low

15

16

0

8

15

14 14

10

4 12 12

5 0 20 0 -500

0

500

1000

10

offer A

0

0

10

20

offer B

10 -500

0

500

10 0

1000

10

20

30

40

chosen value (uB)

offer

i

j

k

30

l

Firing rate (sp/s)

w+ = 1.85

4

18

30 3

í

16

20

0

16 20

14 14

10 10

12 12 0 20

0 -500

0

offer

500

1000

10

offer A

0

0

10

offer B

20

10 -500

0

500

1000

10 0

10

20

30

chosen value (uB)

40

Figure 7

a

b

choice hysteresis 

$‡WULDOV B• trials

chosen juice cells E chosen, easy E chosen, split O chosen, split O chosen, easy

% of B choices

Firing rate (sp/s)

14

12



8 %$

%$

%$ %$ %$ %$

%$

%$ %$

%$



í







offer

d

c $‡WULDOV B• trials

CJB cells 

4

Firing rate (sp/s)

%B choice

     





RIIHU$

f

 





















offer

e

$‡WULDOV B• trials





 í

offer B





8 6



Firing rate (sp/s)

%B choice







3

 í í í

    

RIIHU$

 







offer B



4 2 í í í









  í



offer

Figure 8

a

Trials: A chosen easy split

12

Firing rate (sp/s)

b

chosen value cells

CV cells

15 16

14

14 12 í

0

500

1000

13

10

12 8 11 -500

0

offer

500

1000

-500

0

offer

500

1000

Figure 9

w+ = 1.75, 'J = 30 15

20

a1

w+ = 1.85, 'J = 30 b1

w+ = 1.75, 'J = 15 c1

15

d1

firing rate CJA cells

30

offer A

10

5

0

0

5

10

10 20 10

0 0

15

5

10

offer B

10

20

0 0

10

20

0 0

30

5

10

15

firing rate CJB cells

15

a2

20

b2

c2

15

d2

30 10

10 20 10 5

0

5

10

0

5

10

0 0

15

a3

15

10

20

20

0 0

10

20

0 0

30

b3

c3

5

10

15

15

d3

30 10

10 20 10 5

0

5

10

0

5

10

15

0 0

10

20

0 0

10

20

30

0 0

5

10

15

Figure 10

a

20

high medium low

12

b

CJB cells

8 10 6 4

5

0

500

0 í500

1000

offer

0

500

f

0

2 20

10

20

offer B

d

high medium low

12

0

12

20

10

0

offer B

e

12

high medium low

A chosen B chosen

10 10

8 í500

0

500

8 0

1000

5

offer

g

OVB cells

10

offer A

CVcells Firing rate (sp/s)

offer A

10

0

1000

offer

20

Firing rate (sp/s)

A chosen B chosen 10

15

0 500

10

15

20

chosen value (uB)

h

CJB cells 20

A chosen B chosen

i A chosen B chosen

9

15 6 8 10 3 4

5

0 500

0

500

1000

0 í500

offer

0

500

1000

offer

l 20

offer A

c

A chosen B chosen

0 20

10

offer A

j

CVcells 14

high medium low

0

0

20

10

offer B

k

12 A chosen B chosen

12

11

10

10

10

0

0

10

offer B

20

8 500

0

offer

500

1000

9 0B:1A 1B:20A 1B:16A 1B:12A 1B:8A 1B:4A 4B:1A 8B:1A 12B:1A 16B:1A 20B:1A 1B:0A

Firing rate (sp/s)

OVB cells

Suggest Documents