On Transferring Spatial Filters in a Brain Reading ... - Semantic Scholar

2 downloads 0 Views 769KB Size Report
One of the most crucial factors for the practical success ... compared to learning new spatial filters on the current session's calibration data. ... Pseudo-channel 4.
On Transferring Spatial Filters in a Brain Reading Scenario Jan Hendrik Metzen?, Su Kyoung Kim?,†, Timo Duchrow†, Elsa Andrea Kirchner?,†, Frank Kirchner?,† ? † University Bremen, Robotics Group, Robert-Hooke-Str. 5, 28359 Bremen, Germany DFKI - Robotics Innovation Center, Robert-Hooke-Str. 5, 28359 Bremen, Germany

1 Introduction

3 Methods Dataflow Segmentation

Window [0ms - 1000ms] after stimulus

Detection of the P300: Perception of an important, task-relevant information is accompanied by a particular event-related potential (ERP), the “P300”. This ERP is to be detected using machine learning techniques in the single-trial.

Cz

CSP filter 3

CP3 AF7

CPz

CP4

F7 FT9

Labyrinth Oddball Scenario P3

AF4

F8

C5

C3

C1

Cz

C2

C4

C6

P4

POz

T8

P7

P5 P3 P1 Pz P2 P4 P6 PO3 O1

POz Oz

PO4 O2

P8

0.03

Pseudo-channel 4

Pseudo-channel 5

0.06

Pseudo-channel 6

Pseudo-channel 7

0.09

The Figure shows for 8 exemplary channels and 8 pseudo-channels obtained using CSP filtering the median (bold) and the 10 and 90 percentiles of the single-trials for the ’target’ (red) and ’standard’ condition (blue). The middle column shows the weights of an exemplary CSP filter.

Targets (n = 120)

0.7 0.6 0.5 0.4 0.3 0.2 0.1

Channels: m=16

Channels: m=32

Channels: m=62

InterSubject LeaveOneSessionOut NoFilter FromScratch

200 400 600 800 200 400 600 800 200 400 600 800 Size calibration dataset (t)

Best approach for a given number of retained channels and calibration dataset size

PO8 PO10

Channels: m=8

Pseudo-channel 3

TP10

PO7

2

Pseudo-channel 2

FT10

0.00

Channels: m=4

Average F1-measure on target class for the four spatial filtering settings for different sizes of the calibration dataset and different numbers of retained channels after spatial filtering.

0.03

FT8 FC2 FC4 FC6

Channels: m=2

Pseudo-channel 1

0.06

AF8

F5 F3 F1 Fz F2 F4 F6

PO9

ms

AF3

Pseudo-channel 0

TP9

-2

600

Fp2

CP5 CP3 CP1 CPz CP2 CP4 CP6

Standards (n = 720)

300

Fp1

FT7 FC5 FC3 FC1 T7

Pz

Targets

!V ERP: Standards Averaged

0

(SVM)

0.7 0.6 0.5 0.4 0.3 0.2 0.1

Retained channels (m)

Operator Monitoring: Ensure that a telemanipulation operator has recognized an important message that is displayed to him. Avoid that a message is displayed too obtrusively (distracting!). Instead, display message unobtrusively and check in operator’s brain activity whether message was perceived. If not: Display message again.

4

Classification

CSP maps the data onto axes such that the variance for instances of the first class is maximized (c) and the variance for the second class is minimized (or vice versa). With Xi ∈ Rn×t being the i-th of the nc examples of band-pass filtered and centered EEG segments with t samples for class c, this is achieved by a simultaneous diagonalization of the two empirical intra-class covariance Pnc (c) (c) T −1 matrices Σc = nc X i=1 i (Xi ) , i.e. by solving Σ1W = ΛΣ2W where Λ is the vector of generalized eigenvalues and W is the corresponding matrix of generalized eigenvectors whose columns correspond to the learned spatial filters.

Filter Weights

Brain Reading: The decoding of a user’s mental state and intent based on detecting patterns in brain activity by external observation, e.g. by means of electroencephalography.

Pz

Feature Normalization [2.5,97.5] percentile [0,1]

Common Spatial Patterns (CSP)

2 Brain Reading

-4

4Hz

1000Hz to 25Hz

Time Domain features

CSP

Lowpass-Filter

Decimation

Detrend Mean

Feature Extraction

Spatial Filtering

Performance of different spatial filtering settings F-Measure (Targets) F-Measure (Targets)

Machine learning approaches are increasingly used in brain-machine-interfaces to allow automatic adaptation to user-specific brain patterns. One of the most crucial factors for the practical success of these systems is that this adaptation can be achieved with a minimum amount of training data since training data needs to be recorded during a calibration procedure prior to the actual usage session. To this end, one promising approach is to reuse models based on data recorded in preceding sessions of the same or of other users. We investigate under which conditions it is favorable to reuse models (more specifically spatial filters) trained on data from historic sessions compared to learning new spatial filters on the current session’s calibration data. We present an empirical study in a scenario in which Brain Reading is used to support robotic telemanipulation.

5 Results

26

LeaveOneSessionOut NoFilter FromScratch

25 24 23 22 21 0

100

Averaged ERP: Targets !V -4

Pz

2 P300

4 0

300

600

ms

R The subject plays a physical simulation of the BRIO labyrinth and has to respond to rare ’target’ stimuli by pressing a buzzer. Average event-related potentials (ERPs) evoked by ’target’ and more frequent ’standard’ stimuli are depicted.

Dataset: The dataset used in this paper consists of the labeled EEG data recorded in ten sessions from five (male) subjects; each subject performed two sessions and each session consisted of five repetitions (called “runs”) of the Labyrinth Oddball paradigm. Sessions have been recorded on different days and thus, the positioning of the EEG cap can vary between sessions of the same user. EEGs were recorded continuously from 64 electrodes (extended 10-20 system with reference at electrode FCz). Two of the 64 channels (replacing the electrodes TP7 and TP8) were used to record electromyography signals of muscles of the lower arm and have been discarded in this study.

• Historic spatial filters: One CSP filter matrix (consisting of 62 CSP filters) was computed for each of the 10 sessions. Each of these matrices was trained on the merged data of all 5 runs of the respective session. • Classification task: The dataflow was trained on a subset of the data recorded during the first run of a session (the calibration data) and was tested on each of the remaining 4 runs. Each of the 10 sessions has been considered once as ’current’ session with the 9 remaining sessions acting as ’historic’ sessions. • 4 Spatial Filtering Setups have been considered: no spatial filtering (’NoFilter’), learning a CSP filter matrix on the calibration data from scratch (’FromScratch’), reusing a CSP matrix from any of the historic sessions (’LeaveOneSessionOut’), and reusing a CSP matrix from any of the historic sessions excluding those from the current subject (’InterSubject’). • Filter selection: Which historic spatial filter is selected for a session was determined—along with the classifier’s meta-parameter–using five-fold cross-validation on the calibration data. • Performance measure: Because of the considerable class skew of 6 : 1, the F1 measure (the harmonic mean of precision and recall) on the ’target’ class was used as performance measure.

300 400 500 600 Size calibration dataset (t)

700

800

The Figure shows which method achieves the highest F1-measure for a given combination of retained channels and size of calibration dataset. Filled symbols denote settings where the “winner” method is significantly better than each of the other two methods (p < 0.05). The “InterSubject” setting is omitted since it is mostly on par with the “LeaveOneSessionOut” setting.

4 Setup

-2 2

200

6 Conclusion 1

2

3

4

Reusing spatial filters trained on historic sessions achieves superior results compared to learning a session-specific filter anew (’FromScratch’) in situations when the calibration dataset is small or when the number of channels must be reduced. Reusing spatial filters for novel users (’InterSubject’), for which no historic spatial filters exist and thus filters of other users need to be reused, is only slightly worse than for known users (’LeaveOneSessionOut’). If the calibration dataset is small but many channels can be retained, using no spatial filter (’NoFilter’) is preferable. If the calibration dataset becomes larger, learning a session-specific spatial filters is usually the best choice.

Contact: Jan Hendrik Metzen University Bremen, Robotics Group Director: Prof. Dr. Frank Kirchner www.dfki.de/robotics [email protected] Robotics Innovation Center

Suggest Documents