Noise Filtering

8 downloads 0 Views 1MB Size Report
12 dotted – raw data; thick line – Confidence Filter; thin line – Savitzky-Golay filter. 340 350 360 370 380 390 400 410 420 430 440 450 460 470 480 490 500.
The Best Method of Noise Filtering Yuri Kalambet, Sergey Maltsev, Ampersand Ltd., Moscow, Russia; Yuri Kozmin, Shemyakin Institute of Bioorganic Chemistry, Moscow, Russia [email protected]

1

History: Adaptive peak approximation

2

Rough slope width estimate • Evaluate baseline using default gap (minimum peak width Integration parameter) • Evaluate peak height using default gap • Count all points from peak apex to slope end with height bigger than halfheight of the peak. Count obtained is an estimate of the slope width. 3

Properties of adaptive peak approximation • • • •

Good noise suppression at each slope Minimal peak shape disturbances All peak parameters are resistant to oversampling Baseline approximation may be poor – either noisy (small gap) or disturbed (large gap). • No approximation outside of peaks • Does not improve formal signal/noise ratio • Baseline position is one of the most important sources of error 4

Improvement 1: Non-central approximation

x* G2

G1

5

Height

Confidence intervals

8 9 7

8

6

4

2 3 1

2

6 4 5

Conce ntration 0 0

1

2

3

4

5

6

7

8

9

10

6

Confidence interval estimate CY  t n(1/p2 )  S  u* where 1 (Y  Xβˆ )  (Y  Xβˆ ) 2   u  x ( X  X) x* * * S  n p n - number of data points used for polynomial approximation (gap of the filter); p - power of the polynomial; X - matrix of x power values on independent axis (time); Y - vector of detector response values; x*  {1, x* ,..., x*p } βˆ  ( X X) 1 X  Y t m - Student’s coefficient for confidence probability (1-δ) and m degrees of freedom x* - position at which smoothed (approximated) value is estimated. 7

Approximation using confidence intervals G1

G2

x*

x*

G 2 confidence interval

8

Algorithm of simple Confidence filter approximation • Evaluate points and confidence intervals for new (shifted) window

9

Algorithm of simple Confidence filter approximation • Evaluate points and confidence intervals for new (shifted) window • Compare new confidence interval with that for previously evaluated point. If the new one is smaller than previous, replace approximated point and its confidence interval.

10

Algorithm of simple Confidence filter approximation • Evaluate points and confidence intervals for new (shifted) window • Compare new confidence interval with that for previously evaluated point. If the new one is smaller than previous, replace approximated point and its confidence interval. • Computational complexity of Confidence filter is comparable to that of simple convolution, (e.g. Savitzky-Golay) and linearly depends on the product gap∙ (degree of the polynomial).

11

Bonus #1: Correct handling of baseline steps and array boundaries mv Original SG ASG 2000

1500

1000

500

0 340

350

360

370

380

390

400

410

420

430

440

450

460

470

480

490

500

Nmeas

dotted – raw data; thick line – Confidence Filter; thin line – Savitzky-Golay filter 12

Confidence filter algorithm improvement: Adaptive gap of the polynomial • Repeat confidential filter algorithm for approximations with different windows (gaps) • Computational complexity: degree∙gap∙(gap-1)/2 • Logarithmic step: next gap is k times smaller, than previous, e.g. gap2 = gap1/k, k>1; Computational complexity: degree∙gap∙k/(k-1)

13

Confidence interval estimate CY  t n(1/p2 )  S  u* where 1 (Y  Xβˆ )  (Y  Xβˆ ) 2   u  x ( X  X) x* * * S  n p n - number of data points used for polynomial approximation (gap of the filter); p - power of the polynomial; X - matrix of x power values on independent axis (time); Y - vector of detector response values; x*  {1, x* ,..., x*p } βˆ  ( X X) 1 X  Y t m - Student’s coefficient for confidence probability (1-δ) and m degrees of freedom x* - position at which smoothed (approximated) value is estimated. 14

t(df) for confidence probability 0.975

15

Confidence interval profiles for different slits (degree = 3) 1

0.9

0.8

0.7

0.6

0.5

0.4

0.3

0.2 -15

-14

-13

-12

-11

-10

-9

-8

-7

-6

-5

-4

-3

-2

-1

0

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

Confidence Interval profiles, 31 points, 0…5 degrees

17

σ evaluation problems: • Small gaps: accidental perfect fit • Large gaps: treating small peaks as a noise due to large number of degrees of freedom • Is pump pulsation a noise or a signal? • Small gaps: confidence interval depends on confidence level

σ evaluation solutions: • Evaluate in advance using the whole data array • Use the estimate for evaluation of confidence intervals 18

Handling σ estimate

CY  t

(1 / 2 )  n p

CY  t

½ n p

 S  u*

   u*

S 2   2 , Formula (a )  CY   2  2 S   , Formula ( b )  

SR2   2   2 ( , n  p)

19

Noise Filtering: How it works 1 23

AU

-0.004 280nm -0.005

22 26

S moo280

27

25 21

20

21

22

23

24

25

min

20 22

26 23

25

10 Gap

21 24 23

0 20

27

21

22

24

25

min

20 27 22 0

S hift

21

23

25

24

26

-20 20

21

22

23

24

25

min

20

Noise Filtering: How it works 2

21

0 .0 7 8 4

mV

ch 1

33

0 .0 7 3 3

36

37

38

39

40

41

min

Raw

34

35

34

35

36

37

38

39

40

41

m in

34

35

36

37

38

39

40

41

m i n

34

35

36

37

38

39

40

41

m in

34

35

36

37

38

39

40

41

m in

34

35

36

37

38

39

40

41

min

m V

ch 1

33

0 .0 7 5

0…3

Automatic selection of degree and gap of approximating polynomial

m V

ch 1

33

0 .0 7 7 3

1…3

m V

ch 1

33

0 .0 7 8 4

2...3

m V

ch 1

33

0 .0 7 8 4

3

mV

ch 1

33

3…10

22

Is pump pulsation a noise or a signal?

23

Conclusions: • Confidence filter introduces a measure of approximation quality • Confidence filter helps to select the best set of functions that approximate the data set • Confidence filter is metrologically the best noise filtering method and can be used in the fight with legal metrology

24

Thank you!

25

Suggest Documents