The Best Method of Noise Filtering Yuri Kalambet, Sergey Maltsev, Ampersand Ltd., Moscow, Russia; Yuri Kozmin, Shemyakin Institute of Bioorganic Chemistry, Moscow, Russia
[email protected]
1
History: Adaptive peak approximation
2
Rough slope width estimate • Evaluate baseline using default gap (minimum peak width Integration parameter) • Evaluate peak height using default gap • Count all points from peak apex to slope end with height bigger than halfheight of the peak. Count obtained is an estimate of the slope width. 3
Properties of adaptive peak approximation • • • •
Good noise suppression at each slope Minimal peak shape disturbances All peak parameters are resistant to oversampling Baseline approximation may be poor – either noisy (small gap) or disturbed (large gap). • No approximation outside of peaks • Does not improve formal signal/noise ratio • Baseline position is one of the most important sources of error 4
Improvement 1: Non-central approximation
x* G2
G1
5
Height
Confidence intervals
8 9 7
8
6
4
2 3 1
2
6 4 5
Conce ntration 0 0
1
2
3
4
5
6
7
8
9
10
6
Confidence interval estimate CY t n(1/p2 ) S u* where 1 (Y Xβˆ ) (Y Xβˆ ) 2 u x ( X X) x* * * S n p n - number of data points used for polynomial approximation (gap of the filter); p - power of the polynomial; X - matrix of x power values on independent axis (time); Y - vector of detector response values; x* {1, x* ,..., x*p } βˆ ( X X) 1 X Y t m - Student’s coefficient for confidence probability (1-δ) and m degrees of freedom x* - position at which smoothed (approximated) value is estimated. 7
Approximation using confidence intervals G1
G2
x*
x*
G 2 confidence interval
8
Algorithm of simple Confidence filter approximation • Evaluate points and confidence intervals for new (shifted) window
9
Algorithm of simple Confidence filter approximation • Evaluate points and confidence intervals for new (shifted) window • Compare new confidence interval with that for previously evaluated point. If the new one is smaller than previous, replace approximated point and its confidence interval.
10
Algorithm of simple Confidence filter approximation • Evaluate points and confidence intervals for new (shifted) window • Compare new confidence interval with that for previously evaluated point. If the new one is smaller than previous, replace approximated point and its confidence interval. • Computational complexity of Confidence filter is comparable to that of simple convolution, (e.g. Savitzky-Golay) and linearly depends on the product gap∙ (degree of the polynomial).
11
Bonus #1: Correct handling of baseline steps and array boundaries mv Original SG ASG 2000
1500
1000
500
0 340
350
360
370
380
390
400
410
420
430
440
450
460
470
480
490
500
Nmeas
dotted – raw data; thick line – Confidence Filter; thin line – Savitzky-Golay filter 12
Confidence filter algorithm improvement: Adaptive gap of the polynomial • Repeat confidential filter algorithm for approximations with different windows (gaps) • Computational complexity: degree∙gap∙(gap-1)/2 • Logarithmic step: next gap is k times smaller, than previous, e.g. gap2 = gap1/k, k>1; Computational complexity: degree∙gap∙k/(k-1)
13
Confidence interval estimate CY t n(1/p2 ) S u* where 1 (Y Xβˆ ) (Y Xβˆ ) 2 u x ( X X) x* * * S n p n - number of data points used for polynomial approximation (gap of the filter); p - power of the polynomial; X - matrix of x power values on independent axis (time); Y - vector of detector response values; x* {1, x* ,..., x*p } βˆ ( X X) 1 X Y t m - Student’s coefficient for confidence probability (1-δ) and m degrees of freedom x* - position at which smoothed (approximated) value is estimated. 14
t(df) for confidence probability 0.975
15
Confidence interval profiles for different slits (degree = 3) 1
0.9
0.8
0.7
0.6
0.5
0.4
0.3
0.2 -15
-14
-13
-12
-11
-10
-9
-8
-7
-6
-5
-4
-3
-2
-1
0
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
Confidence Interval profiles, 31 points, 0…5 degrees
17
σ evaluation problems: • Small gaps: accidental perfect fit • Large gaps: treating small peaks as a noise due to large number of degrees of freedom • Is pump pulsation a noise or a signal? • Small gaps: confidence interval depends on confidence level
σ evaluation solutions: • Evaluate in advance using the whole data array • Use the estimate for evaluation of confidence intervals 18
Handling σ estimate
CY t
(1 / 2 ) n p
CY t
½ n p
S u*
u*
S 2 2 , Formula (a ) CY 2 2 S , Formula ( b )
SR2 2 2 ( , n p)
19
Noise Filtering: How it works 1 23
AU
-0.004 280nm -0.005
22 26
S moo280
27
25 21
20
21
22
23
24
25
min
20 22
26 23
25
10 Gap
21 24 23
0 20
27
21
22
24
25
min
20 27 22 0
S hift
21
23
25
24
26
-20 20
21
22
23
24
25
min
20
Noise Filtering: How it works 2
21
0 .0 7 8 4
mV
ch 1
33
0 .0 7 3 3
36
37
38
39
40
41
min
Raw
34
35
34
35
36
37
38
39
40
41
m in
34
35
36
37
38
39
40
41
m i n
34
35
36
37
38
39
40
41
m in
34
35
36
37
38
39
40
41
m in
34
35
36
37
38
39
40
41
min
m V
ch 1
33
0 .0 7 5
0…3
Automatic selection of degree and gap of approximating polynomial
m V
ch 1
33
0 .0 7 7 3
1…3
m V
ch 1
33
0 .0 7 8 4
2...3
m V
ch 1
33
0 .0 7 8 4
3
mV
ch 1
33
3…10
22
Is pump pulsation a noise or a signal?
23
Conclusions: • Confidence filter introduces a measure of approximation quality • Confidence filter helps to select the best set of functions that approximate the data set • Confidence filter is metrologically the best noise filtering method and can be used in the fight with legal metrology
24
Thank you!
25