Wavefront sensorless adaptive optics: a general ... - OSA Publishing

Wavefront sensorless adaptive optics: a general model-based approach Huang Linhai1,2,* and Changhui Rao1,2 1

The Laboratory on Adaptive Optics, Institute of Optics and Electronics, Chinese Academy of Sciences, Chengdu 610209, China 2 The Key Laboratory on Adaptive Optics, Chinese Academy of Sciences, Chengdu 610209, China *[email protected]

Abstract: Wavefront sensorless adaptive optics (AO) systems have been widely studied in recent years. To reach optimum results, such systems require an efficient correction method. In this paper, a general model-based correction method for a wavefront sensorless AO system is presented. The general model-based approach is set up based on a relationship wherein the second moments (SM) of the wavefront gradients are approximately proportionate to the FWHM of the far-field intensity distribution. The general model-based method is capable of taking various common sets of functions as predetermined bias functions and correcting the aberrations by using fewer photodetector measurements. Numerical simulations of AO corrections of various random aberrations are performed. The results show that the Strehl ratio is improved from 0.07 to about 0.90, with only N + 1 photodetector measurement for the AO correction system using N aberration modes as the predetermined bias functions. ©2010 Optical Society of America OCIS codes: (010.1080) Adaptive optics; (010.7350) Wavefront sensing.

References and links 1. 2. 3. 4. 5. 6. 7. 8. 9.

B. Wang and M. J. Booth, “Optimum deformable mirror modes for sensorless adaptive optics,” Opt. Commun. 282(23), 4467–4474 (2009). M. A. Vorontsov, G. W. Carhart, M. Cohen, and G. Cauwenberghs, “Adaptive optics based on analog parallel stochastic optimization: analysis and experimental demonstration,” J. Opt. Soc. Am. A 17(8), 1440–1453 (2000). P. Piatrou, and M. Roggemann, “Beaconless stochastic parallel gradient descent laser beam control: numerical experiments,” Appl. Opt. 46(27), 6831–6842 (2007). M. J. Booth, “Wavefront sensorless adaptive optics for large aberrations,” Opt. Lett. 32(1), 5–7 (2007). M. J. Booth, “Wave front sensor-less adaptive optics: a model-based approach using sphere packings,” Opt. Express 14(4), 1339–1352 (2006). M. Born and E. Wolf, Principles of Optics, 6th ed. (Pergamon, 1983). J. Braat, “Polynomial expansion of severely aberrated wave fronts,” J. Opt. Soc. Am. A 4(4), 643–650 (1987). R. Noll, “Zernike polynomials and atmospheric turbulence,” J. Opt. Soc. Am. 66(3), 207–211 (1976). W. J. Hardy, Adaptive Optics for Astronomical Telescopes, (Oxford Univ. Press, 1998).

1. Introduction Using wavefront sensorless adaptive optics (AO) is a better option than using distinct, separate wavefront sensor AOs for some applications that benefit from AO correction [1], such as inertial confinement fusion (ICF), optical tracking, free-space laser propagation, and microscopy. The most significant potential advantage of wavefront sensorless AOs is related to the fact that far-field intensity distribution is permitted to be used as the feedback signal, which in turn allows a wavefront sensorless AO to be used in poor illumination situations. Wavefront sensorless AO systems operate by sequentially modulating the AO corrector and maximizing a feedback signal according to particular optimization algorithms. These particular optimization algorithms include model-free methods and model-based methods. Model-free methods contain stochastic, local, or global search methods. Among model-free methods it is noticeable that works on stochastic gradient methods proposed by M. Vorontsov #133421 - $15.00 USD Received 17 Aug 2010; revised 11 Oct 2010; accepted 24 Nov 2010; published 24 Dec 2010

(C) 2011 OSA

3 January 2011 / Vol. 19, No. 1 / OPTICS EXPRESS 371

are verified to be the fastest search methods [2], whereas many measurements are still required for stochastic gradient methods. Taking Ref. [3] as an example, more than 100 measurements are still needed for a 25-element actuator deformable mirror (DM) in order to correct the aberrations [3], and the more measurements that are required, the more difficult it is to realize real-time AO systems. The model-based approach is a much better choice to reduce the measurements. M. J. Booth has proposed that the model-based approach is capable of correcting aberrations with a minimum of N + 1 photodetector measurements for N aberration modes [4,5]. However, the model-based method requires taking different sets of functions as the predetermined bias functions for aberrations of various magnitudes. The approach requires taking Zernike functions as the predetermined bias functions for small aberrations and Lukosz–Zernike (L–Z) functions for large aberrations [4,5]. Furthermore, the accuracy of this method relies upon factors such as bias values (the coefficients of the predetermined bias functions). In this paper we propose a general model-based approach. Unlike the model-based method proposed by M. J. Booth, the general approach is insensitive to the selection of sets of functions as well as the bias values. Besides the L-Z functions, the general model-based approach can also take other kind of modes, such as the Zernike functions, as the predetermined bias functions and correct aberrations effectively, not only for large aberrations but also for small aberrations. 2. The general model-based method 2.1 The wavefront sensorless AO correction system It is well known that a wavefront sensorless AO can be depicted as shown in Fig. 1, where the input wavefront is incident from the left. After aberration correction by the adaptive element (the DM), the input wavefront is focused onto the photodetector by a positive lens. A mark or a small pinhole is placed on the photodetector to generate a feedback signal, such as the encircled energy signal [4]. The encircled energy feedback signal is produced by using a pinhole, which filters energy outside the pinhole and allows energy within the pinhole to be obtained by the photodetector. The feedback signal is used to drive the AO element. The aberration of the input wavefront is described by function ( x, y) , where x and y are the rectangular coordinates in the pupil plane of the lens. The correcting aberration of the DM is depicted by function ( x, y) . The residual aberration of the input wavefront is represented by function R( x, y ) , and R( x, y)  ( x, y)  ( x, y) . The corresponding far-field intensity distribution I ( x' , y' ) is given by [6] I ( x ', y ')  I 0

  A( x, y)  exp[ jR( x, y)]e

jk [( x  x ' )2  ( y  y ' )2 ] 2z

2

dxdy ,

(1)

where x’ and y’ are the rectangular coordinates in the input plane of the photodetector; I0 is proportional to the incident light power; A is the amplitude of the input wavefront and is set to be uniform in this paper; k  2 /  ,  is the wavelength of the input wavefront; z is the focal length of the positive lens; and j is the imaginary unit. We assume that  and  can be represented by a series of M orthonormal functions and N orthonormal functions, respectively. In most cases, the number M is greater than N. The orthonormal functions are called modes in the following text, each mode denoted by Fi(x,y): M

( x, y )   i  Fi ( x, y ), i 1

(2)

N

 ( x, y)   i  Fi ( x, y). i 1

#133421 - $15.00 USD Received 17 Aug 2010; revised 11 Oct 2010; accepted 24 Nov 2010; published 24 Dec 2010

(C) 2011 OSA


In other words, aberration  of the input wavefront can be denoted by vector V, whose elements are the coefficients of  i . Similarly, the correcting aberration  and the residual aberration R can be represented by vectors U and Z, respectively. The elements of U and Z are  i and zi , and moreover, zi   i  i . Minimizing vector Z is the job of this wavefront sensorless AO system. In order to minimize vector Z efficiently, the relationship between the information about the far-field intensity distribution and the aberration need to be set up first.

Fig. 1. Schematic diagram of the adaptive system.

2.2 Relationship between the second moment (SM) of the aberration gradients and the FWHM of far-field intensity distribution As we know, the centroid of the far-field intensity distribution in geometric optics is related to the aberration of the input wavefront. When only the Seidel aberration of tilt is present, this relationship can be described by the following expression [6,7]:

  (3) ( x, y)]2  [ ( x, y)]2  ( x '2  y '2 ). x y When other aberrations are present, those aberrations can be considered as the compositions of many small pieces of Seidel aberrations of tilt. The relationship between tilt aberrations in each small piece and the centroid of the corresponding spot observes the rule in Eq. (3). Hence from this fact we may deduce that the sum of all small pieces have the following relationship: [

TL



{[ x  ( x, y)] i 0

2

i

[

TLF   i ( x, y )]2 }   I i ( x ', y ')( x '2  y '2 ), y i 0

(4)

where  i ( x, y) stands for the ith piece of tilt. TL is the total number of the small pieces. TLF is the total number of points in the far-field intensity distribution. Ii ( x ', y ') is the far-field intensity at point ( x' , y ' ) . Since the far-field intensity distributions of two or more small pieces of input wavefronts may locate at the same point, a variable related to the number of pieces is needed. On the other hand, it is well known that the magnitude of a certain far-field intensity distribution is positive proportional to the number of input small-piece wavefronts in geometric optics, on the condition that those input small-piece wavefronts have the same tilt aberrations. Therefore, Ii ( x ', y ') is present in Eq. (4) to represent the total number of input small-piece wavefronts that focus on point ( x' , y ' ) . When TL, TLF, Eq. (4) is 

  {[ x ( x, y)] x y

2

[

 ( x, y)]2 }dxdy    I ( x ', y ')( x '2  y '2 )dx ' dy ', y x' y'

(5)

where the left-hand side of Eq. (5) is the SM of the aberration gradients, and the right-hand side of Eq. (5) is the sum of the far-field intensity distribution multiplied by a mask [corresponding position ( x'2  y '2 ) ]. Since the total sum of far-field intensity distribution I ( x ', y ') is a constant, the right-hand side of Eq. (5) is changed to be


(C) 2011 OSA


  I ( x ', y ')( x '  y ' )dx ' dy ' R   I ( x ', y ') 2

2

2

x' y'

x' y'

( x '2  y '2 ) dx ' dy ' R2

 R 2{  I ( x ', y ')dx ' dy '    I ( x ', y ')[1  x' y'

x' y'

(6) 2

r ]dx ' dy '}. R2

Hence the mask becomes 1  r 2 / R2 for r  R and zero otherwise. r  x'2  y'2 , R is a suitable chosen detector radius, and R is weighted by the system’s diffraction limitation (DL). The significant advantage of the new mask is that the new mask is insensitive to the actual detected size. In practice, the new mask could be implemented by the weighting of pixels within the software when the photodetector is replaced by a CCD camera. Moreover, the whole expression of the right-hand side of Eq. (5) is also normalized by dividing by the sum of I ( x' , y' ) . For convenience, Eq. (7) is called the masked detector signal (MDS) in subsequent text. r2

MDS 

  I ( x ', y ')[1  R

2

]dx ' dy '

x' y'

  I ( x ', y ')dx ' dy '

.

(7)

x' y'

Therefore, Eq. (5) can be expressed as SM  c0 (1  MDS ) ,

(8)

where SM is the second moment of the aberration gradients, and c0 is the slop of the trend line, which is determined by the detector radius R.

Fig. 2. Signal response between SM of aberration gradient and MDS. 500 random aberrations are taken for study.

To verify the relationship between the SM of the aberration gradients and the MDS, 500 random atmospheric aberrations with various secondary moments are produced by the method proposed by Noll in [8]. By calculating the SM of the 500 random atmospheric aberration gradients as well as the MDS from the corresponding far fields, the signal responses are depicted in Fig. 2. The values of detector radius R are set to be 5 DL, 12 DL and 24 DL, respectively. An approximate linearity between the SM of the aberration gradient and the MDS is obtained in Fig. 2 when the detector radius R is selected to be 12 DL or 24 DL. The approximate linearity has some errors when detector radius R = 5 DL. The errors of approximate linearity result from the cases where the distributions of the far field go beyond the calculated area. Thus, when the detector radius R is suitably chosen, the MDS is considered to be related to the SM of the aberration gradients by using Eq. (8).


(C) 2011 OSA


Fig. 3. Signal response between aberration magnitude |V| and MDS. Each point in the figure stands for the mean MDS values of 100 random aberrations.

For comparison, the relationship used in [4] is numerically calculated by using random aberrations and are drawn in Fig. 3. The relationship between the MDS and aberration magnitude |V| is calculated from 5000 random aberrations, and each circle in Fig. 3 stands for the mean MDS of 100 random aberrations. Note that the MDS is essentially the same as the detected signal used in [4], except that the MDS is normalized by the sum of the far-field intensity. Obviously, our method results in a linear relationship between the MDS and the SM. 2.3 Modeling and analysis So far, the relationship between the MDS and the SM of the aberration gradient has been set up. We will build the general model-based method for a wavefront sensorless AO system according to the relationship. As we know that the aberration of an input wavefront can be expressed by a series of M orthonormal modes Fi ( x, y ) , M

( x, y)   i  Fi ( x, y). i 1

The input aberration  ’s gradients in the x and y axes are [9] M   ( x, y )   i  Fi ( x, y ), x x i 1 M   ( x, y )   i  Fi ( x, y ), y  y i 1

(9)

where ( x, y) / x and ( x, y) / y are the input aberration gradients in the x and y axes, respectively, and Fi ( x, y) / x and Fi ( x, y ) / y are the orthonormal mode gradients in the x and y axes, respectively. In order to find out the value of coefficients vi, the N orthonormal mode Fi ( x, y) is taken as the predetermined bias function and is added by the DM sequentially with coefficient α to the input aberrations. Then the detector measurements are recorded. The difference wi,0 between the SM of the gradients of aberration  and that after adding a predetermined bias function Fi ( x, y) with the coefficient α is declared as 



 {[ x ( x, y)   x F ( x, y)]

2

i

wi ,0  SM i  SM 0 

s

[

     ( x, y )   Fi ( x, y )]2 }  {[ ( x, y )]2  [ ( x, y )]2}dxdy y y x y , s

(10)


(C) 2011 OSA


where SM 0 and SM i are the SMs of the gradients of ( x, y) and ( x, y )  Fi ( x, y ) , respectively. By rearranging, we can get 





2

i

wi ,0 







 [  x F ( x, y)  2 x ( x, y)]dxdy  [  x F ( x, y)] dxdy  [  y F ( x, y)  2 y ( x, y)]dxdy  [  y F ( x, y)] dxdy (11) i



s

 2  [



s

s s     Fi ( x, y) ( x, y)  Fi ( x, y)  ( x, y)]dxdy x x y y s

2

i

s





i



s

2   {[ s

s

s   Fi ( x, y)]2  [ Fi ( x, y)]2 }dxdy x y s

s .

After N predetermined bias functions Fi ( x, y ) are added sequentially by the DM, we will obtain N equations 





i

wi ,0 



 2  [ x F ( x, y) x ( x, y)  y F ( x, y)  y ( x, y)]dxdy   i



s

s

s

2

{[

  Fi ( x, y )]2  [ Fi ( x, y )]2}dxdy x y (12) , i  1 ~ N. s

Replacing the gradients of aberration  by Eq. (9), it is convenient to use the vectors that represent the sampled values of the original functions. Equation (12) becomes W  2 * S *V   2 * S m ,

where

 s1,1   s 2,1 S  ...  s  N ,1

s1, 2 s 2, 2

...

...

s N , N 1

...

(13)

 1   s1,1  2    ... s2,2  s1, N ... s1, M   w1,0   , Sm    , V   , ... ... ...   ...   w2,0   N   W   ...  s N 1, N ...s N 1, M   ...    s  N ,N  s  s N , N ... s N , M    N ,0   M

    ,    

and 





n

sn ,m 



 {[ x F ( x, y) * x F ( x, y)]  [ y F ( x, y) * y F ( x, y)]}dxdy m

n

s

s

m

.

(14)

Generally, S is invertible, so the coefficient V of the modes can be calculated by

S 1 (W   2 * Sm ) . 2* According to Eq. (8) and Eq. (10), we have V

(15)

wi ,0  SMi  SM 0  c0 (1  MDSi ) - c0 (1  MDS0 )  -c0 (MDSi - MDS0 ) ,

where MDS0 and MDSi are the corresponding MDS of input wavefronts ( x, y ) and ( x, y )  Fi ( x, y ) , respectively. That is,

V where

W  c0 M , thus the vector V is estimated by

S 1 (c0 * M   2 * Sm ) , 2*

(16)

 MDS1  MDS0   .  MDS2  MDS0  M    ...    MDS  MDS  N 0 

Note that vector V can be resolved exactly by Eq. (15), no matter what sets of modes are adopted as the predetermined bias functions; hence, it can be concluded that the method is insensitive to the selected sets of modes as well as the choice of the bias value  , and the


(C) 2011 OSA


correction error comes only from Eq. (8). Furthermore, it is known that Eq. (8) is set up based on geometric optics and is satisfied with all kinds of aberrations as long as a suitable detector radius R is selected. 3. Numerical simulations and result analysis To verify the performance of the general method, 50 random aberrations with a normalized root-meaning-square value of 0.5 λ are produced as the input wavefront aberrations in the paper. The aberrations consist of 18 Zernike modes represented by random coefficients ui. The 18 Zernike modes and 18 L–Z modes are taken sequentially to be the biases of the modelbased method and added by the DM to the input wavefront. The 18 Zernike modes and 18 L–Z modes are drawn in Fig. 4(a) and Fig. 4(b), respectively. The corresponding inverses S1 are shown in Fig. 5(a) and Fig. 5(b) respectively.

Fig. 4. Set of aberration modes: (a) 18 Zernike modes (3–20); (b) 18 L–Z modes (3–20).

Fig. 5. Corresponding inverse S1 of sets of aberration modes in Fig. 4: The inverse S1 of (a) 18 Zernike modes (3-20), (b) 18 L–Z modes (3-20).

The biases Fi ( x, y) with coefficient   0.05 are added sequentially to the aberrations, given the total aberrations U  V , U T  [u1 , u2 ,..., u18 ],V T  [0,0,. ..,0] , and the corresponding MDSi are calculated. The detector radius of far-field intensity R equals 16DL , and the slope of the trend line c 0 is 194.1. After the total N biases are added, the correction vector V is calculated by Eq. (16): i = 1,2…,18. The Strehl Ratio (SR) is adopted to evaluate the correction results, and the SR is defined as

SR 

P[ I ( x, y)] , P[ I 0 ( x, y)]

(17)


(C) 2011 OSA


where P[] is an operation, which calculates the peak intensity. I is the actual intensity distribution, and I0 is the intensity distribution when no aberrations are present. Since the vector S1, Sm, and MDS0 are known in advance and do not need recalculation or measurement for each correction, the total detector measurements for one aberration correction would be 19. One aberration correction is considered as one iteration in the following results. Figure 6(a) and Fig. 6(b) show the corresponding results after correction by the method proposed by M. J. Booth and the method in this paper, respectively. The icons “method 1” and “method 2” in the figure correspond to the method proposed by M. J. Booth and our method, respectively. The method proposed by M. J. Booth is referred to as previous works in the following text. According to the correction results in Fig. 6(a) and Table 1, it is not difficult to find that our method is more efficient than previous works when the L–Z polynomials are taken as the predetermined bias functions. When using previous works to correct the aberrations, the values of SR rise slowly when the aberrations become smaller. However, our method works well both for small and large aberrations. Table 1. SR Results of AO Correction Using L–Z Modes by Our Method and Previous Works Iterations

Our Method

Previous Works

0

0.07

0.07

1

0.87

0.63

2

0.99

0.94

3

0.99

0.98

4

0.99

0.99

Fig. 6. Results of AO correction using (a) L–Z modes and (b) Zernike modes as the predetermined bias functions; the surfaces of the modes are drawn in Fig. 4. The icons “method 1” and “method 2” in the figure refer to the method proposed by M. J. Booth and the method proposed in this paper.


(C) 2011 OSA


Fig. 7. Results of AO correction using (a) L–Z and (b) Zernike modes when coefficient  vary from 0.02 to 0.4 λ. The icons “method 1” and “method 2” in the figure refer to the method proposed by M. J. Booth and the method proposed in this paper.

From Fig. 6(b), we see that the correction results of using the method of previous works decline when Zernike polynomials are taken as the predetermined bias functions. However, the correction results of our method keep consistent with those using L–Z polynomials as the biases. The correction results indicate that our method is insensitive to the selected sets of functions. The effect of bias values (the coefficient  ) on correction results is under consideration. We take the L–Z modes and Zernike functions as the predetermined bias functions and change the value of coefficient  . The curves of such iterations are show in Fig. 7. As we have expected in the conclusions of Section 2, our method is insensitive to the choice of bias values. 4. Conclusions A general model-based approach for wavefront sensorless adaptive optics is presented in the paper. The general model-based approach is set up based on the approximate linearity between the MDS and the SM of the wavefront gradient. Because the general method is insensitive to the selected modes, it permits correction of the aberrations by using all kinds of orthogonal aberrations as the predetermined bias functions. Numerical simulations of AO correction to the random aberrations have shown that the general method is a fast and stable wavefront sensorless AO correction method for all kind of modes. Acknowledgments The authors thank Wenham Jiang for providing good suggestions. The authors acknowledge helpful suggestions from the reviewers and help from the editors.


(C) 2011 OSA