Oct 11, 2016 - S. Aaronson, T. C. Ralph, and A. G. White, Science 339, 794. (2013). [15] J. B. Spring, B. J. Metcalf, P. C. Humphreys, W. S. Kolthammer,.
Learning an unknown transformation via a genetic approach Nicol`o Spagnolo,1, ∗ Enrico Maiorino,1 Chiara Vitelli,1 Marco Bentivegna,1 Andrea Crespi,2, 3 Roberta Ramponi,2, 3 Paolo Mataloni,1 Roberto Osellame,2, 3 and Fabio Sciarrino1, †
arXiv:1610.03291v1 [quant-ph] 11 Oct 2016
2
1 Dipartimento di Fisica, Sapienza Universit`a di Roma, Piazzale Aldo Moro 5, I-00185 Roma, Italy Istituto di Fotonica e Nanotecnologie, Consiglio Nazionale delle Ricerche (IFN-CNR), Piazza Leonardo da Vinci, 32, I-20133 Milano, Italy 3 Dipartimento di Fisica, Politecnico di Milano, Piazza Leonardo da Vinci, 32, I-20133 Milano, Italy
Recent developments in integrated photonics technology are opening the way to the fabrication of complex linear optical interferometers. The application of this platform is ubiquitous in quantum information science, from quantum simulation to quantum metrology, including the quest for quantum supremacy via the boson sampling problem. Within these contexts, the capability to learn efficiently the unitary operation of the implemented interferometers becomes a crucial requirement. In this letter we develop a reconstruction algorithm based on a genetic approach, which can be adopted as a tool to characterize an unknown linear optical network. We report an experimental test of the described method by performing the reconstruction of a 7-mode interferometer implemented via the femtosecond laser writing technique. Further applications of genetic approaches can be found in other contexts, such as quantum metrology or learning unknown general Hamiltonian evolutions.
Introduction.- Linear optical networks have recently received increasing attention in the quantum regime thanks to the enhanced capability of building complex interferometers made possible by integrated photonics. This experimental achievement opened new perspectives in the adoption of linear optical networks for different quantum tasks, including quantum walks and quantum simulation [1–10], quantum phase estimation [11–13], as well as the experimental implementation of the Boson Sampling problem [14–21]. Within these contexts, it becomes a crucial task to learn the action of a linear process. On one side, the capability of efficiently reconstructing an unknown transformation provides an analysis tool for integrated devices. Indeed, it allows to verify the quality of the fabrication by checking the adherence of an implemented transformation with the desired one. Conversely, on the fundamental side precise knowledge of the unitary process is required in several tasks to perform accurate tests on the experimental data. For instance, this holds in the case of Boson Sampling validation, where the adoption of statistical tests may require knowledge of the implemented unitary transformation [19, 20]. Furthermore, the task of learning an unknown transformation can be in principle embedded into a larger class of problems, whose objective is to learn physical evolutions from training sets of data [22]. While initial efforts have been dedicated to the characterization of generic quantum processes [23–30], different methods have been specifically adopted and tested to reconstruct an unknown linear transformation U [31–35]. Most of these approaches rely on single-photon and two-photon measurements. Intuitively, single-photon states can be used to obtain information on the square moduli of the unitary matrix, while two-photon interference provides knowledge of the complex phases of the elements of U. Different data analysis approaches have been proposed and adopted to convert the raw measured data in an estimated unitary U, exploiting conventional numerical minimization techniques [31] or by analytically inverting the relations between experimentally measured data and the elements of U [32]. Other methods exploit classical light as input in the interferometer [33]. In this case,
knowledge on the moduli is obtained by sending classical light on a single input, while knowledge on the phases is obtained by sending light on pairs of input modes and by measuring the interference fringes in the output intensities as a function of the relative phase. In this letter we discuss and test experimentally an approach for the reconstruction of linear optical interferometers based on the class of genetic algorithms [36–38]. The latter is a general method that exploits the principles of natural selection in the evolution of a biological system, and has found application to find the solution to optimization and search problems in several fields, including first applications in quantum information tasks [39, 40]. We first discuss the general principles of operations of the broad class of genetic algorithms. Then, we show how to adapt these principles of operations to the specific case of linear optical networks tomography. Finally, we test experimentally the genetic algorithm by performing the reconstruction of a m = 7 modes integrated interferometer built by the femtosecond laser-writing technique [41, 42]. Genetic reconstruction algorithm for unitary transformations.- Genetic algorithms are a broad class of algorithms inspired by the natural evolution of biological systems, which evolve following the principle of natural selection [36–38]. This principle can be briefly described as follows: within an ecosystem, individuals struggling for survival coexist within the same population. Genetically fittest individuals, e.g. individuals with highest adaption to environmental variables, are more likely to survive and reproduce. The fitness of an individual is determined by its genetic signature, the DNA, which is composed by a set of genes representing its fundamental units. Two individuals generate the offspring that inherits a combination of the genes belonging to both the parents by means of reproduction. Thus, at variance with the DNA as a whole, a single gene is or is not inherited but cannot be partially inherited. If the combination of inherited genes determines a better fitness than the parents’ one, the son will have higher survival probability. Since weaker individuals are more unlikely to survive, fittest genes are more likely to spread over the
2 (a)
P˜i,j
(b) Data set
Unknown unitary U?
Learned Unitary
Genetic Algorithm
P˜i,j V˜ij,pq
Ur
i
j U?
Principles of genetic evolution Step i E1
Step i+1 E30
E2
E20
ordering
¯3 E
E20
crossover
E300
E200
⌧
mutation
i j
f (El ) E3
E10
E4
E10
E40
E100
¯4 E ↵kl
El
DNA encoding of a unitary linear transformation Gl1
Gl2
V˜ij,pq
(c)
Gl3 Gl4
Gl5 Gl6
Gl7 Gl8 Gl9
Gl10 Gl11 Gl12
Gl13 Gl14 Gl15
Gl16 Gl17
Gl18 Gl19
q
E400
tlk
l k
Gl20
p U?
Glk
(d)
Gl21
FIG. 1. (a) Learning an unknown linear unitary transformation via a genetic approach. The input data set, measured from the unknown transformation, are processed by an algorithm based on the principles of biological systems. The unitary transformation is decomposed in elementary units, i.e. the genes Glk composing its DNA: beam-splitters (BSs) with transmittivity tlk and phase-shifts (PSs) αkl , βkl . Crossover and mutation mechanism rule the evolution for each step of the algorithm. (b) Schematic view of single-photon measurements corresponding to ˜ij,pq . (d) Internal structure of the actual characterized data set P˜i,j . (c) Schematic view of two-photon measurements corresponding to data set V m = 7 integrated linear interferometer. Blue regions indicate directional couplers, that is, integrated versions of beam-splitters, while cyan regions indicated phase shifts, introduced by modifying the optical path of the waveguides.
population and, consequently, a gradual improvement of the average fitness of the population is expected. The set of genes belonging to all the individuals of a given population is called genetic pool. The described evolution, however, would be destined to reach a local maximum since the evolved genetic pool would be composed of just a subset of the initial genetic pool. Indeed, the mechanism of reproduction, as said, allows for the recombination of existing genes, but not for the creation of new ones. This would imply that the maximum possible fitness reached by any individual of the population would strongly depend on the initial genetic pool. Hence, it is crucial to consider in this model also the mechanism of mutation [36–38]. The latter is a rare event that manifests when an inherited gene changes its form, i.e. mutates, in a random fashion. This mutated gene would likely not be present in any of the parents’ DNA and could possibly provide new advantageous features causing the increase of the individual’s probability of survival and reproduction. This will allow the mutated gene to spread over the population by reproduction, increasing the maximum fitness achievable within the given genetic pool. By adopting the mechanism of mutation, the evolution is no longer limited by the initial conditions and is thus more effective. The principles of genetic evolution can be applied to learning an unknown unitary linear transformation (Fig. 1a). The
goal is to find the unitary matrix Ur whose action best describes a set of experimental data. The latter are single-photon probabilities P˜i,j , describing the transition from input mode i to output mode j (Fig. 1b), and Hong-Ou-Mandel [43] visibilities V˜ij,pq , describing two-photon interference from input modes (i, j) to output modes (p, q) (Fig. 1c). The associated errors are ∆P˜i,j and ∆V˜ij,pq , respectively. Hong-Ou-Mandel q d d visibilities are defined as Vij,pq = (Pij,pq − Pij,pq )/Pij,pq , d where Pij,pq is the probability for two distinguishable pard ticles and Pij,pq is the probability for two indistinguishable photons. The visibilities can be measured experimentally by recording the input-output coincidence pattern as a function of the relative delay ∆τ between the input photons. We model the initial group of unitaries by the set Φ = {E1 , ..., EN } of N randomly-chosen individuals El . Every individual El is completely determined by a set of real parameters, that represent its DNA. At a first glance, one could consider the elements of the unitary transformation (moduli and phases) to compose the DNA of the individuals. However, this is not the most appropriate choice since the generation of new offsprings from the random recombination of the parents according to this mechanism can lead to a non-unitary matrix. A better approach is obtained by exploiting the result by Reck et al. [44], which showed that it is possible to decompose any linear m × m transformation in a network composed of phase
3 shifters (PS) and beam splitters (BS) (see Fig. 1a). Every kth PS-PS-BS set is defined by the transmittivity tk ∈ [0, 1), and by the phases αk , βk ∈ [0, π]. The DNA of the individual El is then represented by the vector El = {Gl1 , Gl2 , ..., GlM }, with the parameter triples Glk = {tlk , αkl , βkl } being the genes Pm−1 and M = g=1 g the total number of PS-PS-BS sets. The global unitary of the system U can be obtained by multiplying (m) the set of m × m unitary matrices Uk describing the action of the k-th gene. Each matrix is obtained starting from the m × m identity and replacing the elements corresponding to the involved modes with the ones of a PS-PS-BS 2 × 2 matrix. With such a parametrization, the unitariety of overall transformation is naturally guaranteed. This decomposition does not necessarily represent the actual internal structure of the system being studied, which may be in general unknown. Indeed, it represents a mathematical tool to reduce a unitary matrix as the combination of its independent unitary parts, those corresponding to the genes of the DNA. The genetic algorithm [45] requires the definition of three ingredients governing the genetic evolution: (i) the fitness function, which quantifies the survival probability of a given individual, (ii) the crossover function, which governs the reproduction mechanism and (iii) the mutation process, which enlarges the available set of genes and thus increases the variabililty of the individuals. The fitness function f (E) ∈ R : [0, ∞) is related to the survival probability of the individual E. In our case f (E) is chosen to be inversely proportional to the distance between experimental data and the data generated from the unitary matrix UE corresponding to the individual E E E. Assuming the matrices Pi,j and Vij,pq to be, respectively, one-photon and two-photons measurement predictions generated from UE , we define the fitness function as f (E) = 1/χ2 , where χ2 = χ2P + χ2V is the chi-square function composed by the two terms E E 2 X (V˜ij,pq − Vij,pq X (P˜i,j − Pi,j )2 ) 2 ; χ = . χ2P = V 2 2 ∆P˜i,j ∆V˜ij,pq i,j,p,q i,j (1) In other words, the fitness f (E) represents the quality of the solution E for the given problem. The crossover function corresponds to the reproduction mechanism described above. Two individuals EA and EB generate one child EC whose DNA is composed of half genes from parent EA and the other half from parent EB , randomly chosen. In the crossover mechanism, even if genes are randomly chosen they always occupy the same place in the child’s DNA sequence. This means that PS-PS-BS sets inherited by the parents always occupy the same position within offsprings’ corresponding linear optical networks. Finally, we establish that for any iteration of the algorithm any gene Glk has a probability γ (called mutation rate) of being replaced by a new random triple {t˜lk , α ˜ kl , β˜kl }. This probability must be carefully chosen. Indeed, an exceedingly high mutation frequency would reduce the search process to a random walk in the space of solutions, while an extremely small value would prevent the algorithm to reach the global maximum of f (E). One of the key
aspects of this algorithm is its computational efficiency. The price to pay is the reduction of the system governability, since the algorithm evolution is not deterministic. Furthermore, the optimal combination of parameters (mutation rate, population size, etc.) cannot be derived a priori and may depend on the dimension m of the network. Experimental results.- We tested the genetic algorithm by reconstructing the linear transformation induced by a 7-mode integrated interferometer, fabricated in a borosilicate glass substrate by means of the femtosecond laser waveguide writing [41, 42] technique. This approach exploits the permanent and localized increase in the refraction index obtained by nonlinear absorption of focused femtosecond pulses, thus directly writing waveguides in the material. The internal structure of the implemented interferometer is shown in Fig. 1d, and is composed by a network of symmetric 50 − 50 directional couplers and a phase pattern. We observe that the internal structure of the interferometer is different from the triangular structure adopted in the genetic algorithm to decompose the unitary matrices. Indeed, the adoption of the latter choice in the reconstruction algorithm is only a mathematical tool, which may not correspond to the actual structure of the transformation under analysis. Single-photon and two-photon input states, necessary to measure the data set of the algorithm, were prepared by a spontaneous parametric down conversion source and injected into the different input ports of the interferometer (see Supplementary Material [45]). The interference pattern necessary to measure the visibilities V˜ij,pq was obtained by a controlling photon indistinguishability through a variable temporal delay between the input particles. The reconstruction method based on the genetic approach has been applied to the 7-mode chip. The complete set of experimental measurements consists of d1 = 49 single photon transition probabilities P˜i,j and d2 = 441 two-photon HongOu-Mandel visibilities V˜ij,pq , thus corresponding to an overc22
2
c2
250 000
250 000 (a)
Ur
200 000
125 000
150 000 100 000
0 0
50 000 0 0
20 000
40 000
2000 60 000
4000 80 000
6000
Niter
Niter
FIG. 2. Evolution of the minimum χ2 in the genetic pool with respect to the experimental data through the running time of the genetic algorithm, plotted as a function of the number of iterations. Green solid lines: minimum χ2 in the genetic pool. Inset: highlight for Niter ∈ [0; 6600], showing the action of mutations (jumps) and crossover (smooth variations). Horizontal blue dashed line: best χ2 obtained from the analytic method, which acts as a staring point of the genetic algorithm.
4 complete set of d = d1 +d2 = 490 experimental data. The genetic algorithm maximizes the fitness function f (E) [Eq. (1)] El and between the experimental data set and the predictions Pi,j El Vij,pq obtained from the unitary UEl belonging to the population of the genetic algorithm. The starting point of the protocol is a population of s = 100 unitaries. As a modification to the recipe previously discussed, a subset of s1 = 20 unitaries at the initial step is chosen starting from the algorithm introduced in Ref. [32]. With that method, a minimal set of singleand two-photon data is exploited to retrieve analytically the elements of the unitary matrix. This approach can be extended by considering that a set of m2 independent estimates of U can be obtained by recording the full set of single- and twophoton measurements, and by permuting the mode indexes accordingly [17]. These operations correspond to selecting m2 independent minimal data sets. For the genetic algorithm, we then choose the s1 = 20 unitaries (among the set of 49 possible matrices for m = 7) presenting the lower values of the χ2 with respect to the full set of experimental data, that is, having higher fitnesses. This provides a reasonable starting point for the genetic pool. Finally, the remaining subset of s2 = 80 are randomly generated from the Haar measure. In Fig. 2 we report the evolution of the best χ2 in the genetic pool during the running time of the algorithm. We observe that an almost stable value of the χ2 is obtained after Niter ∼ 40000 iterations, corresponding to a computational time of t ∼ 1 h on a laptop. During the evolution, the decrease of the χ2 (the increase of the fitness) occurs with two different trends (see inset of Fig. 2). Smooth variations are due to the crossover mechanism between members of the population, converging to the best possible unitary given the available genetic pool. Conversely, fast jumps in the χ2 are due to random mutations in the genetic pool. The convergence of the genetic algorithm is confirmed by the decrease of the chi square χ2 from the starting value minU (a) χ2 ∼ 255000, r
(a) Ur
of the analytic approach, obtained from the best unitary to a final value of χ2r,(g) ∼ 17096, leading to an improvement of one order of magnitude. As an additional figure of (a) merit, we consider the similarities Sr between the experimental two-photon visibilities and the predictions obtained (a) from the analytic unitaries Ur , according to the definition P (a) r,(a) Sr = 1 − i,j,p,q |V˜ij,pq − Vij,pq |/(2d2 ) (and analogous definition for the output of the genetic approach). We observe (g) (g) that the similarity Sr obtained for the output unitary Ur (g) from the genetic algorithm, equal to Sr = 0.957 ± 0.001, clearly outperforms the maximum value obtained from the an(a) alytic algorithm: maxU (a) Sr = 0.920 ± 0.001. r The results for the obtained unitary are shown in Fig. 3, where the real and imaginary parts of the output unitary ma(g) trix of the genetic algorithm Ur are compared with the theoretical unitary Ut expected from the fabrication process. (g) (g) The gate fidelity between Ut and Ur , defined as Fr,t = (g)
(g)
|Tr[Ut† Ur ]|/m, reaches a value Fr,t = 0.975 ± 0.013. This parameter represents the quality of the implemented unitary in
the fabrication process indicating how close the implemented interferometer is with respect to the ideal one. The error on the gate fidelity has been estimated by a N = 10000 MonteCarlo simulation of unitary reconstruction with the analytic method. Further optimizations of the protocol can be envisaged. For instance, the χ2 function in the fitness may be replaced with a weighted function χ2w = wχ2P + (1 − w)χ2V . We then performed the reconstruction method for different values of the weight w, observing that for the present data the best choice is obtained for the symmetric case w = 0.5. The optimal weight w can nevertheless vary with the dimension m of the network. Additionally, the number of unitaries s1 taken at the initial step from the analytic algorithm can be optimized depending on the problem size. (a)