Nov 26, 2009 - (made by physicists/engineers based on many device .... A linear state space system is passive if its transfer function H(s) is positive real.
Model Order Reduction Wil Schilders NXP Semiconductors & TU Eindhoven November 26, 2009 Utrecht University Mathematics Staff Colloquium
Outline Introduction and motivation Preliminaries Model order reduction basics Challenges in MOR Conclusions
2 Utrecht Staff Colloquium, November 26, 2009
The work of mathematicians may not always be very transparant…….
3 Utrecht Staff Colloquium, November 26, 2009
But it is present everywhere in our business Collecting info
Partition 1
Collecting Partition 2 info
Mathematics Transient
Multirate
Partitioning
Mathematics
Mathematics Transient Multirate Partitioning
Invisible contribution, visible success
4 Utrecht Staff Colloquium, November 26, 2009
Moore’s law also holds for numerical methods!
5 Utrecht Staff Colloquium, November 26, 2009
Very advanced simulations
Subgridding mechanisms reduce the number of mesh points in a simulation. In this example (a) the full grid is 20 times larger (35e6 mesh nodes) than (b) the subgridded version (1.75e6 mesh nodes).
6 Utrecht Staff Colloquium, November 26, 2009
The impossible made possible….. A complete package layout used for full wave signal integrity analysis 8 metallization layers and 40,000 devices The FD solver used 27 million mesh nodes and 5.3 million tetrahedrons The transient solver model of the full package had 640 million mesh cells and 3.7 billion of unknowns
7 Utrecht Staff Colloquium, November 26, 2009
Avoiding brute force The behaviour of this MOS transistor can be simulated by solving a system of 3 partial differential equations Æ discrete system for at least 30000 unknowns Insight? Even worse: an electronic circuit consists of 104-106 MOS devices Solution: compact device model (made by physicists/engineers based on many device simulations, ~50 unk)
Can we construct such compact models in an automated way?
8 Utrecht Staff Colloquium, November 26, 2009
Another example Integrated circuits need 3-D structure for wiring 10 years ago
Present situation: – 8-10 layers of metal – Delay of signals, parasitic effects due to high frequencies
Gradual
– 1-2 layers of metal, no influence on circuit performance
3-D solution of Maxwell equations leads to millions of extra unknowns
Can we construct a compact model for the interconnect in an automated way?
circuit
9 Utrecht Staff Colloquium, November 26, 2009
Model Order Reduction is about capturing dominant features
Strong link to numerical linear algebra 10 Utrecht Staff Colloquium, November 26, 2009
And, talking about motivation…..
Mathematical Challenges!
11 Utrecht Staff Colloquium, November 26, 2009
Outline Introduction and motivation Preliminaries Model order reduction basics Challenges in MOR Conclusions
12 Utrecht Staff Colloquium, November 26, 2009
Overall picture Physical System Data Modeling ODEs
PDEs MOR
Reduced system of ODEs
Simulation Control
13 Utrecht Staff Colloquium, November 26, 2009
Dynamical systems
14 Utrecht Staff Colloquium, November 26, 2009
Problem statement
15 Utrecht Staff Colloquium, November 26, 2009
Approximation by projection
16 Utrecht Staff Colloquium, November 26, 2009
Special case: linear dynamical systems
17 Utrecht Staff Colloquium, November 26, 2009
Singular value decomposition
18 Utrecht Staff Colloquium, November 26, 2009
Optimal approximation in 2-norm
19 Utrecht Staff Colloquium, November 26, 2009
Operators and norms
Impulse response
Transfer function
20 Utrecht Staff Colloquium, November 26, 2009
Important observation Model Order Reduction methods approximate the transfer function In other words: not the entire internal characteristics of the problem, but only the relation between input and output (so-called external variables) No need to approximate the internal variables (“states”), but some of these may need to be kept to obtain good representation of input-output characteristics
matching some
Original
part of the frequency response
log H (ω )
log ω 21 Utrecht Staff Colloquium, November 26, 2009
Outline Introduction and motivation Preliminaries Model order reduction basics Challenges in MOR Conclusions
22 Utrecht Staff Colloquium, November 26, 2009
Overview of MOR methods
23 Utrecht Staff Colloquium, November 26, 2009
Balanced truncation
24 Utrecht Staff Colloquium, November 26, 2009
Interpretation of balanced truncation
25 Utrecht Staff Colloquium, November 26, 2009
Properties of balanced truncation
26 Utrecht Staff Colloquium, November 26, 2009
Simple example Mesh: 50 x 100 x 50 cells @ 1u x 1u x 1u cell dimensions
Termination (R = 10 K)
1u x 1u Voltage step (R = 50 ohm) over 1u gap
m 80
SiO2 Ide
al c
50 m icro n
ond
uct or
ip str l A on r c i
100
n cro i m
Voltage step: 0.05 ps delay + 0.25 ps risetime SINSQ Pstar
T = 5 ps 27 Utrecht Staff Colloquium, November 26, 2009
Mapping PDE’s (Maxwell Eqs.) to equivalent high order (200) ODE system
10-th order linear dynamic system fit to FDTD results
Singular value plot
28 Utrecht Staff Colloquium, November 26, 2009
Moment matching (AWE, 1991)
Arnoldi and Lanczos methods 29 Utrecht Staff Colloquium, November 26, 2009
Asymptotic Waveform Evaluation (AWE) Direct calculation of moments suffers from severe problems: – As #moments increases, the matrix in linear system becomes extremely illconditioned Æ at most 6-8 poles accurately – Approximate system often has instable poles (real part > 0) – AWE does not guarantee passivity
But: basis for many new developments in MOR! 30 Utrecht Staff Colloquium, November 26, 2009
Moment matching (AWE, 1991)
Arnoldi and Lanczos methods 31 Utrecht Staff Colloquium, November 26, 2009
32 Utrecht Staff Colloquium, November 26, 2009
The Arnoldi method
33 Utrecht Staff Colloquium, November 26, 2009
The Arnoldi method (2)
34 Utrecht Staff Colloquium, November 26, 2009
The Lanczos method
35 Utrecht Staff Colloquium, November 26, 2009
Two-sided Lanczos
36 Utrecht Staff Colloquium, November 26, 2009
Moment matching using Arnoldi (1)
37 Utrecht Staff Colloquium, November 26, 2009
Moment matching using Arnoldi (2)
38 Utrecht Staff Colloquium, November 26, 2009
Example: MOR for transmission line (RC)
Original
Reduced
Netlist size (Mb) # ports
0.53
0.003
22
22
# internal nodes # resistors
3231
12
5892
28
# capacitors
3065
97
39 Utrecht Staff Colloquium, November 26, 2009
Outline Introduction and motivation Preliminaries Model order reduction basics Challenges in MOR Conclusions
40 Utrecht Staff Colloquium, November 26, 2009
Start of a new era! The Pade-via-Lanczos algorithm was published in 1994-1995 by Freund and Feldmann, and was a very important invention Not only did the method (PVL) solve the problems associated with AWE It also sparked a multitude of new developments, and a true explosion regarding the field of Model Order Reduction
AWE 1990
PVL 1995
• Lanczos methods • Arnoldi methods • Laguerre methods • Vector fitting • …… 1995-2009 41 Utrecht Staff Colloquium, November 26, 2009
Topic 1: passivity A linear state space system is passive if its transfer function H(s) is positive real – H has no poles in C+ – H is real for real s – Real part of z*H(s)z non-negative for all s, z
Passivity is important in practice (no energy generated) Lanczos-based methods such as PVL turned out not to be passive! Æproblem!!! Search started for methods that guarantee (provable) passivity Passive MOR: – PRIMA (Passive Reduced-order Interconnect Macromodeling Algorithm) – SVD-Laguerre (Knockaert & Dezutter) – Laguerre with intermediate orthogonalisation (Heres & Schilders)
42 Utrecht Staff Colloquium, November 26, 2009
Recent developments w.r.t. passivity Several passivity-preserving methods have been developed However: if the original discretized system is not passive, or if the data originate from experiments, passivity of the reduced order model cannot be guaranteed This happens frequently in practical situations: EM software generating an equivalent circuit model that is not passive In the frequency domain, this may not be too problematic, just an accuracy issue (Sparameters may exceed 1) However, it may become really problematic in time domain simulations
Passivity enforcement methods Often: trading accuracy for passivity
43 Utrecht Staff Colloquium, November 26, 2009
Topic 2: Large resistor networks Obtained from extraction programs to model substrate and interconnect Networks are typically extremely large, up to millions of resistors and thousands of inputs/outputs Network typically contains: – Resistors – Internal nodes (“state variables”) – External nodes (connection to outside world, often to diodes)
Model Order Reduction needed to drastically reduce number of internal nodes and resistors
44 Utrecht Staff Colloquium, November 26, 2009
Reduction of resistor networks
45 Utrecht Staff Colloquium, November 26, 2009
Reduction of resistor networks
46 Utrecht Staff Colloquium, November 26, 2009
General idea: exploit structure
47 Utrecht Staff Colloquium, November 26, 2009
“Simple” network 4
71
152 1
274 external nodes (pads, in red)
38
36
149 34
40
32
42
30
44
28
46
26
212
48
24 215
5384 internal nodes
220
273 271
52
267 265
217
218
50
22 269
219
67
20
54
263
18 261
56 259
16
58
221
14 257
60
62
12 255
6 10
253
251
8007 resistors/branches
2
9
227 11
3
249
225
8
7
223
247
229 231
13
245 233 235
15
237
243
239 241
110
17
74 102
19 21 23
100 93
25
76
150
27
135
29 31
Can we reduce this network by deleting internal nodes and resistors, still guaranteeing accurate approximations to the path resistances between pads?
66
33 204
202 35
151 200
206
37 198 39
157 208
196
210 154 194
41
156
155
192
43
158
45 190
47 139
118
49
188
134
113
51 186
53 81 55
160
184
75
57 59
162 182
61
63
86
5
214 213
164
91 166 180
123
168 178
176
174
172
170
101
199
195
197
222 224
189
193 191
187
185
183
181
179
177
175
173
171
169
167
165
159 161
163
201
226 228
203 205
230
207 209
232
211 153
250
64
82 80
114
248
234 246
236
254
238
112 256
252
244
242
240
111
69
258
106 216
270
272
274
260 262
NOTE: there are strongly connected sets of nodes (independent subsets), so the problem is to reduce each of these strongly connected components individually
264
266
268 73
92 90 147 144
146
148
143
72
145
70 141 140
68
142
137
136
132 138
127
133
131 129
130
128
126
125 124 121
120 119
117 115
85
116
79
65
84
109 78
107 108
89 122
105
83 77
103
104
88 96 97
87 94
95
99
98
48 Utrecht Staff Colloquium, November 26, 2009
Two-Connected Components 2 50
49 48
1
52
3 51
53
43
44
42 41 39
5
6 38 40
4 46
47
45 29 28 30
26
54
25
27 7 10
8 9
11
12
35 36
31
34
37 32
33
can delete all internal nodes here
22
24
16
23
21 19
18 17
15
20
13
14
49 Utrecht Staff Colloquium, November 26, 2009
How to reduce the strongly connected components Graph theoretical techniques, such as graph dissection algorithms (but: rather time consuming) Numerical methods such as re-ordering via AMD (approximate minimum degree) “Commercially” available tools like METIS and SCOTCH (not satisfactory)
50 Utrecht Staff Colloquium, November 26, 2009
Border Block Diagonal Form
51 Utrecht Staff Colloquium, November 26, 2009
Block bordered diagonal form Algorithm was developed to put each of the strongly connected components into BBD form (see figure) Internal nodes can be deleted in the diagonal blocks, keeping only the external nodes and crucial internal nodes The reduced BBD matrix allows extremely fast calculation of path resistances (work of Duff et al)
52 Utrecht Staff Colloquium, November 26, 2009
Reduction results
53 Utrecht Staff Colloquium, November 26, 2009
Results are exact No loss of accuracy! ←Spectre New →
Remaining problem: how to drastically reduce the number of resistors
54 Utrecht Staff Colloquium, November 26, 2009
Topic 3: Nonlinear MOR One of the most popular methods is proper orthogonal decomposition (POD) It generates a matrix of snapshots (in time), then calculates the correlation matrix and its singular value decomposition (SVD) The vectors corresponding to the largest singular values are used to form a basis for solutions Drawback of POD: there is no model, only a basis for the solution space Researchers are also developing nonlinear balancing methods, but so far these can only be used for systems of a very limited size (