762
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 11, NO. 5, OCTOBER 2003
Voltage-Pulse Driven Harmonic Resonant Rail Drivers for Low-Power Applications Joong-Seok Moon, Member, IEEE, William C. Athas, Member, IEEE, Sigfrid D. Soli, Jeffrey T. Draper, Member, IEEE, and Peter A. Beerel, Member, IEEE
Abstract—We describe a new design technique for efficient harmonic resonant rail drivers. The proposed circuit implementation is coupled to a standard pulse source and uses only discrete passive components and no external dc power supply. It can thus be externally tuned to minimize the consumed power in the target IC. A new design technique based on current-fed voltage pulse-forming network theory is proposed to find the value of each discrete component for a target frequency and a given load capacitance. The proposed circuit topology can be used to generate any desired periodic 50% duty-cycle waveform by superimposing multiple harmonics of the desired waveform, however, this paper focuses on the generation of trapezoidal-wave clock signals. We have tested the driver with a capacitive load between 38.3 and 97.8 pF with clock frequency ranging between 0.8 and 15 MHz. The overall power dissipation for our second-order harmonic rail driver is 19% of 2 at 15 MHz and 97.8 pF load.
Fig. 1.
Single-rail resonant clock driver (flyback circuit).
Fig. 2.
All-resonant blip driver.
Index Terms—Clock generator, energy-recovery circuit, harmonic resonance, low power.
I. INTRODUCTION
L
OW POWER has become a critical feature of many CMOS-VLSI systems because of the increasing demand for a longer battery life and the high costs of heat removal. Because clocking circuitry is typically a significant source of power dissipation [1], reducing the power consumed by clock drivers and clock nets has become an important focus. Because clock nets are mostly capacitive, resonant charging techniques that recycle most of the energy stored in clock nets are increasingly promising. The simplest resonant charging technique uses the flyback circuit shown in Fig. 1 to generate a sinusoidal clock signal [2]. Although simple, if the nMOS transistor is driven nonresonantly, which has generally been the case, the energy efficiency of this clock driver is poor. The blip circuit [3], illustrated in Fig. 2, has much higher efficiency Manuscript received February 28, 2002; revised September 20, 2002. This work was supported in part by the House Ear Institute under Grant PHS-398, in part by DARPA under Grant F30602-98-2-0108, and in part by the National Science Foundation ITR Award. J.-S. Moon was with Information Sciences Institute, University of Southern California, Marina del Rey, CA 90292 USA. He is now with Apple Computer Inc., Cupertino, CA 95014 USA (e-mail:
[email protected];
[email protected]). S. D. Soli is with House Ear Institute, Los Angeles, CA 90057 USA (e-mail:
[email protected]). J. T. Draper is with the Information Sciences Institute, University of Southern California, Marina del Rey, CA 90292 USA (e-mail:
[email protected]). W. C. Athas is with Apple Computer Inc., Cupertino, CA 95014 USA (e-mail:
[email protected]). P. A. Beerel is with University of Southern California, Los Angeles, CA 90089 USA (e-mail:
[email protected]). Digital Object Identifier 10.1109/TVLSI.2003.814323
because it is all-resonant, i.e., the energy used to drive every transistor is recycled. This circuit successfully has been used as an efficient power source for drivers of large on-chip signal lines of microprocessors [2], [4] but can also be used to generate two-phase almost-nonoverlapping sinusoidal clocks. A common disadvantage of both these clock drivers is that the output signal frequency and magnitude depend heavily on the . Because the value of may be data-deload capacitances pendent and can thus vary from cycle to cycle, the clock frequency may also fluctuate, thereby decreasing performance and increasing design effort [5]. Of the two drivers, the frequency fluctuation in the blip driver is more pronounced because of the positive-feedback nature of the two outputs. Another disadvantage of both of these drivers is the need for a distinct dc power whose value is determined by the load capacitance supply and the target frequency. Lastly, while sinusoidal clock signals are well-suited for special adiabatic circuits [6], [7], the slow slew rates cause two problems for conventional clock nets. In particular, while adiabatic circuits have special circuitry that prevents the slow slew rates from causing high short-circuit current, conventional clock buffers, flip-flops, and latches do not have these features and thus, have relatively fast clock slew rate requirements. Secondly, slow slew rates cause increased variations on effective clock skew and clock-output delay which may considerably affect potential performance and system stability.
1063-8210/03$17.00 © 2003 IEEE
MOON et al.: VOLTAGE-PULSE DRIVEN HARMONIC RESONANT RAIL DRIVERS FOR LOW-POWER APPLICATIONS
763
Fig. 4. Trapezoidal-wave voltage signal with slope V =(2 T ).
Fig. 3. Harmonic resonant rail driver containing three harmonic terms.
Younis and Knight [8] developed an incremental design approach for a class of efficient harmonic rail drivers that solves these problems. Their drivers approximate a desired square wave (with 50% duty cycle) by superpositioning its first harmonics, as illustrated by the third-order driver in Fig. 3. These drivers, however, require distinct dc power supplies, which is prohibitive for most practical implementations. In this paper, we present a new systematic design approach for th-order harmonic resonant rail drivers that do not require additional dc power supplies. Linear network theory is normally applied to predict the waveform generated by a network of passive components. Our design approach applies it for the inverse problem. That is, we use linear network theory to systematically derive a network of passive components that generates th-order approximations of any given desired clock waveform with 50% duty cycle that can be expressed as a periodic trapezoid. In this way, we can achieve approximations of both ideal square waves and more practical waveforms with finite rise and fall times. In particular, we use linear network theory to develop a noniterative method for calculating the component values given the desired waveform shape and the nominal value of the load capacitance. The topology of our proposed driver is based on a modified current-fed voltage pulse-forming network (CFVPN) [9]. This network is traditionally connected to a constant current source, which internally consumes significant power. In contrast, we propose using a conventional pulse generator that consumes much less internal power and is readily available in most systems. Moreover, it requires no additional distinct dc voltage/current supply and reduces the impact of variations in load capacitance on fluctuations in output magnitude and frequency. Self-oscillating resonant circuits such as flyback and blip circuits cannot be trivially synchronized to an external clock signal connected to other blocks in the system. However, this can be easily achieved in our design because it is driven by an external pulse generator. Our proposed design approach has been implemented and tested for frequencies up to 15 MHz with various load capacitances. The worst case overall power dissipation of at 15 MHz with a the second-order driver is 19% of
97.8-pF load. Magnitude and frequency fluctuation due to a broad range of load capacitances variation are observed to be minimal. In addition, the power efficiency as a function of load capacitance and input pulse frequency variations is quantified. The remainder of this paper is organized as follows. In Section II, we briefly review the theory of waveform synthesis using current-fed voltage pulse-forming networks. Section III describes our systematic approach to identify the value of all driver components. Then, Section IV discusses practical implementations, Section V presents laboratory measurement results, and Section VI concludes with a discussion of potential applications and future work. II. CURRENT-FED VOLTAGE PULSE-FORMING NETWORK This section reviews standard implementations of Fourier series approximations of periodic trapezoidal waveforms using current-fed voltage pulse-forming networks. , shown in Fig. 4, can be defined by A trapezoidal wave the following:
where
is integer.
(1)
is an odd function, the Because the trapezoidal waveform contains only sine terms as follows: Fourier series for (2) where
where
(3)
In practice, only the first few terms are needed to yield a waveform that closely approximates an ideal trapezoidal wave. Notice that the model approximates a square wave as becomes zero.
764
Fig. 5.
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 11, NO. 5, OCTOBER 2003
CFVPN.
A CFVPN that can generate an output voltage consisting of the superposition of harmonics is shown in Fig. 5 [9]. To , first assume that switch S opens at and there analyze is no energy initially stored in the network. The voltage across section is shown in (4) the th
TABLE I COMPONENT VALUES FOR NETWORK OF FIG. 5
(4) Cascading :
such
sections in series yields the following for
(5) With this analysis, it is straightforward to determine the values of all network components to approximate a trapezoidal waveform defined by (2) and (3). In particular, by comparing (5) with , and for both square and trape(2), the values of , zoidal waveforms can be easily determined, as summarized in Table I. As is, however, this network cannot be directly used as a clock rail driver because none of the capacitances in the network represents a load capacitance that resides between the output node and ground. To meet this requirement, an equivalent network can be derived through mathematical transformations of impedance and admittance functions of the output, as shown in the following:
(6)
(7)
has zeros at and Notice that the impedance function , which in turn appear as poles in the admittance function . We, therefore, can rewrite (7) as follows:
(8)
where
(9) and the values for
and
are determined as follows:
(10) to be generated using the This transformation enables alternative circuit topology illustrated in Fig. 6, which is now suitable as a clock rail driver because it has an explicit clock that lies between the output and ground. load capacitance To find the values of other components of Fig. 6, we can use a [10], partial fraction expansion of the admittance function which is an iterative numerical procedure that provides little insight into the operation of the network. In Section III, we present a characteristic equation that constrains component values so that only desired frequency components are produced in the network and together with a set of linear equations provides a more insightful closed-form expression for component values. III. THEORETICAL ANALYSIS OF CFVPN There are three steps in our theoretical and algorithmic analysis of the CFVPN and its desired component values. First, we convert all of node voltage and branch current equations from time-domain to frequency-domain using the Laplace transform. Second, the branch current equations are simplified to find a
MOON et al.: VOLTAGE-PULSE DRIVEN HARMONIC RESONANT RAIL DRIVERS FOR LOW-POWER APPLICATIONS
Fig. 6.
765
An equivalent network of Fig. 5 containing a capacitance C that can represent an on-chip clock load.
characteristic equation whose roots are the product of and of each branch such that all unwanted frequency components are suppressed. Using these roots, the third step is to establish a set of linear equations that can be found by applying on the output node to identify all of the inductor values in the network. These values are combined with the roots of the characteristic equation to identify all of the capacitor values. A. Step 1: Convert Voltage and Current Equations to Frequency-Domain Representation To ease the tedium and complexity of solving the integral and differential equations, we use the Laplace transform to convert voltage and current equations into the frequency domain. Since two networks shown in Figs. 5 and 6 are equivalent, we write of Fig. 6 by the Laplace transform of the output voltage approximating (2) to the th order, i.e.,
B. Step 2: Simplify Branch Current Equations and Establish the Characteristic Equation As shown in (14), each branch introduces a new free oscillation frequency component at . This frequency component is unwanted because the output of the network should have only desired harmonics at to . By simplifying the branch current equation and finding a condition to suppress the frequency component at , we can establish the characteristic distinct values and thus, the equation whose roots are and . product of To simplify the branch current equation, we can rewrite (14) as follows:
(16)
where
(11)
By comparing th terms, two conditions for be found
and
By noting that voltages across all of the branches in Fig. 6 equal , it is straightforward to derive the Laplace transform for each branch current as follows:
can
(17)
(12)
(18) (13)
Equation (16) can now be simplified by replacing and collecting the terms together
with
(14) where
for
(15)
(19)
766
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 11, NO. 5, OCTOBER 2003
By applying on the output node of the network, the relationship of branch currents can be defined by (20) term exists in the right side of this equation, the Because no term in each branch current must evaluate to zero, implying the following additional constraint (21)
(25) Comparing both sides of (25), the linear equations shown in (26) . Note that these determine the inductor values values can be combined with (24) to calculate the capacitance values
Equations (18) and (21) are combined to produce the characteristic equation shown in (23),
(22)
(26)
where
(23)
Notice that the numerator of the characteristic equation is an ) polynomial of variable . The roots of this numerorder-( to , are the roots of the entire characator polynomial, teristic equation, which can be represented as follows:
As an example, consider the task of finding the value of all components of the second-order square-wave driver for a 1-MHz clock and a 100-pF load. From (23), we have
(27)
(24) Using this value, we can rewrite (26) as follows: and C. Step 3: Setup Linear Equations to Find a Set of Combine With the Roots of the Characteristic Equation to Find a Set of Using (24), we can substitute with the product in (22) and use the result to simplify (20) as follows:
and
(28) By solving these equations, we find inductor values, uH and uH. Lastly, since , it follows that pF. IV. VOLTAGE-PULSE DRIVEN PULSE-FORMING NETWORK Even though the CFVPN shown in Fig. 6 has an appropriate configuration for our target applications, two problems preclude the network from being directly applied as a clock rail driver. First, a dc current source is required to drive the network that in practice consumes large amounts of power internally, canceling out the benefits of the CFVPN clock rail driver. Second, the and as opposed to 0 waveform swings between required for driving CMOS clock nets. We propose a and unique solution that overcomes these impediments.
MOON et al.: VOLTAGE-PULSE DRIVEN HARMONIC RESONANT RAIL DRIVERS FOR LOW-POWER APPLICATIONS
767
Fig. 7. Equivalent network of Fig. 6 driven by a voltage pulse.
Fig. 8. Practical approximation of the network shown in Fig. 7.
A. Voltage-Pulse Driven Network for dc Current Source Elimination In theory, to eliminate the dc current source of the CFVPN we can use the equivalent network shown in Fig. 7, which is triggered by a voltage source generating a waveform identical to the desired output that provides no current, and thus consumes no power. However, it is impractical to build a voltage source that generates a waveform matching the desired th-order harmonic voltage waveform. Thus, we propose a solution that uses a more practical voltage pulse whose undesired harmonics are effectively absorbed using a series resistor. The cheapest source of the voltage pulses is a conventional clock oscillator that, given finite rise and fall times, approximates a trapezoidal wave. The first harmonics of the trapezoidal waveform should match that of the network. However, the trapezoidal-wave clock signal will also contain higher order harmonics than those generated by the network. This will cause significant current draw from the voltage source, reducing the power efficiency of the proposed rail driver. We propose to reject these higher order harmonics from the input pulse generator by placing a resistor between the input and output of the network as depicted in Fig. 8. To understand the benefits of adding this resistor we first write the impedance function of the original network as follows:
Note that (29) is the impedance function of the network shown in Fig. 5 that is the equivalent network of Fig. 6. Then, by adding the resistance , the overall impedance seen by the input pulse generator is
(30) The transfer function ratio of impedance
of the network, represented as the and , is as follows:
where
is real
(31)
. The magnitude and phase shift of this where transfer function can then be expressed as follows:
(32) (29)
Thus, for all frequencies other than the harmonic frequencies is finite), the magnitude approaches zero and the (where
768
Fig. 9.
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 11, NO. 5, OCTOBER 2003
Frequency response of the network shown in Fig. 8 (V (s)=V (s)).
phase shift approaches 90 as the value of increases. Moreover, the magnitude and phase of the transfer function at each of the harmonic frequencies can be calculated as follows:
(33) for . Thus, the value where of does not affect the phase or magnitude of any of the generated harmonics. A more detailed analytical proof of (33) for the second-order driver example is presented in the Appendix. Fig. 9 depicts the frequency response of the second-order driver for a 1-MHz clock signal with different resistance values. It clearly shows that no distortion is incurred at two resonant frequencies (1 and 3 MHz) for all resistance values. From this graph, it seems beneficial to increase the resistance to reject higher harmonics. However, the parasitic resistances of the components and wires unfortunately reduce the voltage level of the output signal as becomes larger because of the inherent voltage divider present between the parasitic resistances and . To understand this more clearly, the driver is redrawn in Fig. 10 with a parasitic dc resistor associated with each inductor for the second-order. Other parasitic components whose values are negligible compared with components used are not considered on the output node, to simplify the analysis. If we apply we can write the following:
(34)
Fig. 10.
Proposed network with the parasitic resistance of inductors.
By arranging for (35) at the first two Fig. 11 shows the magnitude and phase of and are used harmonic frequencies. for the inductor parasitic resistances, as specified by the data sheet for the inductor used in our implementation [11]. Though there is negligible change in the phase, the magnitude decreases from 0.96 to 0.69 when increases from 1 to 10 k . This is in sharp contrast to the ideal network whose frequency response at the harmonic frequencies is not affected by the resistance value as demonstrated in Fig. 9. Consequently, it is important to use an adequate resistance value while maintaining proper voltage levels of the output signal for low-power dissipation. For driving 97.8-pF load capacitance at 1 MHz, our test measureis dissipated with ment demonstrates that only 15% of a 2-k resistance with negligible degradation in output voltage.
MOON et al.: VOLTAGE-PULSE DRIVEN HARMONIC RESONANT RAIL DRIVERS FOR LOW-POWER APPLICATIONS
769
(a)
(b) Fig. 11.
Frequency response of the network shown in Fig. 10 for the first two harmonic frequencies. (a) Magnitude (b) Phase.
B. A Tank Capacitor for a Positive-Swing Waveform The output of the network in Fig. 8 swings between and because one branch between the output and ground contains a single inductor . To redesign the network to swing we must introduce a dc offset to the output. We from 0 to propose accomplishing this by introducing a dc offset at the
pulse generator input and adding a tank capacitance ries with .
in se-
To understand how a dc offset from the input pulse generator affects the network, two circuits that differ only in an existence at the dc steady-state condition are shown in Fig. 12. of , the induced output dc voltage is zero because the Without
770
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 11, NO. 5, OCTOBER 2003
Fig. 13. (a)
(b) Fig. 12.
dc steady-state for the network. (a) Without C . (b) With C .
output and ground node are shorted by the branch as shown flows into in Fig. 12(a). As a result, the dc current the network, creating significant unwanted dc power. For the connected network shown in Fig. 12(b), the tank capacitance in series induces a matching dc offset voltage at the output to node, eliminating the dc current into the network. Moreover, the introduction of the tank capacitor has a negligible impact on the overall frequency response of the rail driver. To see this, notice that the impedance functions for the branches can be written as follows: in Fig. 12 that contain
(36) is very large, the impedance of Assuming gibly affected by
when
is negli-
(37)
In our laboratory test, a 10 nF off-the-shelf capacitor was sufficient to achieve the desired dc offset voltage within an 0.8to 15-MHz frequency range. The final proposed voltage-pulse driven positive-swing driver is shown in Fig. 13. V. MEASUREMENT The proposed harmonic resonant square-wave rail drivers containing up to four terms (i.e., fourth-order) were designed and tested on a wire-wrap board that included tunable inductors and capacitors. We varied the frequency from 0.8 to 15 MHz by setting these components to theoretical values we calculated using (23) and (26). We then tuned each component to achieve minimum measured power dissipation and compared them with their theoretical value. Testing at higher frequencies was
Voltage-pulse driven harmonic resonant rail driver.
limited by the test setup and equipments that are available to the authors. Table II summarizes the laboratory measurement results for various configurations. In most cases, the measured values of the components are within 7% of the theoretical values. Deviation between the theoretical and tuned capacitance values is larger than for the inductors presumably because of the large parasitic capacitances in our wire-wrapped board. As reported in Table II, approximately 19% of the calculated conventional was dissipated for the second-order power dissipation driver at 15 MHz to drive 97.8 pF load capacitance. Power dissipation increases as the order of the driver increases. This effect appears to be due to more parasitic components in the test board. In addition, tuning the circuit for minimum measured power dissipation is increasingly error prone since more design variables are involved. Note that as we increase the order of the driver, we and for the must include additional capacitance such as second-order driver. However, this does not increase the power dissipation significantly because only a small fraction of the current is drawn from the input pulse generator. In particular, the pulse generator needs to provide only a very small current sufficient to compensate the energy loss due to the parasitics of the components. The last row in Table II shows the measurement data of the second-order driver for different load capacitances at 1 MHz. Resistance values are reduced to achieve 10% rising and falling times of the total cycle time. Power dissipation is increased by approximately 7% for this case while rising and falling times are shortened by 3% from the minimal power dissipation mode. This result suggests that by changing resistance value, we can control the rising and falling times at the expense of power dissipation. Fig. 14 illustrates the measured power dissipation as we changed the resistance value for 1 MHz and 100 pF. The transition time with 2-k resistance was measured as 110 ns, which is 11% of the total cycle time. Notice that transition times in Fig. 14 are normalized to this value. At 285 , the transition time drops to 50 ns (45%) while the power dissipation increases . from 15% to 57.9% of Figs. 15 and 16 show oscilloscope traces of the output signal of the driver for the second- and third-order harmonics. To see how the output signal is synchronized, the input pulse is also shown. A fast Fourier (FFT)-enabled oscilloscope trace for the fourth-order driver output is presented in Fig. 17. The figure shows that only four harmonic frequencies are present in the output signal. Fig. 18 presents the trace of the output signal of the second-order harmonic driver for 10-MHz frequency. When the driver is directly connected to the clock network, the nonlinear characteristic of the transistors can cause load capacitance variations. To measure power dissipation as
MOON et al.: VOLTAGE-PULSE DRIVEN HARMONIC RESONANT RAIL DRIVERS FOR LOW-POWER APPLICATIONS
771
TABLE II MEASURED DATA OF SECOND, THIRD, AND FOURTH SQUARE-WAVE HARMONIC RESONANT RAIL DRIVER FOR VARIOUS CLOCK FREQUENCIES AND LOAD CAPACITANCES. THE FIRST THREE ROWS ARE DATA FOR DRIVING 97.8 pF LOAD CAPACITANCE AT DIFFERENT CLOCK FREQUENCIES AND THE LAST ROW SHOWS DATA FOR DIFFERENT LOAD CAPACITANCES AT 1 MHz. THEORETICAL AND MEASURED VALUES OF EACH COMPONENT ARE ALSO SHOWN FOR COMPARISON
Fig. 14. Normalized power dissipation (P=f C V ) and transition time versus resistance R. f C V is the theoretical conventional power dissipation to drive load capacitance C .
a function of the load capacitance variation, we varied from 30% to 30% of the nominal value while keeping all other components the same. The power was then measured. The results for a 1-MHz clock and a 100-pF load capacitance are plotted in Fig. 19. Normalized power dissipation in the graph is the ratio between the measured power dissipation . Power dissipation at 100 pF is minimum beand cause the circuit is designed to harmonically resonate at this value. No frequency variation was noticed for this range of capacitances as is expected for any externally-driven driver.
Unlike the self-oscillating rail drivers whose frequency varies proportional to the square root of variations in capacitance [5], this beneficial characteristic of our drivers significantly increases system stability. For capacitance greater than 130%, however, significant voltage-level degradation is observed. On the other hand, if we reduce the load capacitance below 70% of nominal, the power dissipation increases rapidly because current from the input pulse generator mostly charges the load capacitance instead of it being charged resonantly. In addition to the increased power dissipation by the load ca-
772
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 11, NO. 5, OCTOBER 2003
Fig. 15.
Scope trace of output waveform for the second-order driver at 1 MHz.
Fig. 16.
Scope trace of output waveform for the third-order driver at 1 MHz.
Fig. 17.
FFT-enabled scope trace of waveform for the fourth-order driver at 1 MHz.
pacitance variation, the phase shift between the input pulse and the generated output causes clock jitter. To quantify this effect, we measured the delay time of the output with respect voltage level while varying to the input pulse at the
the load capacitance from 30% to 30% of the nominal value. The measurement results are shown in Fig. 20. We observed clock jitter ranging from 47 to 43 ns for 1-MHz clock frequency. This relatively high clock jitter can be com-
MOON et al.: VOLTAGE-PULSE DRIVEN HARMONIC RESONANT RAIL DRIVERS FOR LOW-POWER APPLICATIONS
Fig. 18.
Scope trace of output waveform for the second-order driver at 10 MHz.
Fig. 19.
Normalized power dissipation versus load capacitance (C ). All components except C are kept same as designed for 100 pF C .
pensated by an increased clock cycle time. Therefore, 5% to 10% performance loss is expected for applications with high load capacitance variation. Another experiment was carried out to measure power dissipation as a function of frequency change of the input pulse generator. We varied the frequency of the input pulse gener) from 10% to 10% from its nominal value then the ator ( power was measured. Fig. 21 shows the measurement results of the power dissipation. At the nominal frequency (1 MHz), the . When is normalized power dissipation is 14% of reduced by 10% from its nominal value, the power dissipation . 36% of was measured is increased to 39% of
773
when is increased by 10%. For 2% frequency fluctuation, . the power dissipation is increased to 15% of VI. CONCLUSION In this paper, we presented a new algorithm and a prototype implementation of a harmonic resonant rail driver. The design goal was to produce an energy-efficient harmonic resonant clock signal using a simple network topology requiring no additional dc power supply. The experiment result shows that a significant amount of energy for driving the clock load can be recycled and saved by the resonant characteristic of the proposed driver.
774
Fig. 20.
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 11, NO. 5, OCTOBER 2003
Clock jitter versus load capacitance (C ).
Fig. 21. Normalized power dissipation versus clock cycle variation.
Depending on the number of harmonics in the driver, we were able to save 70% to 85% of the conventional power dissipation. Moreover, our driver is much more robust than previously reported resonant clock drivers, in that it greatly reduces the frequency variation caused by changes in load capacitance. Implementation with integrated inductor and capacitor requires further work. The straightforward procedure for imple-
menting the external clock driver is to measure the load capacitance by driving it with a conventional clock driver and tune other components correspondingly. Therefore, high-accuracy capacitance extraction is crucial to integrate the driver inside ICs. A phase comparator and varactors to compensate error caused by capacitance estimation and load capacitance variation could be a feasible solution, but power dissipation will be
MOON et al.: VOLTAGE-PULSE DRIVEN HARMONIC RESONANT RAIL DRIVERS FOR LOW-POWER APPLICATIONS
Fig. 22.
775
Waveform plot for v (t) in (46).
increased by these additional circuitries. In addition, the low-Q characteristic of the internal inductor may cause low power efficiency of the harmonic clock driver. For high-frequency applications, the parasitic components of each inductor and capacitor will significantly affect the output waveform and power dissipation. Therefore, a simulation-based optimization strategy is needed to achieve optimum performance and power dissipation. Nevertheless, the theoretically-optimal circuit derived from our analysis can be the initial circuit for a more detailed nonlinear simulation-based optimization strategy. Providing such a good initial condition can dramatically reduce the run time of the simulation-based approach. In addition, the theoretic analysis may also be useful in guiding the search strategy.
can be rewritten as shown in (39) at the bottom of the page. For the second-order square-wave driver, all of the components and as follows: can be represented with
(40) Using (40), we can simplify (39) into (41)
APPENDIX PROOF OF (33) FOR THE SECOND-ORDER DRIVER By applying on the output node of Fig. 8 (for the second-order driver), we can write the following:
(38)
(41) To show how input harmonic signals are not affected by the series resistor , let us examine the output when sine is applied. Note that ideal wave square-wave is the sum of the sine waves harmonically related
(39)
776
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 11, NO. 5, OCTOBER 2003
( ). Using partial fraction expansion, the following can be established
Therefore, when the circuit reaches steady-state condition, only the first two harmonics are present at the output node and no power is dissipated if the input is composed of these two harmonics. REFERENCES
(42) By comparing (41) and (42), we find
(43) Solving for
and (44)
and 3, is forced to zero which in turn Notice that when simplifies other terms as shown below
(45) Therefore, for
or 3, (42) becomes
(46) From this equation, we can conclude the first two harmonics from the input pulse generator are not affected by the series resistor and thus appear at the output node with the same magnitude and phase. Fig. 22 shows the time-domain waveform of for various values. As shown in the figure, the magnialways converges to zero regardless of value. tude of
[1] J. Rabaey, Digital Integrated Circuits: A Design Perspective. Englewood Cliffs, NJ: Prentice-Hall, 1995. [2] W. Athas, N. Tzartzanis, L. Svensson, and L. Peterson, “A low-power microprocessor based on resonant energy,” IEEE J. Solid-State Circuits, vol. 32, pp. 1693–1701, Nov. 1997. [3] W. Athas, L. Svensson, and N. Tzartzanis, “A resonant signal driver for two-phase, almost-nonoverlapping clocks,” in Proc. Int. Symp. Circuits and Systems, 1996, pp. 129–132. [4] W. Athas, N. Tzartzanis, W. Mao, L. Peterson, R. Lal, K. Chong, J.-S. Moon, L. Svensson, and M. Bolotski, “The design and implementation of a low-power clock-powered microprocessor,” IEEE J. Solid-State Circuits, vol. 35, pp. 1561–1570, Nov. 2000. [5] N. Tzartzanis, “Energy-recovery technique for CMOS microprocessor design,” Ph.D. dissertation, Dept. Elect. Eng., Univ. of Southern California, Los Angeles, CA, 1998. [6] W. Athas, L. Svensson, J. Koller, N. Tzartzanis, and Y. Chou, “Lowpower digital systems based on adiabatic-switching principles,” IEEE Trans. VLSI Syst., vol. 2, pp. 398–406, Dec. 1994. [7] J. Denker, “A review of adiabatic computing,” in Proc. 1994 Symp. LowPower Electronics, 1994, pp. 94–97. [8] S. Younis and T. Knight, “Non-dissipative rail drivers for adiabatic circuits,” in Proc. 16th Conf. Advanced Research VLSI, 1995, pp. 404–414. [9] G. Glasoe and J. Lebacqz, Eds., Pulse Generator. New York: McGraw-Hill, 1948, pp. 175–224. [10] M. Van Valkenburg, Network Analysis. Englewood Cliffs, NJ: Prentice-Hall, 1974. [11] Coilcraft Document 112-1, Slot Seven 7mm Tunable Inductor Data Sheet. Cary, IL: Coilcraft, Inc., 2001.
Joong-Seok Moon (S’97–M’03) received the B.S.E.E. and M.S.E.E. degrees from the Seoul National University, Korea, in 1993 and 1995, respectively, and the Ph.D. degree in electrical engineering from the University of Southern California, Los Angeles, in 2003. Currently, he is a VLSI Application Engineering with Apple Computer, Inc, Cupertino, CA. From 1995 to 1996, he was a VLSI Engineer with Samsung Electronics, Korea. He was also with Information Sciences Institutes, University of Southern California, Marina del Rey, CA where he was involved in several VLSI research projects such as low–power microprocessor, feedback cancellation DSP chip for hearing-aid, low-power FIFO SRAM, and floating–point unit for DIVA processor-in-memory processor. His research interests include are low-power circuit and system design for general purpose and DSP architecture.
William C. Athas (M’99) received the B.S. degree in computer science from the University of Utah, Salt Lake City, in 1978, and the Ph.D. degree from the California Institute of Technology, Pasadena, CA, in 1987. In 2000, he joined Apple Computer, Inc., Cupertino, CA, where he is a VLSI Systems Architect and Principal Engineer in the VLSI-Hardware Engineering Department. At Apple he works in the areas of high-speeding signaling, mixedsignal integrated-circuit designs, microprocessor speed and power modeling, and advanced microarchitecture concepts and designs. From 1991 to 2000, he was a Research Scientist and Senior Project Leader with the Information Sciences Institute, University of Southern California, Marina del Rey, CA. During this time he was the Team Leader for the design and implementation of the AC1 microprocessor, which demonstrated a new approach to low-power computing based on the principle of energy recovery. He has 23 publications in the areas of the theory and design of low-power circuits and high-performance computer architecture. He has received patents for low-power and asynchronous circuits, high-performance memory subsystems, and network hardware.
Sigfrid D. Soli photograph and biography not available at the time of publication.
MOON et al.: VOLTAGE-PULSE DRIVEN HARMONIC RESONANT RAIL DRIVERS FOR LOW-POWER APPLICATIONS
Jeffrey T. Draper (S’85–M’89) received the B.S. degree in electrical engineering from Texas A&M University, College Station, TX, in 1987, and the M.S.E. and Ph.D. degrees in electrical engineering from University of Texas, Austin, TX, in 1990 and 1993, respectively. He is a Research Assistant Professor at the University of Southern California (USC), Los Angeles, and is also a Project Leader with the Information Sciences Institute, USC, where he has been working in research since 1993. He has led the VLSI effort on several DARPA-funded projects and has produced a number of complex ICs, with the most recent chip fabricated in 0.18 m technology and containing 55 million transistors.
777
Peter A. Beerel (S’88–M’95) received the B.S.E. degree in electrical engineering from Princeton University, Princeton, NJ, in 1989, and the M.S. and Ph.D. degrees in electrical engineering from Stanford University, Stanford, CA, in 1991 and 1994, respectively. Currently, he is the Vice-President of Asynchronous Computer-Aided Design and Verification at Fulcrum Microsystems, Inc., Calabasas Hills, CA, a startup company developing and commercializing asynchronous designs. He is also an Associate Professor, currently on leave, from the Department of Electrical Engineering-Systems at the University of Southern California (USC), Los Angeles. He has consulted for Yuni Networks and AMCC in the areas of networking chip design, Intel, and asynchronous digital design in the areas of asynchronous design and computer-aided design, and TrellisWare Technologies in the area of communication chip design. His research interests include a variety of topics in CAD and VLSI. Dr. Beerel received the Outstanding Teaching Award in 1997 and the Junior Research Award in 1998, from the School of Engineering, USC. He received a National Science Foundation (NSF) Career Award and a 1995 Zumberge Fellowship. He was also co-recipient of the Charles E. Molnar Award for two papers published in ASYNC’97 that best bridged theory and practice of asynchronous system design and was a co-recipient of the best paper award in ASYNC’99. He is coauthor of four patents in the area of asynchronous circuits. He has been a Member of the Technical Program Committee for the International Symposium on Advanced Research in Asynchronous Circuits and Systems since 1997 and was Program Co-chair for ASYNC’98. He has served on the Technical Program Committee for the International Conference on Computer-Aided Design in 2000 and 2001.