Reduction of Testing Power with Pulsed Scan Flip-flop ... - IEEE Xplore

2 downloads 0 Views 2MB Size Report
Abstract- In this paper, a new scan flip-flop is proposed for low power testing. Different flip-flops (Master-slave, hybrid, pulse triggered) are reviewed and ...
Proceedings of 2011 International Conference on Signal Processing, Communication, Computing and Networking Technologies (ICSCCN 2011)

Reduction of Testing Power with Pulsed Scan Flip-flop for Scan Based Testing l 2 3 D.Satya Valibaba , S.Sivanantham , P.S.Mallick

J.Raja Paul Perinbam

12 , School of Electronics Engineering 3 School of Electrical Engineering

Department of ECE R.M.K. Engineering College

VIT University

Chennai, Tamilnadu, India

Vellore, Tamilnadu, India 23 , {ssivanantham, psmallick}@vit.ac.in

[email protected]

Abstract- In this paper, a new scan flip-flop is proposed for low

unnecessary transitIOns in the cores under test (CUT). This

power testing. Different flip-flops (Master-slave, hybrid, pulse

excessive power dissipation causes the CUT fail or damage

triggered) are reviewed and evaluated their performance using ELDO simulator with TSMC 180 nm CMOS technology. Based on this evaluation, pulsed triggered flip-flop is selected as scan flip-flop

because

of

lower

transition

power.

Comparison

of

proposed scan flip-flop with existing mux based master-slave

during testing period itsel£ Therefore, the demand of low power testing technique is necessary. Many DFT modification techniques have been proposed for scan shift power reduction, which can be broadly categorized into two types:

scan flip-flop is performed at the layout level. Experimental results on ISCAS89 benchmark circuit show that the proposed scan flip-flop can be used to reduce the test power.

Vector-independent techniques: Controlled gates insertion [6] in partitioned scan chain can reduce scan shift power with

Keywords- low power testing, DFT, scan-based testing, tests power reduction, scan flip-flop, pulsed flip-flop.

the

degradation

of

functional

performance

due

to

the

additional gate delay. The above techniques need not to be particularly designed according to their test vectors.

I.

INTRODUCTION

Vector-dependent techniques: By scan chain redesign [8-

Adding more features into a single chip also increases the

10] to reduce the number of switching activities on scan

circuitries.

chains leads to decrease the scan shift power. This technique

Therefore, inserting scan chain throughout the entire chip is a

includes test vector reordering, X-filling and compression.

technique which cannot be ignored anymore During the test

These above techniques need to be particularly applied for

mode a known set of test vectors are loaded into the scan

each design according to their different test vectors, their

chain to test the combinational logic between pipeline stages.

efficiency

If there was no scan chain in the circuit the entire chip would

independent technique and less performance degradation.

complexity

which

demands

highly

testable

are

usually

much

higher than

that

of

vector­

have had to run for multiple cycles to send the test results to the

chip

boundary. Stitching flops

makes

it

possible

to

II.

serialize the test results and send them through the scan chain. The timing of a design significantly depends on the speed of these flip-flops and also has a major contribution in the total power consumption of the design.

normal

function

mode

[7],

because:

Successive

functional input vectors usually have significant correlations than the correlations between consecutive test patterns. Test engineers usually use parallel testing in the system to reduce the test application time thus cut the test cost, which leads to excessive power dissipation in the long duration of shift phase. Scan-based test techniques dominate the current Design for Testability (DFT) market, but they suffer from increasing power dissipation caused by excessive transitions during test vector

shift

and

Power Dissipation during Test Application

The ever increasing demand for portable computing devices and wireless communication systems requires low power

Testing power may be twice as high as the power consumed during

A.

BACKGROUND

capturing processes, which will

978-1-61284-653-8/11/$26.00 ©2011 IEEE

create

VLSI circuits [12]. Minimizing power dissipation during the VLSI design flow increases lifetime and reliability of the circuit [13,14]. Numerous techniques for low power VLSI circuit design were reported [13] for CMOS technology where the dominant factor of power dissipation is dynamic power dissipation caused by switching activity [14]. While these techniques

have

successfully

reduced

the

circuit

power

dissipation during functional operation, testing of such low power circuits has recently

become

an

area of

concern.

Therefore, addressing the problems associated with testing low power VLSI circuits has become an important issue [12].

526

Proceedings of 2011 International Conference on Signal Processing, Communication, Computing and Networking Technologies (ICSCCN 2011)

B.

Review offlip-flops

We have reviewed various flip-flops proposed in literature. Among these five flip-flops are chosen based on the possibility of implementation. The Table I shows the performance of these four flip-flops under consideration. A conventional SET flip-flop (Figure I) is triggered either at the rising edge or the falling edge of a clock cycle. In case of a rising SET flip-flop, the rising edge determines the output of the flip-flop. Similarly, in case of a falling SET flip-flop, the value of the input at the falling edge is transferred to the output port. However, for the correct operation of the flip-flop depends on, the input value has to be maintained constant just before (tset-up ) and just after (thold) the triggering edge of the clock. In SET D flip-flop (Figure 1) with 16 transistors [1,4], the Master section consists of a D latch which is functional on the positive level of the clock and transfers the logic level at input D to the intermediate node P. There is a feedback loop that maintains the logic level at the node P when the clock . INV�

goes to logic level 'LOW'. Similarly, the Slave section consists of a D-Iatch, which is functional on the negative level of the clock and transfers the logic level at intermediate node P to the output node Q. Again, there is a feedback loop that maintains the logic level at node Q while clock is at logic level 'HIGH' state. Figure 2 shows a SET D Flip-flop using pass transistor. Even though it has fewer transistors but there is no much improvement in performance and in power consumption. In the negative edge triggered SETFF (Figure 3) even if the clock is stopped (permanently grounded) the circuit [5] is able to maintain the logic levels, which proves the fact that the this flop is static in nature. The main advantage of the circuit is the reduced transistor count, which is only ten. Hence, it can be used to increase the chip density while maintaining a lower manufacturing cost at the same time.

INVo INV2

INV�

-� � "

elK .elKt .

elK*,

Fig 2. SET Flip-Flop with Pass Transistor

Figure 1. Conventional SET D Flip-Flop with TG

elK

Q

ClK* INVI

N� '

INI'S

INV2

o

Figure 3. SET D Flip-flop proposed in [5]

978-1-61284-653-8/11/$26.00 ©2011 IEEE

Figure 4. Hybrid Flip-Flops

527

Proceedings of 2011 International Conference on Signal Processing, Communication, Computing and Networking Technologies (ICSCCN 2011)

TABLE I.

SIMULATION RESULTS OF SELECTED FLIP-FLOPS WITH TSMC 180NM CMOS TECHNOLOGY AT 1.8V

Tsu

Thold

TcIk-Q

Power

SET (PTL )

16

170.15

333.5

163.6

129.58p

12

190.25

325

360

1.228n

SET Prop

10

152.5

267.5

276

149.980

Hybrid

14

180.9

126.3

42.2

298.48p

Pulsed

8

NA

195.3

145.2

47.69p

#Tr

Flip-flop

SET (TG)

aos

(ps)

(ps)

(ps)

(W)

* T C&.p"., -57.1398 ps and T P""o-Q -87.9 ps

• setup time(ps)

400 300

• hold time(ps)

200 -HI--::I 100 o -t-LA.,.-.......---. ..

• tclk-q (ps) • power

dissipation (pW)

Figure 5. Comparative study of different flip-flops

Another topology is Hybrid FF (Figure 4). It is the fastest FF when compared all other flip-flops. But only disadvantage is that it consumes more power than convention FF. If power is not primary concern then Hybrid FF is the best choice for High Performance applications. The comparison on various metrics of all flip-flops discussed here are shown in Figure 5

III.

PROPOSED

SCAN FLIP-FL op

From Table I, Transmission gate - latch based flip-flops have low power application when speed is not a primary important. So these can serve for low power testing. Hybrid flip-flops are high speed flip-flops with relatively higher power consumption. So these have less scope in low power testing. Pulsed flip-flops are more attracted towards high speed and low power testing due to its negative setup time and due to less area occupied. Based on our evaluation, we desire to use pulsed flip-flop as scanned flip-flop. The proposed pulsed scan flip-flop in Fig. 8 consumes less power, occupies lower area while achieving higher performance compared to conventional muxed scan flip-flop cell (Figure 7).The layout of muxed scan flip-flop and pulsed scan flip-flops are given in Figure 9 and Figure 10 respectively. Traditionally, flip-flops are made up of a master-slave latch, with data being latched at the master and delivered to the slave at the sampling edge of the clock. Such an implementation has a positive setup time and has high figure of merit.

978-1-61284-653-8/11/$26.00 ©2011 IEEE

Figure a/merit

=

TCLK.q+Tsli

(1)

This figure of merit for a flip-flop added with the combinational logic delay determines the operating frequency of a design. The desire to reduce this figure of merit (Tcq+TslI) motivates to develop a pulse based flip-flop. A pulse generator circuit is shown in Figure 8(a). The pulse width can be easily controlled by appropriately sizing the inverters. In Figure 8(b), the latch becomes transparent for the short time duration in which the PULSE signal is high. The pulse is derived from the input clock edge and hence is generated after the clock edge. This allows the data to arrive later than the clock edge, hence making Tsu negative. This fact helps reduce the figure of merit. The pulse generator circuit can be shared across several flip-flops, amortizing its area and power cost. Mux based master-slave scan flip-flop has 28 transistors. This flip-flop is suitable for low power application when speed is not a primary important. So these can serve for low power testing. The proposed Pulsed scan flip-flop requires 12 transistor counts. As pulse occurs after clock triggers, there is no setup violation. The power consumption is less when compared to muxed SFF and can be used for high speed applications. So by using Proposed Pulsed SFF is more efficient for high speed and low power testing applications. POWER REDUCTION METHODOLOGY Power

IV.

dissipation in CMOS circuits consists of two types: dynamic power and static power. Dynamic power dissipation occurs during output switching because of short-circuit current, and charging and discharging of load capacitance, while static power dissipation is caused by leakage current or other current continuously drawn from the power supply. For CMOS technology, dynamic power is the dominant source of power dissipation. Dynamic power dissipation of CMOS circuit can be calculated as: PdynamIC .

=

2"lXCload

X

V�D X f X a

(2)

where C'oad is the load capacitance (including gate input and interconnect capacitances), f is the clock frequency, a is the expected number of output transitions in a cock period, and Vdd is the supply voltage of the circuit. From (2) we can see that, under given C'oad. VDD and f, which are usually decided by given technology and design, the most efficient and easy way to reduce dynamic power is to reduce the value of a, which has linearly correlation with the Pdynamic.

528

Proceedings of 2011 International Conference on Signal Processing, Communication, Computing and Networking Technologies (ICSCCN 2011)

VDD

INV2

Sl

o c:::::> ---+

PULS�

M�I Figure 6. Pulsed Flip-Flop

Figure 7. Mux Based Master-Slave Scan Flip-Flop

fI';'M

�Pll.SE I-f'>Pll.SE-

Figure 8(a). Pulse Generator VDD

3I

Figure 9. Layout ofMuxed scan Flip-Flop

D

Figure 8 (b). Proposed Pulsed scan Flip-Flop Figure 10. Layout ofproposed scan flip-flop

Scan-based testing schemes include two phases: shift and

performance metrics of Muxed Master-Slave scan flip-flop

capture, where capture cycles appear periodically among shift

and Pulsed scan flip-flop is carried at layout using IC Station

We mainly focus on to reduce power dissipation

layout designer tool with TSMC 180nm CMOS Technology

during shifting the sequence of vectors by scan cells, though instantaneous power dissipation during capture cycles are

is given in Table II. The area of Muxed scan FF is 124.859 2 2 um whereas area of pulsed scan FF is 93.774 um . The area

usually higher than that of shift cycles, the duration of shift

reduction of24.09% is achieved.

cycles.

mode is usually much longer than that of capture mode. The

978-1-61284-653-8/11/$26.00 ©2011 IEEE

529

Proceedings of 2011 International Conference on Signal Processing, Communication, Computing and Networking Technologies (ICSCCN 2011)

III-I

By using eldo simulator we have calculated the transition

S

powers (H----+L , and L----+H ). Table III gives the transition power of both scan FFs. The

frequency, rise/fall time and with F04 as load.

#Trans

flop

(3)

i)(bi $ bitl)

The average scan power dissipated during test application is by the

above

formula over all scan chains and all test patterns. The same rules apply to unload transitions observed when scanning out test responses.

The Table V presents the summary of the

Tsu

Thld\

Tclk-q

Power

( ps)

(ps)

(ps)

(pW)

experimental results which show that the proposed pulsed SFF inserted circuit can save �25% of the power when compare to

Muxed

28

145

126.37

181.75

201.44

Pulsed

18

NA

82.28

105.41

57.24

the conventional muxed SDFF.

VI.

T ABLE Ill. TRANSITION POWER MEASUREMENT AT 500MHz, 1.8V

CONCLUSION

The new pulsed SFF is presented in this paper for low

Type of Scan Flip-Flops

Transitions

-

obtained by summing up results provided

PERFORMANCE METRICS OF MUXED SCAN FF AND PROPOSED PULSED SCAN FF

Scan Flip-

2[m(m + l)J;=� (m

Table IV provides the the

performance analysis of various cases like varying the clock

T ABLE II.

=

MuxedScan FF n W

PulsedScan FF n W

OlEI

427.77

281.25

I lEO

213.99

212.50

power VLSI testing. This proposed scan FF not only saving the power and can also operate with higher speed. The area overhead due to pulse generator can be tolerated if it shares with multiple pulsed SDFF. This Pulsed SFF can be included into the standard cell library so that any timing violation

TABLE V.

EXPERIMENTAL RESULTS ON BENCHMARK CIRCUITS

# of

0/0

Testing power(uW)

itself

Power

Test

# of

Circuits

Vecto

Toggle

Muxed SFF

PulsedSFF

Reduc

S298

23

2158

1464.33

1112.66

24.01

S344

13

1379

980.30

740.13

24.4

S386

64

1189

884.38

660.260

25.34

rs

caused by this FF can be solved in the RTL synthesis level

REFERENCES

tion

[1]

Yu Chien-Cheng "Design of Low-Power Double Edge-Triggered Flip­ Flop Circuit "IEEE conference on industrial electronics and applications , vol. 34, No.3, pp.168-173, 2008.

[2]

V. Stojanovic and V. G. Oklobdzija, "Comparative analysis of master­ slave

latches

and

flip-flops

for

high-performance

and

low-power

systems," IEEE J. Solid-State Circuits, vol. 34, pp. 536-548, Apr. 1999.

[3] V.

[4]

Darwish,

and

M.

flip-flops.

Bayoumi.

The

Low

45 th Midwest

power

and

Symposium

high on

The test patterns generated b y

ATALANTA ATPG are applied t o proposed pulsed SDFF

Rasouli, S.H., Khademzadeh, A, Afrali-Kusha, A, and Nourani, M.," Low power single and double edge-triggered flip-flop for high speed applications," lEE Proc.-Circuits Devices Syst., Vol. 152, No. 2, April

pulsed scan flip-flop, experiments are conducted on full scan

2005.

[5]

Manoj sharma, Dr Arti Noor , Shatish Chandra Tiwari, Kunwar Singh ,

"An Area and Power Efficient design of Single EdgeTriggered D-Flip

based scan inserted netlist and muxed SDFF based scan

Flop",

inserted netlist.

2009

International

Conference

on

Advances

in

Recent

Technologies in Communication and Computing.

[6]

Symposium, San Diego, 2008, pp.147-154.

[7]

switching activity when applying test patterns produced b y

Proc. IEEE VLSI Test Symposium (VTS), May 2005, pp. 265-270.

[8]

and DFT Techniques", Proc. International Test Conference, Charlotte,

activity is estimated by the weighted transition metric (WTM)

2004, pp.355-364.

[9]

scan cells, while taking into account their relative positions. =

NBadereddine,

P.Girard,

S.Pravossoudovitch,

AVirazel,

and

C.

Landrault, "Scan Cell Reordering for Peak Power Reduction during Scan Test Cycles", IFIP Internatinal Federation for Information

blb2 ... bm

Processing, Vol. 240, VLSI-SoC:

represent a test vector with bit bk scanned in before bk+ 1. The normalized form of the metric is then defined as follows:

KM.ButIer, J .Saxena, AJain, T .Fryars, J .Lewis, and G.Hetherington, "Minimizing Power Consumption in Scan Testing-Pattern Generation

chains and other parts of the CUT. The resultant switching that counts the number of invoked transitions in successive

X Wen, Y. Yamashita, S. Kajihara, L.-T. Wang, K. K. Saluja, and K. Kinoshita, "On Low-Capture-Power Test Generation for Scan Testing,"

ATPG. Scan shift operations dissipate power which depends directly on the number of transitions that occur in the scan

Y.-Z.Wu, M.C.-T.Chao, "Scan-Chain Reordering for Minimizing Scan­ Shift Power Based on Non-Specified Test Cubes", Proc. VLSI Test

The primary focus of the experiments is to measure the amount of toggling, and to compare shifting power based on

Let m be the length of a scan chain, and T

T.

Circuits and Systems Conf, 11477 - 11480,2002.

In order to validate the effectiveness of the proposed ISCAS89 benchmark circuit.

P. Zhao,

speedexplicit-pulsed

EXPERIMENTAL RESULTS

From Systems

to

Silicon, 2007,

pp.267-28I

[10]

C. Shi and R. Kapur, "How Power Aware Test Improves Reliability and Y ield", IEEDesign.com, September 15, 2004.

[11]

Nicolici,N., A1-Hashimi,B.M.m "Power-Constrained Testing of VLSI Circuits ", Kluwer Academic Publishers, Dordrecht (2003) ISBN140207235-X 2003.

978-1-61284-653-8/11/$26.00 ©2011 IEEE

530

Proceedings of 2011 International Conference on Signal Processing, Communication, Computing and Networking Technologies (ICSCCN 2011)

[12]

[13]

M.

Pedram .

Power

minimization

in

IC

design:

Principles

and

[14]

D. Czysz, M. Kassab, X. Lin, G. Mrugalski, J. Rajski, J. Tyszer, "Low

applications. ACM Transactionson Design Automation of Electronic

Power Scan Shift and Capture in the EDT Environment" Internationa

Systems (TODAES), 1(1):3-56, January 1996.

Test Conference(ITC). 2008

K Roy and S. Prasad. Low-Power CMOS VLSI Circuit Design. John

Wiley & Sons, 2000. TABLE IV .

POWER DISSIPATION OF TGMS SCAN FLIP-FLOP AT 500MHZ AND 250MHz FOR D IFFERENT RISE &F ALL TIMES Rise & fall time

Frequency

Transitons

20% ofTC/k

15% ofTC/k

10% ofTC/k

5% ofTC/k

Peak

Average

Peak

Average

Peak

Average

Peak

Average

Power

Power

Power

Power

Power

Power

Power

Power

Muxed SDFF

OIEI

701.58

561.32

534.37

429.5

549.75

439.80

579.37

468.5

@500MHz

I lEO

426.66

341.33

460.125

368.1

497.71

397.73

613.12

490.5

Pulsed SDFF

OIEI

464.09

278.45

391.95

235.17

402.75

241.65

477.00

286.20

@500MHz

I lEO

742.63

445.57

859.5

512.37

888.75

533.25

962.00

481.50

Muxed SDFF

OIEI

544..33

435.46

536.25

426.60

388.12

296.87

534.37

374.06

@250MHz

I lEO

417.46

313.10

393.75

295.31

308.02

214.37

385.87

293.26

Pulsed SDFF

OIEI

425.06

261.45

441.76

287.14

479.25

277.96

420.75

260.86

@250MHz

I lEO

724.26

579.41

729.65

423.20

684.90

376.69

650.65

390.39

978-1-61284-653-8/11/$26.00 ©2011 IEEE

531

Suggest Documents