Security Camera Processor using One-bit Motion Detection

0 downloads 0 Views 781KB Size Report
power and the LSI consumes 6.34µW under a network environment. ... MAC, which includes S-MAC [3], Low Power Listening. (LPL) [4] and ..... 6.92µW (max.).
A Single-Chip Sensor Node LSI with Synchronous MAC Protocol and Divided Data-Buffer SRAM Takashi Takeuchi, Shintaro Izumi, Takashi Matsuda, Hyeokjong Lee, Toshihiro Konishi, Koh Tsuruda, Yasuhiro Sakai, Hiroshi Kawaguchi, Chikara Ohta, and Masahiko Yoshimoto Kobe University, 1-1 Rokkodai, Nada, Kobe, Hyogo, 657-8501 Japan Abstract — This paper presents an ultra-low-power singlechip sensor-node VLSI with a synchronous MAC protocol and divided data-buffer SRAM for wireless-sensor-network applications. One of the most challenging issues in wireless sensor networks is extension of the overall network lifetime. So a communication centric design approach has been introduced to reduce the power consumption of the RF circuits and the entire sensor network system, through a vertical cooperative design among circuits, architecture, and communication protocols. A transceiver, i8051 microcontroller, and dedicated MAC processor with divided SRAM are integrated in a single chip. The test chip occupies 3.0 x 1.7 mm2 in a 180-nm CMOS process, including 0.63 M transistors. Divided data-buffer reduces 18.6% of average power and the LSI consumes 6.34μW under a network environment. Index Terms — Cross-layer design, sensor network, sensor node, MAC protocol, time synchronization, low power

I. INTRODUCTION Recent advances in micro-sensors, integrated circuits, and wireless communication technologies realize wireless sensor networks (WSNs). Applications of sensor networks comprising numerous such sensor nodes include remote environmental monitoring, smart spaces, military surveillance, and precision agriculture. A WSN consists of many wireless sensor nodes, each of which is driven by a small battery. The sensor nodes obtain environmental information and send it to a base station with a multi-hopping scheme, under the severe energy constraint. As the number of sensor nodes increases to hundreds or to thousands, the persistent necessity of changing batteries would be a considerable burden. For that reason, the most important issues on the WSNs are to extend an available period, say, network lifetime as long as possible. On the other words, it is highly desirable to reduce the power being used by each sensor node. Another issue is low cost realization of the WSNs. Thus a single chip implementation of the sensor node is also necessary. According to Moore’s law, a power of digital portions is scaled down with the progress of process technology. On the other hand, the power consumption of analog RF

978-1-4244-5035-0/09/$26.00 ©2009 IEEE

-202-

circuits will not scale at the same rate. In the entire system of the sensor networks, the power consumption of the RF circuits depends on the amount of its communications. Thus, the communication centric design is strongly requested to reduce the power consumption of the RF circuits. So a sensor node should be operated at a low duty cycle (less than 1%, [1]) to reduce the dynamic RF power consumption, which results in the increase in sleeping time. In the sleep period, sensor nodes have to hold the routing information and observed data from sensor, for instance. Therefore, it is also important to reduce a leakage power of buffer SRAM during the sleep period. In this paper, we propose an ultralow-power sensor node processor using an Isochronous-MAC (I-MAC) protocol [2] and a divided data-buffer SRAM architecture, which reduces both the dynamic power during the operating period and the leakage power during the sleep period. The rest of this paper is organized as follows: The communication protocol and its implementation are explained in Sect. II. The architecture of proposed databuffer SRAM is addressed in Sect. III. Section IV and V describe VLSI implementation and experimental results. These are followed by conclusion in Sect. VI. II. COMMUNICATION CENTRIC DESIGN To achieve communication centric design, the cross layer design between the hardware (node processor architecture) and the algorithm (communication protocol) is indispensable. In this work, we focus on MAC protocol which is a communication protocol of the data link layer. While a node communicates, MAC protocol is always executed since the function of MAC protocol is establishing communication links for data transfer. The active time of the RF circuit mainly depends on the MAC protocol. Here, we introduced an Isochronous-MAC (I-MAC) protocol [2], which has a periodic wakeup time synchronized with the Flooding Time Synchronization

ISOCC 2009

Protocol (FTSP). I-MAC reduces the active time of the RF circuits significantly resulting low power characteristics. A. Isochronous-MAC (I-MAC) An effective way to reduce the energy in MAC is shorten the idle listening, in which the receiver is activated even when no packet is received. To reduce the power of this idle listening, a type of MAC named Cycled Receiver MAC, which includes S-MAC [3], Low Power Listening (LPL) [4] and WiseMAC [5], has been developed. Using Cycled Receiver MAC, each node enters a receiving mode only during a specific wakeup duration time that occurs in every wakeup period. Reducing the duty cycle ratio, the power used for idle listening is also decreasing. In general, with a cycled receiver MAC, the longer the wakeup period, the longer the delay time for connection establishment. Therefore, under the condition of the same duty cycle ratio, the shorter wakeup period, the more advantageous it can be in terms of the delayed time. The proposed Isochronous-MAC (I-MAC) [2][6] is based on LPL that has a periodic wakeup time. Since a sender can predicts a next wakeup time of an intended receiver with high accuracy, we can minimize the duration of a preamble on the sender. As well as on the receiver side, the receiving time for the preamble can be reduced thanks to the short duration of the preamble. I-MAC

TICER Sender Sleep

Ton Tl

Sender Tl

Tl Data

Tl

Preamble Sleep

Ton

B. Time synchronization In [2][6], the time on sensor nodes are adjusted to the actual time using the long-wave standard time code. However, external hardware is required for this method, and it does not work indoors. In contrast, packet exchanging methods do not require external hardware or signal, although a communication overhead is required. Flooding Time Synchronization Protocol (FTSP) [9] is one of the packet exchanging method and it correct the time by using time stamp packet communication. As the time synchronous technique of the proposal sensor node, we chose FTSP in consideration of the cost of hardware and power overhead. Figure 2 shows the synchronization method of the FTSP algorithm. First, the synchronous packet is flooding from the root node. At this time, the time stamp of T1 is send to the receiver node. Next, the receiver node records the time stamp of T2 at the end of the synchronous packet. Then the propagation delay is calculated from T2 - T1. However, note that, until the synchronous packet is recognized by a system, there is a time of Tb as a byte alignment time. This can be calculated by the data rate, and after that, the time error is corrected with T2 - T1 - Tb in the receiver node. The time lag among sensor nodes is suppressed within 250 μ seconds, when the nodes synchronize 50 times, in a day. From [2], time lag of 250 μ seconds is sufficiently small value in order to operate IMAC.

Data

T1

RTS Receiver Ton

Receiver

CTS T

Ton

Ton

T

Sender

preamble sync

T1

data

Ton

RX T: Wake-up period Ton: Carrier sense time Tl: CTS waiting time RTS: Request to send CTS: Clear to send

TX data

Activation ratio of sensor node

Propagation delay TX preamble

1

T2

TICER I-MAC

0.1

Receiver

preamble sync

T1

data

0.01

Byte alignment time (Tb)

84.7 % reduction at T = 100 ms

0.001

0.0001 0

0.5 1 1.5 Wake-up period; T [sec.]

2

T1

T2 Value of time error :

θ = T2 - T1 - Tb

Fig. 1 Comparison of the TICER and the I-MAC. In [7], TICER [8] is implemented as a MAC, in which a sender node periodically repeats wakeups and sleeps in a preamble operation when establishing communication with receiver nodes (Fig. 1). In the figure, T is a receiver’s wakeup period, and Ton is a carrier sense time; as T becomes larger, the power of TICER becomes larger, since a number of preambles must be tried. Refer to [2] for more detail on the I-MAC.

-203-

Fig. 2 Synchronization method of the FTSP. In addition to the above, there are two other reasons why we chose the FTSP for our system. The first reason is that the I-MAC does not need to synchronize the time of the entire network. Only by synchronizing the time between a node and its one-hop

ISOCC 2009

neighbor, the I-MAC can organize communication. Moreover, although the power consumption of the I-MAC is dependent on a preamble length, its length can be determined only by the time drift over a one-hop neighbor [2][6]. The second reason is that we assume data collection type application in this research. In order to collect sensing data, it is necessary to construct routes from a base station to each node in the network. In many routing protocols, the route is constructed by using flooding. (e.g. Directed Diffusion [10] and Tiny Diffusion [11]). Therefore, since the time synchronization by using FTSP and the construction of a route can be executed simultaneously, the flooding operation does not become overhead. Fig. 3 Data-buffer SRAM divided with equal ratio manner III. DIVIDED DATA-BURRER SRAM Conventional sensor nodes primarily use either SRAM or flash memory for data storage. Both types of memory have their advantages and disadvantages. SRAM, for example, is easily implemented as one chip, it has low cost, and its active power consumption is small. However, SRAM must be continually driven to retain data, and, as a result, its leak current is significant. Flash memory, in contrast, consumes no leak current while the sensor node is sleeping, but the cost of making a one-chip sensor node with flash memory is higher than that with SRAM. In addition, the active power consumption of flash memory is high, and it cannot provide high-speed memory access. Moreover, flash memory has a finite number of erase-write cycles which does not suit its application to the packet buffer, which must be written and read many times. Therefore, this paper considers an energy-efficient SRAM architecture that can be implemented in a one-chip sensor node at low cost. To reduce the leakage power in the sleep period, we propose divided data-buffer SRAM, which is based on that leak current of SRAM increases in proportion to SRAM capacity. Figure 3 shows the architecture of divided data-buffer SRAM. Here, we assume 64 kbits of memory capacity. The data-buffer SRAM is divided by same ratio, and memory controller manages the power gating for each memory block. Memory controller extracts the data size from a header of received data and it provides a proper combination of memory block with “power on” signal, which minimizes leakage current in SRAM portion. The memory controller is activated only when the memory is accessed; therefore, the power overhead of the memory controller is quite small.

-204-

To implement the divided data-buffer SRAM, several registers must be added to memorize the states of the blocks. The area overhead and power overhead for the registers are, however, negligible because their number is much smaller than that of SRAM cells. Other than the registers, a pMOS switch is necessary to power on and off each block; the divided data-buffer SRAM has a 10% area overhead for the pMOS switch compared to the conventional SRAM, when the number of memory blocks is eight. The pMOS switch consumes 0.423 pJ for switching, so that its power becomes as small as 0.423 pW even if the switch is turned on and off once per second. The area and power overheads were estimated by using the process model of CMOS 180-nm. IV. IMPLEMENTATION In the conventional sensor nodes [12][13][14], all of the communication protocol (e.g. MAC layer, network layer, and application layer) is processed by micro controller (Fig. 4 (a)). Particularly, the micro controller is dealing with the process of the MAC layer in most of the time, because the sensor networks usually employ a simple transmission method and a lower data rate to reduce power consumption of the RF circuits. Hence the processing power during data communication becomes relatively high. To overcome the above problem, we propose a communication control processor which consists of a MAC processor, i8051 microcontroller, and data memory (Fig. 4 (b)). The MAC processor is a dedicated hardware for the communication processing in the I-MAC. Then i8051 deals with only upper layers than the MAC layer (e.g. time synchronization, rooting, and sensor input).

ISOCC 2009

(a) Conventional RF

Micro processor

Data-buffer SRAM

(b) Proposal Communication control processor

RF

MAC processor

Micro processor Memory controller Data-buffer SRAM

-205-

3 mm Power management module TX/RX SW

CPU (i8051)

TX

1.7 mm

Memory controller

MAC processor

RX (digital part)

The overall block diagram of the sensor node LSI is shown in Fig.5. The proposed sensor node has a power management module which controls the power gating for each function blocks, based on the state transition of MAC processor. The state of the power supply in each functional block depends on the state transition of I-MAC. For this reason, the sequencer specified for power supply management has been designed so that efficient power supply management can be realized. The power management module consists of a clock manager, timer, and MAC state register. The state of the MAC processor is stored in the MAC-state register. According to this state register, a clock and a power supply of each block are cut off. The timer is operated at 32.768 kHz and counts up the internal time. In the sleep state, only the timer is activated. By receiving an interrupt signal from the timer, the node moves to the RX state. A 433MHz RF transceiver [15] is also implemented in the same chip. The transceiver is comprised of transmitter (TX), receiver (RX), and TX/RX switch. The RX is based on a low-IF architecture and it contains an image rejection filter. Figure 6 shows the chip photomicrograph. The VLSI is comprised of a transceiver (TX, RX, PLL), i8051 micro controller, power management module, dedicated MAC processor, and divided data-buffer SRAM for data storage. The test chip was fabricated in a 180-nm CMOS process and the area is 3.0 x 1.7 mm2.

Fig. 5 Block diagram of sensor node LSI

Data RAM (Divided SRAM) 64k bit

Fig. 4 Communication processing method of a conventional sensor node (a), and proposed sensor node (b).

RX (analog part) ROM

Test circuit

Fig. 6 Chip photomicrograph. V. EXPERIMENTAL RESULT Figure 7 shows a leakage power of divided data-buffer SRAM. The leak current during static data-hold period is linearly increases as memory capacity increases. According to the memory block size, the leakage power is varied from 1.29μW to 2.73μW. Table 1 summarizes the power consumption of each functional block and total power for every operating mode. The power of each functional block is measured value and the total power in each state is also summarized. We evaluated the average energy in data-gathering operation on a network level using network simulator: QualNet [16]. In the network simulation, the wake-up period is 100 ms, and the data rate is 20 kbps. Randomlydeployed 100 nodes in the area of 100 x 100 m2 send data to a base station at the center. The interval of data gathering is 60 minutes; means 24 times per day. The operation of a sensor node assumed on this network simulation is illustrated in Fig. 8. Sensor nodes are sleeping at most of the data gathering period by I-MAC

ISOCC 2009

Leakage power [μW]

protocol. In our previous work [15], the average power during data gathering time was 58.0μW. This time, the average power during data gathering time is reduced to 44.2μW using divided-data buffer SRAM.

Sleeping

3 Data gathering period (Avg. 84.8 sec., 44.2 μW)

2 1

Wake-up period; T (100 msec.)

Gathering interval (60 min., 6.34 μW)

0

Fig. 8 Operation of sensor node on network.

Block size (kbit)

32 16 8 4 2 1 0.5 0.5 Total size (kbit)

ON ON ON ON ON ON ON ON

OFF ON ON ON ON ON ON ON

OFF OFF ON ON ON ON ON ON

OFF OFF OFF ON ON ON ON ON

OFF OFF OFF OFF ON ON ON ON

64

32

16

8

4

OFF OFF OFF OFF OFF ON ON ON 2

OFF OFF OFF OFF OFF OFF ON ON

OFF OFF OFF OFF OFF OFF OFF ON

1

0.5

Table 2. Average operation time of sensor nodes in a day. Gathering interval [min.] Gathering times per day TX active time [sec.] RX active time [sec.] Network processing time [sec.] Sleep time [sec.] Sleep time [hour] Average power with divided SRAM [uW] Average power with normal SRAM[uW]

Fig. 7 Leakage power of divided data-buffer SRAM

Table 1. Power consumption of each functional block and total power for every operating mode Dynamic [μW]

Leak [nW]

State of a sensor node Sleep

Transmitting

Receiving

Network processing

TX

4260

-

OFF

ON

OFF

OFF

RX

7214

-

OFF

OFF

ON

OFF

PLL

3280

-

OFF

ON

ON

OFF

MAC Processor

11.7

-

OFF

ON

ON

ON

Memory controller

14.4

88.3

clk gating

ON

ON

ON

ON

OFF

OFF

ON

read

265

write

280

1287 to 2732 (depend on operating block size)

clk gating

ON (r/w)

i8051

787

-

OFF

OFF

OFF

ON

Power management

4.53

-

OFF

ON

ON

ON

Power management (operating with 32.768kHz crystal)

0.485

-

ON

OFF

OFF

OFF

5.46μW (min.) 6.92μW (max.)

10.72mW

13.69mW

3.97mW

Total power

60 24 0.54 5.12 0.11 86394.22 23.998 6.34 7.79

10 Average power of sensor node [uW] μ

Block power Block name

Data RAM (proposed)

I-MAC operation

Wake-up period; T (100 msec.)

Sleep Network processing RX TX

8

6

18.6% reduction

4

2

0 Normal SRAM

The average operation time of each component in a day is shown in Table 2. It is estimated that the sensor node operates at 7.79μW of one-day average power supposing the normal SRAM buffer. A comparison of the average power between normal SRAM and divided data-buffer SRAM is shown in Fig. 9. Divided SRAM can reduce the average power of 18.6 %, which realizes 6.34μW of oneday average power.

-206-

Divided SRAM

Fig. 9 Comparison of the average power between normal SRAM and divided data-buffer SRAM. Figure 10 shows the effect of the power reduction by the divided data-buffer which depends on the number of data gathering times per day. The average power in a day is also indicated in Fig.10. This result shows, the reduction ratio of leakage power is decreased when the number of data gathering time per day is increasing. It is because sleep time is decreasing. Therefore, this divided databuffer SRAM architecture is suitable for the application which gathers the data less frequently.

ISOCC 2009

Sleep Network processing RX TX

12

REFERENCES

25

[1] 20

10 8

15

6

10

4 5 2

Power reduction ratio by divided SRAM [%]

Average power in a day [μW]

14

[2]

[3]

[4] 0

0 1

12

24

48

72

144

[5]

The number of data gathering per day

Fig. 10 Average power in a day and power reduction ratio. [6]

VI. CONCLUSION We propose a single chip sensor node processor with communication centric design. The sensor node is comprised of a transceiver, i8051 micro controller, power management module, dedicated MAC processor, and divided data-buffer SRAM. The test chip occupies 3 x 3 mm2 in a 180-nm CMOS process, including 0.63 M transistors. The sensor node LSI operates at 6.34μW of one-day average power, supposing 24 of gathering times per day. I-MAC and FTSP reduce the operation time of RF, and make up long sleep time. Power management module reflects I-MAC state transition and decrease the active power. Moreover, divided data-buffer SRAM architecture reduces the power of sleep state. It is estimated that 18.6% of average power is reduced by divided data-buffer SRAM. This architecture is useful for long gathering interval applications.

[7]

[8]

[9]

[10]

[11]

[12]

ACKNOWLEDGMENT

[13]

This research was partially supported by the Strategic Information and Communications R&D Promotion Programme (SCOPE), the Ministry of Internal Affairs and Communications, Japan, and also supported by the Grantin-Aid for JSPS Fellows, No. 21000333, the Ministry of Education, Culture, Sports, Science and Technology (MEXT), Japan. This research was also supported by VLSI Design and Education Center (VDEC), the University of Tokyo in collaboration with Synopsys, incorporated, Cadence Design Systems, incorporated, Mentor Graphics, incorporated, and Agilent Technologies Japan, Ltd.

-207-

[14]

[15]

[16]

“http://panasonic.co.jp/corp/news/official.data/data.dir/jn0505191/jn050519-1.html”. M. Ichien, T. Takeuchi, S. Mikami, H. Kawaguchi, C. Ohta and M. Yoshimoto, “Isochronous MAC using Long-Wave Standard Time Code for Wireless Sensor Networks,” Proceedings of International Conference on Communications and Electronics, pp. 172-177, Oct. 2006. W.Ye, J.Heidemann and D.Estrin, “An Energy-Efficient MAC Protocol for Wireless Sensor Networks,” Proc. of the 21st International Annual Joint Conference of the IEEE Computer and Communications Societies (INFOCOM 2002), June 2002. W.Ye, J.Heidemann and D.Estrin, “Medium Access Control With Coordinated Adaptive Sleeping for Wireless Sensor Networks,” In IEEE Transactions on Networking, April 2004. J. Elson, L. Girod and D. Estrin: “Fine-Grained Network Time Synchronization using Reference Broadcasts,” Proceedings of the 5th Symposium on Operating Systems Design and Implementation (OSDI’02), Boston, Massachusetts, 2002. T. Takeuchi, Y. Otake, M. Ichien, A. Gion, H. Kawaguchi, C. Ohta and M. Yoshimoto, “Cross-Layer Design for Low-Power Wireless Sensor Node Using Wave Clock,” IEICE Transactions on Communications, vol. E91-B, no. 11, pp. 3480-3488, Nov. 2008. M. Sheets, F. Burghardt, T. Karalar, J. Ammer, Y. H. Chee, J. Rabaey and A. Functionality, “A Power-Managed Protocol Processor for Wireless Sensor Networks,” Digest of Technical Papers 2006 Symposium on VLSI Circuits, pp. 212-213, June. 2006. En-Yi A.Lin, J. M. Rabaey and A. Wolisz, “Power-Efficient Rendez-vous Schemes for Dense Wireless Sensor Networks,” Proceedings of IEEE International Conference (ICC), vol.7, pp. 3769-3776, June. 2004. S. Ganeriwal, R. Kumar and M. B. Srivastava: “Timing-sync Protocol for Sensor Networks,” Proceedings of the 1st ACM Conference on Embedded Network Sensor Systems (SenSys), pp. 138–149, 2003. C. Intanagonwiwat, R. Godinvan and D. Estrin, “Directed diffusion for wireless sensor networking,” IEEE/ACM Transactions on Net-working, Vol. 11, No. 1, pp. 2–16, 2003. J. Heidemann, F. Silva and D. Estrin, “Matching data dissemination algorithm to application requirements,” Proceedings of the 1st ACM Conference on Embedded Network Sensor Systems (SenSys’03), pp. 218–229, 2003. J. Hill and D. Culler, “Mica: a wireless platform for deeply embedded networks,” IEEE Micro, pp. 12-24, 2002. J. Rabaey, J. Ammer, T. Karalar, S. Li, B. Otis, M. Sheets and T. Tuan, “PicoRadios for Wireless Sensor Networks: The Next Challenge in Ultra-Low-Power Design,” Proceedings of International Solid-State Circuits Conference (ISSCC), 2002. N. Yamauchi, I. Urushibara, A. Aizawa, H. Sato, H. Hosaka, K. Sasaki and K. Itao, “Nature interfacer version 3 (Ni3): A wearable wireless sensor module with exible protocol con gurability for ubiquitous sensor networks,” Proceedings of 1st International Workshop on Networked Sensing Systems (INSS), 2004. T. Takeuchi, S. Izumi, T. Matsuda, H. Lee, Y. Otake, T. Konishi, K. Tsuruda, Y. Sakai, H. Fujiwara, C. Ohta, H. Kawaguchi and M. Yoshimoto, “58-μW Single-Chip Sensor Node Processor Using Synchronous MAC Protocol,” Digest of Technical Papers 2009 Symposium on VLSI Circuits, pp. 290-291, June. 2009. “http://www.qualnet.com/”, Scalable Network Technologies.

ISOCC 2009

Suggest Documents