Nothing Special   »   [go: up one dir, main page]

Academia.eduAcademia.edu

vlsi design

INSTITUTE OF PHYSICS PUBLISHING SMART MATERIALS AND STRUCTURES Smart Mater. Struct. 14 (2005) 569–574 doi:10.1088/0964-1726/14/4/015 Dynamic bootstrapped shift register for smart sensor arrays Leo Lee, Said Al-Sarawi and Derek Abbott Centre for Biomedical Engineering (CBME) and The Centre for High Performance Integrated Technologies and Systems (CHiPTec) and Department of Electrical and Electronic Engineering, The University of Adelaide, SA 5005, Australia Received 24 September 2003, in final form 27 September 2004 Published 4 May 2005 Online at stacks.iop.org/SMS/14/569 Abstract In this paper we demonstrate the operation of a dynamic serial-to-parallel shift register, with only four transistors per stage. A bootstrap capacitor is used to overcome the problem of transistor threshold voltage drop. We will refer to this new logic family as non-ratioed bootstrap logic (NRBL). Simulation results are presented showing the operation of the shift register along with Monte Carlo analysis to demonstrate the robustness of the circuit. A key application area for this novel shift register is in the addressing and read out of high-density smart sensor arrays. 1. Introduction The shift register is perhaps the most common structure for enabling the movement of a data bit in VLSI systems [1]. Shift registers have many applications ranging from producing time delay by using serial-in serial-out shift registers, or converting serial data to parallel data using a serial-in parallel-out shift register. Serial-in parallel-out shift registers are commonly used in image sensors to control the analogue multiplexer, which transmits data from the photodiode pixel to the A/D converter where they are then processed into an image, as shown in figure 1. There has been recent interest in using capacitors to reduce the size of VLSI circuits [2]. Here we present a different capacitive approach to produce a very compact serial– parallel shift register. We demonstrate functionality with four transistors per stage and no DC power rails. There are a number of applications where both low power and compactness are required. These include dense optical smart sensors [3] and gate control of solid-state quantum computers [4]. A special feature of this shift register is that the output amplitude can be externally controlled by adjusting the height of the two-phase non-overlapping clock; this feature is important for these target applications. As shown in figure 2, each stage of the low power serialto-parallel dynamic shift register consists of four non-ratioed minimum sized n-channel transistors and can be implemented in any standard CMOS process. A bootstrapping capacitor is used to overcome the threshold voltage drop of the nchannel pass transistor. The register has no DC power supply and is driven entirely by the two-phase clock, leading to low 0964-1726/05/040569+06$30.00 © 2005 IOP Publishing Ltd power dissipation. The clocks should be non-overlapping, and if this requirement is not met, the register will not perform correctly [5]. Non-overlapping clocks are advantageous as fixed pattern noise (fpn), due to capacitive clock feedthrough, can be more easily controlled [6]. Power consumption and dissipation is an important parameter in modern VLSI design, as more electronic devices increasingly become portable [7]. Power consumption therefore needs to be minimized, keeping the battery size practical for portable electronic devices. 2. Operation of the shift register With reference to figure 2, the operation of the circuit is as follows. The drain of each enable transistor is pulsed every clock period. The output of each shift element remains low until a high is enabled by the previous stage or the serial I/P. The output is charged up by the clock pulse when a high appears at the gate of the enable transistor. The output pulse does not drop, by the threshold voltage Vt of the n-transistor, due to the bootstrap action of the capacitor Cb . Moreover, Cb pulls up the enable transistor by Vboot , holding it at Vclk + Vboot , Vboot = Cp Vclk Cb + Cp (1) where Cp is the parasitic capacitance to ground at the input and Vclk is the clock high level. As the output is charged up to Vclk , Vclk − Vt appears at the gate of the reset transistor and the following enable transistor via the pass transistor. As Clk 1 starts going low, the output drops accordingly. When the output is 2Vt below VClk , the reset transistor turns on and Printed in the UK 569 L Lee et al Figure 1. Photodiode array. The row select register is required to drive the gates of the photodiode address transistors on each row, therefore it has a larger load to drive than the column select register, which is only required to drive the gate of one transistor in the analogue multiplexer. Output Reset Shift in Shift out Cb Pass As the circuit is driven by the two non-overlapping clocks, to ensure the circuit functions as designed, the minimum nonoverlapping period of the clocks should be equal to the fall time of the current output stage and the rise time of the following output stage. Furthermore, to ensure the circuit functions, the input pulse should be pulsed in while Clk 2 is high, to charge the bootstrap capacitor. 3. Structure of the shift register Discharge Cp Enable Clk1 Clk2 Figure 2. One stage of the shift register, with only four transistors and a capacitor Cb to provide bootstrapping action to overcome the MOSFET threshold voltage drop in the enable transistor. The register has no DC supply and is driven entirely by two non-overlapping clocks. resets the bootstrap capacitor. This causes the enable transistor to turn off when the output is at this level. When the next clock phase comes on (Clk 2), the discharge transistor turns on and completes the discharge of the output. The bootstrap capacitor of the next stage is not charged as the gate of the pass transistor is now at a lower potential than its source. The same cycle repeats for the subsequent stages of the shift register. Note that the connection of the two clocks alternates between each stage of the shift register. A summary of the functions of each transistor is as follows. • Enable transistor: connects clock to output when enabled by the previous stage. • Reset transistor: resets the bootstrap capacitor. • Pass transistor: (a) passes initial current to charge up the bootstrap capacitor of the next stage, (b) when output discharges, it turns off so that the bootstrap capacitor of the next stage does not discharge, (c) provides a threshold voltage drop so that the reset transistor is kept hard off until the output potential drops. 570 Each stage of the shift register is non-inverting and consists of four transistors with an extra transistor at the input and output of the shift register, as shown in figure 3. The added transistor at the input stage is required to clock the serial input pulse. The added transistor at the serial output stage is required to discharge the last bootstrap capacitor. The layout of the shift register, shown in figure 4, displays four stages of the shift register. The two clock lines were designed with an interdigitated structure such that the contacts for both clock lines are at the same level in the layout. This symmetry is important as it balances the parasitic capacitance on the clock lines. This allows us to have the same bootstrap capacitance value for each stage and also allows easier clock buffer designs. Furthermore, if the shift register is used to address an imaging array, as in figure 1, the balanced clock lines will minimize odd/even type fixed-pattern noise on the output. Note that the bootstrap capacitance is formed by a gate oxide layer sandwiched between poly and diffusion layers. So there is the question of which way around to place the capacitor. Figure 4 shows the poly terminal connected to the gate of the enable transistor and the diffusion terminal connected to an output stage. This orientation has two advantages: (a) it reduces the number of contact cuts by maintaining contiguous layers and (b) the diffusion is kept away from the sensitive node at the gate of the enable transistor (this is important for applications where the shift register might possibly be exposed to light). 4. Bootstrap analysis Here we show that a first-order circuit analysis of the bootstrapped enabled transistor is fairly straightforward and Dynamic bootstrapped shift register for smart sensor arrays Output: Serial I/P Clk1 Clk2 Figure 3. Multiple stages of the NRBL shift register, driven by two non-overlapping clocks. The shift register also requires two extra transistors, one before the first stage for the serial input, and one at the last stage to discharge the last bootstrap capacitance. Note the alternate clock connections between the stages. O/P Clock Phase2 metal Clock Phase1 n-diffusion VS Cboot CL ib polysilicon S G Discharge Transistor ids D Enable Transistor Cp Bootstrap Capacitor VClk 0V t=0 Reset and Pass Transistor Slot Output 0 Output 1 Output 2 Figure 5. Circuit analysis of the enable transistor, when it receives a rising clock edge. Equations (2) and (4) give Output 3 Cp V˙G = Cboot (V˙G − V˙S ) Figure 4. Layout of the dynamic shift register in a standard 0.25 µm CMOS process, where only n-channel transistors are required. Four shift stages are shown and each stage occupies a pitch of only 2.04 µm. (This figure is in colour only in the electronic version) ∴ V˙S = γ V˙G roughly agrees with SPICE simulations. We begin the analysis by considering the rising clock edge in figure 5. Now, VG = Vclk + Vboot Cp . Cp +Cboot where Vboot = ηVclk and η = At t → ∞, VGS = VG −VS = (Vclk +Vboot )−Vclk = Vboot . Due to the bootstrapping action, VS follows VD and so VDS is small. This is maintained because η is designed such that Vboot  Vt to keep the transistor on. As saturation is defined when VDS > (VGS − Vt ), we can assume nonsaturation due to the small VDS and hence we must use V2 i ds = K {(VGS − Vt )VDS − 2DS }. The circuit equations are: i b = Cp V˙G (2) i ds − i b = C L V˙S (3) i b = Cboot (V˙G − V˙S ). (4) Assuming VDS is very small, we can make the following linear approximation: g(VG − VS − Vt ) = i ds , and choose a constant clk . g = K ×V 2 where γ = 1 − Cp . Cboot Now, on substituting equation (2) and the linear approximation of i ds into equation (3), we obtain g(VG − VS − Vt ) − Cp V˙G = C L V˙S . Then differentiating with respect to t gives C L V̈S + Cp V¨G = g V˙G − g V˙S . Substituting V˙S = γ V˙G gives γ C L V¨G + Cp V¨G = g V˙G − γ g V˙G , g − γg ˙ VG . ∴ V¨G = γ C L + Cp Now, solving this second-order differential equation gives the gC circuit time constant τ as τ1 = CL Cboot −CL Cp p +Cp Cboot .   Cboot C L CL ∴τ = − +1 . (5) g Cp Cboot From equation (5) the expected operating frequency of the shift register can be calculated using f = 2π1 τ . The calculated frequency was 116 MHz, at an example 128 transistor load with bootstrap capacitance of 15.46 fF. This frequency is slightly higher than the 100 MHz frequency the circuit operates at. This could be due to g being slightly overestimated, as we picked a mid-point value rather than calculating an integrated average value. Finding the integrated average value of g gives  Vclk 1 K Vclk g= K (VGS − Vt ) dVGS = − K Vt . Vclk 0 2 571 L Lee et al Clock (V) 3 2 1 0 0 10 20 30 40 50 60 70 80 0 10 20 30 40 50 60 70 80 0 10 20 30 40 TIME (ns) 50 60 70 80 Input (V) 3 2 1 0 3 Output (V) 2 1 0 –1 Figure 6. Simulation results of NRBL shift register. The top graph shows the 2.5 V non-overlapping clock pulses at 100 MHz after passing through the clock distribution circuit. The second graph shows a 2.5 V input pulse when Clk 2 is pulsed. The last graph shows the operation of ten shift register stages, with a bootstrap capacitance of 15.46 fF, and parasitic capacitance of 3 fF. The register drives a load of 128 minimum size transistors for 0.25 µm technology. Using this integrated average value g the calculated frequency with 128 transistor load and bootstrap capacitance of 15.46 fF is 106 MHz. This is closer to the 100 MHz clock frequency we expect the shift register to operate at, by simulation, as shown in the following sections. 5. Simulation Simulation of the dynamic shift register was performed with HSPICE using an industrial 0.25 µm CMOS process, using Level 49 model parameters with a 2.5 V clock voltage. The value of the bootstrap capacitance is determined by the value of Vboot required as shown in equation (1), and it was chosen to be 15.46 fF. Using a bootstrap capacitance value of 15.46 fF, η is calculated to be 0.185; therefore η × 2.5 = 0.46 V. This provides the necessary voltage to overcome the threshold voltage drop 0.41 V of the transistor. The shift register was simulated with a load inverter with size equivalent to 128 minimum size load transistors. Figure 6 shows the operation of the shift register, with two non-overlapping clocks at 100 MHz. When the input pulse is clocked at the serial I/P, the output pulses of each stage are as shown in figure 6. 5.1. Distributed clock tree When one counts up all the capacitances in the registers of a large CMOS design, it often leads to large power dissipation when driven in a small time and at high repetition rate [8]. Long clock lines will also introduce substantial resistance [9]. This introduces an RC delay in the clock lines. To make the simulations more realistic a distributed clock tree is used to distribute the capacitance required to be driven and to also minimize clock skew. A cascade inverter design was used to size the inverter stages of the clock tree. In order to minimize 572 the total delay a width factor of e (base of natural logarithms) is used. 5.2. Power dissipation The power dissipation of the dynamic shift register is minimal as only one stage is on at a time. The total power dissipation is therefore mainly due to the charging and discharging of the capacitance that each stage drives, C L . This is equal to 2 f , where f is the clock frequency. From figure 7 C L Vclk it can be observed that the power dissipation of the circuit varies linearly with the clock frequency. The calculated power dissipation at 100 MHz with the load is 0.29 mW; however, this does not take account of the power dissipation due to the distributed clock circuit. Figure 7 shows simulated results of power dissipation versus varying voltage of operation and power dissipation versus varying frequency of operation, with a load of 128 minimum sized transistors. Table 1 shows a comparison of the NRBL shift register with two other shift registers: a standard dynamic shift register [1] and Shibata’s shift register [10]. The standard dynamic shift register is a simple pass transistor and inverter arrangement; however, two stages are required for a noninverting output. The shift register proposed by Shibata uses a D flip-flop arrangement. From table 1 it can been seen that the NRBL shift register has the lowest transistor count and a low power dissipation. 5.3. Temperature simulation The temperature dependence of transistors on the shift register performance must be considered. The simulations were performed using the industrially specified temperature range of −40 to 85 ◦ C. The simulations were performed with a 2.5 V Dynamic bootstrapped shift register for smart sensor arrays 3.4 100 3.2 frequency 60 3 Voltage 40 2.8 20 2.6 0 0 1 2 3 4 5 Voltage (V) Frequency (MHz) 80 Table 1. Comparison of different shift register designs. Note that the NRBL shift register case contains only four transistors, no DC power supply and a low power dissipation. The rail voltage for the NRBL shift register is indicated by (—) in the table, as it is driven entirely by the 2.5 V clock. The power dissipation value obtained for the NRBL shift register was under a load of 128 minimum sized transistors, at 100 MHz and a clock voltage of 2.5 V. The simulations were performed using level 49 industrial 0.25 µm CMOS technology. The dynamic shift register in column two shows the simulated results, with the same loading value, frequency and clock voltage of the NRBL shift register. Care should be taken when comparing with Shibata’s shift register, as it uses a 0.6 µm CMOS process with 3.3 V power supply and a mean capacitance load of 0.04 pF. This capacitive load is about 0.01 pF less than for the first two cases. 2.4 6 Power Dissipation (mV) Figure 7. Simulation results of frequency versus power dissipation on the left y-axis and clock voltage versus power dissipation on the right y-axis. Both simulations presented in this diagram were simulated with 128 transistor loading. The frequency plot with clock voltage of 2.5 V indicates that the power dissipation of one stage of the shift register varies linearly with the clock frequency, as expected from the power dissipation formula in section 5.2. The voltage plot shows the power dissipation of one stage of the shift register as the clock voltage is varied with clock frequency of 100 MHz. At a clock voltage of 2.5 V the power dissipation is 1.75 mV. clock voltage and a frequency of 100 MHz under 128 minimum sized transistor loading. From the simulations, the shift register was found to be functioning correctly within the specified temperature range. Even though the circuit was shown to work within the industrially specified temperature range, the temperature range was further increased in later simulations. Simulations showed that the circuit would still function correctly for a temperature range of −75 to 180 ◦ C, which demonstrates the robustness of the shift register to temperature variations. This temperature range is greater than the standard military specified temperature range of −55 to 125 ◦ C. 5.4. Process variation simulations The CMOS fabrication process is a long sequence of chemical steps that result in device characteristics that follow a normal or Gaussian distribution [8]. Since the dynamic shift register discussed in this paper only consists of n-type transistors, only the fast and slow parameters are considered, as skew parameters are not relevant. From the simulated results the shift register was shown to be functioning correctly when simulated with one sigma process variations. The simulated outputs are similar to those shown in figure 6. Temperature simulation was repeated for process variation simulations. The simulated results showed the circuit functioning as designed for the extended temperature range as described in section 5.3. 5.5. Monte Carlo analysis The design goal of this shift register is to minimize power dissipation and also area. The bootstrap capacitor is therefore designed to provide enough bootstrapping to overcome the threshold drop of the transistor, but at the same time keeping the area to a minimum. A Monte Carlo analysis of threshold voltages of the transistors in the circuit was performed to Max speed (MHz) Technology (µm) Transistor/stage No. power supplies Rail voltage (V) Power dissipation per stage (mW) NRBL shift reg Dynamic shift reg [1] Shibata’s shift reg [10] 100 0.25 4 0 — 1.75 100 0.25 6 1 2.5 2.98 100 0.6 D-FF(∼15) 1 3.3 1.5 analyse the overall robustness of the shift register and its sensitivity to process parameters. The Monte Carlo analysis was performed with a Gaussian distribution with a standard deviation of 10 mV in threshold voltage. The 10 mV standard deviation was chosen from the relation between transistor area and threshold voltage mismatch, given in the manufacturer’s data sheets. This voltage is added to the threshold voltage as calculated by the HSPICE simulator. The simulations for the Monte Carlo analysis were performed with a clock voltage of 2.5 V, clock frequency of 100 MHz and with a load of 128 transistors. The simulated mean rise time is 1.64 ns with a standard deviation of 90.54 ps. The simulated mean fall time of the circuit is 0.300 ns with a standard deviation of 3.62 ps. Compared with the rise and fall time of the circuit without the 10 mV standard deviation added to the threshold voltage, the circuit is shown to still function correctly and to not be sensitive to a standard deviation variation in threshold voltage. 6. Conclusion A four-transistor-per-stage serial-to-parallel dynamic shift register has been designed and simulated with HSPICE using a 2.5 V, 0.25 µm CMOS process. It has been shown to drive a fan out of 128 with a very low power dissipation. The circuit has also been simulated with varying temperature, process variation and also a Monte Carlo analysis to test for the robustness of the design to manufacturing processes and applications. From the results presented, the shift register operates as designed and is robust to temperature and process parameter variation. Although the proposed dynamic shift register works with a low clock speed of 100 MHz, the performance requirement of the shift register is low power and small area. These requirements can be traded to increase the register speed. The present clock speed is suitable for application in the addressing and readout of smart sensor arrays. 573 L Lee et al Acknowledgments Funding from the ARC and Sir Ross and Sir Keith Smith Fund is gratefully acknowledged. References [1] Mead C and Conway L 1980 Introduction to VLSI Systems 1st edn (Reading, MA: Addison-Wesley) [2] Celinski P, López J F, Al-Sarawi S and Abbott D 2001 Low power, high speed, charge recycling CMOS threshold logic gate Electron. Lett. 37 1067–77 [3] Moini A, Bouzerdoum A, Eshraghian K, Yakovleff A, Nguyen X T, Blanksby A, Beare R, Abbott D and Bogner R E 1997 An insect vision-based motion detection chip IEEE J. Solid-State Circuits 32 279–84 [4] Ng J and Abbott D 2002 Introduction to solid-state quantum computation for engineers Microelectron. J. 33 171–7 574 [5] Lu S 1988 A safe single-phase clocking scheme for CMOS circuits IEEE J. Solid-State Circuits 23 280–3 [6] Ohba S, Nakai M, Ando H, Hanamura S, Satoh K, Takahashi K, Kubo M, Shimada S and Fujita T 1980 MOS area sensor II low-noise MOS area sensor with antiblooming photodiodes IEEE Trans. Electron Devices 27 1682–7 [7] Sklavos N, Kitsos P, Zervas N and Koufopavlou O 2000 A new low and high speed bidirection shift register architecture Technical Report University of Patras, Patras, Greece [8] Weste N H E and Eshraghian K 1993 Principles of CMOS VLSI Design, A System Perspective 2nd edn (Reading, MA: Addison-Wesley) [9] Rabaey J M 1996 Digital Integrated Circuits, A Design Perspective 1st edn (Upper Saddle River, NJ: Prentice-Hall) [10] Shibata N, Watanabe M and Tanabe Y 2002 A current-sensed high-speed and low-power first-in-first-out using a wordline/bitline-swapped dual-port SRAM cell IEEE J. Solid-State Circuits 37 735–50