# Variation-aware Subthreshold Logic Circuit Design

Hiroshi Fuketa<sup>1\*</sup>, Ryo Takahashi<sup>1</sup>, Makoto Takamiya<sup>1</sup>, Masahiro Nomura<sup>2</sup>, Hirofumi Shinohara<sup>2</sup>, and Takayasu Sakurai<sup>1</sup>

<sup>1</sup> Institute of Industrial Science, University of Tokyo, Japan
<sup>2</sup> Semiconductor Technology Academic Research Center (STARC), Japan
\* Email: fuketa@iis.u-tokyo.ac.jp

## Abstract

Subthreshold logic circuits are one of promising solutions to achieve ultra-low power operation. However, subthreshold circuits are significantly sensitive to manufacturing and environmental variability. In this paper, we will discuss design challenges in subthreshold logic circuits, such as rise in a minimum operating voltage, signal integrity degradation, and large delay variations.

#### 1. Introduction

Subthreshold logic circuits, which operate at lower supply voltage than threshold voltage (V<sub>TH</sub>), are one of promising solutions to achieve ultra-low power operation [1]. Fig. 1 shows simulated delay, power, and energy (power delay product) of a CMOS logic circuit [2]. As the supply voltage (V<sub>DD</sub>) is reduced, the delay increases, whereas the power significantly decreases. Therefore, subthreshold logic circuits are suitable for low-power applications in which circuit speed is not a primary concern or can be compensated by parallelization. For such applications, energy is a key metric. As shown in Fig. 1, the energy has its optimum value (E<sub>opt</sub>) at V<sub>DD</sub>=V<sub>opt</sub>. Thus, V<sub>opt</sub> is the target supply voltage for ultra-low-power applications. V<sub>opt</sub> is around 0.3V for typical logic circuits [2].

Subthreshold circuits are, however, extremely sensitive to manufacturing and environmental (e.g. temperature) variability, because I-V characteristics of MOSFET's in the subthreshold region have an exponential dependence on  $V_{TH}$ . In this paper, the following three challenges in subthreshold circuit design induced by such variability are discussed; 1) rise in a minimum operating voltage  $(V_{DDmin})$ , 2) signal integrity degradation due to crosstalk noise, and 3) large delay variations.

# 2. Minimum operating voltage (V<sub>DDmin</sub>)

Reducing  $V_{DD}$  could cause functional errors. The supply voltage at which the first functional error occurs is defined as a minimum operating voltage ( $V_{DDmin}$ ). The



Fig. 1. Simulated dependence of power, delay, and energy of CMOS logic circuit.



Fig. 2. Closed-form expression of  $V_{DDmin}$  of logic gates when pMOS and nMOS are not perfectly balanced.

minimum operating voltage ( $V_{DDmin}$ ) is defined as the supply voltage where the first functional error occurs. If  $V_{DDmin}$  is higher than  $V_{opt}$ , the minimum-energy operation cannot be attained. Therefore, investigations on  $V_{DDmin}$  are essential for subthreshold circuits.

Fig. 2 shows a closed-form expression for estimating  $V_{DDmin}$  of logic gates [3].  $V_{DDmin}$  can be expressed as a linear function of square-root of logarithm of the number of logic gates. This means that  $V_{DDmin}$  rises as the number of logic gates increases. Fig. 3 illustrates  $V_{DDmin}$ 's of an inverter chain with various configurations of the within-die  $V_{TH}$  variation obtained by Monte Carlo SPICE simulations and the closed-form expression in a



Fig. 3. Simulated  $V_{DDmin}$ 's of inverter chain in 65nm process.  $\sigma_{pn}$ 's are normalized by nominal value of this process.

65nm CMOS process. V<sub>DDmin</sub>'s obtained by the expression agree with the simulated results.

The slope of the expression of  $V_{DDmin}$  (Fig. 2) is proportional to the standard deviation of the within-die variation in the  $V_{TH}$  difference between pMOS and nMOS transistors ( $\sigma_{pn}$ ). In contrast, the intercept of the expression depends on the balance between intrinsic strengths of pMOS and nMOS transistors ( $\beta$ ). This indicates that the slope depends on the within-die  $V_{TH}$ variation, while the intercept depends on the die-to-die  $V_{TH}$  variation. Although body-biasing can compensate the die-to-die variation, that is, can reduce the intercept, it has no effects on reducing the slope. Therefore, the  $V_{DDmin}$  increase due to the within-die  $V_{TH}$  variation could be a critical problem in large-scale circuits.

Fig. 4 shows simulated  $V_{DDmin}$ 's of the inverter and NAND chains.  $V_{DDmin}$ 's of logic gates, which contain the larger number of stacked/paralleled transistors, are much higher. This is because stacking and paralleling transistors contained in the NAND gate worsen the balance of the strength of pMOS and nMOS, which corresponds to the increase in parameter |b| in the closed-form expression (Fig. 2). Although  $V_{DDmin}$  can be reduced by adjusting the balance between the strength of pMOS and nMOS, it is usually impractical, since the gate sizes of either pMOS or nMOS transistors must be significantly enlarged for the adjustment, and hence it is not acceptable due to the area constraint [4]. Therefore, the logic gates with a lot of inputs should not be used in the design of subthreshold logic circuits.



Fig. 4. Simulated  $V_{DDmin}$ 's of inverter and 2-, 3-, 4-input NAND chains in 65nm process.



Fig. 5. Crosstalk-noise on 1.5-mm coupled wire lines discussed in this paper.

#### 3. Increase of crosstalk noise due to $V_{TH}$ imbalance

In subthreshold circuits, the dependence of crosstalk noise on manufacturing variability should be also taken into account. In this paper, crosstalk noise on a 1.5-mm coupled wire line shown in Fig. 5 is investigated. The approximation model of crosstalk noise in the subthreshold region has been proposed in [5, 6]. The peak voltage of crosstalk noise ( $V_{PEAK}$ ) in Fig. 5 in the

subthreshold region is approximated as

$$V_{PEAK,approx} = V_{DD} \left( \frac{r}{r+0.5} \cdot \frac{c}{c+0.5} \right)^{1.3},$$
 (1)

where  $c=C_C/C_W$  ( $C_W$  and  $C_C$  are the wire capacitance and the coupling capacitance, respectively.) and r is the equivalent resistance ratio. r is expressed as

$$r \propto e^{\frac{V_{TH,N} - |V_{TH,P}|}{nU_T}},$$
 (2)

for rising noise, and

$$r \propto e^{\frac{|V_{TH,P}| - V_{TH,N}}{nU_T}},$$
(3)

for falling noise, where  $V_{TH,P}$  and  $V_{TH,N}$  are  $V_{TH}$ 's of pMOS and nMOS, respectively. These equations indicate that the peak noise voltage is sensitive to the  $V_{TH}$  balance between pMOS and nMOS. Here, we focus on the  $V_{TH}$  imbalance induced by the die-to-die  $V_{TH}$  variation, since within-die  $V_{TH}$  variation can be reduced by employing wider MOSFET's.

Fig. 6 shows the simulated peak noise voltage under a typical process condition (TT) and the corner conditions (slow-nMOS/fast-pMOS (SF) and fast-nMOS/slowpMOS (FS)) for the rising noise and falling noise, respectively. V<sub>PEAK</sub>/V<sub>DD</sub> at 0.3V is 1.1 times larger than  $V_{PEAK}/V_{DD}$  at 1.1V for the rising noise under the SF corner condition and 1.4 times larger for the falling noise under the FS corner condition. The approximated peak noise (V<sub>PEAK,approx</sub>) calculated using (1) is also shown, and it agrees with the simulation results. Fig. 6 indicates that peak noise voltage in the subthreshold region is significantly sensitive to the  $V_{TH}$  balance, since peak noise voltage depends on the equivalent resistance ratio (r), which has an exponential dependence on the  $V_{TH}$ difference between nMOS and pMOS, as indicated in (2) and (3). Therefore, designers must consider the process corners, such as the SF and FS corner conditions, as the worst-case conditions for signal integrity in subthreshold circuits.

In [5, 6], the peak voltage of crosstalk noise is measured in a 40nm CMOS process, and the measurement results agrees with the simulation results.

#### 4. Gate delay variation in subthreshold region

In the subthreshold region, the gate delay is significantly



Fig. 6. Peak voltage of crosstalk noise on 1.5-mm coupled wire line with two aggressors and one victim at TT, FS, and SF process corners in 40nm CMOS process.

sensitive to V<sub>TH</sub>, because the subthreshold current has an exponential dependence on V<sub>TH</sub>. Therefore, the gate delay is fluctuated by environmental variability such as temperature. Fig. 7 shows the measured dependence of the average gate delay (= $\mu$ ) on temperature of NAND and NOR gates with different V<sub>DD</sub> [7]. As temperature is reduced from 80°C to -40°C, the average gate delay slightly decreases at V<sub>DD</sub>=1.1V, whereas it increases at V<sub>DD</sub>=0.3V by a factor of more than 10. The worst-case (=largest)  $\mu$  is observed at -40°C.

In addition, the gate delay variation  $(=\sigma/\mu)$  also depends on temperature. Fig. 8 shows the measured dependence of  $\sigma/\mu$  on temperature [7]. The worst-case (=largest)  $\sigma/\mu$ is also observed at -40°C. In the NAND gate at



Fig. 7. Measured dependence of average gated delay ( $\mu$ ) on temperature in 40nm CMOS process.

 $V_{DD}$ =0.3V,  $\sigma/\mu$  at -40°C is 1.8 times larger than that at 85°C. This indicates that the temperature dependence of  $\sigma/\mu$  as well as  $\mu$  must be taken into account for subthreshold logic circuits.

As shown in Figs 7 and 8, the gate delay variation in the subtreshold region is much larger than that in the nominal supply voltage region. This indicates that large timing margins are required in the worst case design. Since such large timing margins spoil energy-efficient operation achieved by the reduction of  $V_{DD}$ , an adaptive performance control, such as dynamic  $V_{DD}$  control and/or body-bias control, is indispensable for subtreshold circuits.

In order to attain the adaptive performance control, a performance monitor is required, and conventionally, replica circuits have been used for monitoring [8]. However, the critical path replica is inadequate for subthreshold circuits, because the delay mismatch between the replica and the actual critical path is remarkably large due to the within-die variation. In order to overcome this mismatch, in-situ techniques, such as Canary FF [9] and PEPD [10], which can predict the occurrence of timing errors, have been proposed. These techniques reduce the large timing margins, and consequently the energy-efficient operation can be achieved.

## 5. Conclusion

In this paper, design challenges in subthreshold logic circuits were discussed. Subthreshold circuits are



Fig. 8. Measured dependence of gated delay variation ( $\sigma/\mu$ ) on temperature in 40nm CMOS process.

sensitive to manufacturing and environmental variability. Due to  $V_{TH}$  variation,  $V_{DDmin}$  increases and signal integrity is degraded in subthreshold region. In addition, the gate delay is fluctuated by environmental variability such as temperature, which requires the large timing margins. Therefore, the adaptive performance control, which can reduce the timing margins, is effective for subthreshold logic circuits.

## Acknowledgments

This work was carried out as a part of the Extremely Low Power (ELP) project supported by the Ministry of Economy, Trade and Industry (METI) and the New Energy and Industrial Technology Development Organization (NEDO).

#### References

- [1] D. Bol, et al., ISSCC, pp.490-491 (2012).
- [2] H. Fuketa, et al., IEDM, pp. 559-562 (2011).
- [3] H. Fuketa, et al., DAC, pp. 984-989 (2011).
- [4] T. Yasufuku, et al., ISLPED, pp. 21-26 (2011).
- [5] H. Fuketa, et al., CICC, 20-3 (2012).
- [6] H. Fuketa, et al., JSSC, in press.
- [7] R. Takahashi, et al., TCAS-II, 12, pp. 918-921 (2012).
- [8] J. Kao, et al., JSSC, 11, pp. 1545-1554 (2002).
- [9] H. Fuketa, et al., TVLSI, 2, pp. 333-343 (2012).
- [10] K. Hirairi, et al., ISSCC, pp. 486-487 (2012).